Scaling transformers (more params + data) led to:
Answer options
A
Emergence of strong few/zero-shot capabilities
B
Smaller vocabulary always
C
Removal of attention
D
Instant training
Correct answer: Emergence of strong few/zero-shot capabilities
Explanation
The correct answer is: Emergence of strong few/zero-shot capabilities.