Which is a limitation of RNNs compared to Transformers?
Answer options
A
Parallelization and long-range learning
B
Ability to handle small sequences
C
Usefulness on text
D
Being neural networks
Correct answer: Parallelization and long-range learning
Explanation
Transformers process all tokens in parallel using self-attention (unlike sequential RNNs), making them faster to train and better at capturing long-range dependencies.