Which mechanism allows Transformers to weigh the importance of different words in a sequence?
Answer options
A
LSTM cells
B
CNN layers
C
RNN cells
D
None of the options given
E
Self Attention Mechanism
Correct answer: Self Attention Mechanism
Explanation
The source marks the correct answer as: Self Attention Mechanism.