What is the first step in training a Transformer model for a specific task?
Answer options
A
Initialization
B
Pre-training
C
None of the options given
D
Backpropagation
E
Fine-tuning
Correct answer: Pre-training
Explanation
The source marks the correct answer as: Pre-training.