Scheduled sampling mixes:
Answer options
A
Only reinforcement signals
B
Model predictions and ground-truth tokens during training
C
ELBO and adversarial losses
D
Two discriminators
Correct answer: Model predictions and ground-truth tokens during training
Explanation
The correct answer is: Model predictions and ground-truth tokens during training.