Which activation is most used to mitigate vanishing gradients?
Answer options
A
Sigmoid
B
Tanh
C
ReLU
D
Linear
Correct answer: ReLU
Explanation
The correct answer is: ReLU.
Correct answer: ReLU
The correct answer is: ReLU.
PrimerDumps has 1400+ primer questions, 2026 mocks and coding hands-on — all free.