MS-DS Master of Data science Practice Test
MS-DS Master of Data science Deep Learning and Neural Networks 2
What is the role of the attention mechanism in transformer-based deep learning models?
Select your answer
A
Regularize weights
B
Allow the model to weigh the relevance of each input token when producing an output
C
Reduce the number of parameters
D
Replace activation functions
Hint