자연어 처리 과정

Quotient rule for derivative of softmax with respect to fk(x)

chanuu 2023. 8. 9. 19:22

Transformer: Scaled dot-product attention (0)	2023.08.13
Transformer: Multi-head attention (0)	2023.08.13
What does "linear in parameters" mean in linear regression? (0)	2023.07.31
Backpropagation (0)	2023.07.31
Regularization (0)	2023.07.26

바보 같지만 괜찮아.