'자연어 처리 과정' 카테고리의 다른 글
Autograd explained diagram (0) | 2024.02.14 |
---|---|
The differences between InstructGPT and ChatGPT (0) | 2023.11.01 |
Transformer: Scaled dot-product attention (0) | 2023.08.13 |
Transformer: Multi-head attention (0) | 2023.08.13 |
Quotient rule for derivative of softmax with respect to fk(x) (0) | 2023.08.09 |