728x90
https://mlexplained.com/2017/12/29/attention-is-all-you-need-explained/
Paper Dissected: “Attention is All You Need” Explained
“Attention is All You Need”, is an influential paper with a catchy title that fundamentally changed the field of machine translation. Previously, RNNs were regarded as the go-to archite…
mlexplained.com
http://jalammar.github.io/illustrated-transformer/
The Illustrated Transformer
Discussions: Hacker News (65 points, 4 comments), Reddit r/MachineLearning (29 points, 3 comments) Translations: Chinese (Simplified), Korean Watch: MIT’s Deep Learning State of the Art lecture referencing this post In the previous post, we looked at Atten
jalammar.github.io
728x90
'Dic' 카테고리의 다른 글
PCA (0) | 2019.10.24 |
---|---|
Spectral Clustering (0) | 2019.10.24 |
Knowledge distillation (0) | 2019.09.27 |
Clustering Evaluation (0) | 2019.09.23 |
F-measure (0) | 2019.09.23 |