
Sequence to Sequence Learning with Neural Networks(NIPS 2014)

Efficient Estimation of Word Representations in Vector Space 상편

Efficient Estimation of Word Representations in Vector Space 하편

Attention is all you need (NIPS 2017)

Deep Resiaual Learning for Image Recognition

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding