Deep Feed Forward Network에 대한 소개입니다.
Attention is all you need 논문 리뷰
"Improving Language Understanding by Generative Pre-Training" 논문 리뷰입니다.
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding" 논문 리뷰입니다.
"Language Models are Unsupervised Multitask Learners" 논문 리뷰입니다.
"An Improved Baseline for Sentence level Relation Extraction" 논문 구현