논문

1.ViT: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

post-thumbnail

2.Attention Is All You Need

post-thumbnail