[Paper Review] Vision Transformer for Image Classification]

1.[2020 arXiv][ViT] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

post-thumbnail

2.[2021 ICCV] [Simple Review] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

post-thumbnail

3.[2021 ICCV] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

post-thumbnail