Review] Self-supervised Vision Transformers for Land-cover Segmentation and Classification
1. Motivation
- In this paper, it proposed the method to combine vision transformer architecture and self-supervied learning
2. Method
- Overall structure
![](https://velog.velcdn.com/images/lake_park_0706/post/7f9709ef-a181-4820-b657-1f8b8714408f/image.png)
2.1. Self-supervised learning
2.2. SwinUNet
- This work proposed two separate SwinUNet streams for contrastive learning process as Fig 2.
3. Result
![](https://velog.velcdn.com/images/lake_park_0706/post/e623276d-e562-4ed5-a2ca-1c78f1a895a9/image.png)
Reference