action recognition 논문들

FSA·2024년 2월 23일
0

1. InternVideo: General Video Foundation Models via Generative and Discriminative Learning


2. Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning


3. Unmasked Teacher: Towards Training-Efficient Video Foundation Models


4. Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning


5. UNIFORMERV2: SPATIOTEMPORAL LEARNING BY ARMING IMAGE VITS WITH VIDEO UNIFORMER


6. Masked Feature Prediction for Self-Supervised Visual Pre-Training


7. Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning


8. Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles


9. CoCa: Contrastive Captioners are Image-Text Foundation Models


10. Multiview Transformers for Video Recognition


11. MERLOT: Multimodal Neural Script Knowledge Models

profile
모든 의사 결정 과정을 지나칠 정도로 모두 기록하고, 나중에 스스로 피드백 하는 것

0개의 댓글