[video] foundation model

1.[22, 12/317] InternVideo: General Video Foundation Models via Generative and Discriminative Learning

post-thumbnail

2.Awesome video github repo

post-thumbnail

3.[24,3][110] INTERNVIDEO2: SCALING FOUNDATION MODELS FOR MULTIMODAL VIDEO UNDERSTANDING

post-thumbnail

4.[21][2600]ViViT: A Video Vision Transformer

post-thumbnail

5.[25,1] VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

post-thumbnail

6.Videollama 3 돌려보는 목적과 계획

post-thumbnail

7.VideoLLaMA 3 README

post-thumbnail

8.[VideoLLaMA 3] action recognition

post-thumbnail

9.[VideoLLaMA 3] processor

post-thumbnail

10.[VideoLLaMA 3][processor] load_video

post-thumbnail