[Paper Review] VLM to LMM

1.VLM(Vision-Language Model) to LMM(Large Multimodal Model)

post-thumbnail

2.[2021 CVPR] (Simple Review) VirTex: Learning Visual Representations from Textual Annotations

post-thumbnail

3.[2022 MLHC] [Simple Review] (ConVIRT) Contrastive Learning of Medical Visual Representations from Paired Images and Text

post-thumbnail

5.CLIP은 ConVIRT를 어떻게 단순화했는가?

post-thumbnail

6.[2021 ICML] [simple review] (ALIGN) Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

post-thumbnail

7.[2022 ICML] (Simple Review) BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

post-thumbnail

8.[2022 TMLR] (simple review) CoCa: Contrastive Captioners are Image-Text Foundation Models

post-thumbnail

9.[2022 NeurIPS] (simple review) 🦩Flamingo: a Visual Language Model for Few-Shot Learning

post-thumbnail