Paper Review

1.[Paper Review] KOSMOS-2.5: A Multimodal Literate Model

post-thumbnail

2.Multimodal LLM Study

post-thumbnail

3.[Paper Review] A Survey on Multimodal Large Language Models

post-thumbnail

4.[Paper Short Review] Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization (LAVIT)

post-thumbnail