[KOSMOS-2.5: A Multimodal Literate Model Tengchao Lv∗, Yupan Huang∗, Jingye Chen∗, Lei Cui∗ †, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, L
Multimodal LLM 전반적인 내용 정리 ContentsEvaluation( - Paper 정리( - Instruction Tuning(https://github.com/BradyFU/Awesome-Multimodal-Large-Languag
[A Survey on Multimodal Large Language Models Shukang Yin1* , Chaoyou Fu2∗‡†, Sirui Zhao1∗‡, Ke Li2, Xing Sun2, Tong Xu1, Enhong Chen School of CST.,
[Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Q