Multimodal Deep Learning

1.[관련연구]VQA - papers with code

post-thumbnail

2.[관련연구]Object detection using text

post-thumbnail

3.[관련연구]State-of-the-Art model in VQA

post-thumbnail

4.[논문리뷰]EXplainable AI (XAI) approach to image captioning

post-thumbnail

5.[논문리뷰] MDETR - Modulated Detection for End-to-End Multi-Modal Understanding

post-thumbnail

7.[논문리뷰] From Show to Tell: A Survey on Deep Learning-based Image Captioning(1)(Visual Encoder Part를 중심으로 )

post-thumbnail

8.[논문리뷰] From Show to Tell: A Survey on Deep Learning-based Image Captioning(2)(Language Model을 중심으로)

post-thumbnail

9.[논문리뷰]X-Linear Attention Networks for Image Captioning

post-thumbnail