논문

1.Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

post-thumbnail

2.Learning Transferable Visual Models From Natural Language Supervision (CLIP)

post-thumbnail

3.Flamingo: a Visual Language Model for Few-Shot Learning(작성중)

post-thumbnail