Multimodal

1.[Multimodal #1] ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks (NeurIPS, 2019)

post-thumbnail

2.[Multimodal #2] Found in Translation: Learning Robust Joint Representations by Cyclic Translations between Modalities (AAAI, 19)

post-thumbnail

3.[Vision] DDIM: Denoising Diffusion Implicit Models (ICLR, 2021)

post-thumbnail