Paper Review

1.Going Deeper with Convolutions(InceptionV1) - 논문 구현

post-thumbnail

2.Very Deep Convolutional Networks for Large-Scale Image Recognition(VGGNet) - 논문 구현

post-thumbnail

3.TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering - 논문 정리

post-thumbnail

4.A Joint Sequence Fusion Model for Video Question Answering and Retrieval - 논문 정리

post-thumbnail

5.MERLOT: Multimodal Neural Script Knowledge Models - 논문 정리 (+BERT 정리)

post-thumbnail

6.MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound - 논문 정리

post-thumbnail

7.Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training - 논문 정리

post-thumbnail