논문 리뷰

1.3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection

post-thumbnail

2.Virtual Occlusions Through Implicit Depth [CVPR 2023]

post-thumbnail

3.Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation(RALF)[CVPR 2024]

post-thumbnail

4.SAM (Segment Anything)

post-thumbnail

5.PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation [CVPR 2017]

post-thumbnail

6.Object pop-up: Can we infer 3D objects and their poses from human interactions alone? [CVPR 2023]

post-thumbnail

7.InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions [CVPR 2023 Highlight]

post-thumbnail

8.CAST: Cross-Attention in Space and Time for Video Action Recognition [NeurIPS 2023]

post-thumbnail

9.CLIP: Learning Transferable Visual Models From Natural Lanugage Supervision

post-thumbnail

10.In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation [ECCV 2024]

post-thumbnail

11.DDIM : Denoising Diffusion Implicit Models [ICLR2021]

post-thumbnail

12.Hierarchical Open-vocabulary Universal Image Segmentation [NeurIPS 2023]

post-thumbnail

13.Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs [CVPR 2016]

post-thumbnail

14.BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

post-thumbnail

15.Emergent Visual-Semantic Hierarchies in Image-Text Representations [ECCV 2024(Oral)]

post-thumbnail

16.Recognize Anything: A Strong Image Tagging Model

post-thumbnail