논문리뷰

1.[ 논문 리뷰 ] DETR ( DEtection TRansformer )

post-thumbnail

2.[ 논문리뷰 ] Co-DETR ( DETR with Collaborative Hybrid Assignments Training )

post-thumbnail

3.[ 논문 리뷰 ] Attention Is All You Need

post-thumbnail

4.[ 논문 리뷰 ] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

post-thumbnail

5.[ 논문 리뷰 ] SA ( segment Anything )

post-thumbnail

6.[ 논문 리뷰 ] End to End Learning for Self-Driving cars

post-thumbnail

7.[ 논문 리뷰 ] Show and Tell: A Neural Image Caption Generator

post-thumbnail

8.[ 논문 리뷰 ] Show, Attend and Tell : Neural Image caption Generation with Visual Attention

post-thumbnail

9.[ 논문 리뷰 ] ViLBERT : Pretraining Task-Agnostic Visiolinguistic Representations

post-thumbnail

10.[ 논문 리뷰 ] CLIPSeg : Image Segmentation Using Text and Image Prompts

post-thumbnail

11.[ 논문 리뷰 ] DeepPose : Human Pose Estimation Via Deep Neural Networks

post-thumbnail

12.[ 논문 리뷰 ] Stacked Hourglass Networks for Human Pose Estimation

post-thumbnail

13.[ 논문 리뷰 ] Distilling the Knowledge in a Neural Network

post-thumbnail

14.[ 논문 리뷰 ] On the Efficacy of Knowledge Distillation

post-thumbnail

15.[ 논문 리뷰 ] HRNet : Deep High-Resolution Representation Learning for Human Pose Estimation

post-thumbnail

16.[ 논문 리뷰 ] HRFormer : High-Resolution Transformer for Dense Prediction

post-thumbnail

17.[ 논문 리뷰 ] PPT : token-Pruned Pose Transformer for monocular and multi-view human pose estimation

post-thumbnail

18.[ 논문 리뷰 Renewal ] DEtection TRansformer

post-thumbnail

19.[ 논문 리뷰 ] Diffusion Model

post-thumbnail

20.BLIP-2 : Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

post-thumbnail

21.EventVL: Understand Event Streams via Multimodal Large Language Model

post-thumbnail

22.NeRF : Representing Scenes as Neural Radiance Fields for View Synthesis

post-thumbnail

23.[computer vision] SwinTransformer

post-thumbnail

24.[ computer vision ] DINO : Emerging Properties in Self-Supervised Vision Transformers

post-thumbnail