논문리뷰

1.목표

post-thumbnail

2.Distinctive Image Features from Scale-Invariant Keypoints

post-thumbnail

3.Histograms of Oriented Gradients for Human Detection

post-thumbnail

4.Photo Tourism: Exploring Photo Collections in 3D

post-thumbnail

5.ImageNet Classification with Deep Convolutional Neural Networks

post-thumbnail

6.Visualizing and Understanding Convolutional Networks

post-thumbnail

7.Going deeper with convolutions

post-thumbnail

8.Very Deep Convolutional Networks for Large-Scale Image Recognition

post-thumbnail

9.Deep Residual Learning for Image Recognition

post-thumbnail

10.Rich feature hierarchies for accurate object detection and semantic segmentation

post-thumbnail

11.Disentangling Visual and Written Concepts in CLIP

post-thumbnail

12.You Only Look Once: Unified, Real-Time Object Detection

post-thumbnail

13.Generative Adversarial Nets

post-thumbnail

14.Network In Network

post-thumbnail

15.Selective Search for Object Recognition

post-thumbnail

16.Squeeze-and-Excitation Networks

post-thumbnail

17.U-Net: Convolutional Networks for Biomedical Image Segmentation

post-thumbnail

18.A Neural Algorithm of Artistic Style

post-thumbnail

19.Efficient Estimation of Word Representations in Vector Space

post-thumbnail

20.Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation

post-thumbnail

21.Sequence to Sequence Learning with Neural Networks

post-thumbnail

22.Attention Is All You Need

post-thumbnail

23.Image-to-Image Translation with Conditional Adversarial Networks

post-thumbnail

24.A Discriminatively Trained, Multiscale, Deformable Part Model

post-thumbnail

25.OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

post-thumbnail

26.Fast R-CNN

post-thumbnail

27.Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

post-thumbnail

28.SSD: Single Shot MultiBox Detector

post-thumbnail

29.Feature Pyramid Networks for Object Detection

post-thumbnail

30.Mask R-CNN

post-thumbnail

31.High-resolution image reconstruction with latent diffusion models from human brain activity

post-thumbnail

32.UNSUPERVISED REPRESENTATION LEARNING WITH DEEP CONVOLUTIONAL GENERATIVE ADVERSARIAL NETWORKS

post-thumbnail

33.InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

post-thumbnail

34.Wasserstein GAN

post-thumbnail

35.Improved Training of Wasserstein GANs

post-thumbnail

36.Least Squares Generative Adversarial Networks

post-thumbnail

37.ENERGY-BASED GENERATIVE ADVERSARIAL NETWORKS

post-thumbnail

38.BEGAN: Boundary Equilibrium Generative Adversarial Networks

post-thumbnail

39.Conditional Generative Adversarial Nets

post-thumbnail

40.Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

post-thumbnail

41.Semantic Image Synthesis with Spatially-Adaptive Normalization

post-thumbnail

42.StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

post-thumbnail

43.Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

post-thumbnail

44.MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

post-thumbnail

45.MobileNetV2: Inverted Residuals and Linear Bottlenecks

post-thumbnail

46.Searching for MobileNetV3

post-thumbnail

47.SPECTRAL NORMALIZATION FOR GENERATIVE ADVERSARIAL NETWORKS

post-thumbnail

48.Self-Attention Generative Adversarial Networks

post-thumbnail

49.LARGE SCALE GAN TRAINING FOR HIGH FIDELITY NATURAL IMAGE SYNTHESIS

post-thumbnail

50.PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION

post-thumbnail

51.A Style-Based Generator Architecture for Generative Adversarial Networks

post-thumbnail

52.Analyzing and Improving the Image Quality of StyleGAN

post-thumbnail

53.StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

post-thumbnail

54.A Survey of Vision-Language Pre-Trained Models

post-thumbnail

55.A survey on Self Supervised learning approaches for improving Multimodal representation learning

post-thumbnail

56.A survey of multimodal deep generative models

post-thumbnail

57.Multimodal Learning with Transformers: A Survey

post-thumbnail

58.Multimodal Machine Learning: A Survey and Taxonomy

post-thumbnail

59.Self-Supervised Multimodal Learning: A Survey

post-thumbnail

60.EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

post-thumbnail

61.EfficientDet: Scalable and Efficient Object Detection

post-thumbnail

62.Auto-Encoding Variational Bayes

post-thumbnail

63.GloVe: Global Vectors for Word Representation

post-thumbnail

64.Improving Language Understanding by Generative Pre-Training

post-thumbnail

65.Language Models are Unsupervised Multitask Learners

post-thumbnail

66.BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

post-thumbnail

67.Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

post-thumbnail

69.RoBERTa: A Robustly Optimized BERT Pretraining Approach

post-thumbnail

70.STRUCTBERT: INCORPORATING LANGUAGE STRUCTURES INTO PRE-TRAINING FOR DEEP LANGUAGE UNDERSTANDING

post-thumbnail

71.XLNet: Generalized Autoregressive Pretraining for Language Understanding

post-thumbnail

72.ALBERT: A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS

post-thumbnail

73.ELECTRA: PRE-TRAINING TEXT ENCODERS AS DISCRIMINATORS RATHER THAN GENERATORS

post-thumbnail

74.DEBERTA: DECODING-ENHANCED BERT WITH DISENTANGLED ATTENTION

post-thumbnail

75.Reducing the Dimensionality of Data with Neural Networks

post-thumbnail

76.Neural Turing Machines

post-thumbnail

77.Hybrid computing using a neural network with dynamic external memory

post-thumbnail

78.Pixel Recurrent Neural Networks

post-thumbnail

79.Conditional Image Generation with PixelCNN Decoders

post-thumbnail

80.PIXELCNN++: IMPROVING THE PIXELCNN WITH DISCRETIZED LOGISTIC MIXTURE LIKELIHOOD AND OTHER MODIFICATIONS

post-thumbnail

81.VideoBERT: A Joint Model for Video and Language Representation Learning

post-thumbnail

82.AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE

post-thumbnail

83.ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

post-thumbnail

84.FLAVA: A Foundational Language And Vision Alignment Model

post-thumbnail

85.ActBERT: Learning Global-Local Video-Text Representations

post-thumbnail

86.Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

post-thumbnail

87.VISUALBERT: A SIMPLE AND PERFORMANT BASELINE FOR VISION AND LANGUAGE

post-thumbnail

88.UNITER: UNiversal Image-TExt Representation Learning

post-thumbnail

89.LXMERT: Learning Cross-Modality Encoder Representations from Transformers

post-thumbnail

90.VL-BERT: PRE-TRAINING OF GENERIC VISUALLINGUISTIC REPRESENTATIONS

post-thumbnail

91.MDETR - Modulated Detection for End-to-End Multi-Modal Understanding

post-thumbnail

92.End-to-End Object Detection with Transformers

post-thumbnail

93.BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

post-thumbnail

94.Deep contextualized word representations

post-thumbnail

95.Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

post-thumbnail

96.Cross-lingual Language Model Pretraining

post-thumbnail

97.YOLO9000: Better, Faster, Stronger

post-thumbnail

98.YOLOv3: An Incremental Improvement

post-thumbnail

99.YOLOv4: Optimal Speed and Accuracy of Object Detection

post-thumbnail

100.Universal Language Model Fine-tuning for Text Classification

post-thumbnail

101.Focal Loss for Dense Object Detection

post-thumbnail

102.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

post-thumbnail

103.Fully Convolutional Networks for Semantic Segmentation

post-thumbnail

104.SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS

post-thumbnail

105.Distilling the Knowledge in a Neural Network

post-thumbnail

106.VLP: A Survey on Vision-Language Pre-training

post-thumbnail