YOLOv2

정강민·2022년 1월 1일

Computer-Vision

목록 보기

4/9

YOLO의 목적

Make it better
Do it faster
Makes us stronger

Make it better

Better는 정확도를 올리기 위한 방법

Batch Normalization

High Resolution Classifier

Convolutional with Anchor boxes

Dimension Clusters

Direct location prediction

Fine-Grained Features

Multi-Scale Training를 사용

Makes us stronger

stronger는 더 많은 범위의 class를 예측하기 위한 방법

Hierarchical classification

Dataset combination with WordTree

Joint classification and detection를 사용

[Deeplearning] YOLO9000: Better, Faster, Stronger

YOLO v2 는 실제로 YOLO9000: Better, Faster, Stronger이라는 논문 이름으로 발표되었습니다. 9000개의 class를 classification하면서 detection까지 해내는 놀라움을 다시 한번 보여주는데요. 9000개의 클래스를 구성하는 방법까지는 다루지 않겠습니다. 궁금하신 분은 아래 링크를 참고

◇ Yolo v2 의 특징 ◇

Batch Normalization
High Resolution Classifier : 네트웍의 Classifier 단을 보다 높은 resolution(448x448)로 fine tuning
13 x 13 feature map 기반에서 개별 Grid cell 별 5개의 Anchor box에서 Object Detection
– anchor box의 크기와 ratio는 K-Means Clustering으로 설정.
예측 Bbox의 x,y 좌표가 중심 Cell 내에서 벗어나지 않도록 Direct Location Prediction 적용
Darknet-19 Classification 모델 채택
Classification layer를 fully Connected layer에서 Fully Convolution 으로 변경하고 서로 다른 크기의 image들로 네트웍 학습

◇ Yolo v2 Anchor Box로 1 Cell에서 여러 개 Object Detection ◇

SSD와 마찬가지로 1개의 Cell에서 여러 개의 Anchor를 통해 개별 Cell에서 여러 개 Object Detection 가능 K-Means Clustering 을 통해 데이터 세트의 이미지 크기와 Shape Ratio 따른 5개의 군집화 분류를 하여 Anchor Box를 계산