[부스트캠프 AI tech CV] week09 (2022.03.14) Instance/Panoptic Segmentation and Landmark Localization

redgreen·2022년 3월 14일

부스트캠프 AI tech 3기

목록 보기

25/40

1 Instance segmentation

https://towardsdatascience.com/review-deepmask-instance-segmentation-30327a072339

개체(instance) 별로 사물을 segmentation하는 방법

1.1 mask R-CNN

https://lilianweng.github.io/lil-log/2017/12/31/object-recognition-for-dummies-part-3.html

ROI pooling 대신 ROI Align을 사용

2개의 head로 나누어지고 예측된 class와 동일한 mask 추출함

다른 R-CNN과 마찬가지로 2-stage 모델

1.2 one-stage

YOLACT(You Only Look At CoefficienTs)
mask coefficients와 prototypes의 연산을 통해 detection하는 방법인듯 하다.

YolactEdge
조금 더 빠른 detection을 위해 개발된 방법

2 Panoptic segmentation

semantic segmentation: 개체별 구분없이 class 구분

instance segmentation: 개체별 구분 + class 구분 + 배경x

panoptic segmentation: 배경 + 개체 + class 구분

2.1 UPSNet

semantic Head: sementic segmentation에서 처럼 FCN를 통해 semantic logit을 얻음

instance Head: 물체 detection + mask logit추출

panoptic Head: 앞의 head들을 종합

$Y$ $i$ : instance mask

$X$ $thing$ : 물체를 나타내는 mask

$X$ $stuff$ : 배경을 나타내는 mask

$Y$ $i$ 와 $X$ $thing$ 를 더해 물체의 위치를 더 잘보이게 함 / unknown물체를 배제하기 위해 빼줌

2.2 VPSNet

3 Landmark localization

Landmark localization = keypoint estimation

얼굴이나 사람의 pose를 추정, 트래킹

coordinate regression: 포인트의 x,y 위치를 regression하는 방법. 부정확하고 일반화 오류가 있음

Heatmap classification: sementic segmentation처럼 하나의 채널의 하나의 key point를 담당, 각 key point마다 하나의 class로 생각. key point가 발생할 확률을 각 픽셀별로 classification. 계산량이 많다는 단점