🔥 논문 리뷰 - EfficientNet

esc247·2023년 9월 22일

AI

목록 보기

22/22

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

모델의 성능과 효율성을 균형있게 조정하기 위해 네트워크의 규모(scale)를 조절하는 방법을 제시

→ Compound Scaling

이전엔 ConvNet을 scale up 할 떄 depth or width or resolution 중 하나만 사용

동시에 안 한 이유: arbitrary scaling requires tedious manual tuning and still often yields sub-optimal accuracy and efficiency

width/depth/resolution를 constant ratio로 scale → Compound Scaling Method

uniformly scales network width, depth and resolution with a set of fixed scaling coefficients
if computational resources $2^N$ → depth width image size : $\alpha^N, \beta^N, \gamma^N$
- 이 때 $\alpha , \beta, \gamma$ 는 original small model 에서 gird search
입력 이미지의 크기가 커질수록 네트워크가 더 많은 layer와 channel을 필요로 한다
- receptive field를 증가시키기 위해 더 많은 layer 필요
  - 개별 픽셀이 "보이는" 영역인 receptive field 도 함께 증가해야
- fine-grained pattern을 찾기 위해 더 많은 channel이 필요하다
  - 큰 이미지에는 작은 세부 사항과 미세한 패턴들이 더 많이 포함

Problem Formulation

$Y = F_i(X_i)$

ConvNet $N$

$F_i^{L_i}$ : F가 stage i에서 L번 반복된다

F를 fix하고 L,C,(H,W)를 expand

Scaling Dimensions

Depth $d$

Width $w$

Wider → capture more fine-grained features and are easier to train
그러나 extremely wide but shallow networks tend to have difficulties in capturing higher level features

Resolution $\gamma$