Week 2 - 8일차 학습정리

이호영·2021년 8월 11일

Boostcamp AI Tech 2기

목록 보기

11/32

Number of parameters
(kernel_width x kernel_height) x input_channel x output_channel
Dense Layer(Fully Connected Layer)
Parameter sharing이 발생해서 parameter 개수가 많음
점점 Fully Connected Layer를 줄여 나가는 추세 ( parameter 수가 많아지기 때문에)
1x1 Convolution
Dimension reduction
Reduce the number of parameters while increasing the depth

AlexNet
ReLU activation - overcome vanishing gradient
GPU Implementations (2 GPUs)
Data augmentation
Dropout
VGGNet
Increasing depth with 3x3 convolution filters
3x3 convolution filter 2개를 적용한 것과 5x5 convolution filter를 하나 적용한 것은 receptive field 크기가 같지만 3x3 convolution filter의 parameter 수가 더 적다.
ex) 128 channel - 3x3 filter 2개: receptive field 5x5, parameter 수: 3x3x128x128x2 = 294912개
128 channel - 5x5 filter 1개: receptive field 5x5, parameter 수: 5x5x128x128 = 409600개
GoogLeNet
Inception block
Feature을 효율적으로 추출하기 위해서 1x1,3x3,5x5 convolution layer를 따로 계산해서 sparse한 network를 만든다. 하지만 그렇게 되면 연산량이 많아져서 1x1 convolution을 이용해서 dimension reduction을 한다. 결과적으로 parameter 개수가 줄어든다.

기존에 overfitting 문제로 layer가 더 깊은 network가 더 성능이 안 좋았는데, residual 구조로 인해서 layer를 더 많이 쌓았을 때 test 성능이 증가하게 되었다.

Bottleneck architecture
1x1 convolution layer를 앞 뒤로 붙여서 channel만 조절함

Fully Convolution Network
Dense Layer에서 진행하는 flat과 dense 과정을 convolution으로 바꾼 네트워크
Deconvolution

R-CNN
SPPNet
Fast R-CNN
Faster R-CNN
Fast R-CNN + Region Proposal Network
YOLO
Extremely fast object detection
Simultaneously predicts multiple boundng boxes class probabilities