Big Transfer (BiT): General Visual Representation Learni

DYN.kim·2025년 1월 21일

pretrained된 대규모 이미지 데이터셋에 전이학습하여 성능 올리자

key idea
upstream:
component-1: scale 데이터셋의 크기 클수록 모델 크기도 커야 하고 성능이 향상
comonent-2: BN대신 GN, weight standardization 사용, 데이터셋 크기 크면 배치 사이즈 작아야 하고 그럼 BN시 성능 저하됨

downstream:
하나의 task당 하느이 파라미터만
훈련시 데이터 전처리 및 증강
해상도 훈련과 테스트시 같거나 테스트가 조금 더 크게
전이학습에서 데이터 혼합 유용

DYN.kim

AI 개발자를 목표로 하고 있는 꿈 많은 공대생입니다. a deo vocatus rite paratus

이전 포스트

Bi-Directional Attention Flow for Machine Comprehension

다음 포스트

Big Transfer (BiT): General Visual Representation Learni

Bi-Directional Attention Flow for Machine Comprehension

Improving Language Understanding by Generative Pre-Training

0개의 댓글