SL Foundation

최서영·2024년 1월 14일

LG AImers module4

Module4

목록 보기

1/1

Supervised Learning

기계 학습 중 컴퓨터가 입력값과 그에 따른 출력값이 있는 데이터를 이용하여 주어진 입력에 맞는 출력을 찾는 학습 방법

Learning Pipeline

Training 단계
학습 sample의 label이 주어지고 error를 줄여나가면서 학습이 진행된다.
Test 단계
model이 실제 환경에 적용되는 것으로, Test 단계에서는 새로운 입력 영상을 이용한다.

머신러닝은 데이터의 결핍으로 인한 불확실성을 포함하므로 일반화가 중요하다. Supervised Learning은 Generalization Error를 최소화 하도록 노력한다.

Generalization (Goal): a model needs to perform well on unseen data
-Generalization error; the goal is to minimize this error, but it is impractical to compute in the real world

Errors

Training Error: model을 주어진 data set에 맞추어 학습하는데 사용하는 error이다.
Test Error: 전체 data set에서 일부 sample을 따로 빼 test sample을 정의하는데 test sample에서 나타나는 error가 test error이다. (=generalization error)

How to achieve the goal in practice?

Avoid overfitting

Data Argumentation
- Regularization
- Ensemble

최서영