Category of Machine Learning
![](https://velog.velcdn.com/images/psy4072/post/523bc9aa-e937-44fc-a5ce-cfd7fa48c05a/image.png)
지도학습
- Classification
- Regreesion
비지도학습
강화학습
Supervised Learning
- Output y is categorical variable : Classification
- Output y is continuous : Regression
Regression
- Ouput(y) , Input(x)
- x = n*p matrix 형태
![](https://velog.velcdn.com/images/psy4072/post/e7e2de8d-4c3c-4d6e-93f5-eda6c480ae45/image.png)
Classification
- Output(y), Input(x)
- Each image is an input and corresponding class is output
![](https://velog.velcdn.com/images/psy4072/post/d5de5150-bf4c-44fc-9f65-199420a3c7eb/image.png)
- We must train the function for appropriately demonstrating the relationship between input and output variables
- The word 'appropriately' means the difference between results from real data and model's prediction is small
![](https://velog.velcdn.com/images/psy4072/post/46e78f3d-a0c4-4e6a-a25f-7155f6f3e4f6/image.png)
![](https://velog.velcdn.com/images/psy4072/post/72f95a9a-2939-409f-9fda-f450de86adb9/image.png)
-> which model is better than the other? => A
Machine Learning and Optimization
- Machine learning models optimize the function by minimizing the loss
- The process of optimization for the machine learning functions represents learning
![](https://velog.velcdn.com/images/psy4072/post/f64025a7-79cd-4cba-86a4-bdd284abafbf/image.png)
Generalization Error and Hyperparameter
-
Training error
- prediction error to training data(Loss)
-
Validation error
- prediction error to validation data(Generalizaion error)
-
As the complexity of the model increases,
the training error decreases but the validation error increases from a critical point ![](https://velog.velcdn.com/images/psy4072/post/b0f3cd9e-2686-4059-8a1f-d5b90571d654/image.png)
-
Machine learning model's goal is predicting unseen data accurately
![](https://velog.velcdn.com/images/psy4072/post/317baf46-6ca2-482c-bef0-526fe59ddb68/image.png)
Model Validation with Data
- Validation with training data only (Bad)
- Training, Testing data (After parameter tuning)
- Training, Validation, Testing data (Best)
- Cross-Validation(with small number of data)
![](https://velog.velcdn.com/images/psy4072/post/eaf84d81-7b1d-4d12-ae4a-7d8f2bd19583/image.png)
reference : K-MOOC 실습으로 배우는 머신러닝