💡 Machine Learning
: a branch of Artificial Intelligence, concerned with the design and development of algorithms that allow computers to evolve behaviors based on empirical data
Improvement on task T, with respect to Performance Metric P, based on experience E
The key idea of ML is "Generalization".
No Free Lunch Theorem for ML
: No machine learning algorithm is universally any better than any other
1. Supervised learning
- given input and output data (x, y) - learn a function f(x)
- Classification: y is categorical
- Regression: y is continous (real number)
2. Unsupervised learning
- only given x to find hidden structure (e.g. clusters)
- Classification
- Clustering, Anomaly detection, Density estimation, ...
3. Semi-supervised learning
- some of training data includes desired outputs
- label only a small number of examples & make use of a large number of unlabeled examples to learn
- LU learning: small set of Labeled examples, large set of Unlabeled examples
- PU learning: Positive and Unlabeled examples
- can label unlabeled data probabilistically (soft label)
4. Reinforcement learning
- feedback loop between learning system and environment
- No supervisors, but only rewards
- (-) feedback could be delayed = take longer time
Generalization in ML
Generalization Error
Model's Capacity
Occam's Razor: a principle of Parsimony (based on probability and experience)
Typical Relation between Capacity and Error
Regularization
Bias/Variance
GPT-3
InstructGPT
ChatGPT: similar model of InstructGPT with conversational UI
GPT-4: a large multimodal language model
Timeline of Large Language Models
Conclusion