
Provides both problems (features) and answers (labels) to the machine learning model.
Definition : Predicts continuous outputs
Types
Simple Linear Regression : Examines the linear relationship between two variables. Where is the dependent variable, is the independent variable, is the intercept, is the slope, and is the error term.
Multiple Linear Regression : Extends simple linear regression to include multiple independent variables. Where , , ..., are the independent variables.
Polynomial Regression : Fits a polynomial equation to the data. It can model non-linear relationships.
Logistic Regression : Used for binary classification problems, where the dependent variable is categorical. The model predicts the probability of the outcome:
Ridge and Lasso Regression : These are types of regularized regression techniques. They add a penalty to the model to prevent overfitting.
A type of machine learning where the model is trained on unlabeled data. The primary goal is to find hidden patterns, groupings, or features in the data.
| Feature | Supervised Learning | Unsupervised Learning |
|---|---|---|
| Data Type | Labeled data (input-output pairs) | Unlabeled data (only inputs) |
| Goal | Predict outcomes for new data | Discover hidden patterns or structures |
| Algorithms | Regression, classification | Clustering, association, dimensionality reduction |
| Training Process | Guided by labels, iterative improvement | Self-organized based on data structure |
| Examples | Image classification, spam detection, price prediction | Customer segmentation, anomaly detection, market basket analysis |
| Applications | When labeled data is available and prediction is needed | When labeled data is not available, or the goal is to explore data |