-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Description
Once the students appreciate the world of ML, and the general scikit-learn ecosystem, let's dig into detail into linear regression, the goal is to train and evaluate a regression model using scikit-learn.
- Discussion of linear regression: what is it, how is it connected to linear correlation from week 1.
- Use scikit learn to create model, do model fit, and model prediction (using the standard three-part scikit-learn api). Lots of data points with noisy data.
- Evaluate model fit with visualizations, and metrics like MSE and R2 (explain these metrics)
- Break data into training/test data to illustrate generalization
Use the extremelyl important test_train_split function to break data into test/train data and explain why we are doing this (to avoid overfitting!)
Show polynomial regression to illustrate overfitting (here let's do it with few data points to illustrate the point of lack of generalization).
Metadata
Metadata
Assignees
Labels
No labels