Linear Regression ECE5424 G CS5824 JiaBin Huang Virginia

Linear Regression ECE-5424 G / CS-5824 Jia-Bin Huang Virginia Tech Spring 2019

BRACE YOURSELVES WINTER IS COMING

BRACE YOURSELVES HOMEWORK IS COMING

Administrative • Office hour • Chen Gao • Shih-Yang Su • Feedback (Thanks!) • Notation? • More descriptive slides? • Video/audio recording? • TA hours (uniformly spread over the week)?

Recap: Machine learning algorithms Discrete Continuous Supervised Learning Unsupervised Learning Classification Clustering Regression Dimensionality reduction

Recap: Nearest neighbor classifier •

Recap: Instance/Memory-based Learning 1. A distance metric • Continuous? Discrete? PDF? Gene data? Learn the metric? 2. How many nearby neighbors to look at? • 1? 3? 5? 15? 3. A weighting function (optional) • Closer neighbors matter more 4. How to fit with the local points? • Kernel regression Slide credit: Carlos Guestrin

Validation set • Spliting training set: A fake test set to tune hyper-parameters Slide credit: CS 231 @ Stanford

Cross-validation • 5 -fold cross-validation -> split the training data into 5 equal folds • 4 of them for training and 1 for validation Slide credit: CS 231 @ Stanford

Things to remember • Supervised Learning • Training/testing data; classification/regression; Hypothesis • k-NN • Simplest learning algorithm • With sufficient data, very hard to beat “strawman” approach • Kernel regression/classification • Set k to n (number of data points) and chose kernel width • Smoother than k-NN • Problems with k-NN • Curse of dimensionality • Not robust to irrelevant features • Slow NN search: must remember (very large) dataset for prediction

Today’s plan: Linear Regression • Model representation • Cost function • Gradient descent • Features and polynomial regression • Normal equation

Linear Regression • Model representation • Cost function • Gradient descent • Features and polynomial regression • Normal equation

Regression Training set real-valued output Learning Algorithm Size of house Hypothesis Estimated price

House pricing prediction Price ($) in 1000’s 400 300 200 100 500 1000 1500 2000 Size in feet^2 2500

Training set Size in feet^2 (x) 2104 1416 1534 Price ($) in 1000’s (y) 460 232 315 852 … 178 … Slide credit: Andrew Ng

Model representation • Training set Learning Algorithm Price ($) in 1000’s 400 300 200 100 Size of house Hypothesis Estimated price 500 1000 1500 2000 2500 Size in feet^2 Univariate linear regression Slide credit: Andrew Ng

Linear Regression • Model representation • Cost function • Gradient descent • Features and polynomial regression • Normal equation

Training set • Size in feet^2 (x) Price ($) in 1000’s (y) 2104 1416 1534 460 232 315 852 … 178 … Slide credit: Andrew Ng

3 3 3 2 2 2 1 1 2 3 Slide credit: Andrew Ng

Cost function • Price ($) in 1000’s 400 300 200 100 500 1000 1500 2000 Size in feet^2 2500 Cost function Slide credit: Andrew Ng

Simplified • Slide credit: Andrew Ng

3 3 2 2 1 1 1 2 3 0 1 2 3 Slide credit: Andrew Ng

• Slide credit: Andrew Ng

Cost function Slide credit: Andrew Ng

• Slide credit: Andrew Ng

Linear Regression • Model representation • Cost function • Gradient descent • Features and polynomial regression • Normal equation

Gradient descent • Slide credit: Andrew Ng

Slide credit: Andrew Ng

Gradient descent • Slide credit: Andrew Ng

Gradient descent Slide credit: Andrew Ng

3 2 1 0 1 2 3 Slide credit: Andrew Ng

Learning rate

Gradient descent for linear regression • Slide credit: Andrew Ng

Computing partial derivative • Slide credit: Andrew Ng

Gradient descent for linear regression • Slide credit: Andrew Ng

Batch gradient descent • Slide credit: Andrew Ng

Linear Regression • Model representation • Cost function • Gradient descent • Features and polynomial regression • Normal equation

Training dataset Size in feet^2 (x) 2104 1416 1534 Price ($) in 1000’s (y) 460 232 315 852 … 178 … Slide credit: Andrew Ng

Multiple features (input variables) Price ($) in 1000’s (y) 2104 1416 1534 852 … 5 3 3 2 1 2 2 1 45 40 30 36 460 232 315 178 … Slide credit: Andrew Ng