ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –(Finish) Regression –Model selection, Cross-validation –Error decomposition.

Slides:



Advertisements
Similar presentations
Chapter 5 One- and Two-Sample Estimation Problems.
Advertisements

9.3 Equations as Relations CORD Math Mrs. Spitz Fall 2006.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
BALANCING 2 AIM: To solve equations with variables on both sides.
MACHINE TOOLS – I.
The x- and y-Intercepts
25 seconds left…...
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –SVM –Multi-class SVMs –Neural Networks –Multi-layer Perceptron Readings:
1 Linear Methods for Regression Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Setup for linear regression model. Analyzing “EPE” (fixed X) 0 0 (independence) =  2 (irreducible) bias 2 variance (=  2 p/n)
1 Econometrics 1 Lecture 6 Multiple Regression -tests.
ECE 5984: Introduction to Machine Learning
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
ECE 5984: Introduction to Machine Learning
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Naïve Bayes Readings: Barber
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Probability Review –Statistical Estimation (MLE) Readings: Barber 8.1, 8.2.
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –Neural Networks –Backprop –Modular Design.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Statistical Estimation (MLE, MAP, Bayesian) Readings: Barber 8.6, 8.7.
Introduction to Machine Learning Supervised Learning 姓名 : 李政軒.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Unsupervised Learning: Kmeans, GMM, EM Readings: Barber
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –(Finish) Nearest Neighbour Readings: Barber 14 (kNN)
Christoph Eick: Learning Models to Predict and Classify 1 Learning from Examples Example of Learning from Examples  Classification: Is car x a family.
Concept learning, Regression Adapted from slides from Alpaydin’s book and slides by Professor Doina Precup, Mcgill University.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Logistic Regression –NB & LR connections Readings: Barber.
INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN © The MIT Press, Lecture.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Supervised Learning –General Setup, learning from data –Nearest Neighbour.
1 CSCI 3202: Introduction to AI Decision Trees Greg Grudic (Notes borrowed from Thomas G. Dietterich and Tom Mitchell) Intro AIDecision Trees.
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –Recurrent Neural Networks (RNNs) –BackProp Through Time (BPTT) –Vanishing / Exploding.
Machine Learning with Discriminative Methods Lecture 05 – Doing it 1 CS Spring 2015 Alex Berg.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Regression Readings: Barber 17.1, 17.2.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Ensemble Methods: Bagging, Boosting Readings: Murphy 16.4; Hastie 16.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Regression. We have talked about regression problems before, as the problem of estimating the mapping f(x) between an independent variable x and a dependent.
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –LSTMs (intuition and variants) –[Abhishek:] Lua / Torch Tutorial.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Decision/Classification Trees Readings: Murphy ; Hastie 9.2.
ECE 6504: Deep Learning for Perception Dhruv Batra Virginia Tech Topics: –Deconvolution.
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –(Finish) Expectation Maximization –Principal Component Analysis (PCA) Readings:
Evaluating Classifiers. Reading for this topic: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website)
LECTURE 13: LINEAR MODEL SELECTION PT. 3 March 9, 2016 SDS 293 Machine Learning.
CSE573 Autumn /13/98 Finished Administrative –Final exam Tuesday, Mar. 17, 2:30-4:20 p.m., here –Additional help today after class Saturday
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –(Finish) Model selection –Error decomposition –Bias-Variance Tradeoff –Classification:
Why do we do formative assessment?
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
Dhruv Batra Georgia Tech
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
BACK SOLUTION:
ECE 5424: Introduction to Machine Learning
Leave-one-out cross-validation
ECE 5424: Introduction to Machine Learning
Start Finish Find the solution 42 out of out of 2200
The Research Proposal How to Write It.
Recitation #2 Tel Aviv University 2017/2018 Slava Novgorodov
Recitation #2 Tel Aviv University 2016/2017 Slava Novgorodov
Technical Communication
Shih-Yang Su Virginia Tech
INTRODUCTION TO Machine Learning
Types of Errors And Error Analysis.
Presentation transcript:

ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –(Finish) Regression –Model selection, Cross-validation –Error decomposition –Bias-Variance Tradeoff Readings: Barber 17.1, 17.2

Administrativia HW1 –Solutions available Project Proposal –Due: Tue 02/24, 11:55 pm –<=2pages, NIPS format –Show Igor’s proposal HW2 –Due: Friday 03/06, 11:55pm –Implement linear regression, Naïve Bayes, Logistic Regression (C) Dhruv Batra2

Recap of last time (C) Dhruv Batra3

Regression (C) Dhruv Batra4

5Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra6Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra7Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra8Slide Credit: Greg Shakhnarovich

Why sum squared error??? Gaussians, Watson, Gaussians… But, why? 9(C) Dhruv Batra

10Slide Credit: Greg Shakhnarovich

Is OLS Robust? Demo – Bad things happen when the data does not come from your model! How do we fix this? (C) Dhruv Batra11

Robust Linear Regression y ~ Lap(w’x, b) On paper (C) Dhruv Batra12

Plan for Today (Finish) Regression –Bayesian Regression –Different prior vs likelihood combination –Polynomial Regression Error Decomposition –Bias-Variance –Cross-validation (C) Dhruv Batra13

Robustify via Prior Ridge Regression y ~ N(w’x, σ 2 ) w ~ N(0, t 2 I) P(w | x,y) = (C) Dhruv Batra14

Summary LikelihoodPriorName GaussianUniformLeast Squares Gaussian Ridge Regression GaussianLaplaceLasso LaplaceUniformRobust Regression StudentUniformRobust Regression (C) Dhruv Batra15

(C) Dhruv Batra16Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra17Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra18Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra19Slide Credit: Greg Shakhnarovich

Example Demo – (C) Dhruv Batra20

What you need to know Linear Regression –Model –Least Squares Objective –Connections to Max Likelihood with Gaussian Conditional –Robust regression with Laplacian Likelihood –Ridge Regression with priors –Polynomial and General Additive Regression (C) Dhruv Batra21

New Topic: Model Selection and Error Decomposition (C) Dhruv Batra22

Example for Regression Demo – How do we pick the hypothesis class? (C) Dhruv Batra23

Model Selection How do we pick the right model class? Similar questions –How do I pick magic hyper-parameters? –How do I do feature selection? (C) Dhruv Batra24

Errors Expected Loss/Error Training Loss/Error Validation Loss/Error Test Loss/Error Reporting Training Error (instead of Test) is CHEATING Optimizing parameters on Test Error is CHEATING (C) Dhruv Batra25

(C) Dhruv Batra26Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra27Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra28Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra29Slide Credit: Greg Shakhnarovich

(C) Dhruv Batra30Slide Credit: Greg Shakhnarovich

Typical Behavior (C) Dhruv Batra31 a

Overfitting Overfitting: a learning algorithm overfits the training data if it outputs a solution w when there exists another solution w’ such that: 32(C) Dhruv BatraSlide Credit: Carlos Guestrin

Error Decomposition (C) Dhruv Batra33 Reality Modeling Error model class Estimation Error Optimization Error

Error Decomposition (C) Dhruv Batra34 Reality Estimation Error Optimization Error Modeling Error model class

Error Decomposition (C) Dhruv Batra35 Reality Modeling Error model class Estimation Error Optimization Error Higher-Order Potentials

Error Decomposition Approximation/Modeling Error –You approximated reality with model Estimation Error –You tried to learn model with finite data Optimization Error –You were lazy and couldn’t/didn’t optimize to completion (Next time) Bayes Error –Reality just sucks (C) Dhruv Batra36