Derek Hoiem CS 598, Spring 2009 Jan 27, 2009

Derek Hoiem CS 598, Spring 2009 Jan 27, 2009
Classification Derek Hoiem CS 598, Spring 2009 Jan 27, 2009

Outline Principles of generalization Survey of classifiers
Project discussion Discussion of Rosch

Pipeline for Prediction
Imagery Representation Classifier Predictions

Free Lunch Theorem

Bias and Variance Error Complexity Low Bias High Variance High Bias
Low Variance Error

Overfitting Need validation set Validation set not same as test set

Bias-Variance View of Features
More compact = lower variance, potentially higher bias More features = higher variance, lower bias More independence among features = simpler classifier  lower variance

How to reduce variance Parameterize model E.g., linear vs. piecewise

How to measure complexity?
VC dimension Upper bound on generalization error Training error + N: size of training set h: VC dimension : 1-probability

How to reduce variance Parameterize model Regularize

How to reduce variance Parameterize model Regularize
Increase number of training examples

Effect of Training Size
Number of Training Examples Error

Risk Minimization Margins x o x2 x1

Classifiers Generative methods Discriminative methods Ensemble methods
Naïve Bayes Bayesian Networks Discriminative methods Logistic Regression Linear SVM Kernelized SVM Ensemble methods Randomized Forests Boosted Decision Trees Instance based K-nearest neighbor Unsupervised Kmeans

Components of classification methods
Objective function Parameterization Regularization Training Inference

Classifiers: Naïve Bayes
Objective Parameterization Regularization Training Inference y x1 x2 x3

Classifiers: Logistic Regression
Objective Parameterization Regularization Training Inference

Classifiers: Linear SVM
Objective Parameterization Regularization Training Inference x o x2 x1

Classifiers: Linear SVM
Objective Parameterization Regularization Training Inference x o x2 x1 Needs slack

Classifiers: Kernelized SVM
Objective Parameterization Regularization Training Inference x o x1 x o x1 x12

Classifiers: Decision Trees
Objective Parameterization Regularization Training Inference x o x2 x1

Ensemble Methods: Boosting
figure from Friedman et al. 2000

Boosted Decision Trees
High in Image? Gray? Yes No Yes No Smooth? Green? High in Image? Many Long Lines? … Yes Yes No Yes No Yes No No Blue? Very High Vanishing Point? Yes No Yes No P(label | good segment, data) Ground Vertical Sky [Collins et al. 2002]

Boosted Decision Trees
How to control bias/variance trade-off Size of trees Number of trees

K-nearest neighbor Objective Parameterization Regularization Training
Inference x o x2 x1

Clustering x2 + x1 x o x1

References SVMs General Adaboost
Tom Mitchell, Machine Learning, McGraw Hill, 1997 Christopher Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995 Adaboost Friedman, Hastie, and Tibshirani, “Additive logistic regression: a statistical view of boosting”, Annals of Statistics, 2000 SVMs

Project ideas?

Discussion of Rosch

Derek Hoiem CS 598, Spring 2009 Jan 27, 2009

Similar presentations

Presentation on theme: "Derek Hoiem CS 598, Spring 2009 Jan 27, 2009"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Derek Hoiem CS 598, Spring 2009 Jan 27, 2009

Similar presentations

Presentation on theme: "Derek Hoiem CS 598, Spring 2009 Jan 27, 2009"— Presentation transcript:

Similar presentations

About project

Feedback