Introduction to Machine Learning 236756 Nir Ailon Lecture 11: Probabilistic Models.

Slides:



Advertisements
Similar presentations
: INTRODUCTION TO Machine Learning Parametric Methods.
Advertisements

Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Pattern Recognition and Machine Learning
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
LECTURE 11: BAYESIAN PARAMETER ESTIMATION
Applied Probability Lecture 5 Tina Kapur
Chapter 4: Linear Models for Classification
Laboratory for Social & Neural Systems Research (SNS) PATTERN RECOGNITION AND MACHINE LEARNING Institute of Empirical Research in Economics (IEW)
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Machine Learning CMPT 726 Simon Fraser University CHAPTER 1: INTRODUCTION.
Today Linear Regression Logistic Regression Bayesians v. Frequentists
Kernel Methods Part 2 Bing Han June 26, Local Likelihood Logistic Regression.
CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {
CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation Given.
Thanks to Nir Friedman, HU
Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
Crash Course on Machine Learning
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Naïve Bayes Readings: Barber
Bayesian Inference Ekaterina Lomakina TNU seminar: Bayesian inference 1 March 2013.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
CS Statistical Machine learning Lecture 10 Yuan (Alan) Qi Purdue CS Sept
CSE 446 Logistic Regression Winter 2012 Dan Weld Some slides from Carlos Guestrin, Luke Zettlemoyer.
1 Generative and Discriminative Models Jie Tang Department of Computer Science & Technology Tsinghua University 2012.
Machine Learning CUNY Graduate Center Lecture 4: Logistic Regression.
Optimal Bayes Classification
ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Logistic Regression –NB & LR connections Readings: Barber.
KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN © The MIT Press, Lecture.
Machine Learning 5. Parametric Methods.
Lecture 3: MLE, Bayes Learning, and Maximum Entropy
Generative classifiers: The Gaussian classifier Ata Kaban School of Computer Science University of Birmingham.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
KNN & Naïve Bayes Hongning Wang
COMP24111 Machine Learning Naïve Bayes Classifier Ke Chen.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
1 1)Bayes’ Theorem 2)MAP, ML Hypothesis 3)Bayes optimal & Naïve Bayes classifiers IES 511 Machine Learning Dr. Türker İnce (Lecture notes by Prof. T. M.
Crash course in probability theory and statistics – part 2 Machine Learning, Wed Apr 16, 2008.
Bayesian Learning Reading: Tom Mitchell, “Generative and discriminative classifiers: Naive Bayes and logistic regression”, Sections 1-2. (Linked from.
Oliver Schulte Machine Learning 726
Matt Gormley Lecture 3 September 7, 2016
Probability Theory and Parameter Estimation I
Variational Bayes Model Selection for Mixture Distribution
Ch3: Model Building through Regression
Linear Regression (continued)
3(+1) classifiers from the Bayesian world
Summary Tel Aviv University 2016/2017 Slava Novgorodov
Bayes Net Learning: Bayesian Approaches
ECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning
Oliver Schulte Machine Learning 726
ECE 5424: Introduction to Machine Learning
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
with observed random variables
Probabilistic Models with Latent Variables
Revision (Part II) Ke Chen
Generative Models and Naïve Bayes
Pattern Recognition and Machine Learning
LECTURE 07: BAYESIAN ESTIMATION
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Generative Models and Naïve Bayes
Recap: Naïve Bayes classifier
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Naïve Bayes Classifier
Presentation transcript:

Introduction to Machine Learning Nir Ailon Lecture 11: Probabilistic Models

Most of the Course So Far: Discriminative Approach “Bayes Optimal”

ERM Can Sometimes Be Viewed as Discriminative Approach for a ``Made Up’’ Probabilistic Method Gaussian Density

Class-Conditional Density Class Prior

Why Not Generative Approach

Why Generative Approach?

Stats 101: Maximum Likelihood Estimator (MLE)

Example: MLE For Biased Coin

Abuse of notation! Should be density… MLE for Continuous R.V.’s

Naïve Bayes Approach Conditional Independence

Naïve Bayes Classifier (Binary Case) It’s a linear model!

Depends on coordinate only Depends on coordinate & label Naïve Bayes Classifier (Gaussian Case) It’s a linear model!

(Gaussian) Naïve Bayes vs Linear Regression

Bayesian Reasoning

Bayesian Priors vs SRM

Because of conditional independence Posterior Bayesian Reasoning Bayes Average Laplace Smoothing

Difficulties in Bayes Reasoning

MAP

Summary