Introduction to Machine Learning 236756 Nir Ailon Lecture 11: Probabilistic Models.

Slides:

Advertisements

Similar presentations

: INTRODUCTION TO Machine Learning Parametric Methods.

Advertisements

Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.

Pattern Recognition and Machine Learning

ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.

LECTURE 11: BAYESIAN PARAMETER ESTIMATION

Applied Probability Lecture 5 Tina Kapur

Chapter 4: Linear Models for Classification

Laboratory for Social & Neural Systems Research (SNS) PATTERN RECOGNITION AND MACHINE LEARNING Institute of Empirical Research in Economics (IEW)

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

Machine Learning CMPT 726 Simon Fraser University CHAPTER 1: INTRODUCTION.

Today Linear Regression Logistic Regression Bayesians v. Frequentists

Kernel Methods Part 2 Bing Han June 26, Local Likelihood Logistic Regression.

CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation X = {

CHAPTER 4: Parametric Methods. Lecture Notes for E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Parametric Estimation Given.

Thanks to Nir Friedman, HU

Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,

Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.

Crash Course on Machine Learning

ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:

ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Naïve Bayes Readings: Barber

Bayesian Inference Ekaterina Lomakina TNU seminar: Bayesian inference 1 March 2013.

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

CS Statistical Machine learning Lecture 10 Yuan (Alan) Qi Purdue CS Sept

CSE 446 Logistic Regression Winter 2012 Dan Weld Some slides from Carlos Guestrin, Luke Zettlemoyer.

1 Generative and Discriminative Models Jie Tang Department of Computer Science & Technology Tsinghua University 2012.

Machine Learning CUNY Graduate Center Lecture 4: Logistic Regression.

Optimal Bayes Classification

ECE 5984: Introduction to Machine Learning Dhruv Batra Virginia Tech Topics: –Classification: Logistic Regression –NB & LR connections Readings: Barber.

KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN © The MIT Press, Lecture.

Machine Learning 5. Parametric Methods.

Lecture 3: MLE, Bayes Learning, and Maximum Entropy

Generative classifiers: The Gaussian classifier Ata Kaban School of Computer Science University of Birmingham.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.

KNN & Naïve Bayes Hongning Wang

COMP24111 Machine Learning Naïve Bayes Classifier Ke Chen.

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

1 1)Bayes’ Theorem 2)MAP, ML Hypothesis 3)Bayes optimal & Naïve Bayes classifiers IES 511 Machine Learning Dr. Türker İnce (Lecture notes by Prof. T. M.

Crash course in probability theory and statistics – part 2 Machine Learning, Wed Apr 16, 2008.

Bayesian Learning Reading: Tom Mitchell, “Generative and discriminative classifiers: Naive Bayes and logistic regression”, Sections 1-2. (Linked from.

Oliver Schulte Machine Learning 726

Matt Gormley Lecture 3 September 7, 2016

Probability Theory and Parameter Estimation I

Variational Bayes Model Selection for Mixture Distribution

Ch3: Model Building through Regression

Linear Regression (continued)

3(+1) classifiers from the Bayesian world

Summary Tel Aviv University 2016/2017 Slava Novgorodov

Bayes Net Learning: Bayesian Approaches

ECE 5424: Introduction to Machine Learning

ECE 5424: Introduction to Machine Learning

Oliver Schulte Machine Learning 726

ECE 5424: Introduction to Machine Learning

Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)

with observed random variables

Probabilistic Models with Latent Variables

Revision (Part II) Ke Chen

Generative Models and Naïve Bayes

Pattern Recognition and Machine Learning

LECTURE 07: BAYESIAN ESTIMATION

Multivariate Methods Berlin Chen

Multivariate Methods Berlin Chen, 2005 References:

Generative Models and Naïve Bayes

Recap: Naïve Bayes classifier

Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)

Naïve Bayes Classifier

Presentation transcript:

Introduction to Machine Learning Nir Ailon Lecture 11: Probabilistic Models

Most of the Course So Far: Discriminative Approach “Bayes Optimal”

ERM Can Sometimes Be Viewed as Discriminative Approach for a ``Made Up’’ Probabilistic Method Gaussian Density

Class-Conditional Density Class Prior

Why Not Generative Approach

Why Generative Approach?

Stats 101: Maximum Likelihood Estimator (MLE)

Example: MLE For Biased Coin

Abuse of notation! Should be density… MLE for Continuous R.V.’s

Naïve Bayes Approach Conditional Independence

Naïve Bayes Classifier (Binary Case) It’s a linear model!

Depends on coordinate only Depends on coordinate & label Naïve Bayes Classifier (Gaussian Case) It’s a linear model!

(Gaussian) Naïve Bayes vs Linear Regression

Bayesian Reasoning

Bayesian Priors vs SRM

Because of conditional independence Posterior Bayesian Reasoning Bayes Average Laplace Smoothing

Difficulties in Bayes Reasoning

MAP

Summary