Evaluating Hypotheses

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Evaluating Classifiers
Sampling: Final and Initial Sample Size Determination
POINT ESTIMATION AND INTERVAL ESTIMATION
Class notes for ISE 201 San Jose State University
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Evaluating Classifiers Lecture 2 Instructor: Max Welling Read chapter 5.
Evaluating Hypotheses
Evaluating Classifiers Lecture 2 Instructor: Max Welling.
Chapter 8 Estimation: Single Population
Experimental Evaluation
Inferences About Process Quality
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Note 9 of 5E Statistics with Economics and Business Applications Chapter 7 Estimation of Means and Proportions Point Estimation, Interval Estimation/Confidence.
Standard error of estimate & Confidence interval.
Review of normal distribution. Exercise Solution.
1 Machine Learning: Lecture 5 Experimental Evaluation of Learning Algorithms (Based on Chapter 5 of Mitchell T.., Machine Learning, 1997)
Chapter 7 Estimation: Single Population
AP Statistics Chapter 9 Notes.
1 Machine Learning: Experimental Evaluation. 2 Motivation Evaluating the performance of learning systems is important because: –Learning systems are usually.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 9 Samples.
Determination of Sample Size: A Review of Statistical Theory
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Estimation Chapter 8. Estimating µ When σ Is Known.
CpSc 881: Machine Learning Evaluating Hypotheses.
机器学习 陈昱 北京大学计算机科学技术研究所 信息安全工程研究中心. 课程基本信息  主讲教师:陈昱 Tel :  助教:程再兴, Tel :  课程网页:
Machine Learning Chapter 5. Evaluating Hypotheses
1 CSI5388 Current Approaches to Evaluation (Based on Chapter 5 of Mitchell T.., Machine Learning, 1997)
Chapter5: Evaluating Hypothesis. 개요 개요 Evaluating the accuracy of hypotheses is fundamental to ML. - to decide whether to use this hypothesis - integral.
1 Evaluation of Learning Models Literature: Literature: T. Mitchel, Machine Learning, chapter 5 T. Mitchel, Machine Learning, chapter 5 I.H. Witten and.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
© Copyright McGraw-Hill 2004
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Empirical Evaluation (Ch 5) how accurate is a hypothesis/model/dec.tree? given 2 hypotheses, which is better? accuracy on training set is biased – error:
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Chapter 6: Sampling Distributions
Evaluating Hypotheses
Lecture Slides Essentials of Statistics 5th Edition
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
ESTIMATION.
STATISTICAL INFERENCE
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Evaluating Classifiers
Elementary Statistics
Empirical Evaluation (Ch 5)
Introduction to Probability and Statistics
Problems: Q&A chapter 6, problems Chapter 6:
MATH 2311 Section 4.4.
Computational Learning Theory
CHAPTER 15 SUMMARY Chapter Specifics
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter Outline Inferences About the Difference Between Two Population Means: s 1 and s 2 Known.
Evaluating Hypothesis
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Warmup Which of the distributions is an unbiased estimator?
Chapter 8 Estimation.
Chapter 8: Estimating with Confidence
Machine Learning: Lecture 5
How Confident Are You?.
MATH 2311 Section 4.4.
Presentation transcript:

Evaluating Hypotheses Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal distribution, Central Limit Theorem Paired t-tests Comparing Learning Methods CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Problems Estimating Error 1. Bias: If S is training set, errorS(h) is optimistically biased For unbiased estimate, h and S must be chosen independently 2. Variance: Even with unbiased S, errorS(h) may still vary from errorD(h) CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Two Definitions of Error The true error of hypothesis h with respect to target function f and distribution D is the probability that h will misclassify an instance drawn at random according to D. The sample error of h with respect to target function f and data sample S is the proportion of examples h misclassifies How well does errorS(h) estimate errorD(h)? CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Chapter 5 Evaluating Hypotheses Example Hypothesis h misclassifies 12 of 40 examples in S. What is errorD(h)? CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Chapter 5 Evaluating Hypotheses Estimators Experiment: 1. Choose sample S of size n according to distribution D 2. Measure errorS(h) errorS(h) is a random variable (i.e., result of an experiment) errorS(h) is an unbiased estimator for errorD(h) Given observed errorS(h) what can we conclude about errorD(h)? CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Chapter 5 Evaluating Hypotheses Confidence Intervals If S contains n examples, drawn independently of h and each other Then With approximately N% probability, errorD(h) lies in interval CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Chapter 5 Evaluating Hypotheses Confidence Intervals If S contains n examples, drawn independently of h and each other Then With approximately 95% probability, errorD(h) lies in interval CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

errorS(h) is a Random Variable Rerun experiment with different randomly drawn S (size n) Probability of observing r misclassified examples: CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Binomial Probability Distribution CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Normal Probability Distribution CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Normal Distribution Approximates Binomial CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Normal Probability Distribution CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Confidence Intervals, More Correctly If S contains n examples, drawn independently of h and each other Then With approximately 95% probability, errorS(h) lies in interval equivalently, errorD(h) lies in interval which is approximately CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Calculating Confidence Intervals 1. Pick parameter p to estimate errorD(h) 2. Choose an estimator errorS(h) 3. Determine probability distribution that governs estimator errorS(h) governed by Binomial distribution, approximated by Normal when 4. Find interval (L,U) such that N% of probability mass falls in the interval Use table of zN values CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Chapter 5 Evaluating Hypotheses Central Limit Theorem CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Difference Between Hypotheses CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Paired t test to Compare hA,hB CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Comparing Learning Algorithms LA and LB CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Comparing Learning Algorithms LA and LB What we would like to estimate: where L(S) is the hypothesis output by learner L using training set S i.e., the expected difference in true error between hypotheses output by learners LA and LB, when trained using randomly selected training sets S drawn according to distribution D. But, given limited data D0, what is a good estimator? Could partition D0 into training set S and training set T0 and measure even better, repeat this many times and average the results (next slide) CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses

Comparing Learning Algorithms LA and LB Notice we would like to use the paired t test on to obtain a confidence interval But not really correct, because the training sets in this algorithm are not independent (they overlap!) More correct to view algorithm as producing an estimate of instead of but even this approximation is better than no comparison CS 5751 Machine Learning Chapter 5 Evaluating Hypotheses