Type I & Type II Errors, Power of a Statistical Test, & Effect Size four of the most confusing topics in introductory statistics packaged in a way that.

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

 When we perform a hypothesis test, we make a decision to either Reject the Null Hypothesis or Fail to Reject the Null Hypothesis.  There is always the.
CHAPTER 21 More About Tests: “What Can Go Wrong?”.
Inference Sampling distributions Hypothesis testing.
Chapter 10: Hypothesis Testing
Hypothesis testing & Inferential Statistics
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Warm-up Day of 8.1 and 8.2 Quiz and Types of Errors Notes.
Intermediate Statistical Analysis Professor K. Leppel.
Hypothesis Testing.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
STT 315 This lecture is based on Chapter 6. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Using Inference to Make Decisions
Chapter 3 Investigating Independence Objectives Students will be able to: 1) Understand what it means for attempts to be independent 2) Determine when.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Presentation on Type I and Type II Errors How can someone be arrested if they really are presumed innocent? Why do some individuals who really are guilty.
Chapter 11 Testing Hypotheses about Proportions © 2010 Pearson Education 1.
Warm-up Day of 8.1 and 8.2 Review. 8.2 P#20, 23 and 24 P#20 a. and b. c. Since the p-hat is along the line for reasonably likely events.
Hypotheses tests for means
1 Lecture note 4 Hypothesis Testing Significant Difference ©
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Errors & Power. 2 Results of Significant Test 1. P-value < alpha 2. P-value > alpha Reject H o & conclude H a in context Fail to reject H o & cannot conclude.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
The use & abuse of tests Statistical significance ≠ practical significance Significance ≠ proof of effect (confounds) Lack of significance ≠ lack of effect.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Errors in Hypothesis Tests Notes: P When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility.
Errors in Hypothesis Tests. When you perform a hypothesis test you make a decision: When you make one of these decisions, there is a possibility that.
Type I and Type II Error AP Stat Review, April 18, 2009.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Type I and Type II Errors. An analogy may help us to understand two types of errors we can make with inference. Consider the judicial system in the US.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Welcome to MM570 Psychological Statistics
Sampling Distributions Statistics Introduction Let’s assume that the IQ in the population has a mean (  ) of 100 and a standard deviation (  )
1 Section 8.2 Basics of Hypothesis Testing Objective For a population parameter (p, µ, σ) we wish to test whether a predicted value is close to the actual.
If we fail to reject the null when the null is false what type of error was made? Type II.
Significance Tests: The Basics Textbook Section 9.1.
Tests about a Population Proportion Textbook Section 9.2.
Errors in Hypothesis Tests
Unit 4 – Inference from Data: Principles
Section Testing a Proportion
Errors in Hypothesis Tests
Type I and Type II Errors
CHAPTER 9 Testing a Claim
Welcome to Week 08 College Statistics
CHAPTER 9 Testing a Claim
Errors in Hypothesis Tests
Errors in Hypothesis Tests
P-value Approach for Test Conclusion
Using Inference to Make Decisions
Type I and Type II Error AP Stat February 28th 2011.
Errors in Hypothesis Tests
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
Errors in Hypothesis Tests
Errors in Hypothesis Tests
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Errors in Hypothesis Tests
Power and Error What is it?.
Presentation transcript:

Type I & Type II Errors, Power of a Statistical Test, & Effect Size four of the most confusing topics in introductory statistics packaged in a way that will hopefully make them clear Nicole Radziwill, James Madison / Feel free to use & share with citation!

Type I & Type II Errors When you perform a statistical test, you’re only taking ONE SAMPLE from a population – and there are tons of different samples you could potentially be collecting. You have to think about your sample in the context of ALL the potential samples you could have collected… fortunately this is made easy thanks to the sampling distributions of proportions and means. Type I Error, also called , is the likelihood that you incorrectly reject the null hypothesis Type II Error, also called , is the likelihood that you fail to detect the effect that your alternative hypothesis is trying to uncover

The Pregnancy Test Example Reality What the Pregnancy Test Said H 0 True (Really Are Pregnant) H 0 False (Really NOT pregnant) POS (Reject H 0 ) NEG (Fail to Reject H 0 ) H 0 : You are not pregnant. H A : You ARE pregnant. There are four things that can happen when you take the pregnancy test: The test can be ACCURATE, and say that: 1) you’re pregnant when you actually are, OR 2) you’re not pregnant when you actually AREN’T Or it can be INACCURATE, and say that: 3) you’re pregnant when you’re NOT (a FALSE ALARM), or 4) you’re not pregnant when you actually ARE…. (a FAILURE TO DETECT the effect) H 0 True (Really NOT pregnant) H 0 False (Really ARE pregnant)

The Pregnancy Test Example Reality What the Pregnancy Test Said H 0 True (Really Are Pregnant) H 0 False (Really NOT pregnant) POS (Reject H 0 ) NEG (Fail to Reject H 0 ) H 0 : You are not pregnant. H A : You ARE pregnant. There are four things that can happen when you take the pregnancy test: The test can be ACCURATE, and say that: 1) you’re pregnant when you actually are, OR 2) you’re not pregnant when you actually AREN’T Or it can be INACCURATE, and say that: 3) you’re pregnant when you’re NOT (a FALSE ALARM), or 4) you’re not pregnant when you actually ARE…. (a FAILURE TO DETECT the effect)  = probability of this test raising a FALSE ALARM  = probability of this test FAILING TO DETECT your pregnancy! H 0 True (Really NOT pregnant) H 0 False (Really ARE pregnant)

The Pregnancy Test Example Reality What the Pregnancy Test Said H 0 True (Really Are Pregnant) H 0 False (Really NOT pregnant) POS (Reject H 0 ) NEG (Fail to Reject H 0 ) H 0 : You are not pregnant. H A : You ARE pregnant. If Type II Error is the probability that a statistical test performed on this particular sample FAILS TO DETECT THE EFFECT (pregnancy), then what is the probability that a test using this sample will SUCCESSFULLY DETECT THE EFFECT? P(successfully detecting the effect) = 1 – P(NOT detecting the effect) = 1 -  This (1 -  ) is called The POWER OF THE TEST!  = probability of this test raising a FALSE ALARM  = probability of this test FAILING TO DETECT your pregnancy! H 0 True (Really NOT pregnant) H 0 False (Really ARE pregnant)

Consequences of Pregnancy Test Errors Type I Error – The test raised a false alarm and got you either very worried or excited. You may have already run to the store to buy baby supplies, incurring unnecessary costs. You may have spent days or weeks panicking until you realized that the test was faulty. Type II Error - The test failed to detect your pregnancy and you didn’t stop drinking or smoking, therefore potentially harming a life. You didn’t seek prenatal medical attention. In a case like this, the test designers need to think about how to minimize BOTH Type I and Type II Errors. There are psychological and cost ramifications if either kind happens.

Getting Picked Up By the Cops Reality What the Cop Decided H 0 True (Really NOT Guilty; it was a SKUNK) H 0 False (Really ARE Guilty) I’m Taking You to Jail (Reject H 0 ) I’m Letting You Go (Fail to Reject H 0 ) The scenario: A cop pulls you over and says he smells pot in your car. He is trying to figure out whether or not to take you to jail. The cop needs to do a little hypothesis test in his head to figure out whether to bring you in. H 0 : You are not guilty. H A : You ARE guilty. You have been smoking pot in your car. There are four things that can happen here: The cop can be ACCURATE, and say that: 1) you’re guilty when you actually are, OR 2) you’re not guilty and you haven’t been smoking pot Or he can be INACCURATE, and say that: 3) you’re guilty when you’re NOT (a FALSE ARREST), or 4) you’re not guilty when you actually ARE…. (a FAILURE TO DETECT the effect and a GET OUT OF JAIL FREE card)

Getting Picked Up By the Cops Reality What the Cop Decided H 0 True (Really NOT Guilty; it was a SKUNK) H 0 False (Really ARE Guilty) I’m Taking You to Jail (Reject H 0 ) I’m Letting You Go (Fail to Reject H 0 ) The scenario: A cop pulls you over and says he smells pot in your car. He is trying to figure out whether or not to take you to jail. The cop needs to do a little hypothesis test in his head to figure out whether to bring you in. H 0 : You are not guilty. H A : You ARE guilty. You have been smoking pot in your car. There are four things that can happen here: The cop can be ACCURATE, and say that: 1) you’re guilty when you actually are, OR 2) you’re not guilty and you haven’t been smoking pot Or he can be INACCURATE, and say that: 3) you’re guilty when you’re NOT (a FALSE ARREST), or 4) you’re not guilty when you actually ARE…. (a FAILURE TO DETECT the effect and a GET OUT OF JAIL FREE card)  = probability of this test leading to FALSE ARREST  = probability of this cop FAILING TO DETECT your pot smoking!

Consequences of Cop Errors Type I Error – The test raised a false alarm and got you sent to jail even though you didn’t deserve it. It cost you time and maybe even cost you money – bailing yourself out, or fighting court fees!! Type II Error - The cop let you go when he had reason to send you to jail! Probably good for you, bad for the cop (who may be trying to make his arrest quota for the month). Possibly bad for society, but other factors would have to be considered. Probably best to focus on keeping the Type I Error as low as possible in these cases – it’s more problematic to have a lot of false alarms than to let a few pot smokers off free here and there.

Getting Picked Up By the Cops #2 Reality What the Cop Decided H 0 True (Really NOT Guilty; you just look sketchy) H 0 False (Really ARE Guilty) I’m Taking You to Jail (Reject H 0 ) I’m Letting You Go (Fail to Reject H 0 ) The scenario: A cop pulls you over and suspects that you have just committed a murder. He is trying to figure out whether or not to take you to jail. The cop needs to do a little hypothesis test in his head to figure out whether to bring you in. H 0 : You are not guilty. H A : You ARE guilty. You are hiding evidence and a body in your trunk. There are four things that can happen here: The cop can be ACCURATE, and say that: 1) you’re guilty when you actually are, OR 2) you’re not guilty - you haven’t just killed someone Or he can be INACCURATE, and say that: 3) you’re guilty when you’re NOT (a FALSE ARREST), or 4) you’re not guilty when you actually ARE…. (a FAILURE TO DETECT the murder and eluding the law)

Getting Picked Up By the Cops #2 Reality What the Cop Decided H 0 True (Really NOT Guilty; you just look sketchy) H 0 False (Really ARE Guilty) I’m Taking You to Jail (Reject H 0 ) I’m Letting You Go (Fail to Reject H 0 )  = probability of this test leading to FALSE ARREST  = probability of this cop FAILING TO DETECT the murder! The scenario: A cop pulls you over and suspects that you have just committed a murder. He is trying to figure out whether or not to take you to jail. The cop needs to do a little hypothesis test in his head to figure out whether to bring you in. H 0 : You are not guilty. H A : You ARE guilty. You are hiding evidence and a body in your trunk. There are four things that can happen here: The cop can be ACCURATE, and say that: 1) you’re guilty when you actually are, OR 2) you’re not guilty – you haven’t just killed someone Or he can be INACCURATE, and say that: 3) you’re guilty when you’re NOT (a FALSE ARREST), or 4) you’re not guilty when you actually ARE…. (a FAILURE TO DETECT the murder and eluding the law)

Consequences of Cop Errors (#2) Type I Error – The test raised a false alarm and got you sent to jail and into a HUGE legal mess even though you didn’t deserve it. It costs you time and will probably cost you tons of money – bailing yourself out, court fees, lawyers… a trial!! Type II Error - The cop let you go when you had a body in the trunk! Probably good for you, bad for the cop, and VERY BAD for society. I’d want to keep the Type II Error as low as possible in this case, and risk some false alarms to AVOID letting killers go. Would you?? (Notice that keeping the Type II Error low ALSO keeps the power of the test pretty high.)

Power of the Test Power = 1 -  It’s the probability of successfully detecting an effect The power of the test INCREASES as the effect size increases… So girls, if you are just ONE day late, the effect size is small – it will be difficult for that pregnancy test to give you an accurate result. You’ll have a high Type II Error at this time. If you are a week late, the effect size is bigger – the Type II Error will be lower, because the pregnancy is easier to detect. If you are a month late, the effect size is HUGE and the Type II Error will be even smaller.

Interrelationships P 0 is the proportion that you’re assuming is true – often a standard or “known” value P* is the TRUE population proportion The bigger the difference between P 0 and P*, the bigger the EFFECT SIZE (the bottom curve shifts to the right as effect size gets bigger) If the proportion you measured from your sample is BIGGER than the real proportion P*, you’ll incorrectly reject the null and incur a Type I Error. If the proportion you measured from your sample is LESS than the real proportion P*, but STILL GREATER than the values in the top curve that are way out to the left all on their own, You’ll fail to reject the null and incur a Type II Error.