Chapter 21: More About Tests

Slides:



Advertisements
Similar presentations
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Advertisements

Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Significance Tests About
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Copyright © 2010 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Chapter 9 Hypothesis Testing.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
Warm-up Day of 8.1 and 8.2 Quiz and Types of Errors Notes.
Overview Definition Hypothesis
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
More About Tests and Intervals Chapter 21. Zero In on the Null Null hypotheses have special requirements. To perform a hypothesis test, the null must.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 21: More About Tests “The wise man proportions his belief to the evidence.” -David Hume 1748.
Copyright © 2009 Pearson Education, Inc. Chapter 21 More About Tests.
Chapter 11 Testing Hypotheses about Proportions © 2010 Pearson Education 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Warm-up Day of 8.1 and 8.2 Review. 8.2 P#20, 23 and 24 P#20 a. and b. c. Since the p-hat is along the line for reasonably likely events.
Chapter 20 Testing hypotheses about proportions
Hypotheses tests for means
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
Copyright © 2012 Pearson Education. Chapter 13 Testing Hypotheses.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Significance Test A claim is made. Is the claim true? Is the claim false?
Chapter 8 Testing Hypotheses about Proportions Part II: Significance Levels, Type I and Type II Errors, Power 1.
Copyright © 2010 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Chapter 21: More About Test & Intervals
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 20, Slide 1 Chapter 20 More about Tests and Intervals.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
Type I and Type II Errors. An analogy may help us to understand two types of errors we can make with inference. Consider the judicial system in the US.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Slide 21-1 Copyright © 2004 Pearson Education, Inc.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Chapter 21 More About Hypothesis Tests Using a Single Sample.
Chapter 20 Testing Hypotheses About Proportions. confidence intervals and hypothesis tests go hand in hand:  A confidence interval shows us the range.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
More about tests and intervals CHAPTER 21. Do not state your claim as the null hypothesis, instead make what you’re trying to prove the alternative. The.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Copyright © 2010 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 21 More About Tests and Intervals.
Section Testing a Proportion
Chapter 21 More About Tests.
More about Tests and Intervals
More About Tests and Intervals
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 9: Testing a Claim
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Chapter 21: More About Tests AP Statistics

Null Hypothesis Stating it can sometimes be tricky If event is random or result of “Guessing”, then many times your null is unusual, such as or Especially when testing to see if “coin is fair” or to see if “someone has ESP and can predict which closed hand contains a prize” or to see if “die is fair”

Alpha Levels (Significant Levels) If P-Value is small, it tells us that our data is rare given the null hypothesis How rare is rare? How low must it be to reject the null hypothesis? We arbitrarily set a threshold for P-value and if it is below that value, we reject the null hypothesis This threshold is called the ALPHA LEVEL

Alpha Levels (Significant Levels) We denote the Alpha Level: Also called the significant level because values under the are considered statistically significant. When we reject the null, “the test is significant at the =.05 level.” Always select before you look at data Always report P-value and level in conclusion

Common Alpha Levels  1-sided 2-sided 0.05 1.645 1.96 0.01 2.28 2.575 0.001 3.09 3.29

When the alternative is one-sided, the critical value puts all of  on one side: When the alternative is two-sided, the critical value splits  equally into two tails:

Practical vs Statistical Significance For larger sample sizes, small unimportant deviations from the null can be statistically significant. For smaller sample sizes, large seemingly important deviations from the null may not be statistically significant. Also, always do a reality check—what is the big deal about that difference?

Confident Intervals You can approximate a hypothesis by examining a confidence interval Just ask whether the null is consistent with a CI for the parameter at the corresponding confidence interval A 95% Confidence interval corresponds to a two-sided hypothesis test at (sum of the two tails) A 95% Confidence interval corresponds to a one-sided hypothesis test at

Errors Here’s some shocking news for you: nobody’s perfect. Even with lots of evidence we can still make the wrong decision. When we perform a hypothesis test, we can make mistakes in two ways: The null hypothesis is true, but we mistakenly reject it. (Type I error) The null hypothesis is false, but we fail to reject it. (Type II error)

Errors

Errors Type I: You are healthy, but a test says you have a disease (false positive) Type II: You are not healthy, but the test says you do not have a disease (false negative) ___________________________________ Type I: Jury convicts a innocent person Type II: Jury fails to convict a guilty person

Type I Errors How often does a Type I Error Occur? Happens when null hypothesis is true, but you have the misfortune to draw the unusual sample. To reject null hypothesis, P-value must fall below When the null hypothesis is true, but we reject it the P-value is exactly When you set , you are setting the probability of a Type I error Remember, you can only have a Type I error if the null hypothesis is true

Type II Error What happens if Null Hypothesis is not true? If null hypothesis is false and we reject if, we have done the correct thing. If the null hypothesis is false and we fail to reject it, we have committed a Type II Error. The probability that this error occurs is denoted by

Errors in General Review Probability of Type I Error = Probability of Type II Error =

Reducing Error Neither Error is good Difficulty is reducing one, without increasing the other. Imagine: To reduce Type I error, I reduce my ; however, then my , or Type II error would increase. The only way to reduce both errors is to collect more evidence—more data Many times studies fail because sample sizes are too small to detect the change they are looking for When designing a survey or experiment it is a good idea to calculate , for a reasonable .

Power—related to reducing error It is natural to think that if we failed to reject the null hypothesis we did not look hard enough and made the wrong decision. Is the null hypothesis really false and our test was too weak to detect the strength of the difference? We want a test that is strong enough to make the right decision when should be rejecting the null hypothesis when it really is false The POWER of the test tells us how strong our test is in rejecting a false null hypothesis.

Power When power is high, we can be confident that we looked hard enough. High Power tells us that our test is strong and has a very good chance of detecting a false null hypothesis (very good chance of NOT making a Type II Error) Power is calculated by: This is the complement of making a Type II Error

Power Whenever a study fails to reject the Null Hypothesis, the Power of the test comes into play. When we calculate Power, we imagine the null hypothesis is FALSE. The value of Power depends on how far the truth lies from the null hypothesis—this distance is called the “effect size”

Power

Notice from visual: Power = Reducing to lower Type I error will move the critical value, to the right and have the effect of increasing the probability of a Type II error and consequentially reducing Power Notice that the large the effect size, the smaller the chance of making a Type II Error and the greater that power of the test.

Reducing Both Type I and Type II Error This was discussed earlier—increase the sample size. The effect of a larger sample size can be seen below. An increased sample size will reduce the standard deviation, thus making the curves narrower.

Reducing Both Type I and Type II Error (cont.) Original comparison of errors: Comparison of errors with a larger sample size: