The Relationship between Confidence Intervals & Hypothesis Tests

Slides:



Advertisements
Similar presentations
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Advertisements

Final Jeopardy $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 LosingConfidenceLosingConfidenceTesting.
Chapter 9 Hypothesis Testing.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Dan Piett STAT West Virginia University
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
M33 Confidence intervals 1  Department of ISM, University of Alabama, Confidence Intervals Estimation.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Hypotheses tests for means
Large sample CI for μ Small sample CI for μ Large sample CI for p
M33 Confidence intervals 1  Department of ISM, University of Alabama, Confidence Interval Estimation.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Testing the Differences between Means Statistics for Political Science Levin and Fox Chapter Seven 1.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-3 Inferences About Two Means:
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 9 Introduction to the t Statistic
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Hypothesis Testing – Two Population Variances
Tests About a Population Proportion
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 8 Estimating with Confidence
Unit 5 – Chapters 10 and 12 What happens if we don’t know the values of population parameters like and ? Can we estimate their values somehow?
CHAPTER 9 Testing a Claim
Chapter 9 Hypothesis Testing.
Inference about Two Means - Independent Samples
Lecture Nine - Twelve Tests of Significance.
Hypothesis Testing I The One-sample Case
Assumptions For testing a claim about the mean of a single population
Exercises #8.74, 8.78 on page 403 #8.110, on page 416
Chapter 9: Inferences Involving One Population
Chapter 21 More About Tests.
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 9 Testing a Claim
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Hypothesis Testing for Proportions
Chapter 8 Hypothesis Testing with Two Samples.
CHAPTER 9 Testing a Claim
Hypothesis Tests for 1-Sample Proportion
Significance Tests: A Four-Step Process
Hypothesis Testing: Hypotheses
More about Tests and Intervals
Inferential Statistics
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Hypothesis Tests for a Population Mean in Practice
Confidence Intervals for Proportions
Chapter Review Problems
Elementary Statistics
Elementary Statistics
STATISTICS INFORMED DECISIONS USING DATA
Chapter 9: Hypothesis Tests Based on a Single Sample
Interval Estimation and Hypothesis Testing
Comparing Two Populations
Chapter 9 Testing a Claim
CHAPTER 9 Testing a Claim
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Section 10.2 Comparing Two Means.
Unit 5: Hypothesis Testing
Section 8.2 Day 2.
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Inferences from Matched Pairs
STA 291 Summer 2008 Lecture 21 Dustin Lueker.
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

The Relationship between Confidence Intervals & Hypothesis Tests DUALITY STICKS The Relationship between Confidence Intervals & Hypothesis Tests John Krickl Glenbrook North High School Northbrook, IL jkrickl@glenbrook225.org

Motivation

How can we make this STATEMENT more meaningful to students? “Using the same sample, a two-tailed hypothesis test with significance level α will generally* make the same decision about a hypothesized value of a parameter as will a 100(1 - α)% confidence interval.” *The difference between does not make this relationship perfect, but I do not believe this detracts from the big picture of the activity. Also, this difference makes for a great discussion in class. (See slide 16 for more reflection on this)

Construction Popsicle stick Paper clip Pipe cleaners Red pipe cleaner: cut to (just over) 29 cm Green pipe cleaner: cut to (just under) 25 cm

RESULT

Logistics For the sake of simplicity and multiple uses, this activity is constructed visualizing a 95% confidence interval for a population proportion in conjunction with the rejection region of a two-tailed, one proportion z-test with alpha = .05.

LOGISTICS One standard deviation will be represented by 7.5 cm when drawn on the board. Thus: Red pipe cleaner (29.4 cm) represents a 95% confidence interval Green pipe cleaner (24.7 cm) represents a 90% confidence interval Having this consistent scale allows you to quickly recreate a sketch of the standard normal distribution that will match up with the sticks we have created

I start by having the students put the popsicle stick at the center of the distribution. While this is a sketch of the standard normal distribution, I put p0 in parentheses below z = 0 to remind students than an obtained p-hat equal to p0 corresponds with a test statistic of 0.

Both our CI and our hypothesis test do not find evidence against p0.

The A-HA moment for many students. Students notice that when our obtained p-hat (or corresponding test statistic) is on the cusp of the rejection region, our 95% CI just barely captures p0. The instant we move further right…

In action Students work in pairs. Person A focuses on the result of the hypothesis test. I reject H0 (p = p0). I fail to reject H0. Person B focuses on the result of the confidence interval. “I do not contain p0, so I reject p0 as a plausible value for p” “I contain p0, so I do not reject p0 as a plausible value for p”

BEFORE AND AFTER Typical student answer in the past for why this duality occurs: “Since the hypothesis test rejected p=.48, the confidence interval also rejects .48.” After this activity: “If we rejected H0 , we had a test statistic larger than 1.96 or smaller than -1.96. Since the 95% confidence interval spans 1.96 standard deviations in either direction from our obtained result, we will be too far from p0 to capture it in the interval.”

What about a one-tailed test? For a one-tailed hypothesis test with alpha = .05, the test will (generally) make the same decision about a hypothesized value of the parameter that a 90% confidence interval will make. While it is not too difficult to note that the rejection region of a one-tailed, one proportion z-test will either be z < -1.645 or z > 1.645, and 1.645 is the critical value for a 90% CI for a population proportion, I think it packs a lot more punch for the students to understand this relationship in a more hands-on way…

Students can see that the 90% CI backs up the decision of the hypothesis test, but the 95% CI does not, as it still captures p0.

CLOSING COMMENTS The statement on slide 3 includes the word “generally” because in a one proportion z-test, we use p0 when calculating the standard deviation of the sampling distribution of p hat. When we construct the confidence interval, we estimate p0 with p hat, and therefore the standard deviation assuming H0 is true & the standard error used in the confidence interval will not perfectly match. While this distinction is worth mentioning, I don’t believe it detracts from the message of the activity. As mentioned previously, this made for a great discussion in class. The same activity could be done in the context of inference for a population mean, but then degrees of freedom are involved (unless we know σ, which is unlikely). This is an activity that I use often & spontaneously in class, so I didn’t want to make a new scale/rejection region each time we had a new sample.

Thank you! Please feel free to contact me with any questions or suggestions for improvement… jkrickl@glenbrook225.org