Confidence intervals for the difference between two proportions

Slides:



Advertisements
Similar presentations
Estimating a Population Variance
Advertisements

Statistics Estimates and Sample Sizes
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.4.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Section 7.3 Confidence intervals for a population proportion
Confidence Intervals about a Population Proportion Section 8.3
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.3 Estimating a Population mean µ (σ known) Objective Find the confidence.
Chapter 19: Confidence Intervals for Proportions
Horng-Chyi HorngStatistics II_Five43 Inference on the Variances of Two Normal Population &5-5 (&9-5)
Estimating a Population Proportion
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
1 Chapter 9 Inferences from Two Samples In this chapter we will deal with two samples from two populations. The general goal is to compare the parameters.
Section 7-4 Estimating a Population Mean: σ Not Known.
Confidence Intervals and Two Proportions Presentation 9.4.
Sections 6-1 and 6-2 Overview Estimating a Population Proportion.
Chapter 9 Inferences from Two Samples
Highlighting Standards for Mathematical Practice in an Intro Stats Course by Fides Ushe Springfield College, MA.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8.
Confidence Interval Estimation for a Population Proportion Lecture 31 Section 9.4 Wed, Nov 17, 2004.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Two-sample Proportions Section Starter One-sample procedures for proportions can also be used in matched pairs experiments. Here is an.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 9-1 Review and Preview.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
AP Statistics Chapter 19 Notes “Confidence Intervals for Sample Proportions”
Section 6-3 Estimating a Population Mean: σ Known.
Section 7-3 Estimating a Population Mean: σ Known.
Introduction Studying the normal curve in previous lessons has revealed that normal data sets hover around the average, and that most data fits within.
STT 315 Ashwini maurya This lecture is based on Chapter 5.4 Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil.
Section 8.4 Estimating Population Proportions HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Confidence Interval Estimation for a Population Proportion Lecture 33 Section 9.4 Mon, Nov 6, 2006.
Estimating a Population Mean. Student’s t-Distribution.
Confidence Interval for a Single Proportion p-hat, not phat.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.3.
Chapter 22 Comparing Two Proportions. Comparing 2 Proportions How do the two groups differ? Did a treatment work better than the placebo control? Are.
Introduction to Inference Confidence Intervals Sample means to sample proportions parameterstatistic mean proportion.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Lesson Testing the Significance of the Least Squares Regression Model.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 8.3.
Confidence Interval Estimation for a Population Proportion Lecture 31 Section 9.4 Tue, Mar 27, 2007.
Confidence Interval Estimation for a Population Proportion Lecture 33 Section 9.4 Mon, Nov 7, 2005.
CHAPTER 8 Estimating with Confidence
Chapter 7 Lecture 2 Section: 7.2.
Inference about Two Means - Independent Samples
Confidence Interval Estimation for a Population Proportion
Confidence Interval Estimation for a Population Mean
Confidence Intervals for a Population Proportion
Elementary Statistics
Inference about Two Means: Dependent Samples
Interval Estimation Part II
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Elementary Statistics
Independent Samples: Comparing Proportions
Confidence Intervals for a Population Mean, Standard Deviation Known
Confidence Interval Estimation for a Population Proportion
Hypothesis tests for the difference between two proportions
One-Way Analysis of Variance
Hypothesis Tests for Proportions
Confidence Interval Estimation for a Population Mean
Estimating a Population Proportion
Hypothesis Tests for Two Population Standard Deviations
Confidence intervals for the difference between two means: Independent samples Section 10.1.
Independent Samples: Confidence Intervals
Determining Which Method to use
Chapter 12 Inference for Proportions
2/3/ Estimating a Population Proportion.
Confidence Interval Estimation for a Population Proportion
Presentation transcript:

Confidence intervals for the difference between two proportions Section 10.2

Objectives Construct confidence intervals for the difference between two proportions

Objective 1 Construct confidence intervals for the difference between two proportions

Notation Consider the following example. In a study of the effect of air pollution on lung function, a random sample of 50 children living in a community with a high level of ozone pollution had their lung capacities measured, and 14 of them had capacities that were below normal for their size. A second random sample of 80 children was drawn from a community with a low level of ozone pollution, and 12 of them had lung capacities that were below normal for their size. We are interested in studying the difference between the proportions of individuals in two different categories (communities). We begin by associating some notation for the population proportions, the numbers of individuals in each category, and the sample sizes. 𝑝 1 and 𝑝 2 are the population proportions of the category of interest in the two populations. 𝑥 1 and 𝑥 2 are the numbers of individuals in the category of interest in the two samples. 𝑛 1 and 𝑛 2 are the two sample sizes.

Point Estimate and Standard Error The point estimates for the population proportions are the sample proportions 𝑝 1 = 𝑥 1 𝑛 1 𝑝 2 = 𝑥 2 𝑛 2 It is relatively straightforward to see that the point estimate of 𝑝 1 − 𝑝 2 is 𝑝 1 − 𝑝 2 . The sample proportions 𝑝 1 and 𝑝 2 have variances 𝑝 1 1− 𝑝 1 𝑛 1 and 𝑝 2 1− 𝑝 2 𝑛 2 respectively. When the samples are independent, the variance of the difference 𝑝 1 − 𝑝 2 can be shown to be the sum of the variances of the sample proportions. In other words, the variance of 𝑝 1 − 𝑝 2 is 𝑝 1 1− 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2 , therefore, the standard error is 𝑝 1 1− 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2 .

Critical Value and Margin of Error When the sample sizes are large enough 𝑝 1 and 𝑝 2 are approximately normally distributed, so the critical value is 𝑧 𝛼 2 . We multiply the standard error by the critical value to obtain the margin of error: Margin of error = 𝑧 𝛼 2 𝑝 1 1 − 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2 The level 100(1 − 𝛼)% confidence interval is Point estimate ± Margin of Error 𝑝 1 − 𝑝 2 ± 𝑧 𝛼 2 𝑝 1 1 − 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2

Assumptions The method for constructing a confidence interval for the difference of population proportions requires that the sampling distribution be approximately normal. The following assumptions ensure this: Assumptions: We have two independent simple random samples. Each population is at least 20 times as large as the sample drawn from it. The individuals in each sample are divided into two categories. Both samples contain at least 10 individuals in each category.

Example A random sample of 50 children living in a community with a high level of ozone pollution had their lung capacities measured, and 14 of them had capacities that were below normal. A second random sample of 80 children was drawn from a community with a low level of ozone pollution, and 12 of them had lung capacities that were below normal. Construct a 95% confidence interval for the difference between the proportions of children with diminished lung capacity differ between the two communities. Solution: We have two independent random samples. The populations of children are more than 20 times as large as the samples. The individuals are divided into two categories with at least 10 individuals in each category. The assumptions are satisfied. We summarize the information: High Pollution Low Pollution Sample size 𝑛 1 =50 𝑛 2 =80 Number with below-normal lung capacity 𝑥 1 =14 𝑥 2 =12 Population proportion 𝑝 1 (unknown) 𝑝 2 (unknown)

Solution The point estimate is: 𝑝 1 − 𝑝 2 = 𝑥 1 𝑛 1 − 𝑥 2 𝑛 2 = 14 50 − 12 80 =0.130 The critical value for a 95% confidence interval is 𝑧 𝛼 2 =1.96. The standard error is 𝑝 1 1 − 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2 = 0.280(1 −0.280) 50 + 0.150(1 −0.150) 80 = 0.075005. The margin of error is 𝑧 𝛼 2 𝑝 1 1 − 𝑝 1 𝑛 1 + 𝑝 2 1− 𝑝 2 𝑛 2 =1.96∙ 0.280(1 −0.280) 50 + 0.150(1 −0.150) 80 = 0.14701. The 95% confidence interval is Point estimate ± Margin of Error 0.130 ± 0.14701 0.130 – 0.14701< 𝑝 1 – 𝑝 2 < 0.130 + 0.14701 – 0.017 < 𝑝 1 – 𝑝 2 < 0.277 We are 95% confident that the difference between the proportions is between −0.017 and 0.277. This confidence interval contains 0. Therefore, we cannot be sure that the proportions of children with diminished lung capacity differ between the two communities. Remember: 𝑥 1 =14 𝑛 1 =50 𝑥 2 =12 𝑛 2 =80

Confidence Intervals on the TI-84 PLUS The 2-PropZInt command constructs confidence intervals for the difference between two proportions. This command is accessed by pressing STAT and highlighting the TESTS menu. Input the values of 𝑥 1 , 𝑛 1 , 𝑥 2 , and 𝑛 2 and the confidence level.

Example (TI-84 PLUS) A random sample of 50 children living in a community with a high level of ozone pollution had their lung capacities measured, and 14 of them had capacities that were below normal. A second random sample of 80 children was drawn from a community with a low level of ozone pollution, and 12 of them had lung capacities that were below normal. Construct a 95% confidence interval for the difference between the proportions of children with diminished lung capacity differ between the two communities. Solution: We have two independent random samples. The populations of children are more than 20 times as large as the samples. The individuals are divided into two categories with at least 10 individuals in each category. The assumptions are satisfied. We summarize the information: High Pollution Low Pollution Sample size 𝑛 1 =50 𝑛 2 =80 Number with below-normal lung capacity 𝑥 1 =14 𝑥 2 =12 Population proportion 𝑝 1 (unknown) 𝑝 2 (unknown)

Example (TI-84 PLUS) Solution: We press STAT and highlight the TESTS menu and select 2-PropZInt. Select Stats as the input method and enter the following values: Enter 0.95 as the confidence level and select Calculate. The confidence interval is (–0.17, 0.277). We are 95% confident that the difference between the proportions is between −0.017 and 0.277. This confidence interval contains 0. Therefore, we cannot be sure that the proportions of children with diminished lung capacity differ between the two communities. 𝑥 1 =14 𝑛 1 =50 𝑥 2 =12 𝑛 2 =80

You Should Know… The assumptions for constructing a confidence interval for the difference between proportions How to construct confidence intervals for the difference between two proportions