Chapter 7 Confidence Intervals for 2 Proportions and 2 Means © 2006 W.H. Freeman and Company.

Slides:



Advertisements
Similar presentations
Confidence Intervals for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Section
Advertisements

1 Selected Sections of Chapters 22 and 24 Confidence Intervals for p 1 - p 2 and µ 1 - µ 2.
Chapter 22 Comparing 2 Proportions © 2006 W.H. Freeman and Company.
Chapter 22 Comparing 2 Proportions © 2006 W.H. Freeman and Company.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Announcements Homework 2 will be posted on the web by tonight.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
Two-sample problems for population means BPS chapter 19 © 2006 W.H. Freeman and Company.
Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Comparing 2 proportions BPS chapter 21 © 2006 W. H. Freeman and Company These PowerPoint files were developed by Brigitte Baldi at the University of California,
Inferences Based on Two Samples
Example 1: a) Describe the shape, center, and spread of the sampling distribution of. Because n 1 p 1 = 100(0.7) = 70, n 1 (1 − p 1 ) = 100(0.3) = 30,
8.1 Inference for a Single Proportion
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Inference for  1  2 1.
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for  1  Independent Samples.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Lecture 5 Two population tests of Means and Proportions.
STATISTICAL INFERENCE PART VIII HYPOTHESIS TESTING - APPLICATIONS – TWO POPULATION TESTS 1.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
Two sample problems:  compare the responses in two groups  each group is a sample from a distinct population  responses in each group are independent.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 1.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
Comparing 2 Proportions © 2006 W.H. Freeman and Company.
1 Confidence Intervals for Two Proportions Section 6.1.
Confidence Intervals for µ 1 - µ 2 and p 1 - p 2 1.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 24 Independent Samples Chapter.
Inference for Distributions 7.2 Comparing Two Means © 2012 W.H. Freeman and Company.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Inference for distributions: - Comparing two means.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for the difference Between.
AP STATISTICS COMPARING TWO PROPORTIONS Chapter 22.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Objectives (Chapter 20) Comparing two proportions  Comparing 2 independent samples  Confidence interval for 2 proportion  Large sample method  Plus.
Chapter 10 Comparing Two Populations or Groups Sect 10.1 Comparing two proportions.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 22 Part 2 CI HT for m 1 -
Yandell – Econ 216 Chap 8-1 Chapter 8 Confidence Interval Estimation.
20. Comparing two proportions
Inference about a Population Mean
Testing the Difference between Means, Variances, and Proportions
Chapter 10: Comparing Two Populations or Groups
Objectives (PSLS Chapter 18)
AP Statistics Comparing Two Proportions
Confidence Intervals for p1 - p2 and µ1 - µ2
Inference about Comparing Two Populations
18. Two-sample problems for population means (σ unknown)
Chapter 10: Comparing Two Populations or Groups
Chapter 8 Inference for Proportions
Inference for distributions: - Optional topics in comparing distributions IPS chapter 7.3 © 2006 W.H. Freeman and Company.
Inference for Distributions
Inference for Distributions 7.2 Comparing Two Means
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Objectives (Section 7.2) Two sample problems:
Chapter 8 Inference for Proportions
AP Stat Mock Exam Registration Deadline:
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 7 Estimation: Single Population
Chapter 10: Comparing Two Populations or Groups
Presentation transcript:

Chapter 7 Confidence Intervals for 2 Proportions and 2 Means © 2006 W.H. Freeman and Company

Objectives (Chapter 7) Comparing two proportions  Comparing two independent samples  Large-sample CI for two proportions  Test of statistical significance

3 Sample 1 Sample size n 1 Number of successes x 1 Sample proportion Sample 1 Sample size n 1 Number of successes x 1 Sample proportion  Two random samples are drawn from two populations.  The number of successes in each sample is recorded.  The sample proportions are computed. Sample 2 Sample size n 2 Number of successes x 2 Sample proportion Sample 2 Sample size n 2 Number of successes x 2 Sample proportion x n 1 1 ˆ  p 1 Point Estimator of p 1 – p 2 :

Comparing two independent samples We often need to estimate the difference p 1 – p 2 between two unknown population proportions based on independent samples. We can compute the difference between the two sample proportions and compare it to the corresponding, approximately normal sampling distribution for

Large-sample CI for two proportions For two independent SRSs of sizes n 1 and n 2 with sample proportion of successes 1 and 2 respectively, an approximate level C confidence interval for p 1 – p 2 is Use this method when C is the area under the standard normal curve between −z* and z*.

Cholesterol and heart attacks How much does the cholesterol-lowering drug Gemfibrozil help reduce the risk of heart attack? We compare the incidence of heart attack over a 5-year period for two random samples of middle-aged men taking either the drug or a placebo. So the 90% CI is ( − ) ± 1.645* = ± = (0.016, ) We are 90% confident that the interval 1.6% to 2.66% captures the true percentage difference in heart attack rates for middle-aged men when taking a placebo and the cholesterol-lowering drug. Standard error of the difference p 1 − p 2 : H. attackn Drug p % Placebo p %

7 Example: 95% confidence interval for p 1 – p 2 The age at which a woman gives birth to her first child may be an important factor in the risk of later developing breast cancer. An international study conducted by WHO selected women with at least one birth and recorded if they had breast cancer or not and whether they had their first child before their 30 th birthday or after. CancerSample Size Age at First Birth > % Age at First Birth <= , % The parameter to be estimated is p1 – p2. p1 = cancer rate when age at 1 st birth >30 p2 = cancer rate when age at 1 st birth <=30 We estimate that the cancer rate when age at first birth > 30 is between.05 and.082 higher than when age <= 30.

Beware!! Common Mistake !!! A common mistake is to calculate a one-sample confidence interval for p   a one-sample confidence interval for p   and to then conclude that p  and p  are equal if the confidence intervals overlap. This is WRONG because the variability in the sampling distribution for from two independent samples is more complex and must take into account variability coming from both samples. Hence the more complex formula for the standard error.

INCORRECT Two single-sample 95% confidence intervals: The confidence interval for the rightie BA and the confidence interval for the leftie BA overlap, suggesting no significant difference between Ryan Howard’s ABILITY to hit right- handed pitchers and his ABILITY to hit left-handed pitchers. Rightie interval: (0.274, 0.366) HitsABphat(BA) Rightie Leftie Leftie interval: (0.170, 0.280)

Reason for Contradictory Result 10

Two random samples are selected from the two populations of interest. Because we compare two population means, we use the statistic 11 Chapter 7 CI for the Difference m 1 - m 2 Between 2 Population Means: Independent Samples

12 Population 1Population 2 Parameters: µ 1 and  1 2 Parameters: µ 2 and  2 2 (values are unknown) (values are unknown) Sample size: n 1 Sample size: n 2 Statistics: x 1 and s 1 2 Statistics: x 2 and s 2 2 Estimate µ 1  µ 2 with x 1  x 2

Sampling distribution model for ? Sometimes used (not always very good) estimate of the degrees of freedom is min(n 1 − 1, n 2 − 1). Shape? Estimate using df 0

Confidence Interval for    –   14

Example: “Cameron Crazies”. Confidence interval for    –    Do the “Cameron Crazies” at Duke home games help the Blue Devils play better defense?  Below are the points allowed by Duke (men) at home and on the road for the conference games from a recent season. 15 Pts allowed at home Pts allowed on road

Example: “Cameron Crazies”. Confidence interval for    –   16 Calculate a 95% CI for  1 -  2 where  1 = mean points per game allowed by Duke at home.  2 = mean points per game allowed by Duke on road n 1 = 8, n 2 = 8; s 1 2 = (21.8) 2 = ; s 2 2 = (8.9) 2 = n 1 = 8, n 2 = 8; s 1 2 = (21.8) 2 = ; s 2 2 = (8.9) 2 = 79.41

To use the t-table let’s use df = 9; t 9 * = The confidence interval estimator for the difference between two means is … 17 Example: “Cameron Crazies”. Confidence interval for    –  

Interpretation The 95% CI for  1 -  2 is (-19.22, 18.46). Since the interval contains 0, there appears to be no significant difference between  1 = mean points per game allowed by Duke at home.  2 = mean points per game allowed by Duke on road The Cameron Crazies appear to have no affect on the ABILITY of the Duke men to play better defense. 18 How can this be?

Example: 95% confidence interval for    –   Example – Do people who eat high-fiber cereal for breakfast consume, on average, fewer calories for lunch than people who do not eat high-fiber cereal for breakfast? – A sample of 150 people was randomly drawn. Each person was identified as a consumer or a non-consumer of high- fiber cereal. – For each person the number of calories consumed at lunch was recorded. 19

Example: 95% confidence interval for    –   20 Solution: The parameter to be tested is the difference between two means. The claim to be tested is: The mean caloric intake of consumers (  1 ) is less than that of non-consumers (  2 ).

Example: 95% confidence interval for    –   Let’s use df = 122.6; t * = The confidence interval estimator for the difference between two means is… 21

Interpretation The 95% CI is ( , -1.56). Since the interval is entirely negative (that is, does not contain 0), there is evidence from the data that µ 1 is less than µ 2. We estimate that non-consumers of high-fiber breakfast consume on average between 1.56 and more calories for lunch. 22

Let’s use df = min(43-1, 107-1) = min(42, 106) = 42; t 42 * = The confidence interval estimator for the difference between two means is 23 Previous Example: (cont.) confidence interval for  1 –  2 using min(n 1 –1, n 2 -1) to approximate the df

Beware!! Common Mistake !!! A common mistake is to calculate a one-sample confidence interval for    a one-sample confidence interval for    and to then conclude that   and   are equal if the confidence intervals overlap. This is WRONG because the variability in the sampling distribution for from two independent samples is more complex and must take into account variability coming from both samples. Hence the more complex formula for the standard error.

INCORRECT Two single-sample 95% confidence intervals: The confidence interval for the male mean and the confidence interval for the female mean overlap, suggesting no significant difference between the true mean for males and the true mean for females. Male interval: (18.68, 20.12) MaleFemale mean st. dev. s n50 Female interval: (16.94, 18.86)

Reason for Contradictory Result 26

Does smoking damage the lungs of children exposed to parental smoking? Forced vital capacity (FVC) is the volume (in milliliters) of air that an individual can exhale in 6 seconds. FVC was obtained for a sample of children not exposed to parental smoking and a group of children exposed to parental smoking. We want to know whether parental smoking decreases children’s lung capacity as measured by the FVC test. Is the mean FVC lower in the population of children exposed to parental smoking? Parental smokingFVCsn Yes No

Parental smokingFVCsn Yes No We are 95% confident that lung capacity is between and 6.19 milliliters LESS in children of smoking parents. 95% confidence interval for (µ 1 − µ 2 ), with : df =  t* = :  1 = mean FVC of children with a smoking parent;  2 = mean FVC of children without a smoking parent

Do left-handed people have a shorter life-expectancy than right-handed people?  Some psychologists believe that the stress of being left- handed in a right-handed world leads to earlier deaths among left-handers.  Several studies have compared the life expectancies of left- handers and right-handers.  One such study resulted in the data shown in the table. We will use the data to construct a confidence interval for the difference in mean life expectancies for left- handers and right-handers. Is the mean life expectancy of left-handers less than the mean life expectancy of right-handers? HandednessMean age at deathsn Left Right left-handed presidents star left-handed quarterback Steve Young

We are 95% confident that the mean life expectancy for left- handers is between 3.27 and years LESS than the mean life expectancy for right-handers. 95% confidence interval for (µ 1 − µ 2 ), with : df =  t* = :  1 = mean life expectancy of left-handers;  2 = mean life expectancy of right-handers HandednessMean age at deathsn Left Right The “Bambino”,left-handed Babe Ruth, baseball’s all-time best player.