Confidence Intervals for p1 - p2 and µ1 - µ2

Slides:



Advertisements
Similar presentations
Confidence Intervals for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Section
Advertisements

1 Selected Sections of Chapters 22 and 24 Confidence Intervals for p 1 - p 2 and µ 1 - µ 2.
Chapter 22 Comparing 2 Proportions © 2006 W.H. Freeman and Company.
Chapter 9 Chapter 10 Chapter 11 Chapter 12
6/18/2015Two Sample Problems1 Chapter 18 Two Sample Problems.
1 Inference about Comparing Two Populations Chapter 13.
Announcements Homework 2 will be posted on the web by tonight.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 13 Inference About Comparing Two Populations.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
Lecture 10 Inference about the difference between population proportions (Chapter 13.6) One-way analysis of variance (Chapter 15.2)
1 Inference about Comparing Two Populations Chapter 13.
Lecture 9 Inference about the ratio of two variances (Chapter 13.5)
Economics 173 Business Statistics Lecture 9 Fall, 2001 Professor J. Petry
Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inferences Based on Two Samples
8.1 Inference for a Single Proportion
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Inference for  1  2 1.
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for  1  Independent Samples.
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Lecture 5 Two population tests of Means and Proportions.
Economics 173 Business Statistics Lecture 7 Fall, 2001 Professor J. Petry
Chapter 13 Inference About Comparing Two Populations.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
Chapter 13 Inference About Comparing Two Populations.
Two sample problems:  compare the responses in two groups  each group is a sample from a distinct population  responses in each group are independent.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 1.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
Comparing 2 Proportions © 2006 W.H. Freeman and Company.
1 Confidence Intervals for Two Proportions Section 6.1.
Confidence Intervals for µ 1 - µ 2 and p 1 - p 2 1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
1 Economics 173 Business Statistics Lectures 5 & 6 Summer, 2001 Professor J. Petry.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 24 Independent Samples Chapter.
Inference for Distributions 7.2 Comparing Two Means © 2012 W.H. Freeman and Company.
Inference for distributions: - Comparing two means.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for the difference Between.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 22 Part 2 CI HT for m 1 -
Chapter 7 Confidence Intervals for 2 Proportions and 2 Means © 2006 W.H. Freeman and Company.
Economics 173 Business Statistics
Chapter 10: Comparing Two Populations or Groups
Objectives (PSLS Chapter 18)
AP Statistics Comparing Two Proportions
Inference about Two Populations
Inference about Comparing Two Populations
18. Two-sample problems for population means (σ unknown)
Chapter 10: Comparing Two Populations or Groups
Chapter 8 Inference for Proportions
Confidence Intervals.
Inference for Distributions
Therefore, the Age variable is a categorical variable.
Inference for Distributions 7.2 Comparing Two Means
EQT 272 PROBABILITY AND STATISTICS ROHANA BINTI ABDUL HAMID
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Objectives (Section 7.2) Two sample problems:
Chapter 8 Inference for Proportions
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Chapter 10: Comparing Two Populations or Groups
Presentation transcript:

Confidence Intervals for p1 - p2 and µ1 - µ2

Inference about Two Populations We are interested in: Confidence intervals for the difference between two proportions. Confidence intervals for the difference between two means. 2

Confidence Intervals for the difference p1 – p2 between two population proportions In this section we deal with two populations whose data are qualitative. For qualitative data we compare the population proportions of the occurrence of a certain event. Examples Comparing the effectiveness of new drug versus older one Comparing market share before and after advertising campaign Comparing defective rates between two machines 3

Parameter and Statistic When the data are qualitative, we can only count the occurrences of a certain event in the two populations, and calculate proportions. The parameter we want to estimate is p1 – p2. Statistic An estimator of p1 – p2 is (the difference between the sample proportions). 4

Point Estimator: Two random samples are drawn from two populations. The number of successes in each sample is recorded. The sample proportions are computed. Sample 1 Sample size n1 Number of successes x1 Sample proportion Sample 2 Sample size n2 Number of successes x2 Sample proportion x n 1 ˆ  p 5

Confidence Interval for p1  p2 6

Example: confidence interval for p1 – p2 Estimating the cost of life saved Two drugs are used to treat heart attack victims: Streptokinase (available since 1959, costs $460) t-PA (genetically engineered, costs $2900). The maker of t-PA claims that its drug outperforms Streptokinase. An experiment was conducted in 15 countries. 20,500 patients were given t-PA 20,500 patients were given Streptokinase The number of deaths by heart attacks was recorded. 7

Example: confidence interval for p1 – p2 (cont.) Experiment results A total of 1497 patients treated with Streptokinase died. A total of 1292 patients treated with t-PA died. Estimate the difference in the death rates when using Streptokinase and when using t-PA. 8

Example: confidence interval for p1 – p2 (cont.) Solution The problem objective: Compare the outcomes of two treatments. The data are qualitative (a patient lived or died) The parameter to be estimated is p1 – p2. p1 = death rate with Streptokinase p2 = death rate with t-PA 9

Example: confidence interval for p1 – p2 (cont.) Compute: Manually Sample proportions: The 95% confidence interval estimate is 10

Example: confidence interval for p1 – p2 (cont.) Interpretation The interval (.0051, .0149) for p1 – p2 does not contain 0; it is entirely positive, which indicates that p1, the death rate for streptokinase, is greater than p2, the death rate for t-PA. We estimate that the death rate for streptokinase is between .51% and 1.49% higher than the death rate for t-PA. 11

Example: 95% confidence interval for p1 – p2 The age at which a woman gives birth to her first child may be an important factor in the risk of later developing breast cancer. An international study conducted by WHO selected women with at least one birth and recorded if they had breast cancer or not and whether they had their first child before their 30th birthday or after. Cancer Sample Size Age at First Birth > 30 683 3220 21.2% Age at First Birth <= 30 1498 10,245 14.6% The parameter to be estimated is p1 – p2. p1 = cancer rate when age at 1st birth >30 p2 = cancer rate when age at 1st birth <=30 We estimate that the cancer rate when age at first birth > 30 is between .05 and .082 higher than when age <= 30. 12

Confidence Intervals for the Difference between Two Population Means µ1 - µ2: Independent Samples Two random samples are drawn from the two populations of interest. Because we compare two population means, we use the statistic . 13

Population 1 Population 2 Parameters: µ1 and 12 Parameters: µ2 and 22 (values are unknown) (values are unknown) Sample size: n1 Sample size: n2 Statistics: x1 and s12 Statistics: x2 and s22 Estimate µ1 µ2 with x1 x2 14

Confidence Interval for – Note: when the values of 12 and 22 are unknown, the sample variances s12 and s22 computed from the data can be used. 15

Example: confidence interval for – Do people who eat high-fiber cereal for breakfast consume, on average, fewer calories for lunch than people who do not eat high-fiber cereal for breakfast? A sample of 150 people was randomly drawn. Each person was identified as a consumer or a non-consumer of high-fiber cereal. For each person the number of calories consumed at lunch was recorded. 16

Example: confidence interval for – Solution: The parameter to be tested is the difference between two means. The claim to be tested is: The mean caloric intake of consumers (1) is less than that of non-consumers (2). Use s12 = 4,103 for 12 and s22 = 10,670 for 22 17

Example: confidence interval for – The confidence interval estimator for the difference between two means is 18 18

Interpretation The 95% CI is (-56.59, -1.83). We are 95% confident that the interval (-56.59, -1.83) contains the true but unknown difference – Since the interval is entirely negative (that is, does not contain 0), there is evidence from the data that µ1 is less than µ2. We estimate that non-consumers of high-fiber breakfast consume on average between 1.83 and 56.59 more calories for lunch. 19

Does smoking damage the lungs of children exposed to parental smoking? Forced vital capacity (FVC) is the volume (in milliliters) of air that an individual can exhale in 6 seconds. FVC was obtained for a sample of children not exposed to parental smoking and a group of children exposed to parental smoking. Parental smoking FVC s n Yes 75.5 9.3 30 No 88.2 15.1 We want to know whether parental smoking decreases children’s lung capacity as measured by the FVC test. Is the mean FVC lower in the population of children exposed to parental smoking?

95% confidence interval for (µ1 − µ2): Parental smoking FVC s n Yes 75.5 9.3 30 No 88.2 15.1 95% confidence interval for (µ1 − µ2): 1 = mean FVC of children with a smoking parent; 2 = mean FVC of children without a smoking parent We are 95% confident that lung capacity in children of smoking parents is between 19.05 and 6.35 milliliters LESS than in children without a smoking parent.

Bunny Rabbits and Pirates on the Box The data below show the sugar content (as a percentage of weight) of 10 brands of cereal randomly selected from a supermarket shelf that is at a child’s eye level and 8 brands selected from the top shelf. Eye level 40.3 55 45.7 43.3 50.3 45.9 53.5 43 44.2 44 Top 20 2.2 7.5 4.4 22.2 16.6 14.5 10 Create and interpret a 95% confidence interval for the difference 1 – 2 in mean sugar content, where 1 is the mean sugar content of cereal at a child’s eye level and 2 is the mean sugar content of cereal on the top shelf. 22

Eye level 40.3 55 45.7 43.3 50.3 45.9 53.5 43 44.2 44 Top 20 2.2 7.5 4.4 22.2 16.6 14.5 10 23

Interpretation We are 95% confident that the interval (28.46, 40.22) contains the true but unknown value of 1 – 2. Note that the interval is entirely positive (does not contain 0); therefore, it appears that the mean amount of sugar 1 in cereal on the shelf at a child’s eye level is larger than the mean amount 2 on the top shelf. 24

star left-handed quarterback Steve Young Do left-handed people have a shorter life-expectancy than right-handed people? Some psychologists believe that the stress of being left-handed in a right-handed world leads to earlier deaths among left-handers. Several studies have compared the life expectancies of left-handers and right-handers. One such study resulted in the data shown in the table. left-handed presidents Handedness Mean age at death s n Left 66.8 25.3 99 Right 75.2 15.1 888 star left-handed quarterback Steve Young We will use the data to construct a confidence interval for the difference in mean life expectancies for left-handers and right-handers. Is the mean life expectancy of left-handers less than the mean life expectancy of right-handers?

Handedness Mean age at death s n Left 66.8 25.3 99 Right 75.2 15.1 888 95% confidence interval for (µ1 − µ2): The “Bambino”,left-handed hitter Babe Ruth, baseball’s all-time best hitter 1 = mean life expectancy of left-handers; 2 = mean life expectancy of right-handers We are 95% confident that the mean life expectancy for left-handers is between 3.32 and 13.48 years LESS than the mean life expectancy for right-handers.

Example: confidence interval for – An ergonomic chair can be assembled using two different sets of operations (Method A and Method B) The operations manager would like to know whether the assembly time under the two methods differ. 27

Example: confidence interval for – Two samples are randomly and independently selected A sample of 25 workers assembled the chair using method A. A sample of 25 workers assembled the chair using method B. The assembly times were recorded Do the assembly times of the two methods differs? 28

Assembly times in Minutes Example: confidence interval for – Assembly times in Minutes Solution The parameter of interest is the difference between two population means. The claim to be tested is whether a difference between the two methods exists. Use s12 = .848 for 12 and s22 = 1.303 for 22 29

Example: confidence interval for – A 95% confidence interval for 1 - 2 is calculated as follows: We are 95% confident that the interval (-0.3029 , 0.8469) contains the true but unknown 1 - 2 Notice: “Zero” is included in the confidence interval 30