Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Inference for  1  2 1.

Slides:



Advertisements
Similar presentations
Confidence Intervals for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Section
Advertisements

Objectives 7.1, 7.2Inference for comparing means of two populations  Matched pairs t confidence interval  Matched pairs t hypothesis test  Two-sample.
Chapter 22 Comparing 2 Proportions © 2006 W.H. Freeman and Company.
Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Inference for distributions: - for the mean of a population IPS chapter 7.1 © 2006 W.H Freeman and Company.
Conditions with σ Unknown Note: the same as what we saw before.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Inference for Distributions - for the Mean of a Population IPS Chapter 7.1 © 2009 W.H Freeman and Company.
Chapter Goals After completing this chapter, you should be able to:
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
Inference for Distributions - for the Mean of a Population
BPS - 5th Ed. Chapter 171 Inference about a Population Mean.
Two-sample problems for population means BPS chapter 19 © 2006 W.H. Freeman and Company.
Hypothesis Testing Using The One-Sample t-Test
Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry
Inferences Based on Two Samples
Lesson Comparing Two Means.
Significance Tests in practice Chapter Tests about a population mean  When we don’t know the population standard deviation σ, we perform a one.
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for  1  Independent Samples.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
1 Happiness comes not from material wealth but less desire.
Inference for a population mean BPS chapter 18 © 2006 W.H. Freeman and Company.
STATISTICAL INFERENCE PART VIII HYPOTHESIS TESTING - APPLICATIONS – TWO POPULATION TESTS 1.
Chapter 13 Inference About Comparing Two Populations.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
AP Statistics Section 10.2 B. Comparative studies are more convincing than single-sample investigations. For that reason, one-sample inference is less.
Two sample problems:  compare the responses in two groups  each group is a sample from a distinct population  responses in each group are independent.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 1.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
Chapter 20 Confidence Intervals and Hypothesis Tests for a Population Mean  ; t distributions t distributions confidence intervals for a population mean.
Comparing 2 Proportions © 2006 W.H. Freeman and Company.
Essential Statistics Chapter 161 Inference about a Population Mean.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
1 Confidence Intervals for Two Proportions Section 6.1.
Inference for Distributions - for the Mean of a Population IPS Chapter 7.1 © 2009 W.H Freeman and Company.
Confidence Intervals for µ 1 - µ 2 and p 1 - p 2 1.
Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.
Chapter 7 Inference for Distributions. Inference for the mean of a population  So far, we have assumed that  was known.  If  is unknown, we can use.
STATISTICAL INFERENCE PART VI HYPOTHESIS TESTING 1.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 24 Independent Samples Chapter.
Inference for Distributions 7.2 Comparing Two Means © 2012 W.H. Freeman and Company.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Inference for distributions: - Comparing two means.
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Inference for the difference Between.
When  is unknown  The sample standard deviation s provides an estimate of the population standard deviation .  Larger samples give more reliable estimates.
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Analysis of Financial Data Spring 2012 Lecture 2: Statistical Inference 2 Priyantha Wijayatunga, Department of Statistics, Umeå University
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 22 Part 2 CI HT for m 1 -
Chapter 7 Confidence Intervals for 2 Proportions and 2 Means © 2006 W.H. Freeman and Company.
10/31/ Comparing Two Means. Does smoking damage the lungs of children exposed to parental smoking? Forced vital capacity (FVC) is the volume (in.
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 © 2009 W.H Freeman and Company.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 7: Tests of significance and confidence.
Inference for distributions: - for the mean of a population IPS chapter 7.1 © 2006 W.H Freeman and Company.
Objectives (PSLS Chapter 18)
Confidence Intervals for p1 - p2 and µ1 - µ2
Inference about Comparing Two Populations
Chapter 23 CI HT for m1 - m2: Paired Samples
The Practice of Statistics in the Life Sciences Fourth Edition
Inference for Distributions
Comparing Two Means: Paired Data
Lesson Comparing Two Means.
Inference for Distributions 7.2 Comparing Two Means
Comparing Two Means: Paired Data
Objectives 7.1 Inference for the mean of a population
Presentation transcript:

Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Inference for  1  2 1

Confidence Intervals for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Two random samples are drawn from the two populations of interest. Because we compare two population means, we use the statistic. 2

3 Population 1Population 2 Parameters: µ 1 and  1 2 Parameters: µ 2 and  2 2 (values are unknown) (values are unknown) Sample size: n 1 Sample size: n 2 Statistics: x 1 and s 1 2 Statistics: x 2 and s 2 2 Estimate µ 1  µ 2 with x 1  x 2

df 1-21-21-21-2 Sampling distribution model for ? An estimate of the degrees of freedom is min(n 1 − 1, n 2 − 1). Shape? Estimate using

Two sample t-confidence interval C t*t*t*t* −t*−t*−t*−t* Practical use of t: t*   C is the area between −t* and t*.   We find the value of t* in the line of the t-table for the correct df and the column for confidence level C.

Confidence Interval for    –   6 An estimate of the degrees of freedom is min(n 1 − 1, n 2 − 1).

Hypothesis test for    –   7 H 0 :    –   = 0 ; H a :    –   >0 (or 0 (or <0, or ≠0) Test statistic: An estimate of the degrees of freedom is min(n 1 − 1, n 2 − 1).

Example: confidence interval for  1 –  2 using min(n 1 –1, n 2 -1) to approximate the df Example – Do people who eat high-fiber cereal for breakfast consume, on average, fewer calories for lunch than people who do not eat high-fiber cereal for breakfast? – A sample of 150 people was randomly drawn. Each person was identified as a consumer or a non-consumer of high- fiber cereal. – For each person the number of calories consumed at lunch was recorded. 8

Example: confidence interval for    –   9 Solution: The parameter to be tested is the difference between two means. The claim to be tested is: The mean caloric intake of consumers (  1 ) is less than that of non-consumers (  2 ). Let’s use df = min(43-1, 107-1) = min(42, 106) = 42; t 42 * =

Example: confidence interval for    –   df = 42; t 42 * = The confidence interval estimator for the difference between two means using the formula is 10

Interpretation The 95% CI is (-57.40, -1.02). Since the interval is entirely negative (that is, does not contain 0), there is evidence from the data that µ 1 is less than µ 2. We estimate that non-consumers of high-fiber breakfast consume on average between 1.02 and more calories for lunch. 11

Beware!! Common Mistake !!! A common mistake is to calculate a one-sample confidence interval for    a one-sample confidence interval for    and to then conclude that   and   are equal if the confidence intervals overlap. This is WRONG because the variability in the sampling distribution for from two independent samples is more complex and must take into account variability coming from both samples. Hence the more complex formula for the standard error.

INCORRECT Two single-sample 95% confidence intervals: The confidence interval for the male mean and the confidence interval for the female mean overlap, suggesting no significant difference between the true mean for males and the true mean for females. Male interval: (18.68, 20.12) MaleFemale mean st. dev. s n50 Female interval: (16.94, 18.86)

Reason for Contradictory Result 14

Example: hypothesis test for    –   15 Solution: The parameter to be tested is the difference between two means. The claim to be tested is: The mean caloric intake of consumers (  1 ) is less than that of non-consumers (  2 ).

Example: hypothesis test for    –   (cont.) 16 H 0 :    –   = 0 ; H a :    –   < 0 Test statistic: Let’s use df = min(n 1 − 1, n 2 − 1) = min(43-1, 107-1) = min(42, 106) = 42 From t-table: for df=42, <t=-2.09 < .01 < P-value <.025 Conclusion: reject H 0 and conclude high-fiber breakfast eaters consume fewer calories at lunch

Does smoking damage the lungs of children exposed to parental smoking? Forced vital capacity (FVC) is the volume (in milliliters) of air that an individual can exhale in 6 seconds. FVC was obtained for a sample of children not exposed to parental smoking and a group of children exposed to parental smoking. We want to know whether parental smoking decreases children’s lung capacity as measured by the FVC test. Is the mean FVC lower in the population of children exposed to parental smoking? Parental smokingFVCsn Yes No

Parental smokingFVCsn Yes No We are 95% confident that lung capacity is between and 6.07 milliliters LESS in children of smoking parents. 95% confidence interval for (µ 1 − µ 2 ), with : df = min(30-1, 30-1) = 29  t* = :  1 = mean FVC of children with a smoking parent;  2 = mean FVC of children without a smoking parent

Do left-handed people have a shorter life-expectancy than right-handed people?   Some psychologists believe that the stress of being left- handed in a right-handed world leads to earlier deaths among left-handers.   Several studies have compared the life expectancies of left- handers and right-handers.   One such study resulted in the data shown in the table. We will use the data to construct a confidence interval for the difference in mean life expectancies for left- handers and right-handers. Is the mean life expectancy of left-handers less than the mean life expectancy of right-handers? HandednessMean age at deathsn Left Right left-handed presidents star left-handed quarterback Steve Young

We are 95% confident that the mean life expectancy for left- handers is between 3.26 and years LESS than the mean life expectancy for right-handers. 95% confidence interval for (µ 1 − µ 2 ), with : df = min(99-1, 888-1) = 98  t* = :  1 = mean life expectancy of left-handers;  2 = mean life expectancy of right-handers HandednessMean age at deathsn Left Right The “Bambino”,left-handed Babe Ruth, baseball’s all-time best player.

Matched pairs t procedures Sometimes we want to compare treatments or conditions at the individual level. These situations produce two samples that are not independent — they are related to each other. The members of one sample are identical to, or matched (paired) with, the members of the other sample. –Example: Pre-test and post-test studies look at data collected on the same sample elements before and after some experiment is performed. –Example: Twin studies often try to sort out the influence of genetic factors by comparing a variable between sets of twins. –Example: Using people matched for age, sex, and education in social studies allows canceling out the effect of these potential lurking variables.

Matched pairs t procedures The data: – “before”: x 11 x 12 x 13 … x 1n – “after”: x 21 x 22 x 23 … x 2n The data we deal with are the differences d i of the paired values: d 1 = x 11 – x 21 d 2 = x 12 – x 22 d 3 = x 13 – x 23 … d n = x 1n – x 2n A confidence interval for matched pairs data is calculated just like a confidence interval for 1 sample data: A matched pairs hypothesis test is just like a one- sample test: H 0 : µ difference = 0 ; H a : µ difference >0 (or <0, or ≠0) 22

Sweetening loss in colas The sweetness loss due to storage was evaluated by 10 professional tasters (comparing the sweetness before and after storage): Taster 95% Confidence interval: % Confidence interval:  (1.196/sqrt(10)) = 1.02  (.3782) 3 0.7= 1.02 .8556 =(.1644, ) − − Summary stats: = 1.02, s = We want to test if storage results in a loss of sweetness, thus: H 0 :  = 0 H 0 :  difference = 0 versus H a :  > 0 versus H a :  difference > 0 Before sweetness – after sweetness This is a pre-/post-test design and the variable is the cola sweetness before storage minus cola sweetness after storage. A matched pairs test of significance is indeed just like a one-sample test.

Sweetening loss in colas hypothesis test H 0 :  difference = 0vs H a :  difference > 0 Test statistic From t-table: for df=9, <t=2.6970< .01 < P-value <.025 ti83 gives P-value = … Conclusion: reject H 0 and conclude colas do lose sweetness in storage (note that CI was entirely positive. 24

Does lack of caffeine increase depression? Individuals diagnosed as caffeine-dependent are deprived of caffeine-rich foods and assigned to receive daily pills. Sometimes, the pills contain caffeine and other times they contain a placebo. Depression was assessed (larger number means more depression). – There are 2 data points for each subject, but we’ll only look at the difference. – The sample distribution appears appropriate for a t-test. 11 “difference” data points.

Hypothesis Test: Does lack of caffeine increase depression? For each individual in the sample, we have calculated a difference in depression score (placebo minus caffeine). There were 11 “difference” points, thus df = n − 1 = 10. We calculate that = 7.36; s = 6.92 H 0 :  difference = 0 ; H a :  difference > 0 For df = 10, p > ti83 gives P-value =.0027 Caffeine deprivation causes a significant increase in depression.