Download presentation
Presentation is loading. Please wait.
Published byMelinda Gaines Modified over 9 years ago
1
Topic 8 - Comparing two samples Confidence intervals/hypothesis tests for two means - pages 246 - 261246 - 261 Hypothesis test for two variances - pages 272 – 275 272 – 275
2
Comparing two populations Sometimes we want to compare two populations rather making decisions about a single population. For example, we might want to compare two population means or two population proportions to see if they are equal. Is the expected drying time for one type of paint lower than that of another type of paint? Is the proportion of republicans who favor withdrawing from Iraq higher than the proportion of democrats who favor withdrawal?
3
Comparing two population means Suppose we have two independent samples, X 1,…, X m and Y 1,…, Y n, from two separate populations. A natural statistic for comparing the two population means, X and Y, is. The distribution of is also Normal for m and n both large.
4
Large samples test for comparing population means To test H 0 : X – Y = 0, use the test statistic HAHA Reject H 0 if X – Y < 0 Z < - z X – Y > 0 Z > z X – Y ≠ 0 | Z | > z /2
5
Home sales data A realtor in Albuquerque wants to argue that houses in the Northeast are more expensive on average than those in the rest of town. The data below contain sale prices (in $100s) for homes in the city. NE = 1 indicates a home was in the Northeast. NE = 0 indicates a home was not in the Northeast. Test the appropriate hypotheses with = 0.01.
6
Large samples confidence interval for the difference between two population means A large sample (1- )100% confidence interval for X – Y is For the home sales data, what is a 99% confidence interval for the difference between sale prices in the Northeast and the rest of town? Home sales data
7
Equal population variances Suppose we assume that the two populations have a common variance 2. We can then estimate this common variance using the pooled sample variance:
8
Small samples test for comparing population means from Normal distributions with equal variances To test H 0 : X – Y = 0, use the test statistic HAHA Reject H 0 if X – Y < 0 T < - t , n + m -2 X – Y > 0 T > t , n + m -2 X – Y ≠ 0 | T | > t /2, n + m -2
9
THC example with equal variances The active component in marijuana is THC. An experiment was conducted to compare two slightly different configurations of this substance. The THC data set contains the time until the effect was perceived for 6 subjects exposed to each configuration. Is there any evidence that the mean time to perception is different between the two configurations using = 0.01? THC data set
10
Small samples confidence interval for the difference between two population means Assuming equal variances, a small sample (1- )100% confidence interval for X – Y is For the THC data, what is a 99% confidence interval for the mean difference between the detection times for the two configurations? THC data set
11
Unequal population variances The pooled procedures we have discussed previously are fairly robust to the assumption of equal variances. In other words if the two population variances are relatively close, the procedures perform well: –The level of significance for the hypothesis test is close to what it should be –The coverage probability for the confidence interval is close to what it should be If the variances are quite different, then we need a different procedure.
12
Small samples test for comparing population means from Normal distributions with unequal variances To test H 0 : X – Y = 0, use the test statistic with degrees of freedom HAHA Reject H 0 if X – Y < 0 T < - t , v X – Y > 0 T > t , v X – Y ≠ 0 | T | > t /2, v
13
THC example with unequal variances
14
Small samples confidence interval for the difference between two population means Assuming unequal variances, a small sample (1- )100% confidence interval for X – Y is For the THC data, what is a 99% confidence interval for the mean difference between the detection times for the two configurations? THC data set
15
Paired data Sometimes we have a third variable that connects elements from the X and Y samples. In this case, the assumption of independence between the two samples may be violated. Is there any evidence that the first twin and the second twin have different average weights among boy-boy twins? In this case, the twins are clearly connected by the mother. It might be better to base our test on the n pairwise differences, D i = X i – Y i.
16
Paired test for comparing population means To test H 0 : X – Y = 0, use the test statistic HAHA Reject H 0 if X – Y < 0 T < - t , n -1 X – Y > 0 T > t , n -1 X – Y ≠ 0 | T | > t /2, n -1
17
Twins example Load the Twins data from StatCrunch sample data sets. Is there any evidence that Twin A and Twin B have different average weights among boy-boy twins with = 0.1? StatCrunch
18
Paired confidence interval for the difference between two population means A small sample (1- )100% confidence interval for X – Y is For the twins data, what is a 90% confidence interval for the mean difference between the twin A and twin B weights? StatCrunch
19
Comparing two population proportions A natural statistic for comparing the two population proportions, p X and p Y, is. The distribution of is also Normal for m and n both large.
20
Large samples test for comparing population proportions To test H 0 : p X – p Y = 0, use the test statistic where HAHA Reject H 0 if p X – p Y < 0 Z < - z p X – p Y > 0 Z > z p X – p Y ≠ 0| Z | > z /2
21
Polio example The following table summarizes a study of the efficacy of the Salk vaccine. Was the vaccine effective? Test at = 0.05. StatCrunch Treatm ent Total patients Cases of polio Vaccine201,22933 Placebo200,745110
22
Large samples confidence interval for the difference between two population proportions A large sample (1- )100% confidence interval for p X – p Y is For the Polio data, what is a 95% confidence interval for the difference between the proportion who contract the disease under each treatment? StatCrunch
23
Comparing two population variances Suppose two chemical companies can supply a raw material, but we suspect the variability in concentration may differ between the two. The standard deviation of concentration in a random sample of 15 batches from company 1 was found to be 4.7 g/l. A sample of 21 batches from company 2 yielded a standard deviation of 5.8 g/l. Is there sufficient evidence to conclude that the variability in concentration differs for the two companies?
24
Test for comparing population variances from Normal distributions To test H 0 : X Y 2, use the test statistic HAHA Reject H 0 if X > Y 2 F > F m-1,n-1 X < Y 2 F < F 1 m-1,n-1 X ≠ Y 2 F > F m-1,n-1 or F < F 1 m-1,n-1 F calculator
25
Chemical example Is there sufficient evidence to conclude that the variability in concentration differs for the two companies with = 0.05? F Calculator
26
Confidence interval for the ratio of two Normal population variances A large sample (1- )100% confidence interval for X / Y 2 is For the THC example, what is a 95% confidence interval for the ratio of concentration variances? THC data set
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.