Download presentation
Presentation is loading. Please wait.
Published byReynard Merritt Modified over 8 years ago
2
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples
3
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 Each observation in one sample has a matched observation in the other sample. The observations are called matched pairs. Dependent Samples
4
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 4 The cell phone analysis presented earlier in this text used independent samples: One group used cell phones A separate control group did not use cell phones Example: Cell Phones and Driving
5
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 5 An alternative design used the same subjects for both groups. Reaction times are measured when subjects performed the driving task without using cell phones and then again while using cell phones. Example: Cell Phones and Driving
6
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 6 Data: Table 10.9 Reaction Times on Driving Skills Before and While Using Cell Phone The difference score is the reaction time using the cell phone minus the reaction time not using it, such as 636 - 604 = 32 milliseconds. Example: Cell Phones and Driving
7
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 7 Benefits of using dependent samples (matched pairs): Many sources of potential bias are controlled so we can make a more accurate comparison. Using matched pairs keeps many other factors fixed that could affect the analysis. Often this results in the benefit of smaller standard errors. Example: Cell Phones and Driving
8
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 8 To Compare Means with Matched Pairs, Use Paired Differences: For each matched pair, construct a difference score. d = (reaction time using cell phone) – (reaction time without cell phone). Calculate the sample mean of these differences: To Compare Means with Matched Pairs, Use Paired Differences
9
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 9 The difference between the means of the two samples equals the mean of the difference scores for the matched pairs. The difference between the population means is identical to the parameter that is the population mean of the difference scores. To Compare Means with Matched Pairs, Use Paired Differences
10
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 10 Let n denote the number of observations in each sample. This equals the number of difference scores. The 95 % CI for the population mean difference is: Confidence Interval For Dependent Samples
11
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 11 To test the hypothesis of equal means, we can conduct the single-sample test of with the difference scores. The test statistic is: Paired Difference Inferences
12
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 12 These paired-difference inferences are special cases of single-sample inferences about a population mean so they make the same assumptions. The sample of difference scores is a random sample from a population of such difference scores. The difference scores have a population distribution that is approximately normal. This is mainly important for small samples (less than about 30) and for one- sided inferences. Paired Difference Inferences
13
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 13 Confidence intervals and two-sided tests are robust: They work quite well even if the normality assumption is violated. One-sided tests do not work well when the sample size is small and the distribution of differences is highly skewed. Paired Difference Inferences
14
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 14 The box plot shows skew to the right for the difference scores. Two- sided inference is robust to violations of the assumption of normality. The box plot does not show any severe outliers. Example: Cell Phones and Driver Reaction Time Figure 10.9 MINITAB Box Plot of Difference Scores from Table 10.9. Question: How is it that some of the scores plotted here are negative?
15
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 15 Significance test: (and hence equal population means for the two conditions) Test statistic: Example: Cell Phones and Driver Reaction Time
16
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 16 Table 10.10 Software Output for Matched-Pairs Analysis With Table 10.9 Example: Cell Phones and Driver Reaction Time
17
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 17 The P-value displayed in the output is approximately 0. There is extremely strong evidence that the population mean reaction times are different. Example: Cell Phones and Driver Reaction Time
18
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 18 A 95% CI for : Example: Cell Phones and Driver Reaction Time
19
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 19 We infer that the population mean when using cell phones is between about 32 and 70 milliseconds higher than when not using cell phones. The confidence interval is more informative than the significance test, since it predicts possible values for the difference. Example: Cell Phones and Driver Reaction Time
20
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 20 A recent GSS asked subjects whether they believed in Heaven and whether they believed in Hell: Belief in Hell Belief in HeavenYesNoTotal Yes9551621117 No9188197 Total9643501314 Comparing Proportions with Dependent Samples
21
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 21 We can estimate as: Note that the data consist of matched pairs. Recode the data so that for belief in heaven or hell, 1=yes and 0=no. HeavenHellInterpretationDifference, dFrequency 11believe in Heaven and Hell1-1=0955 10believe in Heaven, not Hell1-0=1162 01believe in Hell, not Heaven0-1=-19 00do not believe in Heaven or Hell0-0=0188 Comparing Proportions with Dependent Samples
22
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 22 Sample mean of the 1314 difference scores is: Note that this equals the difference in proportions, We have converted the two samples of binary observations into a single sample of 1314 difference scores. We can now use single-sample methods with the differences as we did for the matched-pairs analysis of means. Comparing Proportions with Dependent Samples
23
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 23 Use the fact that the sample difference,, is the mean of difference scores of the re-coded data. We can find a confidence interval for by finding a confidence interval for the population mean of difference scores. Confidence Interval Comparing Proportions with Matched-Pairs Data
24
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 24 We’ve already seen that the sample mean of the 1314 difference scores is. A 95% confidence interval for equals Sample mean difference, which is Table 10.13 Software Output for Analyzing Difference Scores from Table 10.12 to Compare Beliefs in Heaven and Hell. Confidence Interval Comparing Proportions with Matched-Pairs Data
25
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 25 Hypotheses:, can be one or two sided. Test Statistic: For the two counts for the frequency of “yes” on one response and “no” on the other, the z test statistic equals their difference divided by the square root of their sum. (Assumption: The sum of the counts should be at least 30, but in practice the two-sided test works well even if this is not true). P-value: For, two-tail probability of z test statistic values more extreme than observed z, using standard normal distribution. SUMMARY: McNemar Test for Comparing Proportions with Matched-Pairs Data
26
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 26 Research in comparing the quality of different speech recognition systems uses a series of isolated words as a benchmark test, finding for each system the proportion of words for which an error of recognition occurs. Example: McNemar’s Test: Speech Recognition Systems Table 10.14 Results of Test Using 2000 Words to Compare Two Speech Recognition Systems (GMDS and CDHMM)
27
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 27 The test statistic for McNemar’s Test: The two-sided P-value is 0.000001, approximately 0. There is extremely strong evidence against the null hypothesis that the correct detection rates are the same for the two systems. Note that this result agrees with the confidence interval for calculated earlier. Example: McNemar’s Test: Speech Recognition Systems
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.