Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Eleven Part 3 (Sections & 11.5) Chi-Square and F Distributions
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 2 Testing Two Variances Use independent samples from two populations to test the claim that the variances are equal.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 3 Assumptions for Testing Two Variances The two populations are independent The two populations each have a normal probability distribution.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 4 Notations Used:
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 5 Define population I as the population with the larger (or equal) sample variance
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 6 Set Up Hypotheses
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 7 Set Up Hypotheses
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 8 Equivalent hypotheses may be stated about standard deviations.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 9 Use the F Statistic
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 10 The F Distribution Not symmetrical Skewed right Values are always greater than or equal to zero. A specific F distribution is determined from two degrees of freedom.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 11 An F Distribution
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 12 Degrees of Freedom for Test of Two Variances Degrees of freedom for the numerator = d.f. N = n Degrees of freedom for the denominator = d.f. D = n 2 - 1
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 13 Values of the F Distribution Given in Table 8 of Appendix II
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 14 Some Values of the F Distribution
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 15 Find critical value of F from Table 8 Appendix II d.f. N = 3 d.f. D = 5 Right tail area = = 0.025
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 16 F Distribution d.f. N = 3, d.f. D = 5, = 0.025
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 17 Testing Two Variances
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 18 Assume we have the following data and wish to test the claim that the population variances are not equal.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 19 Hypotheses
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 20 The Sample Test Statistic
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 21 Degrees of Freedom for Test of Two Variances Degrees of freedom for the numerator = d.f. N = n = = 8 Degrees of freedom for the denominator = d.f. D = n = = 9
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 22 Critical Values of F Distribution Use = 0.05 For a two-tailed test, the area in the right tail of the distribution should be /2 = With d.f. N = 8 and d.f. D = 9 the critical value of F is 4.10.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 23 Critical Value of F: Two-Tailed Test F = 4.10 Area = /2
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 24 Our Test Statistic Does not fall in the Critical Region F = 4.10 Area = /2 F = 1.108
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 25 Conclusion At 5% level of significance, we cannot reject the claim that the variances are the same.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 26 P Value Approach Our sample test statistic was F = Looking in the block of entries in table 8 where d.f. N = 8 and d.f. D = 9, we find entries ranging from 3.23 to 5.47 for ranging from to F = is less than even the smallest of these results.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 27 P Value Conclusion For a two-tailed test, double the area in the right tail. Therefore P is greater than
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 28 Analysis of Variance A technique used to determine if there are differences among means for several groups.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 29 One Way Analysis of Variance Groups are based on values of only one variable.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 30 ANOVA Analysis of Variance
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 31 Assumptions for ANOVA Each of k groups of measurements is from a normal population. Each group is randomly selected and is independent of all other groups. Variables from each group come from distributions with approximately the same standard deviation.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 32 Purpose of ANOVA To determine the existence (or nonexistence) of a statistically significant difference among the group means
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 33 Null Hypothesis All the group populations are the same. All sample groups come from the same population.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 34 Alternate Hypothesis Not all the group populations are equal.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 35 Hypotheses H 0 : 1 = 2 =... = k H 1 :At least two of the means 1, 2,..., k are not equal.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 36 Steps in ANOVA Determine null and alternate hypotheses Find SS TOT = the sum of the squares of entire collection of data Find SS BET which measures variability between groups Find SS W which measures variability within groups
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 37 Steps in ANOVA Find the variance estimates within groups: MS W Find the variance estimates between groups: MS BET Find the F ratio and complete the ANOVA test
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 38 Data for three groups:
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 39 Sample sizes The sample sizes for the groups may be the same or different from one another. In our example, each sample has four items.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 40 Population Means Let 1, 2, and 3 represent the population means of groups 1, 2, and 3.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 41 Hypotheses and Level of Significance H 0 : 1 = 2 = 3 H 1 :At least two of the means 1, 2, and 3 are not equal. Use = 0.05.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 42 Find SS TOT = the sum of the squares of entire collection of data
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 43 Find SS TOT = the sum of the squares of entire collection of data
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 44 Find SS TOT = the sum of the squares of entire collection of data
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 45 Find SS TOT = the sum of the squares of entire collection of data
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 46 Squares and Sums
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 47 Finding SS TOT
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 48 Find SS TOT
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 49 Find SS BET
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 50 Finding SS BET
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 51 Variability Within ith Group
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 52 Variation Within First Group
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 53 Variation Within Second Group
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 54 Variation Within Third Group
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 55 Variability Within All Groups SS W = SS 1 + SS 2 + SS 3
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 56 Finding SS W SS W = SS 1 + SS 2 + SS 3 = = 9.50
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 57 Note: SS TOT = SS BET + SS W
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 58 We can check: SS TOT = SS BET + SS W = 94 =
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 59 Degrees of Freedom Degrees of freedom between groups = d.f. BET = k - 1, where k is the number of groups. Degrees of freedom within groups = d.f. W = N - k, where N is the total sample size. d.f. BET + d.f. W = N - 1.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 60 Variance Estimate Between Groups Mean Square Between Groups =
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 61 Variance Estimate Within Groups Mean Square Within Groups =
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 62 Mean Square Between Groups
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 63 Mean Square Within Groups
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 64 Find the F Ratio
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 65 Finding the F Ratio
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 66 Null Hypothesis: All the groups are samples from the same distribution If H 0 is true MS BET and MS W would estimate the same quantity. The F ratio should be approximately equal to one.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 67 Using Table 8, Appendix II d.f. BET = number of groups - 1 = d.f. for numerator. d.f. W = total sample size - number of groups = d.f. for denominator. Rejection region is the right tail of the distribution.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 68 Using Table 8, Appendix II d.f. BET = number of groups - 1 = d.f. for numerator = 2. d.f. W = total sample size - number of groups = d.f. for denominator = 9. = Critical F value = 4.26
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 69 Conclusion Since our observed value of F (40.009) is greater than the critical F value (4.26) we reject the null hypothesis. We conclude that not all of the means are equal.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 70 P Value Approach For d.f. BET = d.f. for numerator = 2 and d.f. W = d.f. for denominator = 9, our observed F value (42.009) exceeds even the largest critical F value. Thus P < We conclude that we would reject the null hypothesis for any 0.01.