Chapter 17 Analysis of Variance

Slides:



Advertisements
Similar presentations
Chapter 11 Analysis of Variance
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
Chapter 12 ANALYSIS OF VARIANCE.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 11 Analysis of Variance
Chapter 12 Chi-Square Tests and Nonparametric Tests
Statistics for Business and Economics
Chapter Topics The Completely Randomized Model: One-Factor Analysis of Variance F-Test for Difference in c Means The Tukey-Kramer Procedure ANOVA Assumptions.
Chap 11-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 11 Hypothesis Testing II Statistics for Business and Economics.
Chapter 3 Analysis of Variance
Statistics for Managers Using Microsoft® Excel 5th Edition
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 15 Analysis of Variance.
Chapter 11 Analysis of Variance
Copyright ©2011 Pearson Education 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel 6 th Global Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 12-1 Business Statistics: A Decision-Making Approach 8 th Edition Chapter 12 Analysis.
F-Test ( ANOVA ) & Two-Way ANOVA
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Statistics for Business and Economics Chapter 8 Design of Experiments and Analysis of Variance.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
INFERENTIAL STATISTICS: Analysis Of Variance ANOVA
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
Chapter 10 Two-Sample Tests and One-Way ANOVA
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
CHAPTER 12 Analysis of Variance Tests
Chapter 10 Analysis of Variance.
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance & Post-ANOVA ANALYSIS IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Analysis of Variance.
Business Statistics: Contemporary Decision Making, 3e, by Black. © 2001 South-Western/Thomson Learning 11-1 Business Statistics, 3e by Ken Black Chapter.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
TOPIC 11 Analysis of Variance. Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Lecture 9-1 Analysis of Variance
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 15 Analysis of Variance.
Business Statistics: A First Course (3rd Edition)
Chapter 4 Analysis of Variance
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Chap 11-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 11 Analysis of Variance.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
ENGR 610 Applied Statistics Fall Week 8 Marshall University CITE Jack Smith.
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Chapter 11 Analysis of Variance
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Statistics for Managers Using Microsoft Excel 3rd Edition
Two-Way Analysis of Variance Chapter 11.
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Econ 3790: Business and Economic Statistics
Chapter 11 Analysis of Variance
Chapter 9 Analysis of Variance
Chapter 15 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Chapter 17 Analysis of Variance Statistics for Business and Economics 6th Edition Chapter 17 Analysis of Variance Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Chapter Goals After completing this chapter, you should be able to: Recognize situations in which to use analysis of variance Understand different analysis of variance designs Perform a one-way and two-way analysis of variance and interpret the results Conduct and interpret a Kruskal-Wallis test Analyze two-factor analysis of variance tests with more than one observation per cell Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Way Analysis of Variance Evaluate the difference among the means of three or more groups Examples: Average production for 1st, 2nd, and 3rd shift Expected mileage for five brands of tires Assumptions Populations are normally distributed Populations have equal variances Samples are randomly and independently drawn Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Hypotheses of One-Way ANOVA All population means are equal i.e., no variation in means between groups At least one population mean is different i.e., there is variation between groups Does not mean that all population means are different (some pairs may be the same) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Way ANOVA All Means are the same: The Null Hypothesis is True (No variation between groups) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Way ANOVA At least one mean is different: (continued) At least one mean is different: The Null Hypothesis is NOT true (Variation is present between groups) or Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Variability The variability of the data is key factor to test the equality of means In each case below, the means may look different, but a large variation within groups in B makes the evidence that the means are different weak A B A B C Group A B C Group Small variation within groups Large variation within groups Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Partitioning the Variation Total variation can be split into two parts: SST = SSW + SSG SST = Total Sum of Squares Total Variation = the aggregate dispersion of the individual data values across the various groups SSW = Sum of Squares Within Groups Within-Group Variation = dispersion that exists among the data values within a particular group SSG = Sum of Squares Between Groups Between-Group Variation = dispersion between the group sample means Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Partition of Total Variation Total Sum of Squares (SST) Variation due to random sampling (SSW) Variation due to differences between groups (SSG) = + Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Total Sum of Squares SST = SSW + SSG Where: SST = Total sum of squares K = number of groups (levels or treatments) ni = number of observations in group i xij = jth observation from group i x = overall sample mean Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Total Variation (continued) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Within-Group Variation SST = SSW + SSG Where: SSW = Sum of squares within groups K = number of groups ni = sample size from group i Xi = sample mean from group i Xij = jth observation in group i Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Within-Group Variation (continued) Summing the variation within each group and then adding over all groups Mean Square Within = SSW/degrees of freedom Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Within-Group Variation (continued) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Between-Group Variation SST = SSW + SSG Where: SSG = Sum of squares between groups K = number of groups ni = sample size from group i xi = sample mean from group i x = grand mean (mean of all data values) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Between-Group Variation (continued) Variation Due to Differences Between Groups Mean Square Between Groups = SSG/degrees of freedom Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Between-Group Variation (continued) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Obtaining the Mean Squares Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Way ANOVA Table Source of Variation MS (Variance) SS df F ratio Between Groups SSG MSG SSG K - 1 MSG = F = K - 1 MSW Within Groups SSW SSW n - K MSW = n - K SST = SSG+SSW Total n - 1 K = number of groups n = sum of the sample sizes from all groups df = degrees of freedom Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Factor ANOVA F Test Statistic H0: μ1= μ2 = … = μK H1: At least two population means are different Test statistic MSG is mean squares between variances MSW is mean squares within variances Degrees of freedom df1 = K – 1 (K = number of groups) df2 = n – K (n = sum of sample sizes from all groups) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Interpreting the F Statistic The F statistic is the ratio of the between estimate of variance and the within estimate of variance The ratio must always be positive df1 = K -1 will typically be small df2 = n - K will typically be large Decision Rule: Reject H0 if F > FK-1,n-K,  = .05 Do not reject H0 Reject H0 FK-1,n-K, Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Factor ANOVA F Test Example Club 1 Club 2 Club 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204 You want to see if three different golf clubs yield different distances. You randomly select five measurements from trials on an automated driving machine for each club. At the .05 significance level, is there a difference in mean distance? Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Factor ANOVA Example: Scatter Diagram Distance 270 260 250 240 230 220 210 200 190 Club 1 Club 2 Club 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204 • • • • • • • • • • • • • • • 1 2 3 Club Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Factor ANOVA Example Computations Club 1 Club 2 Club 3 254 234 200 263 218 222 241 235 197 237 227 206 251 216 204 x1 = 249.2 x2 = 226.0 x3 = 205.8 x = 227.0 n1 = 5 n2 = 5 n3 = 5 n = 15 K = 3 SSG = 5 (249.2 – 227)2 + 5 (226 – 227)2 + 5 (205.8 – 227)2 = 4716.4 SSW = (254 – 249.2)2 + (263 – 249.2)2 +…+ (204 – 205.8)2 = 1119.6 MSG = 4716.4 / (3-1) = 2358.2 MSW = 1119.6 / (15-3) = 93.3 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

One-Factor ANOVA Example Solution Test Statistic: Decision: Conclusion: H0: μ1 = μ2 = μ3 H1: μi not all equal  = .05 df1= 2 df2 = 12 Critical Value: F2,12,.05= 3.89 Reject H0 at  = 0.05  = .05 There is evidence that at least one μi differs from the rest Do not reject H0 Reject H0 F = 25.275 F2,12,.05 = 3.89 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

ANOVA -- Single Factor: Excel Output EXCEL: tools | data analysis | ANOVA: single factor SUMMARY Groups Count Sum Average Variance Club 1 5 1246 249.2 108.2 Club 2 1130 226 77.5 Club 3 1029 205.8 94.2 ANOVA Source of Variation SS df MS F P-value F crit Between Groups 4716.4 2 2358.2 25.275 4.99E-05 3.89 Within 1119.6 12 93.3 Total 5836.0 14   Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Test Use when the normality assumption for one-way ANOVA is violated Assumptions: The samples are random and independent variables have a continuous distribution the data can be ranked populations have the same variability populations have the same shape Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Test Procedure Obtain relative rankings for each value In event of tie, each of the tied values gets the average rank Sum the rankings for data from each of the K groups Compute the Kruskal-Wallis test statistic Evaluate using the chi-square distribution with K – 1 degrees of freedom Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Test Procedure (continued) The Kruskal-Wallis test statistic: (chi-square with K – 1 degrees of freedom) where: n = sum of sample sizes in all groups K = Number of samples Ri = Sum of ranks in the ith group ni = Size of the ith group Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Test Procedure (continued) Complete the test by comparing the calculated H value to a critical 2 value from the chi-square distribution with K – 1 degrees of freedom Decision rule Reject H0 if W > 2K–1, Otherwise do not reject H0  2 Do not reject H0 Reject H0 2K–1, Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Example Do different departments have different class sizes? Class size (Math, M) Class size (English, E) Class size (Biology, B) 23 45 54 78 66 55 60 72 70 30 40 18 34 44 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Example Do different departments have different class sizes? Class size (Math, M) Ranking Class size (English, E) Class size (Biology, B) 23 41 54 78 66 2 6 9 15 12 55 60 72 45 70 10 11 14 8 13 30 40 18 34 44 3 5 1 4 7  = 44  = 56  = 20 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Example (continued) The W statistic is Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Kruskal-Wallis Example (continued) Compare W = 6.72 to the critical value from the chi-square distribution for 3 – 1 = 2 degrees of freedom and  = .05: Since H = 6.72 > , reject H0 There is sufficient evidence to reject that the population means are all equal Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way Analysis of Variance Examines the effect of Two factors of interest on the dependent variable e.g., Percent carbonation and line speed on soft drink bottling process Interaction between the different levels of these two factors e.g., Does the effect of one particular carbonation level depend on which level the line speed is set? Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way ANOVA Assumptions Populations are normally distributed (continued) Assumptions Populations are normally distributed Populations have equal variances Independent random samples are drawn Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Randomized Block Design Two Factors of interest: A and B K = number of groups of factor A H = number of levels of factor B (sometimes called a blocking variable) Block Group 1 2 … K . H x11 x12 x1H x21 x22 x2H xK1 xK2 xKH Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way Notation Let xji denote the observation in the jth group and ith block Suppose that there are K groups and H blocks, for a total of n = KH observations Let the overall mean be x Denote the group sample means by Denote the block sample means by Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Partition of Total Variation SST = SSG + SSB + SSE Variation due to differences between groups (SSG) Total Sum of Squares (SST) = + Variation due to differences between blocks (SSB) + Variation due to random sampling (unexplained error) (SSE) The error terms are assumed to be independent, normally distributed, and have the same variance Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way Sums of Squares The sums of squares are Degrees of Freedom: n – 1 K – 1 H – 1 (K – 1)(K – 1) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way Mean Squares The mean squares are Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way ANOVA: The F Test Statistic F Test for Groups H0: The K population group means are all the same Reject H0 if F > FK-1,(K-1)(H-1), F Test for Blocks H0: The H population block means are the same Reject H0 if F > FH-1,(K-1)(H-1), Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

General Two-Way Table Format Source of Variation Sum of Squares Degrees of Freedom Mean Squares F Ratio Between groups blocks Error Total SSG SSB SSE SST K – 1 H – 1 (K – 1)(H – 1) n - 1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

More than One Observation per Cell A two-way design with more than one observation per cell allows one further source of variation The interaction between groups and blocks can also be identified Let K = number of groups H = number of blocks L = number of observations per cell n = KHL = total number of observations Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

More than One Observation per Cell (continued) SST = SSG + SSB + SSI + SSE Degrees of Freedom: SSG Between-group variation K – 1 SST Total Variation SSB Between-block variation H – 1 SSI Variation due to interaction between groups and blocks (K – 1)(H – 1) n – 1 SSE Random variation (Error) KH(L – 1) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Sums of Squares with Interaction Degrees of Freedom: n - 1 K – 1 H – 1 (K – 1)(H – 1) KH(L – 1) Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way Mean Squares with Interaction The mean squares are Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way ANOVA: The F Test Statistic F Test for group effect H0: The K population group means are all the same Reject H0 if F > FK-1,KH(L-1), F Test for block effect H0: The H population block means are the same Reject H0 if F > FH-1,KH(L-1), F Test for interaction effect H0: the interaction of groups and blocks is equal to zero Reject H0 if F > F(K-1)(H-1),KH(L-1), Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Two-Way ANOVA Summary Table Source of Variation Sum of Squares Degrees of Freedom Mean Squares F Statistic Between groups SSG K – 1 MSG = SSG / (K – 1) MSG MSE Between blocks SSB H – 1 MSB = SSB / (H – 1) MSB MSE Interaction SSI (K – 1)(H – 1) MSI = SSI / (K – 1)(H – 1) MSI MSE Error SSE KH(L – 1) MSE = SSE / KH(L – 1) Total SST n – 1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Features of Two-Way ANOVA F Test Degrees of freedom always add up n-1 = KHL-1 = (K-1) + (H-1) + (K-1)(H-1) + KH(L-1) Total = groups + blocks + interaction + error The denominator of the F Test is always the same but the numerator is different The sums of squares always add up SST = SSG + SSB + SSI + SSE Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Examples: Interaction vs. No Interaction Interaction is present: No interaction: Block Level 1 Block Level 1 Block Level 3 Mean Response Mean Response Block Level 2 Block Level 2 Block Level 3 A B C A B C Groups Groups Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.

Chapter Summary Described one-way analysis of variance The logic of Analysis of Variance Analysis of Variance assumptions F test for difference in K means Applied the Kruskal-Wallis test when the populations are not known to be normal Described two-way analysis of variance Examined effects of multiple factors Examined interaction between factors Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc.