Two Factor ANOVA Copyright (c) 2008 by The McGraw-Hill Companies. This material is intended solely for educational purposes by licensed users of LearningStats.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Analysis of Variance (ANOVA)
Chapter 11 Analysis of Variance
Chapter 17 Analysis of Variance
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
T WO W AY ANOVA W ITH R EPLICATION  Also called a Factorial Experiment.  Factorial Experiment is used to evaluate 2 or more factors simultaneously. 
T WO WAY ANOVA WITH REPLICATION  Also called a Factorial Experiment.  Replication means an independent repeat of each factor combination.  The purpose.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Factorial Experiments
1 Experimental Statistics - week 7 Chapter 15: Factorial Models (15.5) Chapter 17: Random Effects Models.
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
CHAPTER 12 Analysis of Variance Tests
Chapter 10 Analysis of Variance.
ANOVA (Analysis of Variance) by Aziza Munir
Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Copyright © 2004 Pearson Education, Inc.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Blocking & Confounding in the 2 k Factorial Design Text reference, Chapter 7 Blocking is a technique for dealing with controllable nuisance variables.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Design Of Experiments With Several Factors
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 14 Experimental.
1 Always be contented, be grateful, be understanding and be compassionate.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 10.3.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 12-3 Two-Way ANOVA.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Three or More Factors: Latin Squares
Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.
Chapter 9 More Complicated Experimental Designs. Randomized Block Design (RBD) t > 2 Treatments (groups) to be compared b Blocks of homogeneous units.
10.12 Page 344 Two Sample t-test regarding (2) population means.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Analysis of Variance l Chapter 8 l 8.1 One way ANOVA
Chapter 11 Analysis of Variance
Process Control Charts
Regression and Correlation
Keller: Stats for Mgmt & Econ, 7th Ed Analysis of Variance
CHAPTER 4 Analysis of Variance (ANOVA)
SPSS Homework SPSS Homework 12.1 Practice Data from exercise ) Use linear contrasts to compare 5 days vs 20 and 35 days 2) Imagine you.
One-Sample Hypothesis Tests
Statistics for Managers Using Microsoft Excel 3rd Edition
Lecture Slides Elementary Statistics Twelfth Edition
Factorial Experiments
Two way ANOVA with replication
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
Applied Business Statistics, 7th ed. by Ken Black
Comparing Three or More Means
Two way ANOVA with replication
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Lecture Slides Elementary Statistics Twelfth Edition
Goodness-of-Fit Tests
More Complicated Experimental Designs
Two-Factor Studies with Equal Replication
Chapter 11 Analysis of Variance
Chapter 9 Hypothesis Testing.
More Complicated Experimental Designs
Two-Factor Studies with Equal Replication
Statistical Inference about Regression
More Complicated Experimental Designs
Analysis of Variance (ANOVA)
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Analysis of Variance Objective
Experimental Statistics - week 8
Chapter 15 Analysis of Variance
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Two Factor ANOVA Copyright (c) 2008 by The McGraw-Hill Companies. This material is intended solely for educational purposes by licensed users of LearningStats. It may not be copied or resold for profit.

When Do We Need Two-Factor ANOVA? When we believe that the response variable Y is affected by more than one factor. Y = response variable A = first factor B = second factor Y = f(A, B) Model: A Y B

Two-Factor ANOVA (Randomized Block) Linear Model Form: Yjk = m + tj + fk + ejk Hypotheses H0: tj = 0 (no treatment effect exists) H1: tj  0 (treatment effect exists) H0: fk = 0 (no block effect exists) H1: fk  0 (block effect exists) Definitions Yij = data in treatment j and block k m = common mean tj = effect due to treatment j fk = effect due to block k ejk = random error Note This notation is for a fixed effects model. If tj = 0 and fk = 0, the model collapses to Yjk = m + ejk which says that each observed data value is the mean perturbed by some random error.

Example of 3x3 Format (two-factor unreplicated) Data Format Example of 3x3 Format (two-factor unreplicated) Row-Column Format (Excel) Stacked Format (Minitab) Note: Often only one factor (treatment) is of research interest, while the other variable (block) serves only to control for a second factor. The calculations are the same, no matter how you view the design. Usually, the blocking factor is placed in the rows.

Example: Pollution

ANOVA Table: 2-Factor General format: Illustration: Definitions c = number of columns r = number of rows n = number of observations If F exceeds Fcrit there is a significant difference between treatment groups at the chosen a.

Interpretation F Statistics: p-Values: Bottom Line: For freeway, F = 24.903 exceeds Fcrit =3.490 (for d.f. = 3,12) so there is a significant difference between freeways at a = 0.05. For time of day, F = 21.506 exceeds Fcrit =3.259 (for d.f. = 4,12) so there is a significant difference between times of day at a = 0.05. p-Values: Both p-values are 0.000, which says that F statistics as large as these would not arise by chance at any common level of significance if the null hypothesis were true (i.e., if the treatment means were the same). We could reject the hypothesis of equal group means even at a = 0.001 Bottom Line: Both freeway and time of day affect the pollution level. Both effects are extremely significant.

Two-Factor ANOVA (replicated) Model Form Xijk = m + aj + bk + abjk+ eijk Note aj and bk are called the main effects. Hypotheses H0: aj = 0 (no row effect exists) H1: aj  0 (row effect exists) H0: bk = 0 (no column effect exists) H1: bk  0 (column effect exists) H0: abjk = 0 (no interaction exists) H1: abjk  0 (interaction effect exists) Definitions Xijk = ith obs. in row j and col. k m = common mean aj = effect due to row j bk = effect due to column k abjk = effect due to interaction eijk = random error

Two-Factor ANOVA (replicated) Example with 3 Groups Row-Column Format (Excel) Stacked Format (Minitab) For clarity, only two observations per cell are shown, but you can have as many as you want. Subscripts and symbols omitted for clarity.

Replicated: DVD Sales

ANOVA Table: Replicated General format: Illustration: Definitions c = number of columns r = number of rows m = number of observations per cell If F exceeds Fcrit there is a significant difference between treatment groups at the chosen a.

Interpretation F Statistics: P-Values: Bottom Line: Both main effects are significant difference at a = 0.05 since the F statistics (200.412 and 14.054) exceed their critical values (4.256 in both cases, using d.f. =2,9). For the interaction effect, F = 2.398 does not exceed the critical value Fcrit =3.633 (for d.f. = 4,9) at a = 0.05. P-Values: For both main effects, the p-values of 0.000 and 0.002 are highly significant, but the interaction p-value of 0.127 is significant at a = 0.20 but not at a = 0.10. Bottom Line: Both store size and display location affect weekly sales. Both effects are extremely significant, though store size is a stronger effect (smaller p-value). Interaction exists only at a very weak level of significance.

ANOVA Notation Textbooks and computer software use various symbols for treatments and subscripts. We follow Excel's practice of using r for rows and c for columns. In one-factor ANOVA, we use SSB for variation between columns and SSW for variation within columns. In two-factor ANOVA, we use SSA and SSB for the main effects and SSAB for the interaction.

Stacked Format Most computer packages expect the dependent variable to be in a column, and each factor to be in a column, like this: Obs Y Factor 1 Factor 2 1 y1 3 2 y2 y3 4 ... n yn

Example: One-Factor Stacked Key AmtPaid is the amount to be paid to provider by insurance carrier ProdID refers to the patient’s insurance type M=Medicare D=Medicaid S=Commercial 1 P=Commercial 2 Note These observations (n = 498) were chosen at random from a database containing 1,508 records.

Minitab Procedure Copyright Notice Portions of MINITAB Statistical Software input and output contained in this document are printed with permission of Minitab, Inc. MINITABTM is a trademark of Minitab Inc. in the United States and other countries and is used herein with the owner's permission.

Minitab Results: Hospital Charges P-value indicates no significant group difference at a = 0.10. Minitab shows overlapping C.I.'s for each group mean.

Stacked or Unstacked? Comment In Excel, the one-factor ANOVA test requires that the data be grouped into separate columns. In this case, there are 4 insurance types so we would need 4 separate data columns. This is awkward for large data sets. Minitab can convert unstacked data into stacked data (and vice versa). In databases, the variables usually are coded in stacked format (one column for each factor).

Excel Procedure: Hospital Charges

Excel Results: Hospital Charges P-value would not even be significant at a = 0.10. There are 4 insurance groups so d.f.=3 for the “treatment” effect. At the 5% level of significance, F does not exceed Fcrit.

More Than Two Factors? ANOVA can have any number of factors (main effects) and their interactions. For example: 3-factor ANOVA model with all possible interactions: X = f (A, B, C, AB, AC, BC, ABC) 4-factor ANOVA model with all possible interactions is: X = f (A, B, C, D, AB, AC, AD, BC, BD, CD, ABC, ABD, BCD, ACD, ABCD) Samples sizes are rarely large enough to estimate such models, so higher-order interactions often are not examined.

One factor often suffices! Summary ANOVA compares means in several groups. Each group (or combination of factors) is a treatment. 1-factor ANOVA is most common, comparing c groups. 2-factor ANOVA without replication omits interactions. 2-factor ANOVA with replication allows interactions. k-factor ANOVA is conceptually simple (but Excel doesn’t do it). One factor often suffices!