Issues in factorial design

Slides:



Advertisements
Similar presentations
Factorial Designs & Managing Violated Statistical Assumptions
Advertisements

Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Hypothesis testing 5th - 9th December 2011, Rome.
Comparing Two Proportions (p1 vs. p2)
Factorial Designs Passer Chapter 9
Topic 12 – Further Topics in ANOVA
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Other Analysis of Variance Designs Chapter 15. Chapter Topics Basic Experimental Design Concepts  Defining Experimental Design  Controlling Nuisance.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Robust Between Groups Factorial and Robust RM Brief examples.
Introduction to Factorial ANOVA Designs
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Dr George Sandamas Room TG60
Analysis of frequency counts with Chi square
Factorial ANOVA 2-Way ANOVA, 3-Way ANOVA, etc.. Factorial ANOVA One-Way ANOVA = ANOVA with one IV with 1+ levels and one DV One-Way ANOVA = ANOVA with.
Lecture 9: One Way ANOVA Between Subjects
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Two-Way Balanced Independent Samples ANOVA Overview of Computations.
Two-Way Balanced Independent Samples ANOVA Computations Contrasts Confidence Intervals.
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Bootstrapping applied to t-tests
Initial Data Analysis Central Tendency.
Understanding the Two-Way Analysis of Variance
Presentation 12 Chi-Square test.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Chapter 14Prepared by Samantha Gaies, M.A.1 Chapter 14: Two-Way ANOVA Let’s begin by reviewing one-way ANOVA. Try this example… Does motivation level affect.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Statistical Techniques I EXST7005 Factorial Treatments & Interactions.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Two-Way Balanced Independent Samples ANOVA Computations.
Testing Hypotheses about Differences among Several Means.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
The Robust Approach Dealing with real data. Estimating Population Parameters Four properties are considered desirable in a population estimator:  Sufficiency.
Intermediate Applied Statistics STAT 460 Lecture 17, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Weighted and Unweighted MEANS ANOVA. Data Set “Int” Notice that there is an interaction here. Effect of gender at School 1 is = 45. Effect of.
Adjusted from slides attributed to Andrew Ainsworth
CHI SQUARE TESTS.
Mixed designs. We’ve discussed between groups designs looking at differences across independent samples We’ve also talked about within groups designs.
1 Psych 5510/6510 Chapter 14 Repeated Measures ANOVA: Models with Nonindependent ERRORs Part 3: Factorial Designs Spring, 2009.
More sophisticated ANOVA applications Repeated measures and factorial PSY SP2003.
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Smith/Davis (c) 2005 Prentice Hall Chapter Fifteen Inferential Tests of Significance III: Analyzing and Interpreting Experiments with Multiple Independent.
Handout Eight: Two-Way Between- Subjects Design with Interaction- Assumptions, & Analyses EPSE 592 Experimental Designs and Analysis in Educational Research.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Factorial Experiments
Hypothesis Testing Review
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Hypothesis Testing and Comparing Two Proportions
Main Effects and Interaction Effects
Exercise 1 Use Transform  Compute variable to calculate weight lost by each person Calculate the overall mean weight lost Calculate the means and standard.
BUS-221 Quantitative Methods
Presentation transcript:

Issues in factorial design

No main effects but interaction present Can I have a significant interaction without significant main effects? Yes Consider the following table of means B1 B2 A1 10 20 15 A2

No main effects but interaction present We can see from the marginal means that there is no difference in the levels of A, nor difference in the levels of factor B However, look at the graphical display B1 B2 A1 10 20 15 A2

No main effects but interaction present In such a scenario we may have a significant interaction without any significant main effects Again, the interaction is testing for differences among cell means after factoring out the main effects Interpret the interaction as normal

Robustitude As in most statistical situations there would be a more robust method for going about factorial anova Instead of using means, we might prefer trimmed means or medians so as to have tests based on estimates not so heavily influenced by outliers. And, there’s nothing to it.

Between subjects factorial using trimmed means Interaction Main effects

Robustitude Or using R you type in something along the lines of t2way(A, B, x, grp=c(1:p), tr=.2, alpha=.05) Again, don’t be afraid to try robust methods as they are often easily implemented with appropriate software.

Unequal sample sizes Along with the typical assumptions of Anova, we are in effect assuming equal cell sizes as well In non-experimental situations, there will be unequal numbers of observations in each cell Semester/time period for collection ends and you need to graduate Quasi-experimental design Participants fail to arrive for testing Data are lost etc. In factorial designs, the solution to this problem is not simple Factor and interaction effects are not independent Do not total up to SSb/t Interpretation can be seriously compromised No general, agreed upon solution

The problem (Howell example) Michigan Arizona Column means Non-drinking Drinking Row means Drinking participants made on average 6 more errors, regardless of whether they came from Michigan or Arizona No differences between Michigan and Arizona participants in that regard

Example However, there is a difference in the row means as if there were a difference between States Occurs because there are unequal number of participants in the cells In general, we do not wish sample sizes to influence how we interpret differences between means What can be done?

Another example How men and women differ in their reports of depression on the HADS (Hospital Anxiety and Depression Scale), and whether this difference depends on ethnicity. 2 independent variables--Gender (Male/Female) and Ethnicity (White/Black/Other), and one dependent variable-- HADS score. 

Note the difference in gender 2.47 vs. 4.73 A simple t-test would show this difference to be statistically significant and noticeable effect

Unequal sample sizes Note that when the factorial anova is conducted, the gender difference disappears It’s reflecting that there is no difference by simply using the cell means to calculate the means for each gender (1.48+6.6+12.56)/3 vs. (2.71+6.26+11.93)/3

Unequal sample sizes What do we do? One common method is the unweighted (i.e. equally weighted)-means solution Average means without weighting them by the number of observations Note that in such situations SStotal is usually not shown in ANOVA tables as the separate sums of squares do not usually sum to SStotal

Unequal sample sizes In the drinking example, the unweighted means solution gives the desired result Use the harmonic mean of our sample sizes No state difference 17 v 17 With the HADS data this was actually part of the problem The t-test would be using the weighted means, the anova the ‘unweighted’ means However, with the HADS data the tests of simple effects would bear out the gender difference and as these would be part of the analysis, such a result would not be missed In fact the gender difference is largely only for the white category i.e. there really was no main effect of gender in the anova design

A note about proportionality Unequal cells are not always a problem Consider the following tables of sample sizes B1 B2 B3 A1 5 10 20 A2 40 B1 B2 B3 A1 5 20 10 A2 50

Table 2 is not proportional The cell sizes in the first table are proportional b/c their relative values are constant across all rows (1:2:4) and columns (1:2) Table 2 is not proportional Row 1 (1:4:2) Row 2 (1:1:5) B1 B2 B3 A1 5 10 20 A2 40 B1 B2 B3 A1 5 20 10 A2 50

Proportionality Equal cells are a special case of proportional cell sizes As such, as long as we have proportional cell sizes we are ok with traditional analysis With nonproportional cell sizes, the factors become correlated and the greater the departure from proportional, the more overlap of main effects

More complex design: the 3-way interaction Before we had the levels of one variable changing over the levels of another So what’s going on with a 3-way interaction? How would a 3-way interaction be interpreted?

2 X 2 X 2 Example *Sometimes you will see interactions referred to as ordinal or disordinal, with the latter we have a reversal of treatment effect within the range of some factor being considered (as in the left graph).

3 X 2 X 2

3 X 3 X 2

Interpretation An interaction between 2 variables is changing over the levels of another (third) variable Interaction is interacting with another variable AB interaction depends on C Recall that our main effects would have their interpretation limited by a significant interaction Main effects interpretation is not exactly clear without an understanding of the interaction In other words, because of the significant interaction, the main effect we see for a factor would not be the same over the levels of another In a similar manner, our 2-way interactions’ interpretation would be limited by a significant 3-way interaction

Simple effects Same for the 2-way interactions However now we have simple, simple main effects (differences in the levels of A at each BC) and simple interaction effects

Simple effects In this 3 X 3 X 2 example, the simple interaction of BC is nonsignificant, and that does not change over the levels of A (nonsig ABC interaction) Consider these other situations

Simple effects As mentioned previously, a nonsignificant interaction does not necessarily mean that the simple effects are not significant as simple effects are not just a breakdown of the interaction but the interaction plus main effect In a 3-way design, one can test for simple interaction effects in the presence of a nonsignificant 3-way interaction The issue now arises that in testing simple, simple effects, one would have at minimum four comparisons (for a 2X2X2), Some examples are provided on the website using both the GLM and MANOVA procedures. Here is another from our backyard: http://www.coe.unt.edu/brookshire/spss3way.htm#simpsimp