Handout Nine: Repeated Measures –Design, Analysis, & Assumptions.

Slides:



Advertisements
Similar presentations
ANALYSIS OF VARIANCE (ONE WAY)
Advertisements

MANOVA: Multivariate Analysis of Variance
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Statistics for Linguistics Students Michaelmas 2004 Week 6 Bettina Braun
Design of Experiments and Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
C82MST Statistical Methods 2 - Lecture 7 1 Overview of Lecture Advantages and disadvantages of within subjects designs One-way within subjects ANOVA Two-way.
Statistics for Business and Economics
Using Between-Subjects and Within-Subjects Experimental Designs
Lecture 9: One Way ANOVA Between Subjects
Two Groups Too Many? Try Analysis of Variance (ANOVA)
PSY 1950 Repeated-Measures ANOVA October 29, 2008.
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
One-way Between Groups Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Chapter 14 Inferential Data Analysis
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
F-Test ( ANOVA ) & Two-Way ANOVA
SPSS Series 1: ANOVA and Factorial ANOVA
ANOVA Greg C Elvers.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Single-Factor Experimental Designs
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
TAUCHI – Tampere Unit for Computer-Human Interaction ERIT 2015: Data analysis and interpretation (1 & 2) Hanna Venesvirta Tampere Unit for Computer-Human.
Chapter 7 Experimental Design: Independent Groups Design.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
ANOVA (Analysis of Variance) by Aziza Munir
Experimental Design: One-Way Correlated Samples Design
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Testing Hypotheses about Differences among Several Means.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Experimental Research Methods in Language Learning
EDCI 696 Dr. D. Brown Presented by: Kim Bassa. Targeted Topics Analysis of dependent variables and different types of data Selecting the appropriate statistic.
Smoking Data The investigation was based on examining the effectiveness of smoking cessation programs among heavy smokers who are also recovering alcoholics.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Stats Lunch: Day 8 Repeated-Measures ANOVA and Analyzing Trends (It’s Hot)
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
ONE-WAY BETWEEN-GROUPS ANOVA Psyc 301-SPSS Spring 2014.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Smith/Davis (c) 2005 Prentice Hall Chapter Fifteen Inferential Tests of Significance III: Analyzing and Interpreting Experiments with Multiple Independent.
Handout Seven: Independent-Samples t Test Instructor: Dr. Amery Wu
Instructor: Dr. Amery Wu
Handout Five: Between-Subjects Design of Analysis of Variance- Planned vs. Post Hoc Comparisons EPSE 592 Experimental Designs and Analysis in Educational.
Handout Twelve: Design & Analysis of Covariance
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Handout Eight: Two-Way Between- Subjects Design with Interaction- Assumptions, & Analyses EPSE 592 Experimental Designs and Analysis in Educational Research.
Research Methods and Data Analysis in Psychology Spring 2015 Kyle Stephenson.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Analysis of Variance STAT E-150 Statistical Methods.
Handout Ten: Mixed Design Analysis of Variance EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr. Amery Wu Handout Ten:
Within Subject ANOVAs: Assumptions & Post Hoc Tests.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.Introduction To Basic Ratios 3.Basic Ratios In Excel 4.Cumulative.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Multivariate vs Univariate ANOVA: Assumptions. Outline of Today’s Discussion 1.Within Subject ANOVAs in SPSS 2.Within Subject ANOVAs: Sphericity Post.
Presentation transcript:

Handout Nine: Repeated Measures –Design, Analysis, & Assumptions. EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr. Amery Wu

One-way Within-Subjects Factorial Design with Interaction Where We Are Today One-way Within-Subjects Factorial Design with Interaction Measurement of Data Quantitative Categorical Type of the Inference Descriptive A B Inferential C D Today, we will introduce the analysis and assumptions of a within-subjects design. This design explains/predicts quantitative data using one (implicit) independent variables, with two or more conditions under which the measurements of the DV are taken from the same individuals. 2

An Example to Contextualize the Learning One-Way Within-Subjects Design of ANOVA Eight Canadian students participated in a food dislike study where four different edible foods from different cultures were compared. The time until retching of the same 8 students was recorded after they were given each of the four treatment conditions (eating parts of 4 animals; Field, 2007). Student Stick Insect Kangaroo Testicle Fish Eye Witchetty Grub 1 8 7 6 2 9 5 3 4 10 12

An Example to Contextualize the Learning One-Way Within-Subjects Design of ANOVA The researcher would like to find out whether there is an effect of animal parts on the students’ food dislike (time till retch).

Quantitative Methodology Network Research Question Inference Descriptive vs. Inferential Relational vs. Causal Design Experimental Observational Model Descriptive/Summative Explanatory/Predictive Data Quantitative Categorical

What renders the validity of making a causal inference based on a randomized experiment research design?

Between-Subjects Design Within-Subjects Design Between- vs. Within Subjects Designs 1 2 3 4 5 6 8 Between-Subjects Design 12 individuals were randomly assigned to each of the three groups. Each group receives a different treatment (A, B, or C). Treatment A Treatment B 4 3 5 1 2 6 8 Within-Subjects Design The same 4 individuals receive all three treatments of A, B, and C. Treatment A Treatment B Treatment C

What are the major differences between the designs of One-Way Between-Subjects and One-Way Within-Subjects ANOVA?

Between- vs. Within Subjects Designs In a between-subjects design, groups separate individuals such that individuals in one group are different from those in another. In a within-subjects design, individuals are not grouped. Each individual is a group of his/her own. There are as many groups as individuals. In a between-subjects design, each individual receives one condition of the treatment, in a within-subjects design, each individual receives all conditions of the treatment. For a between-subjects design, the DV is only measured once for each individual. For a within-subjects-design, the same DV is measured repeatedly (under various conditions). The data of the two designs are also laid out differently (see next slide).

Layout of the Data Files between- vs. within- Subjects Designs Note that there are 18 data values for both designs. Subject Condition Score 1 114 2 79 3 89 4 91 5 94 6 95 7 92 8 9 10 11 96 12 13 81 14 85 15 87 16 88 17 90 18 Subject Condition 1 Condition 2 Condition 3 1 114 102 121 2 92 97 101 3 85 82 89 4 93 99 5 110 6 98

Within-Subjects Design & Analysis In a between-subjects (independent samples) experimental design, different treatments are given to different groups of individuals, e.g., individuals receiving treatment A will not receive treatment B or C. Their scores on the DV is then recorded once after the single treatment is given. An individual’s group membership is recorded as an IV. In a within-subjects (repeated measures or dependent samples) experimental design, the same individuals receives all the treatments, one treatment at a time, and the same DV is measured multiple times - each time after each treatment. Recording the treatment IV as a variable (i.e., control, treatment A, or treatment B) is unnecessary since everyone is in all treatment groups.

Within-Subjects Design & Analysis In between-subjects ANOVA, the treatment effect is investigated by comparing the mean DV scores among the different treatment groups (e.g., control vs. treatment). In a within-subjects ANOVA, the treatment effect is investigated by comparing the mean DV scores of the same individuals- comparing their own repeated DV scores under different treatment conditions. Repeated measures ANOVA is analogous to a dependent (paired) sample t-test in which it compares the means between two groups of the same individuals. When there are 2 or more dependent (related) samples, the repeated measures ANOVA is used.

Within-Subjects Design & Analysis Like the between-subjects ANOVA, within-subjects ANOVA is an inferential statistical technique used to compare and test the effect of a treatment (mean differences among two or more treatment groups). However, unlike the between-subjects ANOVA where the group membership is independent, individuals’ DV scores across the IV groups are dependent resulting from the same (or matched) individuals whose scores on the DV are repeatedly measured after each of the experiment conditions.

How Repeated Measures ANOVA Works? In a between-subjects design, each individual is given one of the treatments of the experiment and measured on the DV once. Cross-individual differences (within each group) can only be regarded as random errors (sample to sample errors due to chance). Remember that random variation can not be explained, although it can be quantified as error variance. With a repeated measures experimental design, systematic variation can be due to differences in the treatments and differences across the individuals. Another way of understanding a repeated measures design is that each individual forms a group, the number of groups is equal to the sample size.

How Repeated Measures ANOVA Works? For example, 30 people were asked to give their ratings on 8 ethnic dishes, the variation among the 240 scores (30X8) may be due to: 1. systematic treatment condition differences (e.g., differences in the taste of the dishes) 2. systematic individual differences (e.g., Tony and Kiki tended to give higher ratings but Chen and Wu tended to give lower ratings on western cuisine). 3. random differences (sample to sample variation due to chance) If each individual is measured K times on the DV, systematic variation across the K repeated measures for each individual can be calculated and partitioned out of the total sum of squares. That is, variation within a subject, can be explained and partitioned out of the error variance).

Advantages of a Within-Subjects Design With a repeated measures design, relatively fewer participants are needed. It is preferred when the recruitment of participants is difficult. A repeated measures design allows studying of changes of the same participants’ behavior over the conditions (temporal changes or changes due to the conditions of the treatment). In essence, with a repeated measures design, participants serve as their own comparison groups. This type of design may increase the power by controlling cross-individual differences. However, increase in power is not guaranteed, the actual observed power will depend on how large the individual differences are and in the degrees of freedom.

Limitations of a Within-Subjects Design This design will not work well if there is a carry-over effect, i.e., the effect of treatment on the DV is affected by practice and learning due to earlier exposure under other treatment conditions or participants’ natural maturation. For example, it is hard to conclude that the new intern training program leads to better nursing practices, if the same interns were exposed to the tasks/questions at the first exam and are already familiar with the contents and procedures of the exam when taking the exam again after completing the second and third training programs. Because there is only one person in each group, the “subject by condition interaction effect” can not be computed (modeled) and is treated as error.

Error Variance in a One-Way Within-Subjects Design The pattern of dislike of animal parts differs among the individuals. Variation due to individuals’ differences in the pattern over the repeated measures (subject by animal_parts interaction) is treated as the error variance.

One-way Within-Subjects Design Partitioning the Total Sum of Squares by the 1 Factors 1 Factor

See Excel spreadsheet “Animal Parts.xls” Calculating The Mean Squares for the F Summary Table One-way Within-Subjects Design See Excel spreadsheet “Animal Parts.xls”

Data Assumptions about Repeated Measures ANOVA Assumption 1: Normal Distribution The difference scores between any pairs of the repeatedly measured DVs is normally distributed. With K groups, there are K(K-1)/2 difference scores. This assumption is analogous to that for the dependent-sample t-test where the difference scores between the pair of DVs is normally distributed. This assumption can be tested by whether all the paired difference scores are normally distributed using the statistic of skewness and graphs of histogram.

Data Assumptions about Repeated Measures ANOVA Assumption 2: Sphericity The variances of the difference scores between all pairs of repeatedly measured DVs are equal (Sphericity). This assumption used to be hypothesis tested using the Mauchly’s W test, which follows Chi-square distributions. A significant result signals that the sphericity assumption is violated. However, using a inferential test for checking sphericity is considered to be too strict (always rejected). Epsilon, ἓ, is an alternative measure for sphericity; a maximum value of 1 indicates perfect sphericity. In SPSS, three Epsilon indices are reported. It’s been suggested that a Greenhouse-Geisser’s ἓ being less than 0.75 signals a violation of the sphericity.

Interpretation of the Repeated Measures ANOVA The omnibus F test The interpretation of a repeated measures ANOVA results is similar to that of an independent ANOVA. The focus is to conclude, using the F test, whether there is any statistically significant difference between the means in the DV under different IV conditions. If the sphericity assumption is violated (Greenhouse-Geisser’s ἓ less than 0.75), pick one of the alternative F-tests that corrects for lack of sphericity (i.e., Greenhouse-Geisser, Huynh-Feldt, or Lower-bound). Specifically, the numerator and denominator degrees of freedom in the F test are multiplied by the epsilon of the corresponding method. Greenhouse-Geisser F test is more conservative, especially when sample size is small. Huynh-Feldt’s F test less conservative but may assume epsilon values greater than 1.0, in which case it is set to 1.0. The lower bound method for F Test is the most conservative.

Lab Activity: One-Way Within-Subjects Design Using SPSS for Omnibus F Test Remember to hit “Add”

Lab Activity: One-Way Within-Subjects Design SPSS Outputs for Omnibus F Test Since Greenhouse-Geisser’s ἓ is less than 0.75 signaling a violation of the sphericity, one should choose one of the three alternative F tests. Note that they reached different conclusions.

Interpretation of the Repeated Measures ANOVA The post-hoc comparisons If the omnibus test is significant, the next task is to examine which paired differences is statistically significant. This post hoc test is analogous to the dependent (paired) sample t-tests. Note that violation of sphericity can also complicate the post hoc paired t-tests. In SPSS, there are three options: Tukey’s LSD test does not correct for violation of sphericity, hence is not recommended. Bonferroni’s test corrects for violation of sphericity. It is more conservative than the other two and is recommended in general. Sidak’s test corrects for violation of sphericity. It is more powerful than the other two and is considered when power is a concern.

Lab Activity: One-Way Within-Subjects Design Using SPSS for Post Hoc Comparisons

Lab Activity: One-Way Within-Subjects Design SPSS Outputs for Post hoc Comparisons.