Random Effects & Repeated Measures

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Within Subjects Designs
Mixed Designs: Between and Within Psy 420 Ainsworth.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Other Analysis of Variance Designs Chapter 15. Chapter Topics Basic Experimental Design Concepts  Defining Experimental Design  Controlling Nuisance.
Design Supplemental.
DOCTORAL SEMINAR, SPRING SEMESTER 2007 Experimental Design & Analysis Analysis of Covariance; Within- Subject Designs March 13, 2007.
Random Effects & Repeated Measures Alternatives to Fixed Effects Analyses.
N-way ANOVA. Two-factor ANOVA with equal replications Experimental design: 2  2 (or 2 2 ) factorial with n = 5 replicate Total number of observations:
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
C82MST Statistical Methods 2 - Lecture 7 1 Overview of Lecture Advantages and disadvantages of within subjects designs One-way within subjects ANOVA Two-way.
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
PSYC512: Research Methods PSYC512: Research Methods Lecture 19 Brian P. Dyre University of Idaho.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Lecture 9: One Way ANOVA Between Subjects
Sampling and Experimental Control Goals of clinical research is to make generalizations beyond the individual studied to others with similar conditions.
Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.
ANOVA Chapter 12.
Factorial ANOVA 2 or More IVs. Questions (1)  What are main effects in ANOVA?  What are interactions in ANOVA? How do you know you have an interaction?
Calculations of Reliability We are interested in calculating the ICC –First step: Conduct a single-factor, within-subjects (repeated measures) ANOVA –This.
Chapter 14: Repeated-Measures Analysis of Variance.
ANOVA. Independent ANOVA Scores vary – why? Total variability can be divided up into 2 parts 1) Between treatments 2) Within treatments.
Inferential Statistics
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
IE341 Midterm. 1. The effects of a 2 x 2 fixed effects factorial design are: A effect = 20 B effect = 10 AB effect = 16 = 35 (a) Write the fitted regression.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
Stats/Methods II JEOPARDY. Jeopardy Compare & Contrast Repeated- Measures ANOVA Factorial Design Factorial ANOVA Surprise $100 $200$200 $300 $500 $400.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Correlated-Samples ANOVA The Univariate Approach.
Smith/Davis (c) 2005 Prentice Hall Chapter Fifteen Inferential Tests of Significance III: Analyzing and Interpreting Experiments with Multiple Independent.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
More repeated measures. More on sphericity With our previous between groups Anova we had the assumption of homogeneity of variance With repeated measures.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
Differences Among Groups
Chapter 14 Repeated Measures and Two Factor Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh.
Design Lecture: week3 HSTS212.
Dependent-Samples t-Test
Statistics for Managers Using Microsoft Excel 3rd Edition
Between-Subjects, within-subjects, and factorial Experimental Designs
An Introduction to Two-Way ANOVA
i) Two way ANOVA without replication
Comparing Three or More Means
Statistics for the Social Sciences
Chapter 5 Introduction to Factorial Designs
Experimental Design.
Two-Factor Studies with Equal Replication
Statistics for the Social Sciences
Two Sample t-test vs. Paired t-test
Chapter 11 Analysis of Variance
Other Analysis of Variance Designs
Introduction to ANOVA.
Two-Factor Studies with Equal Replication
Main Effects and Interaction Effects
Reasoning in Psychology Using Statistics
Joanna Romaniuk Quanticate, Warsaw, Poland
Latin Square Designs KNNL – Sections
Reasoning in Psychology Using Statistics
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
12 Inferential Analysis.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Factorial ANOVA 2 or More IVs.
Statistics for the Social Sciences
What are their purposes? What kinds?
Psych 231: Research Methods in Psychology
Experimental Statistics - week 8
Graziano and Raulin Research Methods: Chapter 12
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Random Effects & Repeated Measures Alternatives to Fixed Effects Analyses

Questions What is the difference between fixed- and random-effects in terms of treatments? How are F tests with random effects different than with fixed effects? How is a repeated measures design different from a totally between subjects design in the collection of the data?

Questions (2) How does the significance testing change from the totally between to a design to one in which one or more factors are repeated measures (just the general idea, you don’t need to show actual F ratios or computations)? Describe one argument for using repeated measures designs and one argument against using such designs (or describe when you would and would not want to use repeated measures).

Fixed Effects Designs All treatment conditions of interest are included in the study All in cell get identical stimulus (treatment, IV combination) Interest is in specific means Expected mean squares are (relatively) simple; F tests are all based on common error term.

Random Effects Designs Treatment conditions are sampled; some or many conditions of interest are excluded. Replications of the experiment would get different treatments because treatments are sampled Interest in the variance produced by an IV rather than means Expected mean squares relatively complex; the denominator for F changes depending on the effect being tested.

Fixed vs. Random Random Fixed Examples Conditions Persuasiveness of commercials Treatment Sampled All of interest Sex of participant Experimenter effect Replication different Replication same Drug dosage Impact of team members Variance due to IV Means due to IV Training program effectiveness

Single Factor Random The expected mean squares and F-test for the single random factor are the same as those for the single factor fixed-effects design.

Experimenter effects (Hays Table 13.4.1) 2 3 4 5 5.8 6.0 6.3 6.4 5.7 5.1 6.1 5.5 5.9 6.6 6.5 5.6 6.2 5.4 5.3 6.7 5.2 6.21 6.33 6.16

Sum of Source DF Squares Mean Square F Value Pr > F Model 4 3.48150000 0.87037500 10.72 <.0001 Error 35 2.84250000 0.08121429 Corrected Total 39 6.32400000

Random Effects Significance Tests (A & B random/within) Source E(MS) F df A J-1, (J-1)(K-1) B K-1, AxB (J-1)(K-1), JK(n-1) Error

A fixed B random Source E(MS) F df A J-1, (J-1)(K-1) B K-1, AxB JK(n-1) Error

Why the Funky MS? Treatment effects for A, B, & AxB are the same for fixed & random in the population of treatments. In fixed, we have the population, in random, we just have a sample. Therefore, in a given (random) study, the interaction effects need not sum to zero. The AxB effects appear in the main effects.

Applications of Random Effects Reliability and Generalizability How many judges do I need to get a reliability of .8? How well does this score generalize to a particular universe of scores? Intraclass correlations (ICCs) Estimated variance components Meta-analysis Control (Randomized Blocks and Repeated Measures) Sample as many conditions as you can afford (5+ if possible)

Review What is the difference between fixed- and random-effects in terms of treatments? How are F tests with random effects different than with fixed effects?

Repeated Measures Designs In a repeated measures design, participants appear in more than one cell; use ALL the levels of a factor, not just some of them. Painfree study – note the design – AVOID a single group pre-post design Sports instruction Commonly used in psychology

Pros & Cons of RM Pro Con Individuals serve as own control – improved power Carry over effects May be cheaper to run Participant sees design - demand characteristics Scarce participants

RM – Participant ‘Factor’ Source df MS E(MS) F Between Subjects K-1 No test Within Subjects Treatments J-1 Subjects x Treatments (J-1)(K-1) Total JK-1 Generic representation of a single factor within design

Drugs on Reaction Time Order of drug randomized. All Ss, all drugs. Interest is drug. Note that the Trial effect is ignored. Person Drug 1 Drug 2 Drug 3 Drug 4 Mean 1 30 28 16 34 27 2 14 18 10 22 3 24 20 23 4 38 44 5 26 24.5 26.4 25.6 15.6 32 24.9 Drug is fixed; person is random. ‘1 Factor’ repeated measures design. Notice 1 person per cell. We can get 3 SS: row, column, and residual (interaction plus error).

Total SS Person Drug 1 Drug 2 Drug 3 Drug 4 Mean 1 30 28 16 34 27 2 14 18 10 22 3 24 20 23 4 38 44 5 26 24.5 26.4 25.6 15.6 32 24.9

Drug SS Person Drug M D*D 1 26.4 2.25 3 15.6 86.49 2 4 5 25.6 0.49 32 50.41 Total 698.20

Person SS Person Drug M D*D 1 27 4.41 3 2 16 79.21 23 3.61 4 34 82.81 5 24.5 0.16 Total 680.8

Summary Total = 1491.8; Drugs = 698.2, People=680.8. Residual = Total –(Drugs+People) = 1491.8-(698.2+680.8) =112.8 Source SS df MS F Between People 680.8 4 Nuisance variance Within people (by summing) 811.0 15 Drugs 698.2 3 232.73 24.76 Residual 112.8 12 9.40 Total 1491.8 19 Fcrit(.05) =3.95

R code Run the same problem using R.

2 Factor, 1 Repeated Subject B1 B2 B3 B4 M 1 5 3 2 A1 4 3.25 6 3.75 7 8 5.25 A2 9 5.75 3.83 2.5 6.17 3.33 4.56 DV=errors in control setting dials; IV(A) is dial calibration - between; IV(B) is dial shape - within. Observation is randomized over dial shape.

Summary Source SS df MS F Between people 68.21 5 A(calibration) 51.04 Note that different factors are tested with different error terms. Source SS df MS F Between people 68.21 5 A(calibration) 51.04 1 11.9 Subjects within groups 17.17 4 4.29 Within people 69.75 18 B (dial shape) 47.46 3 15.82 12.76 AB 7.46 2.49 2.01 BxSub within group 14.83 12 1.24

Graph Run the problem in R.

Post Hoc Tests Post hoc tests with repeated measures are tricky. You have to use the proper error term for each test. The error term changes depending on what you are testing. Be sure to look up the right error term. In R, you will have to look for examples of the kind of thing you want for post-hoc tests. Generally speaking, avoid ANOVA for repeated measures designs. Use MANOVA instead (see next slide).

Assumptions of RM Orthogonal ANOVA assumes homogeneity of error variance within cells. IVs are independent. With repeated measures, we introduce covariance (correlation) across cells. For example, the correlation of scores across subjects 1-3 for the first two calibrations is .89. Repeated measures designs make assumptions about the homogeneity of covariance matrices across conditions for the F test to work properly. If the assumptions are not met, you have problems and may need to make adjustments. You can avoid these assumptions by using multivariate techniques (MANOVA) to analyze your data. I suggest you do so. Howell likes corrections to df. If you use ANOVA, you need to look up your design to get the right F tests and check on the assumptions.

Review How is a repeated measures design different from a totally between subjects design in the collection of the data? How does the significance testing change from the totally between to a design to one in which one or more factors are repeated measures (just the general idea, you don’t need to show actual F ratios or computations)? Describe one argument for using repeated measures designs and one argument against using such designs (or describe when you would and would not want to use repeated measures).

If time R code simple judge reliability Judges and targets are crossed

If time.. Paired associate learning experiment: 8 randomly chosen participants were given 3 lists of 35 pairs of words to learn. Lists in random order to each participant. Score is number correctly recalled on first trial. Are these lists differently difficult?

Paired associate data Subject A B C 1 22 15 18 2 9 12 3 16 13 10 4 19 20 6 17 7 14 8