Using Multilevel Modeling to Analyze Longitudinal Data Mark A. Ferro, PhD Offord Centre for Child Studies Lunch & Learn Seminar Series January 22, 2013.

Slides:



Advertisements
Similar presentations
Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Inference for Regression
Linear regression models
Lecture 6 (chapter 5) Revised on 2/22/2008. Parametric Models for Covariance Structure We consider the General Linear Model for correlated data, but assume.
Objectives (BPS chapter 24)
Simple Linear Regression 1. Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable.
© Willett, Harvard University Graduate School of Education, 5/21/2015S052/I.3(b) – Slide 1 More details can be found in the “Course Objectives and Content”
HLM – ESTIMATING MULTI-LEVEL MODELS Hierarchical Linear Modeling.
Graphs in HLM. Model setup, Run the analysis before graphing Sector = 0 public school Sector = 1 private school.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 13 Multiple Regression
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Psychology 202b Advanced Psychological Statistics, II
Mixing it up: Mixed Models Tracy Tomlinson December 11, 2009 Tracy Tomlinson December 11, 2009.
LECTURE 13 ANALYSIS OF COVARIANCE AND COVARIANCE INTERACTION and ATI (Aptitude-Treatment Interaction)
Chapter 12 Multiple Regression
Chapter Topics Types of Regression Models
Sept. 29 th, 2005 Investigating Learning over Time Mingyu Feng Neil Heffernan Longitudinal Analysis on Assistment Data.
Introduction to Probability and Statistics Linear Regression and Correlation.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
G Lecture 111 SEM analogue of General Linear Model Fitting structure of mean vector in SEM Numerical Example Growth models in SEM Willett and Sayer.
Introduction to Regression Analysis, Chapter 13,
Analysis of Clustered and Longitudinal Data
Introduction to Multilevel Modeling Using SPSS
Multilevel Modeling: Other Topics
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
Simple Linear Regression
Multilevel Modeling 1.Overview 2.Application #1: Growth Modeling Break 3.Application # 2: Individuals Nested Within Groups 4.Questions?
Covariance structures in longitudinal analysis Which one to choose?
A Random Coefficient Model Example: Observations Nested Within Individuals Academy of Management, 2010 Montreal, Canada Jeffrey B. Vancouver Ohio University.
The Impact of Including Predictors and Using Various Hierarchical Linear Models on Evaluating School Effectiveness in Mathematics Nicole Traxel & Cindy.
Hierarchical Linear Modeling (HLM): A Conceptual Introduction Jessaca Spybrook Educational Leadership, Research, and Technology.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Introduction Multilevel Analysis
Growth Curve Models Using Multilevel Modeling with SPSS David A. Kenny January 23, 2014.
Modeling Repeated Measures or Longitudinal Data. Example: Annual Assessment of Renal Function in Hypertensive Patients UNITNOYEARAGESCrEGFRPSV
Vilija R. Joyce, MS January 11, 2012 Modeling Utilities over Time.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Multilevel Linear Modeling aka HLM. The Design We have data at two different levels In this case, 7,185 students (Level 1) Nested within 160 Schools (Level.
Multilevel Modeling Software Wayne Osgood Crime, Law & Justice Program Department of Sociology.
What is a “Launch” relationship? Definition: Individual differences in the rate of change of a target variable is predicted from individual differences.
Statistical Models for the Analysis of Single-Case Intervention Data Introduction to:  Regression Models  Multilevel Models.
BUSI 6480 Lecture 8 Repeated Measures.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Developing a Mixed Effects Model Using SAS PROC MIXED
Unit 3a: Introducing the Multilevel Regression Model © Andrew Ho, Harvard Graduate School of EducationUnit 3a – Slide 1
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Vilija R. Joyce, MS November 14, 2012 Modeling Health-Related Quality of Life over Time.
1 Regression Review Population Vs. Sample Regression Line Residual and Standard Error of Regression Interpretation of intercept & slope T-test, F-test.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
© Murnane & Willett, Harvard University Graduate School of Education, 6/28/2016S290/Class #09 – Slide 1 S052: Applied Data Analysis Everything in the.
Longitudinal Data & Mixed Effects Models Danielle J. Harvey UC Davis.
An Introduction to Latent Curve Models
B&A ; and REGRESSION - ANCOVA B&A ; and
Psychology 202a Advanced Psychological Statistics
Statistical Models for the Analysis of Single-Case Intervention Data
Lecture 4 - Model Selection
Statistical Models for the Analysis of Single-Case Intervention Data
From GLM to HLM Working with Continuous Outcomes
An Introductory Tutorial
Longitudinal Data & Mixed Effects Models
Statistical Models for the Analysis of Single-Case Intervention Data
Presentation transcript:

Using Multilevel Modeling to Analyze Longitudinal Data Mark A. Ferro, PhD Offord Centre for Child Studies Lunch & Learn Seminar Series January 22, 2013

Recommended Readings 1.Singer JD, Willett JB. Applied longitudinal data analysis. Modeling change and event occurrence. New York: Oxford University Press; Singer JD. Fitting individual growth models using SAS PROC MIXED. In: Moskowitz DS, Hershberger SL, editors. Modeling intraindividual variability with repeated measures data. Methods and applications. Mahwah: Lawrence Erlbaum Associates; Singer JD. Using SAS PROC MIXED to fit multilevel models, hierarchical models, and individual growth models. J Educ Behav Stat 1998;24:

Objectives 1.Explore longitudinal data a)Wrong approaches 2.Understand multilevel model for change a)Specify the level-1 and level-2 models b)Interpret estimated fixed effects and variance components 3.Data analysis with the multilevel model a)Adding level-2 predictors b)Comparing models

Research Questions Broadly speaking, we are interested in two types of questions: 1.Start by asking about systematic change over time for each individual 2.Next ask questions about variability in patterns of change over time (what factors may help us explain different patterns of growth?)

Wrong Approaches 1.Estimated correlation coefficients: Problem: only measures status, not change (tells whether rank order is similar at both time-points) 2.Use difference score to measure change and use this as an estimate of rate of change Problem: assumes linear growth over time, but change may be non-linear

Less-than-Ideal Approaches 1.Aggregate data Reduced power No intra-individual variation 2.Repeated Measures ANOVA Reduced power Equal linear change Compound symmetry Class 1 Patient 1 Time 1 Time 2 Time 3 Patient 2 Time 1 Time 2 Time 3 Class 2 Patient 3 Time 1 Time 2 Time 3 Patient 4 Time 1 Time 2 Time 3 Level 2 Level 1 Class 2 Patient 3 Time 1 Time 2 Patient 4 Time 1 Time 2 Time 3 Level 2 Level 1 Class 1 Patient 1 Time 1 Time 2 Time 3 Patient 2 Time 1 Time

Advantages of MLM Flexibility in research design Different data collection schedules Varying number of waves Identify temporal patterns in the data Inclusion of time-varying predictors Interactions with time Effects that get smaller or larger over time

Example Dataset Longitudinal Study of American Youth (LSAY) N=1322 Caucasian and African-American students Change in mathematics achievement between grades At what rate does mathematics achievement increase over time? 2.Is the rate of increase related to student race, controlling for the effects of SES and gender?

How to Answer the Questions? 1.Exploratory analysis 2.Fit taxonomy of progressively more complex models a)Unconditional means model (not shown) b)Unconditional linear growth model c)Add race as level-2 predictor of initial status and rate of change in match achievement d)Add SES as level-2 control variable, testing impact on initial status and rate (does effect of race change?) e)Add gender as level-2 control variable,… 3.Select final model and plot prototypical trajectories 4.Residual analysis to evaluate tenability of assumptions

Multilevel Model for Change Level-1 model: Level-2 model: Composite model: structural stochastic

Level-1 Model Within-individual Intercept of individual i’s trajectory (initial status) Centred at a time 0 Math achievement at time 0 Slope of individual i’s trajectory (rate of change) Change in math achievement between each time point Deviations of individual i’s trajectory from linearity on occasion j (error term) ~N(0,σ 2 )

Level-2 Model Between-individual Population average intercept and slope for math achievement for reference group (Caucasian) Difference in population average intercept and slope for math achievement between African-American and Caucasian Difference between population average and individual i’s intercept and slope for math achievement, controlling for race

Level-2 Model Residuals Variance-covariance matrix Population variance in intercept, controlling for race Population variance in slope, controlling for race Population covariance between intercept and slope, controlling for race

Exploratory Analysis - OLS

SAS Syntax proc mixed data=lsay noclprint noinfo covtest method=ml; title 'Model A: Unconditional Linear Growth Model'; class lsayid; model math = grade_c / solution ddfm=bw notest; random intercept grade_c /subject=lsayid type=un; run;

Unconditional Linear Growth – Fixed Effects Solution for Fixed Effects EffectEstimateStandard Error DFt ValuePr > |t| Intercept <.0001 grade_c <.0001 Estimated math achievement in 7 th grade Estimated yearly rate of change in math achievement t-test for null H0 of no average change in achievement in the population

Unconditional Linear Growth – Random Effects Covariance Parameter Estimates Cov ParmSubjectEstimateStandard Error Z ValuePr Z UN(1,1)LSAYID <.0001 UN(2,1)LSAYID <.0001 UN(2,2)LSAYID <.0001 Residual <.0001 Estimated variance in intercept Estimated variance in slope Estimated variance in level-1 residuals Estimated covariance between intercept and slope

SAS Syntax proc mixed data=lsay noclprint noinfo covtest method=ml; title 'Model B: Adding the Effect of Race'; class lsayid; model math = grade_c aa aa*grade_c / solution ddfm=bw notest; random intercept grade_c /subject=lsayid type=un; run;

Adding the Effect of Race – Fixed Effects Solution for Fixed Effects EffectEstimateSEDFt ValuePr > |t| Intercept <.0001 grade_c <.0001 aa <.0001 grade_c*aa Estimated math achievement in 7 th grade for Caucasians Estimated yearly rate of change in math achievement for Caucasians Estimated difference in yearly rate of change in math achievement between Caucasian and AA Estimated difference in math achievement in 7 th grade between Caucasians and AA

Adding the Effects of Race – Random Effects Covariance Parameter Estimates Cov ParmSubjectEstimateSEZ ValuePr Z UN(1,1)LSAYID <.0001 UN(2,1)LSAYID <.0001 UN(2,2)LSAYID <.0001 Residual <.0001 Estimated variance in intercept, controlling for race Estimated variance in slope, controlling for race Estimated variance in level-1 residuals Estimated covariance between intercept and slope, controlling for race

SAS Syntax proc mixed data=lsay noclprint noinfo covtest method=ml; title 'Model B: Adding the Effect of Race'; class lsayid; model math = grade_c aa aa*grade_c ses ses*grade_c / solution ddfm=bw notest; random intercept grade_c /subject=lsayid type=un; run;

Adding the Effects of SES – Fixed Effects EffectEstimateSEDFt ValuePr > |t| Intercept <.0001 grade_c <.0001 aa <.0001 ses <.0001 grade_c*aa grade_c*ses Estimated math achievement in 7 th grade for Caucasians of average SES Estimated yearly rate of change in math achievement for Caucasians of average SES Estimated difference in yearly rate of change in math achievement between Caucasian and AA, controlling for SES Estimated difference in math achievement in 7 th grade between Caucasians and AA, controlling for SES Estimated effect of SES on average 7 th grade achievement, controlling for race Estimated effect of SES on rate of change of achievement, controlling for race

Adding the Effects of SES – Random Effects Cov ParmSubjectEstimateStandard Error Z ValuePr Z UN(1,1)LSAYID <.0001 UN(2,1)LSAYID <.0001 UN(2,2)LSAYID <.0001 Residual <.0001 Estimated variance in intercept, controlling for race and SES Estimated variance in slope, controlling for race and SES Estimated variance in level-1 residuals Estimated covariance between intercept and slope, controlling for race and SES

SAS Syntax proc mixed data=lsay noclprint noinfo covtest method=ml; title 'Model B: Adding the Effect of Race'; class lsayid; model math = grade_c aa aa*grade_c ses ses*grade_c / solution ddfm=bw notest; random intercept grade_c /subject=lsayid type=un; run;

Removing the Effect of Race on Rate of Change Solution for Fixed Effects EffectEstimateStandard Error DFt ValuePr > |t| Intercept <.0001 grade_c <.0001 aa <.0001 ses <.0001 grade_c*ses

SAS Syntax proc mixed data=lsay noclprint noinfo covtest method=ml; title 'Model B: Adding the Effect of Race'; class lsayid; model math = grade_c aa ses ses*grade_c female / solution ddfm=bw notest; random intercept grade_c /subject=lsayid type=un; run;

Final Model with Gender Solution for Fixed Effects EffectEstimateStandard Error DFt ValuePr > |t| Intercept <.0001 grade_c <.0001 aa <.0001 ses <.0001 female grade_c*ses

Goodness-of-Fit Model AModel BModel CModel DModel EModel F Deviance AIC BIC Deviance -2LL statistic Worse fit = larger -2LL Can be compared in nested models χ 2 distribution, df = difference in number of parameters AIC & BIC Can be used for non-nested models AIC corrects for number of parameters estimated BIC corrects for sample size and number of parameters, so larger improvement needed for larger samples

Presenting Results Ferro & Boyle. Journal of Pediatric Psychology 2013;38(4):425-37

Plotting Trajectories for Prototypical Individuals RaceSESInitial StatusRate of Change CaucasianLow (0)+3.616(-0.693)+0.818(1)= (-0.693)=2.534 CaucasianHigh (0)+3.616(0.735)+0.818(1)= (0.735)=3.098 AALow (1)+3.616(-0.693)+0.818(1)= (-0.693)=2.534 AAHigh (1)+3.616(0.735)+0.818(1)= (0.735)=3.098 Estimates of initial status and rate of change for Caucasian and African- American girls of high and low SES

Prototypical Trajectories

Assumptions & Evaluation Assumption 1.Level-1 growth model is linear 2.Level-2, relationship between predictors and intercept and slope is linear 3.Level-1 and level-2 residuals are normal and homoscedastic Evaluation 1.Examine empirical growth plots for evidence of linearity 2.Plot OLS estimates of growth parameters vs. each predictor 3.Standard diagnostics for level-1 and level-2