Angela B. Bradford School of Family Life

Slides:



Advertisements
Similar presentations
Autocorrelation and Heteroskedasticity
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Experimental Design, Response Surface Analysis, and Optimization
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Objectives (BPS chapter 24)
Latent Growth Curve Modeling In Mplus:
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Chapter Eighteen MEASURES OF ASSOCIATION
Clustered or Multilevel Data
“Ghost Chasing”: Demystifying Latent Variables and SEM
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Introduction to Multilevel Modeling Using SPSS
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Inference for regression - Simple linear regression
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
Time-Series Analysis and Forecasting – Part V To read at home.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Introduction Multilevel Analysis
Sampling Distributions & Standard Error Lesson 7.
Growth Curve Models Using Multilevel Modeling with SPSS David A. Kenny January 23, 2014.
Modelling non-independent random effects in multilevel models Harvey Goldstein and William Browne University of Bristol NCRM LEMMA 3.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Multivariate Analysis. One-way ANOVA Tests the difference in the means of 2 or more nominal groups Tests the difference in the means of 2 or more nominal.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Tutorial I: Missing Value Analysis
Quality Control  Statistical Process Control (SPC)
More on regression Petter Mostad More on indicator variables If an independent variable is an indicator variable, cases where it is 1 will.
Introduction to Multilevel Analysis Presented by Vijay Pillai.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Forecast 2 Linear trend Forecast error Seasonal demand.
Module II Lecture 1: Multiple Regression
Chapter 13 Simple Linear Regression
1. According to ______ the larger the sample, the closer the sample mean is to the population mean. (p. 251) Murphy’s law the law of large numbers the.
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Inference for Least Squares Lines
REGRESSION DIAGNOSTIC III: AUTOCORRELATION
Linear Regression.
AP Statistics Chapter 14 Section 1.
Basic Estimation Techniques
Statistics: The Z score and the normal distribution
PCB 3043L - General Ecology Data Analysis.
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Linear and generalized linear mixed effects models
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Simple Linear Regression - Introduction
CHAPTER 29: Multiple Regression*
Sample vs Population comparing mean and standard deviations
10701 / Machine Learning Today: - Cross validation,
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Multitrait Scaling and IRT: Part I
CH2 Time series.
Autoregressive and Growth Curve Models
Presentation transcript:

Angela B. Bradford School of Family Life A Gentle Introduction to Dynamic Structural Equation Modeling (DSEM) for Intensive Longitudinal Data Angela B. Bradford School of Family Life

DSEM Used to analyze Intensive Longitudinal Data (ILD) Combines multilevel modeling, time series modeling, structural equation modeling, and time varying effects modeling Goal is to parse out and model these types of correlations, giving a fuller picture of the dynamics of ILD Unlike cross-classified modeling (i.e., long format growth model), it allows you to regress a variable on: other variables at the same timepoint itself at a previous timepoint other variables at previous timepoint ILD- Data with many repeated measurements (e.g., daily diary, experience sampling, etc.) Multilevel modeling- based on correlations that are due to individual-specific effects Time series modeling- based on correlations that are due to proximity of observations SEM- based on correlations between different variables Time-varying effects- based on correlations due to the same stage of evolution Asparouhov, T., Hamaker, E.L. & Muthén, B. (2017). Dynamic structural equation models. Technical Report. Version 3. Submitted for publication. Retrieved from https://www.statmodel.com/TimeSeries.shtml.

DSEM (cont.) Models intra-individual changes over time on level 1 and allows the parameters of these changes to vary across individuals on level 2 using random effects Samples with many subjects (e.g., 200) and few time points (e.g., 20-50) perform better than those with few subjects and many time points (in terms of biased SE and power) . N=200 is best until the total number of observations are between 10000 and 20000, where T=100 becomes better. N=T is better than T=100 up until N=T=100 after which T=100 is better than N=T. Note that this pattern is in line with the conclusion that large N is most important since after 20000 total number of observations, the T=100 case has larger N than the N=200 case. Schultzberg, M. & Muthén, B. (2017). Number of subjects and time points needed for multilevel time series analysis: A simulation study of dynamic structural equation modeling. Accepted for publication in Structural Equation Modeling. 

Lag Create a lag variable Correlation between these is the autocorrelation (regress T on T-1) Observation Variable (T) Lagged Variable (T-1) 1 2.68 2 2.90 3 3.12 4 3.11 5 3.67 6 3.84 Can do this in Mplus

Between vs. Within Between part: individuals’ mean over time on a variable (i.e., baseline); in other words, the over-time mean for each person Within part: an estimate of individuals’ score over time; within-person centered or cluster-mean centered score (deviation from the mean over time) Lines are between-person, and then there are within-person fluctuations over time

Within- and between- person parts Within-person/cluster part: Modeled with first-order autoregressive model T regressed on T-1 Phi is the estimated parameter, ranging between 0 and 1, and is called “inertia.” The closer phi is to 1, the longer it takes to recover from a change from the individual’s mean. Between-person/cluster part: Average across all individuals You have the mean and the phi, modeled as a random effects

EXAMPLE Interpersonal neurobiology suggests that one’s physiology “catches” Therapists should have the most regulatory influence in the room, thereby helping clients regulate Research on empathy in the medical profession suggest that the most empathetic physicians have physiology that synchronizes with their clients So, does therapist and client lagged physiology predict therapist and client physiology?

T physio (t-1) T physio (t) W physio (t-1) W physio (t) H physio (t) Each of these physio variables actually represents 3 indicators of physio functioning (EDA, PEP, and HRV) H physio (t-1) H physio (t)

Mplus input (for multivariate model) These are the autocorrelations of t regressed on t-1.

These are the cross-lagged effects. These are correlations of variables at time t-1. These are the random slopes modeled above with the | symbol, correlated with the random mean at time t.

Intraclass correlation (Type= twolevel basic), calculated as: between variance/(between+within variance) How much variability is between clusters (in means) vs. within Variable Correlation   THRV7_23 0.27 TEDA4_23 0.07 TIMP6_23 0.29 WHRV7_23 0.51 WEDA4_23 0.02 WIMP6_23 0.38 HHRV7_23 0.56 HEDA4_23 0.05 HIMP6_23 0.46

Convergence and model fit Potential Scale Reduction (PSR) as close to 1.00 as possible Normal posterior parameter distribution Trace plots show convergence Model fit is assessed with DIC (and relative fit with ΔDIC). Lower is better. Unstable/Difficult to compute if latent variables are treated as parameters Comparing sample statistics to model-estimated quantities Do I want to do this slide?

Parameter trace plots Should look like this:

Mine look like this

Posterior parameter distributions Should look like this:

Mine look like this

So I should be running more iterations until I have better evidence of convergence. Note: My psr is 1.002, which is good, but mplus always give this warning: tech8 tells you what the psr is at each 100 iterations

Results

Means are fixed effects Variances are random effects (individual differences in carryover from time to time, in other words the fixed effects)

Means are fixed effects Variances are random effects (individual differences in carryover from time to time, in other words the fixed effects)

Standardized results Can request standardized results Mplus uses within-person standardization (as you would if it were time-series analysis) Add this line to the input: Output: Standardized(cluster); Residual(cluster); Standardizes by cluster and then gives average of those standardized parameters R-square tells you average across clusters Also gives within-cluster results

Thank you!