Plausible Values of Latent Variables: A Useful Approach of Data Reduction for Outcome Measures in Pediatric Studies Jichuan Wang, Ph.D. Children’s National.

Slides:



Advertisements
Similar presentations
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Advertisements

SEM PURPOSE Model phenomena from observed or theoretical stances
Lesson 10: Linear Regression and Correlation
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Structural Equation Modeling
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Latent Growth Curve Modeling In Mplus:
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
GRA 6020 Multivariate Statistics The regression model OLS Regression Ulf H. Olsson Professor of Statistics.
Clustered or Multilevel Data
“Ghost Chasing”: Demystifying Latent Variables and SEM
Structural Equation Modeling
Before doing comparative research with SEM … Prof. Jarosław Górniak Institute of Sociology Jagiellonian University Krakow.
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Multiple Regression Research Methods and Statistics.
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
G Lecture 111 SEM analogue of General Linear Model Fitting structure of mean vector in SEM Numerical Example Growth models in SEM Willett and Sayer.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Simple Linear Regression Analysis
Relationships Among Variables
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Mixture Modeling Chongming Yang Research Support Center FHSS College.
Factor Analysis Psy 524 Ainsworth.
Advantages of Multivariate Analysis Close resemblance to how the researcher thinks. Close resemblance to how the researcher thinks. Easy visualisation.
The emotional distress of children with cancer in China: An Item Response Analysis of C-Ped-PROMIS Anxiety and Depression Short Forms Yanyan Liu 1, Changrong.
Measurement Invariance of the Pediatric Version of PROMIS Symptom Scales between American and Chinese Children with Cancer Changrong Yuan 1,* , Pamela.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
ROB CRIBBIE QUANTITATIVE METHODS PROGRAM – DEPARTMENT OF PSYCHOLOGY COORDINATOR - STATISTICAL CONSULTING SERVICE COURSE MATERIALS AVAILABLE AT:
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
1 G Lect 7M Statistical power for regression Statistical interaction G Multiple Regression Week 7 (Monday)
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
G Lecture 81 Comparing Measurement Models across Groups Reducing Bias with Hybrid Models Setting the Scale of Latent Variables Thinking about Hybrid.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
G Lecture 91 Measurement Error Models Bias due to measurement error Adjusting for bias with structural equation models Examples Alternative models.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
ALISON BOWLING STRUCTURAL EQUATION MODELLING. WHAT IS SEM? Structural equation modelling is a collection of statistical techniques that allow a set of.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
Latent Class/Profile Analysis on Symptom Clusters in Pediatric Studies Jichuan Wang, Ph.D Shana Jacobs, MD Catriona Mowbray, PhD,RN Johanna Menard, FNP.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Approaches of Model Specification for Testing Factor Mean Differences in Multi-group Higher-order CFA Model Jichuan Wang, Ph.D. Children’s National Medical.
Structural Equation Modeling using MPlus
Multiple Imputation using SOLAS for Missing Data Analysis
Simple Linear Regression
Structural Equation Modeling (SEM) With Latent Variables
Structural Equation Modeling
Presentation transcript:

Plausible Values of Latent Variables: A Useful Approach of Data Reduction for Outcome Measures in Pediatric Studies Jichuan Wang, Ph.D. Children’s National Health System The George Washington University School of Medicine Washington, DC 1

2 Abstract Multi-item scales are widely used to measure outcomes in pediatric studies. The often used data reduction approaches are total scale scores or estimated factor or IRT scores. However, the total score does not take into account of measurement errors; and using factor scores or IRT scores as dependent variables in secondary analysis gives biased slope coefficients. This presentation introduces a better approach -- plausible values of latent variables -- for data reduction. Real data are used to demonstrate how to estimate and apply plausible values to analyze multi-item outcome measures.

Multi-item outcome measures Mental disorders Quality of life Symptom domains Functional domains … Challenge in using multi-item scales Too many variables are involved in a model, thus data reduction is needed. Measures of outcome scales often used in data analysis Total scores. Factor scores, IRT scores, or latent class membership (categorical latent variable). Plausible values of latent variables (continuous or categorical). 3

What are plausible values of latent variable? A set of generated values of latent variables using multiple imputations. Why using plausible values of latent variables? Total scores do not take into account of measurement errors. Factor analysis model often encounters estimation problem due to too many indicator (items) variables in a structural equation model. Using factor scores as dependent variables in secondary analysis gives biased slope coefficient estimates (Skrondal & Laake, 2001). Using plausible values can alleviate the bias. 4

How to use plausible values for secondary analysis Save the plausible values as “observed” variables and merge them with the original data set. When using plausible values in secondary statistical analysis, multiple data sets (e.g., 5) are needed just like multiple imputation (MI) data analysis using Rubin’s (1987) method. 5

Demonstration Sample: Drug users (N=303) in Changsha, China recruited using RDS, 2012 Outcome measures: BSI-18 Somatization: 6 items Depression: 6 items Anxiety: 6 items Predictors in SEM model: Age Education Marital status Employment status Meth use in the past 30 days 6

y1y1 y4y4 y7y7 y 13 y 10 y 16 SOM y5y5 y2y2 y8y8 y 14 y 11 y 17 DEP y3y3 y6y6 y9y9 y 15 y 12 y 18 ANX Figure 1. 3-factor CFA for BIS-18: SOM: Somatization; DEP: Depression; ANX: Anxiety 7

Figure 2. SEM model 8

9 WARNING: THE RESIDUAL COVARIANCE MATRIX (THETA) IS NOT POSITIVE DEFINITE. THIS COULD INDICATE A NEGATIVE VARIANCE/RESIDUAL VARIANCE FOR AN OBSERVED VARIABLE, A CORRELATION GREATER OR EQUAL TO ONE BETWEEN TWO OBSERVED VARIABLES, OR A LINEAR DEPENDENCY AMONG MORE THAN TWO OBSERVEDVARIABLES. CHECK THE RESULTS SECTION FOR MORE INFORMATION.PROBLEM INVOLVING VARIABLE Y10.

SOM DEP ANX Meth30 Age Education Marital Status Employment Figure 3. Path analysis model: SOM, DEP, and ANX are either total score or traditional point estimates of latent variables in CFA or IRT. 10

Cross-loadings are specified; error covariance could be specified as well. Non-informative priors using a normal distribution with a mean of zero and a small variance (Muthén and Asparouhov, 2012). Model fit: Posterior Predictive P-Value (PPP) = Five sets of plausible values are imputed for each latent variable and saved for further analysis. 11

SOM DEP ANX Meth30 Age Education Marital Status Employment Figure 5. Example of path analysis model using five plausible values data sets * * * * *

13 References Asparouhov, T. & Muthén, B Plausible values for latent variables using Mplus.Technical Report. Mislevy, R. J Randomization-based inference about latent variables from complex samples. Psychometrika, 56, Muthen, B. & Asparouhov, T Bayesian SEM: A more flexible representation of substantive theory. Psychological Methods, 17, Rubin, D Multiple Imputation for Nonresponse in Survey. New York: Wiley. Skrondal, A. and Laake, P Regression among factor scores. Psychometrika, 66,