Testing factorial invariance in multilevel data: A Monte Carlo study Eun Sook Kim Oi-man Kwok Myeongsun Yoon.

Slides:



Advertisements
Similar presentations
Multilevel Models with Latent Variables Daniel J. Bauer Department of Psychology University of North Carolina 9/13/04 SAMSI Workshop.
Advertisements

Structural Equation Modeling
V.2 N.K. Bowen, GROUP 1 CONFIGURAL INVARIANCE AS TESTED WITH MPLUS INVARIANCE SHORTCUT (WLSMV) Loadings (except first) and thresholds free  = factor.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
IRT Equating Kolen & Brennan, IRT If data used fit the assumptions of the IRT model and good parameter estimates are obtained, we can estimate person.
Exploring the Full-Information Bifactor Model in Vertical Scaling With Construct Shift Ying Li and Robert W. Lissitz.
The DIF-Free-Then-DIF Strategy for the Assessment of Differential Item Functioning 1.
ASVAB: E Pluribus Unum? Martin J. Ippel, Ph.D. CogniMetrics Inc, San Antonio,TX Steven E. Watson, Ph.D. U.S. Navy Selection & Classification (CNO 132)
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
How to evaluate the cross-cultural equivalence of single items Melanie Revilla, Willem Saris RECSM, UPF Zurich – 15/16 July.
“Ghost Chasing”: Demystifying Latent Variables and SEM
David Kaplan & Heidi Sweetman University of Delaware Two Methodological Perspectives on the Development of Mathematical Competencies in Young Children:
Before doing comparative research with SEM … Prof. Jarosław Górniak Institute of Sociology Jagiellonian University Krakow.
Analyzing observed composite differences across groups: Is partial measurement invariance enough? Holger Steinmetz Faculty of Economics and Business Administration.
© UCLES 2013 Assessing the Fit of IRT Models in Language Testing Muhammad Naveed Khalid Ardeshir Geranpayeh.
Item response modeling of paired comparison and ranking data.
Measurement, Control, and Stability of Multiple Response Styles Using Reverse Coded Items Eric Tomlinson Daniel Bolt University of Wisconsin-Madison Ideas.
Chapter 1: Introduction to Statistics
Multiple Sample Models James G. Anderson, Ph.D. Purdue University.
Kayla Jordan D. Wayne Mitchell RStats Institute Missouri State University.
Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.
Sampling and Nested Data in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
2 nd Order CFA Byrne Chapter 5. 2 nd Order Models The idea of a 2 nd order model (sometimes called a bi-factor model) is: – You have some latent variables.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Multi-sample structural equation models with mean structures Hans Baumgartner Penn State University.
Rasch trees: A new method for detecting differential item functioning in the Rasch model Carolin Strobl Julia Kopf Achim Zeileis.
Investigating Faking Using a Multilevel Logistic Regression Approach to Measuring Person Fit.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
IRT Model Misspecification and Metric Consequences Sora Lee Sien Deng Daniel Bolt Dept of Educational Psychology University of Wisconsin, Madison.
Measurement Bias Detection Through Factor Analysis Barendse, M. T., Oort, F. J. Werner, C. S., Ligtvoet, R., Schermelleh-Engel, K.
All Hands Meeting 2005 The Family of Reliability Coefficients Gregory G. Brown VASDHS/UCSD.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Confirmatory Factor Analysis Psych 818 DeShon. Construct Validity: MTMM ● Assessed via convergent and divergent evidence ● Convergent – Measures of the.
Controlling for Common Method Variance in PLS Analysis: The Measured Latent Marker Variable Approach Wynne W. Chin Jason Bennett Thatcher Ryan T. Wright.
Evaluating Impacts of MSP Grants Hilary Rhodes, PhD Ellen Bobronnikov February 22, 2010 Common Issues and Recommendations.
Multigroup Models Byrne Chapter 7 Brown Chapter 7.
Multigroup Models Beaujean Chapter 4 Brown Chapter 7.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
G Lecture 81 Comparing Measurement Models across Groups Reducing Bias with Hybrid Models Setting the Scale of Latent Variables Thinking about Hybrid.
CFA: Basics Beaujean Chapter 3. Other readings Kline 9 – a good reference, but lumps this entire section into one chapter.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Evaluating Impacts of MSP Grants Ellen Bobronnikov Hilary Rhodes January 11, 2010 Common Issues and Recommendations.
MTMM Byrne Chapter 10. MTMM Multi-trait multi-method – Traits – the latent factors you are trying to measure – Methods – the way you are measuring the.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Sampling and Nested Data in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Developmental Models: Latent Growth Models Brad Verhulst & Lindon Eaves.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
Multivariate Statistics Latent Growth Curve Modelling. Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University.
The Invariance of the easyCBM® Mathematics Measures Across Educational Setting, Language, and Ethnic Groups Joseph F. Nese, Daniel Anderson, and Gerald.
Project VIABLE - Direct Behavior Rating: Evaluating Behaviors with Positive and Negative Definitions Rose Jaffery 1, Albee T. Ongusco 3, Amy M. Briesch.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
IRT Equating Kolen & Brennan, 2004 & 2014 EPSY
Effects of unequal indicator intercepts on manifest composite differences Holger Steinmetz and Peter Schmidt University of Giessen / Germany.
Inferential Statistics Psych 231: Research Methods in Psychology.
CFA with Categorical Outcomes Psych DeShon.
Multiple Regression.
Using Multilevel Modeling in Institutional Research
Approaches of Model Specification for Testing Factor Mean Differences in Multi-group Higher-order CFA Model Jichuan Wang, Ph.D. Children’s National Medical.
Correlation, Regression & Nested Models
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
CJT 765: Structural Equation Modeling
The Trait Model of Change
2 independent Groups Graziano & Raulin (1997).
Multiple Regression.
Holger Steinmetz Dep. of Work and Organizational Psychology
Sean C. Wright Illinois Institute of Technology, APTMetrics
Sun-Joo CHO Allan COHEN Brian BOTTGE
Presentation transcript:

Testing factorial invariance in multilevel data: A Monte Carlo study Eun Sook Kim Oi-man Kwok Myeongsun Yoon

The purpose of the present study  Investigate the influence of data dependency on the test of factorial invariance in multilevel data.  Comparisons between multilevel SEM (the model- based or the designed-based multiple-group multilevel CFA) and multiple-group CFA

Measurement invariance studies in multilevel data  Multilevel SEM  The groups of interest are treated as random samples.  Good for measurement invariance testing in individual and organizational levels.  Multiple-group CFA  The number of groups is small or relatively finite (e.g., <20 or so).  Can be utilized to test measurement invariance over groups at the organizational level.

Multilevel measurement model  A conventional measurement model  A multilevel measurement model

In a multilevel measurement model  The variance of the factor  The variance of residual  The variance of observed scores

Examine measurement invariance in multilevel data  The general multilevel CFA model

Examine measurement invariance in multilevel data  Test at the between level  Test at the within level Study 1 Study 2

Data generation

Design  Noninvariance in the organizational (study 1) and the individual level (study 2).  Manipulated factors  The number of clusters (CN):30, 50, 80 and 160  Cluster size (CS): 10 and 20  The size of ICC:.09,.20, and.33  The total sample size was between 300 and 3,200, and 1,000 replications were in a condition.

 Data dependency on group membership:  Intraclass correlation (ICC)

Analysis  Tested a configural and a metric models.  The LRT was used to test factorial invariance in multiple-group CFA.  The Satorra-Bentler scaled chi-squared test was used in multiple-group multilevel CFA.  Examined power and type I error.  Conducted ANOVA to examine the influence of 3 manipulated factors on power and type I error.

Results (Study 1) Low admissible rates.

Analysis in study 2  Examined factorial invariance at the individual level using the design-based multilevel CFA, instead of the model-based one, due to the limitation of the SEM software.

Results (study 2)

Limitations and conclusions  The explanations of the discrepant findings between study 1 and study 2 might be confounded by the type of analysis.  It is necessary to take account of the data dependency in multilevel data.  A substantially large number of clusters is required in model-based multiple-group multilevel modeling.

Comments  Information of the accuracy of estimated parameters is not available.  MLR provides accurate factor loadings but underestimates the residual variances and standard errors at the between level. Having more clusters or higher ICC does not effectively compensate for this.  Groups of interest might occur at different levels simultaneously, and how to model them?

Comments (Cont.)  It is not clear how the model was constrained. This study seemed to constrain the variance of the latent factor to be equal across groups, which is not practical and reasonable in real data.  The large effect size in the factorial invariance with a large sample size yielded very high power, and it is recommended reducing the effect size to medium.

Comments (Cont.)  Contrast to the findings of Jones-Farmer (2010), the inflated type I error occurred in CFA even though data dependency was considered small. Further examinations are needed to understand the controversial findings.

Future studies  Further examinations in  Scalar and error variance models  Cross-level invariance  The issue of sample sizes using other estimation methods  In MLR, the min. sample size of the between level is 100. How about in FIML?  The assessment of model fit in ML SEM.

Future studies (Cont.)  Examine more than 1 items with different factor loadings.  Implement purification in this study.  Conduct equal means for the latent factor or equal intercepts of indicators to examine factorial invariance.  Examine scalar invariance to see if the findings are similar to the findings of French and Finch (2010).

Future studies (Cont.)  The possibility to develop a new indicator in SEM, such as adding person fit (as IRT approaches do) to SEM approaches, instead of model fit only.  Conduct the model-based multilevel CFA using other software to overcome the limitation in current SEM software.