CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.

Slides:



Advertisements
Similar presentations
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Advertisements

Structural Equation Modeling. What is SEM Swiss Army Knife of Statistics Can replicate virtually any model from “canned” stats packages (some limitations.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Structural Equation Modeling
Hypothesis Testing Steps in Hypothesis Testing:
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Ch11 Curve Fitting Dr. Deshi Ye
Structural Equation Modeling
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
Structural Equation Modeling
Chapter 11 Multiple Regression.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.
The General (LISREL) SEM model Ulf H. Olsson Professor of statistics.
Chapter 14 Inferential Data Analysis
Simple Linear Regression Analysis
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Relationships Among Variables
Lecture 5 Correlation and Regression
Item response modeling of paired comparison and ranking data.
MEASUREMENT MODELS. BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent.
Confirmatory factor analysis
Introduction to CFA. LEARNING OBJECTIVES: Upon completing this chapter, you should be able to do the following: Distinguish between exploratory factor.
AM Recitation 2/10/11.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Chapter 13: Inference in Regression
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
Kayla Jordan D. Wayne Mitchell RStats Institute Missouri State University.
Confirmatory Factor Analysis Psych 818 DeShon. Purpose ● Takes factor analysis a few steps further. ● Impose theoretically interesting constraints on.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Tests and Measurements Intersession 2006.
Estimation Kline Chapter 7 (skip , appendices)
CJT 765: Structural Equation Modeling Highlights for Quiz 2.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Confirmatory Factor Analysis Psych 818 DeShon. Construct Validity: MTMM ● Assessed via convergent and divergent evidence ● Convergent – Measures of the.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Chapter 13 Multiple Regression
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Structural Equation Modeling Mgmt 291 Lecture 3 – CFA and Hybrid Models Oct. 12, 2009.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Estimation Kline Chapter 7 (skip , appendices)
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Evaluation of structural equation models Hans Baumgartner Penn State University.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
CJT 765: Structural Equation Modeling Class 9: Putting it All Together.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
Estimating standard error using bootstrap
Advanced Statistical Methods: Continuous Variables
CJT 765: Structural Equation Modeling
BPK 304W Correlation.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Confirmatory Factor Analysis
Testing Causal Hypotheses
Presentation transcript:

CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis

Outline of Class Finishing up Model Testing Issues Finishing up Model Testing Issues Confirmatory Factor Analysis Confirmatory Factor Analysis Recent Readings Recent Readings

Comparison of Models Hierarchical Models: Hierarchical Models: Difference of χ 2 testDifference of χ 2 test Non-hierarchical Models: Non-hierarchical Models: Compare model fit indicesCompare model fit indices

Model Respecification Model trimming and building Model trimming and building Empirical vs. theoretical respecification Empirical vs. theoretical respecification Consider equivalent models Consider equivalent models

Sample Size Guidelines Small (under 100), Medium ( ), Large (200+) [try for medium, large better] Small (under 100), Medium ( ), Large (200+) [try for medium, large better] Models with 1-2 df may require samples of thousands for model-level power of.8. Models with 1-2 df may require samples of thousands for model-level power of.8. When df=10 may only need n of for model level power of.8. When df=10 may only need n of for model level power of.8. When df > 20 may only need n of 200 for power of.8 When df > 20 may only need n of 200 for power of.8 20:1 is ideal ratio for # cases/# free parameters, 10:1 is ok, less than 5:1 is almost certainly problematic 20:1 is ideal ratio for # cases/# free parameters, 10:1 is ok, less than 5:1 is almost certainly problematic For regression, N > m for overall R 2, with m = # IVs and N > m for individual predictors For regression, N > m for overall R 2, with m = # IVs and N > m for individual predictors

Statistical Power Use power analysis tables from Cohen to assess power of specific detecting path coefficient. Use power analysis tables from Cohen to assess power of specific detecting path coefficient. Saris & Satorra: use χ 2 difference test using predicted covariance matrix compared to one with that path = 0 Saris & Satorra: use χ 2 difference test using predicted covariance matrix compared to one with that path = 0 McCallum et al. (1996) based on RMSEA and chi-square distribution for close fit, not close fit and exact fit McCallum et al. (1996) based on RMSEA and chi-square distribution for close fit, not close fit and exact fit Small number of computer programs that calculate power for SEM at this point Small number of computer programs that calculate power for SEM at this point

Power Analysis for testing DATA-MODEL fit  H 0 : ε 0 ≥ 0.05 The Null hypothesis: The data-model fit is unacceptable The Null hypothesis: The data-model fit is unacceptable  H 1 : ε 1 < 0.05 The Alternative hypothesis: The data-model fit is acceptable The Alternative hypothesis: The data-model fit is acceptable If RMSEA from the model fit is less than 0.05, then the null hypothesis containing unacceptable population data-model fit is rejected If RMSEA from the model fit is less than 0.05, then the null hypothesis containing unacceptable population data-model fit is rejected

Post Hoc Power Analysis for testing Data-Model fit  If ε 1 is close to 0  Power increases  If N (sample size) increases  Power increases  If df ( degree of freedom) increases  Power increases

Post Hoc Power Analysis for testing Data-Model fit Examples Using Appendix B calculate power for ε 1 =0.02, df=55, N=400  Power ? for ε 1 =0.02, df=55, N=400  Power ? for ε 1 =0.04, df=30, N=400  Power ? for ε 1 =0.04, df=30, N=400  Power ?

Factor Analysis Single Measure in Path Analysis  Measurement error is higher Multiple Measures in Factor Analysis correspond to some type of HYPOTHETICAL CONSTRUCT  Reduce the overall effect of measurement error

Latent Construct Theory guides through the scale development process (DeVellis,1991; Jackson, 1970) Theory guides through the scale development process (DeVellis,1991; Jackson, 1970) Unidimensional vs Multidimensional constuct Unidimensional vs Multidimensional constuct Reliability and Validity of construct Reliability and Validity of construct

Reliability - consistency, precision, repeatability Reliability concerns with RANDOM ERROR Types of reliability:  test-retest  alternate form  interrater  split-half and internal consistency

Validity of construct Validity of construct 4 types of validity content content criterion-related criterion-related convergent and discriminant convergent and discriminant construct construct

Factor analysis Indicators: continuous Indicators: continuous Measurement error are independent of each other and of the factors Measurement error are independent of each other and of the factors All associations between the factors are unanalyzed All associations between the factors are unanalyzed

Identification of CFA Can estimate v*(v+1)/2 of parameters Can estimate v*(v+1)/2 of parameters Necessary Necessary # of free parameters <= # of observations# of free parameters <= # of observations Every latent variable should be scaledEvery latent variable should be scaled

Additional: fix the unstandardized residual path of the error to 1. (assign a scale of the unique variance of its indicator) Scaling factor: constrain one of the factor loadings to 1 ( that variables called – reference variable, the factor has a scale related to the explained variance of the reference variable) OR fix factor variance to a constant ( ex. 1), so all factor loadings are free parameters Both methods of scaling result in the same overall fit of the model

Identification of CFA Sufficient : Sufficient : At least three (3) indicators per factor to make the model identifiedAt least three (3) indicators per factor to make the model identified Two-indicator rule – prone to estimation problems (esp. with small sample size)Two-indicator rule – prone to estimation problems (esp. with small sample size)

Interpretation of the estimates Unstandardized solution Unstandardized solution Factor loadings =unstandardized regression coefficient Factor loadings =unstandardized regression coefficient Unanalyzed association between factors or errors= covariances Unanalyzed association between factors or errors= covariances Standardized solutionStandardized solution Unanalyzed association between factors or errors= correlations Unanalyzed association between factors or errors= correlations Factor loadings =standardized regression coefficient Factor loadings =standardized regression coefficient ( structure coefficient) ( structure coefficient) The square of the factor loadings = the proportion of the explained ( common) indicator variance, R 2 (squared multiple correlation) The square of the factor loadings = the proportion of the explained ( common) indicator variance, R 2 (squared multiple correlation)

Problems in estimation of CFA Heywood cases – negative variance estimated or correlations > 1. Heywood cases – negative variance estimated or correlations > 1. Ratio of the sample size to the free parameters – 10:1 ( better 20:1) Ratio of the sample size to the free parameters – 10:1 ( better 20:1) Nonnormality – affects ML estimation Nonnormality – affects ML estimation Suggestions by March and Hau(1999)when sample size is small: indicators with high standardized loadings( >0.6) indicators with high standardized loadings( >0.6) constrain the factor loadings constrain the factor loadings

Testing CFA models Test for a single factor with the theory or not Test for a single factor with the theory or not If reject H 0 of good fit - try two-factor model… If reject H 0 of good fit - try two-factor model… Since one-factor model is restricted version of the two -factor model, then compare one- factor model to two-factor model using Chi- square test. If the Chi-square is significant – then the 2-factor model is better than 1-factor model. Since one-factor model is restricted version of the two -factor model, then compare one- factor model to two-factor model using Chi- square test. If the Chi-square is significant – then the 2-factor model is better than 1-factor model. Check R 2 of the unexplained variance of the indicators. Check R 2 of the unexplained variance of the indicators.

Respecification of CFA IF IF lower factor loadings of the indicator (standardized<=0.2) lower factor loadings of the indicator (standardized<=0.2) High loading on more than one factor High loading on more than one factor High correlation of the residuals High correlation of the residuals High factor correlation High factor correlation THEN THEN Specify that indicator on a different factor Specify that indicator on a different factor Allow to load on one more than one factor Allow to load on one more than one factor (multidimensional vs unidimensional) (multidimensional vs unidimensional) Allow error measurements to covary Allow error measurements to covary Too many factors specified Too many factors specified

Other tests Indicators: Indicators: congeneric – measure the same construct congeneric – measure the same construct if model fits, then if model fits, then -tau-equivalent – constrain all unstandardized loadings to 1 -tau-equivalent – constrain all unstandardized loadings to 1 if model fit, then - parallelism – equality of error variances  All these can be tested by χ 2 difference test

Nonnormal distributions Normalize with transformations Normalize with transformations Use corrected normal theory method, e.g. use robust standard errors and corrected test statistics, ( Satorra-Bentler statistics) Use corrected normal theory method, e.g. use robust standard errors and corrected test statistics, ( Satorra-Bentler statistics) Use Asymptotic distribution free or arbitrary distribution function (ADF) - no distribution assumption - Need large sample Use Asymptotic distribution free or arbitrary distribution function (ADF) - no distribution assumption - Need large sample Use elliptical distribution theory – need only symmetric distribution Use elliptical distribution theory – need only symmetric distribution Mean-adjusted weighted least squares (MLSW) and variance-adjusted weighted least square (VLSW) - MPLUS with categorical indicators Mean-adjusted weighted least squares (MLSW) and variance-adjusted weighted least square (VLSW) - MPLUS with categorical indicators Use normal theory with nonparametric bootstrapping Use normal theory with nonparametric bootstrapping

Remedies to nonnormality Use a parcel which is a linear composite of the discrete scores, as continuous indicators Use a parcel which is a linear composite of the discrete scores, as continuous indicators Use parceling,when underlying factor is unidimentional. Use parceling,when underlying factor is unidimentional.

Noar Use of CFA in scale development Use of CFA in scale development Test of multiple factor models Test of multiple factor models

Quilty, Oakman and Risko “Correlates of the Rosenberg Self-Esteem Scale Method Effects” Multi-Trait, Multi-Method Multi-Trait, Multi-Method Comparison of Correlated Trait- Correlated Method versus Comparison of Correlated Trait- Correlated Method versus Correlated Uniqueness Models Correlated Uniqueness Models