1 All about variable selection in factor analysis and structural equation modeling Yutaka Kano Osaka University School of Human Sciences IMPS2001, July.

Slides:



Advertisements
Similar presentations
Autocorrelation and Heteroskedasticity
Advertisements

Structural Equation Modeling
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
Structural Equation Modeling
Chapter 4 Validity.
Psychology 202b Advanced Psychological Statistics, II April 7, 2011.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Factor Analysis Ulf H. Olsson Professor of Statistics.
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
Structural Equation Modeling
Class 2: Tues., Sept. 14th Correlation (2.2) Introduction to Measurement Theory: –Reliability of measurements and correlation –Example that demonstrates.
Further Inference in the Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Business Statistics - QBM117 Least squares regression.
Winnie mucherah ball state university FOUNDATIONS OF QUALITY RESEARCH DESIGN: RELIABILITY & VALIDITY.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Factor Analysis Psy 524 Ainsworth.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
MEASUREMENT MODELS. BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent.
Data validation for use in SEM
1 Use of SEM programs to precisely measure scale reliability Yutaka Kano and Yukari Azuma Osaka University IMPS2001, July 15-19,2001 Osaka, Japan.
Statistics for Education Research Lecture 10 Reliability & Validity Instructor: Dr. Tung-hsien He
Kayla Jordan D. Wayne Mitchell RStats Institute Missouri State University.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
1 Variable selection for factor analysis and structural equation models Yutaka Kano & Akira Harada Osaka University International Symposium on Structural.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Factor validation of the Consideration of Future Consequences Scale: An Assessment and Review Tom R. EikebrokkEllen K. NyhusUniversity of Agder.
1 Exploratory & Confirmatory Factor Analysis Alan C. Acock OSU Summer Institute, 2009.
By Cao Hao Thi - Fredric W. Swierczek
Giovanna Brancato, Giorgia Simeoni Istat, Italy European Conference on Quality in Official Statistics – Q2008, Rome, 8-11 July 2008 Modelling Survey Quality.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Data preparation for use in SEM Ned Kock. Data in table format Each column corresponds to a manifest variable. Some groups of columns correspond to a.
Factor Analysis Revealing the correlational structure among variables Understanding & Reducing Complexity.
Factor Analysis ( 因素分析 ) Kaiping Grace Yao National Taiwan University
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
G Lecture 7 Confirmatory Factor Analysis
Academic Research Academic Research Dr Kishor Bhanushali M
MOI UNIVERSITY SCHOOL OF BUSINESS AND ECONOMICS CONCEPT MEASUREMENT, SCALING, VALIDITY AND RELIABILITY BY MUGAMBI G.K. M’NCHEBERE EMBA NAIROBI RESEARCH.
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Research Report. Introduction Introduce the research problem Introduce the research problem Why is the study important and to whom Why is the study important.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Advanced Statistics Factor Analysis, I. Introduction Factor analysis is a statistical technique about the relation between: (a)observed variables (X i.
This is a mess... How the hell can I validate the consumer behaviors’ scales of my SEM model? Maria Pujol-Jover 1, Irene Esteban-Millat 1 1 Marketing Research.
FACTOR ANALYSIS 1. What is Factor Analysis (FA)? Method of data reduction o take many variables and explain them with a few “factors” or “components”
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Principal Component Analysis
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
FACTOR ANALYSIS & SPSS. First, let’s check the reliability of the scale Go to Analyze, Scale and Reliability analysis.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
CFA: Basics Byrne Chapter 3 Brown Chapter 3 (40-53)
FACTOR ANALYSIS & SPSS.
Classical Test Theory Margaret Wu.
Chapter 5 STATISTICS (PART 4).
Two-Variable Regression Model: The Problem of Estimation
LESSON 24: INFERENCES USING REGRESSION
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Data validation for use in SEM
Structural Equation Modeling (SEM) With Latent Variables
Chapter 8 VALIDITY AND RELIABILITY
Regression Part II.
Presentation transcript:

1 All about variable selection in factor analysis and structural equation modeling Yutaka Kano Osaka University School of Human Sciences IMPS2001, July 15-19,2001 Osaka, Japan

2 Today ’ s talk Motivation for variable selection How SEFA (and SCoFA) works Derivation of the statistics Theoretical property What does variable selection with model fit mean? Summary

3 Needs for variable selection Variable selection in EFA is an important but time-consuming process Composite scale construction Reliability analysis Variable selection in SEM should be less important but … Indicator selection Improvement of model fit

4 Recent literature Little et. al. (1999). On selecting indicators for multivariate measurement and modeling with latent variables. Psychological Methods, 4, Fabrigar et. al. (1999). Evaluating the use of EFA in psychological research. Psychological Methods, 4, Kano et. al. (in press, 2000, 1994).

5 Procedures for variable selection in EFA Usual procedure Magnitude of communalities Interpretability Towards simple structures Our approach Model fit

6 Programs for variable selection in factor analysis Exploratory analysis SEFA(Stepwise variable selection in EFA) u.ac.jp/~harada/sefa2001/stepwise/ u.ac.jp/~harada/sefa2001/stepwise/ Confirmatory analysis SCoFA(Stepwise Confirmatory FA) u.ac.jp/~harada/scofa/input.html u.ac.jp/~harada/scofa/input.html

7 Example_1 A questionnaire on perception on physical exercise n=653, p=15, one-factor model Data was collected by Dr Oka (Waseda U.) Conclusion Remove X2, X9, X13, X14

8 Example_2

9 Example_3

10 Example_4

11 Example_5

12 Example_6

13 SCoFa: 24 Pschological variable

14 Original Model (p=24)

15 Theory of SEFA and SCoFA Obtain estimates for a current model Construct predicted chi-square for each one-variable-deleted model using the estimates, without tedious iterations Take a sort of LM approach

16 Known quantities and goal_1

17 Known quantities and goal_2

18 Basic idea We construct T 02’ as LM test

19 Final formula for T2 Note: This is Browne’s (Browne 1982) statistic of goodness-of-fit using general estimates

20 Properties_1

21 Question 1 Can T 2 work even if X1 is inconsistent? Estimate for Θ is biased.

22 Properties_2

23 Question 2 Can SEFA identify an uncorrelated variable? Unfortunately, no We have developed a way of testing zero communality in SEFA (see Harada-Kano, IMPS)

24 Question 3 What is the actual meaning of variable selection with model fit? The following shows an illustrative example:

25 Answer 3_1: Example again X2, X9, X13, X14 are to be removed

26 Answer 3_2: Example again Best fitted model with correlated errors SEFA conclusion: X2, X9, X13, X14 are to be removed

27 Answer 3_3: Example again Variables to be deleted are identified so as to break up the correlated errors Correlated errors may cause Different interpretation of FA results Common factors considered are not enough to explain correlations between observed variables Such variables are not good indicators (e.g., in SEM) Inaccurate reliability estimates Green-Hershberger (2000), Raykov (2001) Kano-Azuma (2001, IMPS)

28 Question 4 How one should do if SEFA or SCoFA identifies a variable with large factor loading estimate as inconsistent?

29 Answer 4_1: Reliability If one employs the alpha coefficient or (s)he has to delete it to have a good-fit model.

30 Answer 4_2: Reliability If one employs (s)he can remain it, and compare reliability between models.

31 Answer 4_3: Example ρ' 0.64 α 0.74 Bad-fitted One-factor Model based ρ 0.76

32 Answer 4_4: Example ρ' α

33 Answer 4_5: Example ρ' α

34 Summary_1 A new option for variable selection was introduced, which is based on model fit. You can easily access the programs on the internet SEFA(Stepwise variable selection in EFA) u.ac.jp/~harada/sefa2001/stepwise/ u.ac.jp/~harada/sefa2001/stepwise/ SCoFA(Stepwise Confirmatory FA) u.ac.jp/~harada/scofa/input.html u.ac.jp/~harada/scofa/input.html

35 Summary_2 It enjoys preferable theoretical properties Testing null communality is important Uncorrelated variables cannot be identified Variable selection with model fit can find out error correlations Traditional reliability coefficients based on a poor-fit model have serious bias

36 Summary_3 High communality variables can be inconsistent Whether such variables should be removed depends Reliability has to be figured out using nonstandard factor model

37 References Harada, A. and Kano, Y. (2001) Variable selection and test of communality in EFA. IMPS2001, Osaka Kano, Y. (in press). Variable selection for structural models. Journal of Statistical Inference and Planning. Kano, Y. and Harada, A. (2000). Stepwise variable selection in factor analysis. Psychometrika, 65, Kano, Y. and Ihara, M. (1994). Identification of inconsistent variates in factor analysis. Psychometrika, Vol.59, 5-20

38 Thank you for coming to Osaka and being at my talk TakoYaki performance will start soon You can understand how octopus relates to Osaka, if you see and taste it