LECTURE 16 STRUCTURAL EQUATION MODELING.

Slides:



Advertisements
Similar presentations
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Advertisements

Structural Equation Modeling. What is SEM Swiss Army Knife of Statistics Can replicate virtually any model from “canned” stats packages (some limitations.
SEM PURPOSE Model phenomena from observed or theoretical stances
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
General Structural Equation (LISREL) Models
Structural Equation Modeling
The Multiple Regression Model.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Structural Equation Modeling
Common Factor Analysis “World View” of PC vs. CF Choosing between PC and CF PAF -- most common kind of CF Communality & Communality Estimation Common Factor.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
LECTURE 5 TRUE SCORE THEORY. True Score Theory OBJECTIVES: - know basic model, assumptions - know definition of reliability, relation to TST - be able.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Structural Equation Modeling
Linear Regression and Correlation Analysis
Chapter 11 Multiple Regression.
G Lecture 51 Estimation details Testing Fit Fit indices Arguing for models.
G Lect 31 G Lecture 3 SEM Model notation Review of mediation Estimating SEM models Moderation.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Lecture 5 Correlation and Regression
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
Factor Analysis Psy 524 Ainsworth.
LECTURE 6 RELIABILITY. Reliability is a proportion of variance measure (squared variable) Defined as the proportion of observed score (x) variance due.
Multiple Sample Models James G. Anderson, Ph.D. Purdue University.
MEASUREMENT MODELS. BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent.
Hypothesis Testing:.
Structural Equation Modeling 3 Psy 524 Andrew Ainsworth.
Confirmatory Factor Analysis Psych 818 DeShon. Purpose ● Takes factor analysis a few steps further. ● Impose theoretically interesting constraints on.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
7.1 Multiple Regression More than one explanatory/independent variable This makes a slight change to the interpretation of the coefficients This changes.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Ordinary Least Squares Regression.
Confirmatory Factor Analysis Psych 818 DeShon. Construct Validity: MTMM ● Assessed via convergent and divergent evidence ● Convergent – Measures of the.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Chapter 13 Multiple Regression
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 12 Making Sense of Advanced Statistical.
G Lecture 7 Confirmatory Factor Analysis
Academic Research Academic Research Dr Kishor Bhanushali M
G Lecture 81 Comparing Measurement Models across Groups Reducing Bias with Hybrid Models Setting the Scale of Latent Variables Thinking about Hybrid.
CFA: Basics Beaujean Chapter 3. Other readings Kline 9 – a good reference, but lumps this entire section into one chapter.
Question paper 1997.
SEM Basics 2 Byrne Chapter 2 Kline pg 7-15, 50-51, ,
CJT 765: Structural Equation Modeling Class 8: Confirmatory Factory Analysis.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
Evaluation of structural equation models Hans Baumgartner Penn State University.
Lesson 5.1 Evaluation of the measurement instrument: reliability I.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
The SweSAT Vocabulary (word): understanding of words and concepts. Data Sufficiency (ds): numerical reasoning ability. Reading Comprehension (read): Swedish.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
Classical Test Theory Psych DeShon. Big Picture To make good decisions, you must know how much error is in the data upon which the decisions are.
Lecture 2 Survey Data Analysis Principal Component Analysis Factor Analysis Exemplified by SPSS Taylan Mavruk.
Advanced Statistical Methods: Continuous Variables
Structural Equation Modeling using MPlus
Classical Test Theory Margaret Wu.
CJT 765: Structural Equation Modeling
Quantitative Methods Simple Regression.
Making Sense of Advanced Statistical Procedures in Research Articles
Writing about Structural Equation Models
Structural Equation Modeling
Simple Linear Regression
Statistics II: An Overview of Statistics
What are their purposes? What kinds?
Causal Relationships with measurement error in the data
Testing Causal Hypotheses
Structural Equation Modeling
Presentation transcript:

LECTURE 16 STRUCTURAL EQUATION MODELING

SEM PURPOSE Model phenomena from observed or theoretical stances Develop and test constructs not directly observed based on observed indicators Test hypothesized relationships, potentially causal, ordered, or covarying

Relationships to other quantitative methods

Decomposition of Covariance/Correlation Most hypotheses about relationship can be represented in a covariance matrix SEM is designed to reproduce the observed covariance matrix as closely as possible How well the observed matrix is fitted by the hypothesized matrix is Goodness of Fit Modeling can be either entirely theoretical or a combination of theory and revision based on imperfect fit of some parts.

Decomposition of Covariance/Correlation Example: Correlation between TAAS reading level at grades 3 and 4 in 1999 was .647 for 3316 schools that gave the test. Suppose this is taken as the theoretical value for year 2000. Thus, .768 error TAAS Grade3 Reading .647 TAAS Grade4 Reading

Decomposition of Covariance/Correlation Example: Correlation between TAAS reading level at grades 3 and 4 in 2000 was .674 for 3435 schools that gave the test. We then test the theory that the relationship is stable across years; error H: =.647 TAAS Grade3 Reading .674 TAAS Grade4 Reading

Decomposition of Covariance/Correlation In classical statistics this problem is solved through Fisher’s Z-transform Zr=tanh-1 r = 1/2 ln[1+  r  /(1 -  r |) And a normal statistic developed, z=Zr - ZH In SEM this is a covariance problem of fitting the observed covariance matrix to the theoretical matrix: .674 .674 1 r r 1 =

Decomposition of Covariance/Correlation The test is based on large sample multivariate normality under either maximum likelihood or generalized least squares estimation. In this case there is no estimation required, since all parameters are known. For the Fisher Z-transform, the statistic is z=1.044, p >.29. For the SEM method,

Decomposition of Covariance/Correlation Under SEM, the model is represented as F = log  + tr(S-1 ) - logS - (p – q) = log | | + tr{ -1 } - log | | - (2-1) = log (1-.6472) + tr{ } -log (1-.6742)-1 = -.23553 + 1.94 - .23552-1 = .94 = 1.94 X2 = .94 , df=1, p > .33 .647 .647 1 .674 .674 1 .647 .647 1 .674 .674 1 .674 .674 1 1/(1-.6472) - .647/(1-.6472) -.647/(1-.6472) 1/(1-.6472)

Developing Theories Previous research- both model and estimates can be used to create a theoretical basis for comparison with new data Logical structures- time, variable stability, construct definition can provide order 1999 reading in grade 3 can affect 2000 reading in grade 4, but not the reverse Trait anxiety can affect state anxiety, but not the reverse IQ can affect grade 3 reading, but grade 3 reading is unlikely to alter greatly IQ (although we can think of IQ measurements that are more susceptible to reading than others)

Developing Theories Experimental randomized design- can be part of SEM What-if- compare competing theories within a data set. Are all equally well explained by the data covariances? Danger- all just-identified models equally explain all the data (ie. If all degrees of freedom are used, any model reproduces the data equally well) Parsimony- generally simpler models are preferred; as simple as needed but not simple minded

MEASUREMENT MODELS

BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent administrations of the same item or test e = error: difference between y and 

ASSUMPTIONS  and e are independent (uncorrelated) The equation can hold for an individual or a group at one occasion or across occasions: xijk = ijk + eijk (individual) x*** = *** + e*** (group) combinations (individual across time)

x x  e

RELIABILITY Reliability is a proportion of variance measure (squared variable) Defined as the proportion of observed score (x) variance due to true score ( ) variance: 2x = xx’ = 2 / 2x

Var() Var(e) Var(x) reliability

Reliability: parallel forms x1 =  + e1 , x2 =  + e2 (x1 ,x2 ) = reliability = xx’ = correlation between parallel forms

x1 x2 x x  e e xx’ = x * x

ASSUMPTIONS  and e are independent (uncorrelated) The equation can hold for an individual or a group at one occasion or across occasions: xijk = ijk + eijk (individual) x*** = *** + e*** (group) combinations (individual across time)

Reliability: Spearman-Brown Can show the reliability of the composite is kk’ = [k xx’]/[1 + (k-1) xx’ ] k = # times test is lengthened example: test score has rel=.7 doubling length produces rel = 2(.7)/[1+.7] = .824

Reliability: parallel forms For 3 or more items xi, same general form holds reliability of any pair is the correlation between them Reliability of the composite (sum of items) is based on the average inter-item correlation: stepped-up reliability, Spearman-Brown formula

COMPOSITES AND FACTOR STRUCTURE 3 MANIFEST VARIABLES REQUIRED FOR A UNIQUE IDENTIFICATION OF A SINGLE FACTOR PARALLEL FORMS REQUIRES: EQUAL FACTOR LOADINGS EQUAL ERROR VARIANCES INDEPENDENCE OF ERRORS

e e x1 x2 x x e  x x3 xx’ = xi * xj

RELIABILITY FROM SEM TRUE SCORE VARIANCE OF THE COMPOSITE IS OBTAINABLE FROM THE LOADINGS: K  =  2i i=1 K = # items or subtests = K2x

RELIABILITY FROM SEM RELIABILITY OF THE COMPOSITE IS OBTAINABLE FROM THE LOADINGS:  = K/(K-1)[1 - 1/  ] example 2x = .8 , K=11  = 11/(10)[1 - 1/8.8 ] = .975

CONGENERIC MODEL LESS RESTRICTIVE THAN PARALLEL FORMS OR TAU EQUIVALENCE: LOADINGS MAY DIFFER ERROR VARIANCES MAY DIFFER MOST COMPLEX COMPOSITES ARE CONGENERIC: WAIS, WISC-III, K-ABC, MMPI, etc.

e2 e1 x1 x2 x1 x2 e3 x3  x3 (x1 , x2 )= x1 * x2

COEFFICIENT ALPHA xx’ = 1 - 2E /2X = 1 - [2i (1 - ii )]/2X , since errors are uncorrelated  = K/(K-1)[(1 - s2i )/ s2X ] where X = xi (composite score) s2i = variance of subtest xi sX = variance of composite Does not assume knowledge of subtest ii

SEM MODELING OF CONGENERIC FORMS PROC CALIS COV CORR MOD; LINEQS X1 = L1 F1 + E1, X2 = L2 F1 + E2, … X10 = L10 F1 + E10; STD E1-E10=THE:, F1= 1.0;

MULTIFACTOR STRUCTURE Measurement Model: Does it hold for each factor? PARALLEL VS. TAU-EQUIVALENT VS. CONGENERIC How are factors related? What does reliability mean in the context of multifactor structure?

MINIMAL CORRELATED FACTOR STRUCTURE x1 x2 x11 x22 e3 1 2 x31 x3 x42 x4 12 e4

STRUCTURAL MODELS Path analysis for latent variables- but can include recursive models Begins with measurement model Theory-based model of relationships among all variables Modification of model at path level: LaGrange and Wald modification indices

Path analysis for latent variables 1 Y1 Y2 2 1 X1 1 1 Y5 5 X2 12 3 Y6 6 2 2 2 X3 Y3 Y4 X4 3 4 3 4 y = By + x + 

Measurement Model first Y1 1 Y2 2 1 X1 1 1 Y5 5 X2 12 3 Y6 6 2 2 2 X3 Y3 Y4 X4 3 4 3 4

Modify Measurement Model as needed Modification indices: Wald Index: release constrained parameter (usually 0 path) * chi square statistic with df=#releases LaGrange Multiplier Index: restrict to 0 a free parameter * chi square statistic with df =# restrictions

Test Full Model Examine overall Fit Examine Modification Indices Decide if there is evidence and theoretical justification for dropping or adding a path (Note- one path at a time- select most critical/theoretically important to start with) Liberal rule for keeping, conservative rule for adding (VW recommendation)

Computer Programs AMOS 5.0 – both drawing and syntax, SPSS based Mplus 3.0 text data input, syntax only EQS 7.8 both drawing and syntax, similar to SAS LISREL 8.7 both drawing and syntax, difficult to use SAS Proc Calis syntax based, easiest to integrate with other data analysis procedures