Paper presented at the American Evaluation Association 2010 conference PLEASE do not distribute and cite only with permission; work is under journal review.

Slides:



Advertisements
Similar presentations
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Advertisements

General Structural Equation (LISREL) Models
Hypothesis Testing Steps in Hypothesis Testing:
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Growth Curve Model Using SEM
Structural Equation Modeling
 Confounders are usually controlled with the “standard” response regression model.  The standard model includes confounders as covariates in the response.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
Structural Equation Modeling
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 11 Experimental and Quasi-experimental.
Today Concepts underlying inferential statistics
Chapter 14 Inferential Data Analysis
L1 Chapter 11 Experimental and Quasi- experimental Designs Dr. Bill Bauer.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Nonparametric or Distribution-free Tests
Inferential Statistics
Advantages of Multivariate Analysis Close resemblance to how the researcher thinks. Close resemblance to how the researcher thinks. Easy visualisation.
Multiple Sample Models James G. Anderson, Ph.D. Purdue University.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Chapter 11 Experimental Designs
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
Some terms Parametric data assumptions(more rigorous, so can make a better judgment) – Randomly drawn samples from normally distributed population – Homogenous.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Statistics Psych 231: Research Methods in Psychology.
Statistics (cont.) Psych 231: Research Methods in Psychology.
ANOVA and Linear Regression ScWk 242 – Week 13 Slides.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
CJT 765: Structural Equation Modeling Class 12: Wrap Up: Latent Growth Models, Pitfalls, Critique and Future Directions for SEM.
Classifying Designs of MSP Evaluations Lessons Learned and Recommendations Barbara E. Lovitts June 11, 2008.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Controlling for Baseline
ANOVA: Analysis of Variance.
Measurement Models: Identification and Estimation James G. Anderson, Ph.D. Purdue University.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Estimating and Testing Hypotheses about Means James G. Anderson, Ph.D. Purdue University.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
1 Study Design Issues and Considerations in HUS Trials Yan Wang, Ph.D. Statistical Reviewer Division of Biometrics IV OB/OTS/CDER/FDA April 12, 2007.
Chapter 10 The t Test for Two Independent Samples
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
T tests comparing two means t tests comparing two means.
Confirmatory Factor Analysis of Longitudinal Data David A. Kenny December
Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.
ANCOVA.
Statistics (cont.) Psych 231: Research Methods in Psychology.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Data analysis using regression modeling: visual display and setup of simple and complex statistical models 1 CIRA Center for Interdisciplinary Research.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Inferential Statistics Psych 231: Research Methods in Psychology.
Paper presented at the American Evaluation Association 2010 conference
CJT 765: Structural Equation Modeling
Emil Coman, Ph.D. 1 & Peg Weeks, Ph.D. 2
Chapter 2 Simple Comparative Experiments
12 Inferential Analysis.
12 Inferential Analysis.
Fixed, Random and Mixed effects
Product moment correlation
Structural Equation Modeling
Presentation transcript:

Paper presented at the American Evaluation Association 2010 conference PLEASE do not distribute and cite only with permission; work is under journal review Emil Coman 1 Ph.D., Marlene Berg, MA 2 1- Senior Research Scientist-Evaluator, 2 – Associate Director of Training Institute for Community Research, Hartford, CT USA

Statement of problem  Evaluation of intervention effects in community-based setting with quasi- experimental designs is challenging Structural equation modeling (SEM) or covariance structure analysis (CSA) is well equipped i. Much more flexible in modeling (e.g. Graham, 2008) ii. Allows for modeling unobserved (latent) variables iii. Can test and compare fit of alternative models Graham, J. M. (2008). The General Linear Model as Structural Equation Modeling. Journal of Educational and Behavioral Statistics, 33(4), 485.

The problem  SEM flexibility ‘causes’ conclusion flexibility but We want firm single answer to the question: ‘did it work’? i.e. has the intervention changed for the better the intervention group vs. the comparison group?

The YARP problem  Youth Action Research for Prevention ‘Did it work’?

The YARP  Youth Action Research for Prevention  A three-year summer and after-school research project designed to reduce drug and sexual risk behavior in urban minority adolescents in Hartford, CT.  Youth identified a youth-related problem in their community, developed a research model and an action plan addressing that issue, gathered and interpret data, and actively engaged in social action to promote changes in their community.  Youth learned to manage and negotiate their identities in various contexts  increase in Internal Locus of Control (ILC) Berg, M., Coman, E., & Schensul, J. (2009). Youth Action Research for Prevention: A Multi-level Intervention Designed to Increase Efficacy and Empowerment Among Urban Youth. American Journal of Community Psychology, 43(3),

SEM solutions  Regular analyses like ANOVA face the same challenges, but can’t deal with all of them; e.g. assumptions like (Prof. David Garson’s www)www i. homogeneity of variances across the groups ii. multivariate normality iii. adequate sample size iv. random sampling ~ SEM seems to handle these well. + SEM can test and “prune away” assumptions (Rhodes, 2010) Rhodes, W. (2010). Heterogeneous Treatment Effects: What Does a Regression Estimate? Evaluation review, 34(4), 334.

SEM reminder  Models are specified by replacing correlations (double headed arrows) with causal paths (one- directional), or deleting them (forcing them equal to zero). This produces degrees of freedom for testing the fit of the model.  Fit is simply the extent to which the model yields implied (estimated) means and covariances that are similar to the observed ones.  To test a specific parameter, one sets it to a value (zero, e.g.) and checks the worsening of fit.

SEM setup (simplest model) The problem is that the test of equal post-intervention means can be done assuming equality (or not) across groups of each of these parameters:  baseline means  baseline variances  the baseline to post-intervention causal path  the posttest error variances Pre-test Internal Locus of Control Post-test Internal Locus of Control vivi pipi eiei 1 mimi τiτi

One SEM result – paths equal, rest not Note: A: model implied intercepts; B: model implied means; C: model implied variances; *,= means parameter estimated but set equal across groups; first numbers are for comparison, second for intervention group.

5Tmvpe 4vmpe.003 SEM modeling for comparing post-intervention observed outcome means in multiple-group structured means models Intervention successful; but not all parameters may be equal.

5Tmvpe 1p_Paths= 1ErVars= 1v_Variances=1m_Pre- Means= 2Tm2Tv 2Tp 2Te 4vmpe.003 SEM modeling for comparing post-intervention observed outcome means in multiple-group structured means models Intervention successful 2/5 time; unsuccessful the other 3/5; not very good to report

AMOS 16 setup - all possible models ---

5Tmvpe 1p_Paths= 1ErVars= 1v_Variances= 1T_Post - Means Test 2vp 1m_Pre- Means= 2mv2mp 2ve 2me 2Tm2pe2Tv 2Tp 2Te 3vmp 3vme3vpe 3Tvm 3Tme 3Tmp 3Tvp 3Tve 3Tpe 4vmpe 4Tvmp 4Tvpe 4Tvme ALL MODELS Decision-tree SEM modeling for comparing post-intervention observed outcome means in multiple-group structured means models Note: Rounded rectangles and bold text: models with good fit; snipped interrupted rectangles: fit not adequate; numbers next to boxes: to right - fit ordered from best fitting (1) up; numbers on arrows: significance of worsening of fit (Δχ 2 ); m=means; v=variances; p=autoregressive path; e = error variances; T = test of post-test equal intercepts; W1 = wave 1, W2 =wave mpe 4Tmpe

Some fit better than others ---

Estimates of Intervention Effects on Internal Locus of Control as Standardized Effect Sizes from All Alternative SMM Models Note: Numbers above bars are p values for nested model comparisons for the equality of post-intervention outcome intercept constraint; F’s = fit rank order of non-constrained (baseline) model according to p(χ 2 ). Labels: m = means; v = variances; p = autoregressive path; e = error variance, and T = the test of equal post-intervention intercepts, τ C2 = τ T2.

Additional problems 1. Unreliability of the composite measure 2. Power of different alternative SEM models to detect that specific (focal) parameter, i.e. the equality between post-test intercepts. SOLUTIONS 1. Re-model with latent ILC instead of composite, with measurement error error variances equal to the variance of the observed ILC variables times one minus their reliability (MacKinnon, 2008, p.186) 2. Assess power of the tests for all models.

Solution 1  All models showed significant positive intervention effect.

5Tmvpe 1p_Paths= 1ErVars= 1v_Variances= 1T_Post - Means Test 2vp 1m_Pre- Means= 2mv2mp 2ve 2me 2Tm2pe2Tv 2Tp 2Te 3vmp 3vme3vpe 3Tvm 3Tme 3Tmp 3Tvp 3Tve 3Tpe 4vmpe 4Tvmp 4Tvpe 4Tvme Solution 2: Decision-Tree (DT) approach Note: Rounded rectangles and bold text: models with good fit; snipped interrupted rectangles: fit not adequate; numbers next to boxes: to left in interrupted pentagons– power of test, and to right - fit ordered from best fitting (1) up; numbers on arrows: significance of worsening of fit (Δχ 2 ); m=means; v=variances; p=autoregressive path; e = error variances; T = test of post-test equal intercepts; W1 = wave 1, W2 =wave mpe 4Tmpe

Conclusions 1. Importance of a priori specification of alternative models 2. The utility of post-hoc power analysis 3. The need to attend to the chi-square fit measure The utility of modeling measurement errors 4. The Decision-Tree (DT) approach helps with organizing, deciding upon, and presenting appropriate alternative SEM testing of intervention effects. Limitations over-fitting concerns; no measurement modeling; etc.

Thank you!   ---