Download presentation
Presentation is loading. Please wait.
Published byLucinda Lamb Modified over 8 years ago
1
Data analysis using regression modeling: visual display and setup of simple and complex statistical models 1 CIRA Center for Interdisciplinary Research on AIDS at Yale U.; 2 Ethel Donaghue TRIPP Center, UConn Health Center; 3 Eastern Connecticut State U.; 4 Transilvania U., Romania; 5 Wesleyan U. Emil Coman PhD 1,2, Maria A. Coman MS 1, Eugen Iordache PhD 4, Russell Barbour PhD 1, Lisa Dierker PhD 5 Yale Day of Data, Sept. 20, 2013 Abstract We present visual modeling solutions for testing simple and more advanced statistical hypotheses in any research field. All models can be directly specified in analytical software like SAS, Stata, Mplus, AMOS, or R. Modeling notes: All models use the intuitive linear regression approach. The model shown on the right translates into the regression equation: Y = α Y + β *X + 1*error Y One can then use various estimation approaches to derive parameter values and their significance levels. Statistical models are meant to explain and predict : 1. differences (between individual cases and groups); 2. changes (over time); 3. differences in changes; 4. changes in differences; 5. dynamic processes. Labels - σ 2 : variance; μ: mean; α: intercept; β (or γ, a, b, c’): regression coefficients; u: error; φ: correlation; τ: transition probability; interrupted line indicates a categorical variable; squares are observed, ellipses un- observed variables. References 1. Voelkle, M. C. (2007). Latent growth curve modeling as an integrative approach to the analysis of change. Psychology Science, 49(4), 375. doi: 10.1111/j.1469-8986.2007.00544.x 2. Coman, E. N., Picho, K., McArdle, J. J., Villagra, V., Dierker, L., & Iordache, E. (2013). The paired t-test as a simple latent change score model. Accepted by: Frontiers in Quantitative Psychology and Measurement 3. Coman, E. (2009). Recapturing Time in Evaluation of Causal Relations: Illustration of Latent Longitudinal and Nonrecursive SEM Models for Simultaneous Data. Paper presented at the American Evaluation Association convention, Nov. 14, 2009 Orlando FL. http://comm.eval.org/EVAL/EVAL/Resources/ViewDocument/Default.aspx?DocumentKey=5f351c13-1f91-42b7-85fe- 450d19f46fca http://comm.eval.org/EVAL/EVAL/Resources/ViewDocument/Default.aspx?DocumentKey=5f351c13-1f91-42b7-85fe- 450d19f46fca 4. Grimm, K. J., An, Y., McArdle, J. J., Zonderman, A. B., & Resnick, S. M. (2012). Recent Changes Leading to Subsequent Changes: Extensions of Multivariate Latent Difference Score Models. Structural Equation Modeling: A Multidisciplinary Journal, 19(2), 268-292. 5. Coman, E., Iordache, E., & Coman, M. A. (2013). Testing mediation the way it was meant to be: changes leading to changes then to other changes. Dynamic mediation implemented with latent change scores. Paper presented at the Modern Modeling Methods (M3) Conference, May 21-22, Storrs, CT. http://www.modeling.uconn.edu/m3c/assets/File/Coman_mediation%20the%20way%20it%20was%20meant.pdf http://www.modeling.uconn.edu/m3c/assets/File/Coman_mediation%20the%20way%20it%20was%20meant.pdf 6. Kenny, D. A. (1987). Statistics for the social and behavioral sciences: Little, Brown Boston. Available FREE online: davidakenny.net/doc/statbook/kenny87.pdf 7. Barreto, H., & Howland, F. (2005). Introductory Econometrics: Using Monte Carlo Simulation with Microsoft Excel: Cambridge University Press. Acknowlegements The first author thanks his causal modeling mentor David Kenny, and his community-based research mentor Marlene Berg. The crosstabs chi-squared model The crosstabs chi-square test is testing whether 2 categorical variables are independent of each other/not correlated. We show two binary categorical variables, but dummy coding can be used to expand the model. Such non-directional models can be extended to multi- equations log-linear models. Whether the regression or the correlation coefficients better capture associations between variables merits discussion (6, 7). Mediation model Mediation is testing whether a * b =/≠ 0. The existence of a φ≠0 biases the mediation estimate. Moderated mediation model 1. Continuous moderator 2. Categorical moderator Moderated mediation is testing: 1. β=/≠0; 2. a g1 * b g1 =/≠ a g2 * b g2. Latent longitudinal simultaneous regression The LL model (3) is testing how β’ changes with σ 2 X0Y0, γ X & γ Y (and σ 2 X0, σ 2 Y0, μ X0 & μ Y0 ). Two-level regression model The two-level regression has the β g and α g coefficients varying across groups (with between-group predictors (B) explaining their variability). Latent Class Analysis model LCA explains covariability between X, Y, Z by a common cause: an unobserved grouping variable; no co-variation is left within classes. Latent Transition Analysis model LTA explains covariabilities between variables, and estimates transition probabilities τ between latent classes. Latent Growth model LGM ‘forces’ linear (or nonlinear) slopes to each individual trajectory, and estimates a mean slope (and its variance). True Change (Latent Change Score) model The TC (LCS) model (4) explains variability in pairwise changes, including by prior states γ or prior changes β d. Dynamic Mediation model The Dynamic Mediation model (5) tests whether receiving treatment leads to changes in an intervening variable M, which then leads to subsequent changes in the outcome Y. Conclusion Data analysis in any substantive field can be easily accomplished by translating statistical tests in the intuitive language of regression-based path diagrams with observed and unobserved variables. All models we presented can be directly specified and estimated in analytical software. Students can particularly benefit from learning the simple regression modeling setup of the path analytical method, as it empowers them to apply the techniques to any data to test models of virtually any complexity. T-test model 1.2-group 1-variable 2. 1-group 1-DV variable with group as predictor The t-test is testing: 1. μ YA = μ YB or 2. β =/≠ 0 Paired t-test model 1. Auto-regressive model (1) tests α Y2 =/≠ 0 2.The True Change (Latent Change Score) is testing α Y2 =/≠ 0 (2) Notes: γ=|1-β| LGM can replicate the paired t-test too (1). Anova model – 3 groups example 1. 3-group 1-variable 2. 1-DV (dependent variable) with grouping predictor The Anova test is testing 1. μ YA = μ YB = μ YC or 2. β′s =/≠ 0 (different coding of the grouping predictor allows for specific desired tests).
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.