Dr. Richard Charnigo Professor of Statistics and Biostatistics 07 December 2015.

Slides:



Advertisements
Similar presentations
A Gentle Introduction to Linear Mixed Modeling and PROC MIXED
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Lesson 10: Linear Regression and Correlation
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Linear Regression t-Tests Cardiovascular fitness among skiers.
Correlation and regression Dr. Ghada Abo-Zaid
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Simple Regression. Major Questions Given an economic model involving a relationship between two economic variables, how do we go about specifying the.
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
Overviews of Multifactor Dimensionality Reduction, Structural Equation Modeling, and Stochastic Modeling BST 764 – FALL 2014 – DR. CHARNIGO – UNIT FOUR.
Multiple regression analysis
The Simple Linear Regression Model: Specification and Estimation
Chapter 10 Simple Regression.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Simultaneous Equations Models
SIMPLE LINEAR REGRESSION
Structural Equation Modeling
Chapter 11 Multiple Regression.
SIMPLE LINEAR REGRESSION
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Linear Regression
SIMPLE LINEAR REGRESSION
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Correlation and Regression
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Understanding Statistics
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chapter 10 Lecture 2 Section: We analyzed paired data with the goal of determining whether there is a linear correlation between two variables.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Agresti/Franklin Statistics, 1 of 88  Section 11.4 What Do We Learn from How the Data Vary Around the Regression Line?
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
Dr. Richard Charnigo Professor of Statistics and Biostatistics 08 and 11 January 2016.
Richard Charnigo Professor of Statistics and Biostatistics University of Kentucky March 2016
Nonparametric Statistics
Chapter 17 STRUCTURAL EQUATION MODELING. Structural Equation Modeling (SEM)  Relatively new statistical technique used to test theoretical or causal.
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
Nonequivalent Groups: Linear Methods Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and practices (2 nd ed.). New.
Lecture Slides Elementary Statistics Twelfth Edition
Multiple Regression.
Structural Equation Modeling using MPlus
Simultaneous equation system
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
Multiple Regression.
Elementary Statistics
A Gentle Introduction to Linear Mixed Modeling and PROC MIXED
The Simple Linear Regression Model: Specification and Estimation
Introduction to Regression
12 Inferential Analysis.
SIMPLE LINEAR REGRESSION
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Testing Causal Hypotheses
Presentation transcript:

Dr. Richard Charnigo Professor of Statistics and Biostatistics 07 December 2015

Motivation Structural equation modeling is useful for data analysis entailing one or both of two features that render ordinary regression modeling inapplicable: 1.There is not a clear dichotomy of “independent variables” versus “dependent variables” because some variables both predict and are predicted by other variables in the modeling.

Motivation This may happen in a longitudinal study if the value of a variable at an intermediate time point both predicts its value at a later time point and is predicted by its value at an earlier time point. This may happen in a cross-sectional study or a longitudinal study if a variable is a “mediator”. For instance, rumination may be a mediator between stress and depression. So, rather than using “independent variable” and “dependent variable”, we refer to variables predicted by other variables in the modeling as “endogenous” and to variables not predicted by other variables in the modeling as “exogenous”.

Motivation 2. What we are mainly interested in is not directly observable but is measured, imperfectly, using one or more instruments. Such instruments are often scales obtained by summations of Likert items. In this context, that which is not directly observable is often called a “latent construct”, while a corresponding instrument is often referred to as an “observable indicator”.

A Hypothetical Example Consider a longitudinal study of college students who have a history of drinking in high school. The goal of the study is to ascertain whether sensation seeking may influence drinking, whether drinking may influence sensation seeking, or both. Let SS 1 denote sensation seeking at the beginning of college and DR 1 drinking at the beginning of college. These are exogenous variables in that they will not be predicted by any other variables that we will include in our structural equation model. They are also latent variables because we do not observe them.

A Hypothetical Example Let SS 2 denote sensation seeking after one year of college and DR 2 drinking after one year of college. Let SS 3 denote sensation seeking after two years of college and DR 3 drinking after two years of college. These are endogenous variables in that they will be predicted by other variables that we will include in our structural equation model. They are also latent variables because we do not observe them.

A Hypothetical Example Let NS 1 be the score observed on a scale of novelty seeking; IM 1 the score observed on a scale of impulsivity; DF 1 the score observed on a scale of drinking frequency; and DI 1 the score observed on a scale of drinking intensity, with all of these scales administered at the beginning of college. Thus, NS 1 and IM 1 are observable indicators of the latent construct SS 1, while DF 1 and DI 1 are observable indicators of the latent construct DR 1.

A Hypothetical Example Let NS 2 be the score observed on a scale of novelty seeking; IM 2 the score observed on a scale of impulsivity; DF 2 the score observed on a scale of drinking frequency; and DI 2 the score observed on a scale of drinking intensity, with all of these scales administered after one year of college. Let NS 3 be the score observed on a scale of novelty seeking; IM 3 the score observed on a scale of impulsivity; DF 3 the score observed on a scale of drinking frequency; and DI 3 the score observed on a scale of drinking intensity, with all of these scales administered after two years of college.

A Hypothetical Example The next slide depicts relationships among these quantities. Those labeled with “e” or “d” symbols represent random deviations from what is predicted and may be called, for lack of a better word, “errors”. Generally speaking, a given quantity is expressed as a linear combination of those quantities which contribute to it, not unlike what is seen in regression. Thus, for example, SS 2 = b 0 + b 1 SS 1 + b 2 DR 1 + e 7 and DF 3 = a 0 + a 1 DR 3 + d 5.

A Hypothetical Example SS 1 DR 1 SS 2 DR 2 SS 3 DR 3 DI 3 DF 3 DI 2 DF 2 DI 1 DF 1 IM 2 NS 2 IM 3 NS 3 IM 1 NS 1 e1e1 e2e2 e8e8 e7e7 e3e3 e4e4 e5e5 e6e6 d6d6 d5d5 d4d4 d3d3 d2d2 d1d1 d8d8 d7d7

An Actual Example Sometimes a structural equation model is not as complicated as indicated on the preceding page. For example, if there is only one observable indicator for each latent construct, then the observable indicator may be identified with its latent construct. An example of this occurs in my 2009 Public Health Nutrition paper with Huddleston-Casas and Simmons. We were trying to understand whether food insecurity predicted depression, or vice versa, or both, using three waves of survey data from low- income rural mothers. The diagram, derived from Figure 1 of the paper, appears next.

An Actual Example

Some results, from Table 2 of the paper, are as shown below. Also, testing the null hypothesis that α 5 = α 6 = 0 yields P=0.034 in the complete case analysis (P<0.001 in the imputed analysis), and we have P=0.003 (P<0.001) in testing α 7 = α 8 = 0.

An Actual Example The conclusion is that the relationship between food insecurity and depression among low-income rural mothers may be bidirectional. And, just as linear regression models have R 2 and other tools to assess goodness of fit, there are various indices of fit for structural equation models. Three of them are χ 2 / df, RMSEA, and CFI. In the complete case analysis from our 2009 paper, we had χ 2 / df = 1.835, RMSEA = 0.068, and CFI = These are favourable values for the indices.

Remarks Goodness of fit is often substantially affected by the assumptions one makes regarding which error terms are correlated. Since that is difficult to ascertain from subject matter theory, some iterative process (not unlike stepwise selection in linear regression modeling) is often helpful. Typically we assume that quantities involved are (close to) normally distributed. However, a categorical variable can be accommodated in the sense that model parameters may be estimated for and compared across multiple groups defined by the categorical variable. We did this, too, in our 2009 paper.

References Bollen and Long (1993). Testing Structural Equation Models. SAGE, Newbury Park CA. Hoyle (1995). Structural Equation Modeling: Concepts, Issues, and Applications. SAGE, Thousand Oaks CA. Huddleston-Casas, Charnigo, and Simmons (2009). Food insecurity and maternal depression in rural, low-income families: a longitudinal investigation. Public Health Nutrition, 12, Kline (2010). Principles and Practice of Structural Equation Modeling, 3 rd Edition. Guilford, New York NY. Mueller (1996). Basic Principles of Structural Equation Modeling. Springer-Verlag, New York NY.