Soc 3306a: Path Analysis Using Multiple Regression and Path Analysis to Model Causality.

Slides:



Advertisements
Similar presentations
Cause (Part II) - Causal Systems I. The Logic of Multiple Relationships II. Multiple Correlation Topics: III. Multiple Regression IV. Path Analysis.
Advertisements

Structural Equation Modeling
Structural Equation Modeling
Data Analysis: Relationships Continued Regression
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression single and multiple. Overview Defined: A model for predicting one variable from other variable(s). Variables:IV(s) is continuous, DV is continuous.
Variance and covariance M contains the mean Sums of squares General additive models.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Structural Equation Modeling
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
Introduction to simple linear regression ASW, Economics 224 – Notes for November 5, 2008.
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Structural Equation Models – Path Analysis
Objectives of Multiple Regression
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Regression and Correlation Methods Judy Zhong Ph.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
Soc 3306a Lecture 8: Multivariate 1 Using Multiple Regression and Path Analysis to Model Causality.
Soc 3306a Lecture 10: Multivariate 3 Types of Relationships in Multiple Regression.
CHAPTER NINE Correlational Research Designs. Copyright © Houghton Mifflin Company. All rights reserved.Chapter 9 | 2 Study Questions What are correlational.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Alcohol consumption and HDI story TotalBeerWineSpiritsOtherHDI Lifetime span Austria13,246,74,11,60,40,75580,119 Finland12,524,592,242,820,310,80079,724.
Bivariate Regression Analysis The most useful means of discerning causality and significance of variables.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 23 Multiple Regression.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
SEM: Basics Byrne Chapter 1 Tabachnick SEM
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Multiple Regression Lab Chapter Topics Multiple Linear Regression Effects Levels of Measurement Dummy Variables 2.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Path Analysis and Structured Linear Equations Biologists in interested in complex phenomena Entails hypothesis testing –Deriving causal linkages between.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Lecture 10: Correlation and Regression Model.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Correlation & Regression Analysis
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 18 Multivariate Statistics.
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
Chapter 16 PATH ANALYSIS. Chapter 16 PATH ANALYSIS.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Lecture 10 Regression Analysis
Bivariate & Multivariate Regression Analysis
REGRESSION (R2).
Cause (Part II) - Causal Systems
Structural Equation Modeling
HW# : Complete the last slide
Product moment correlation
Individual Assignment 6
Correlation & Trend Lines
Regression Part II.
Presentation transcript:

Soc 3306a: Path Analysis Using Multiple Regression and Path Analysis to Model Causality

Causality Criteria:  Association (correlation)  Non-spuriousness  Time order  Theory (implied)

Causation Evidence for causation cannot be attributed from correlational data But can be found in: 1. the strength of the partial relationships (the bivariate relationship does not disappear when controlling for another variable) 2. assumed time order (derived from theory)

Path Analysis Can be used to test causality through the use of bivariate and multivariate regression Note that you are only finding evidence for causality, not proving it. Can use the standardized coefficients (the beta weights) to determine the strengths of the direct and indirect relationships in a multivariate model Is variability in DV stochastic (chance) or can it be explained by systematic components (correctly specified IV’s)

STEP 1 Specify a model derived from theory and a set of hypotheses Example: Model would predict that the variation in the dependent variable SEI can be explained by four independent variables, SEX, EDUC, INCOME, and AGE In other words, hypothesizes a causal relationship to explain SEI

SEI SEX AGE EDUC INC Exogenous VariablesEndogenous Variables Hypothetical Model For SEI

STEP 2 Test the bivariate correlations to determine which relationships are real. Initial correlation matrix showed that SEX was not significantly associated with any of the other variables except INCOME, which was a very weak negative relationship, so it was dropped from the model. Note: Bivariate scatterplots showed that all relationships were linear. Histograms and skewness statistics were within normal limits.

SEI AGE EDUC INC Exogenous VariablesEndogenous Variables Revised Hypothetical Model For SEI

Figure 1 Revised Bivariate Correlations Examine correlations between SEI and IV’s Moderately strong, positive relationship between SEI and Education, a weak- moderate relationship with INCOME and a very weak, non-significant one with AGE Look also at correlations between IV’s Strong correlations between IV’s ( >.700) can indicate multicollinearity. No problems observed in this model.

STEP 3: Find Path Coefficients The direct and indirect path coefficients are the standardized slopes or Beta Weights To find them, a series of multiple regression models are tested

Testing of Models Model 1  SEI = AGE + EDUC + INC + e  e = error or unexplained variance Model 2  INC = AGE + EDUC + e Model 3  EDUC = AGE + e

Figure 1: Model 1 This is a full multiple regression model to regress SEI on all IV’s Examine the scatterplots for linearity and homoscedasticity Interpret the model. Is it significant? Interpret R (multiple correlation coefficient) and Adj. R 2 (coefficient of determination) Interpret slopes, betas and significance. Check partial correlations. Add betas to model diagram

Figure 2: Model 2 Now we need to calculate the other relationships (Betas) in the model Regress INC on EDUC and AGE Add betas to path diagram.

Figure 3: Model 3 Regress EDUC on AGE Again, add beta to path diagram.

SEI AGE EDUC INC Exogenous VariablesEndogenous Variables Causal Model For SEI.049 ns.182***.175*** -.071**.226***.561***

STEP 4 Calculate Causal Effects Causal Effect of Age:  Indirect….. AGE-INC->SEI=.182x.175=.032 AGE-EDUC->SEI= -.071x.561= AGE-EDUC-INC->SEI= -.071x.226x.175 =  Direct…. Age->SEI =.049  Total Causal Effect Indirect + Direct= =.038

Causal Effect of EDUC and INC Causal Effect of EDUC:  Indirect….. EDUC-INC->SEI=.226x.175=.040  Direct…. EDUC->SEI =.561  Total Causal Effect Indirect + Direct= =.601 Causal Effect of INC:  Direct…. INC->SEI =.175 Total Causal Effect =.175

Issues Related to Path Analysis Very sensitive to model specification Failure to include relevant causal variables or inclusion of irrelevant variables can substantially affect the path coefficients Example: inclusion of AGE in above model Build your model one variable at a time (use Blocks and asking for R 2 change under statistics) to test for significant change in R 2 value until new additions do not significantly increase explanatory value of model further. But will not solve problem of irrelevant IV’s (i.e. when your model is overidentified)

SEM (Structural Equation Modeling) To avoid overidentification, the best strategy is to also examine alternative explanatory models One new technique is structural equation modeling (SEM) using specialized software (i.e. SPSS’s AMOS program) Can test several models simultaneously Although we will not cover SEM in class, it is something to keep in mind for future model building.

Comment on SEI Model (above) Model shown above had adj. R 2 =.396 Overall, INC, EDUC, AGE explained 39.6% of variation in SEI But, unexplained variance (error) was =.604 (stochastic component) 60.4% of variation in SEI still unexplained Furthermore, causal effect of AGE only.038 Specification error – this model is underidentified Could drop AGE and consider other important IV’s (i.e. CLASS, OCCUPATIONAL PRESTIGE)? See Figure 4 Revised Model Using CLASS