Correlation, Regression & Nested Models

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Structural Equation Modeling. What is SEM Swiss Army Knife of Statistics Can replicate virtually any model from “canned” stats packages (some limitations.
Lesson 10: Linear Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
Structural Equation Modeling
Chapter 12 Simple Linear Regression
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Multiple Regression Research Methods and Statistics.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Simple Linear Regression Analysis
Linear Regression/Correlation
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Correlation & Regression
Objectives of Multiple Regression
Inference for regression - Simple linear regression
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 13 Multiple Regression
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Environmental Modeling Basic Testing Methods - Statistics III.
Multiple regression.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Stats Methods at IC Lecture 3: Regression.
Nonparametric Statistics
Lecture 11: Simple Linear Regression
Chapter 20 Linear and Multiple Regression
Regression Analysis AGEC 784.
AP Statistics Chapter 14 Section 1.
Basic Estimation Techniques
Inference for Regression
Essentials of Modern Business Statistics (7e)
CJT 765: Structural Equation Modeling
John Loucks St. Edward’s University . SLIDES . BY.
...Relax... 9/21/2018 ST3131, Lecture 3 ST5213 Semester II, 2000/2001
Basic Estimation Techniques
CHAPTER 29: Multiple Regression*
Nonparametric Statistics
CHAPTER 26: Inference for Regression
Linear Regression/Correlation
No notecard for this quiz!!
Linear regression Fitting a straight line to observations.
The greatest blessing in life is
Simple Linear Regression
Simple Linear Regression
Simple Linear Regression
Product moment correlation
Chapter 14 Inference for Regression
Inferences for Regression
Autoregressive and Growth Curve Models
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Correlation, Regression & Nested Models Rachael Bedford Mplus: Longitudinal Analysis Workshop 26/09/2017

Overview Victimization & Delinquency Scatterplots Correlation Simple Linear Regression Structural Equation Models Nested Models Testing Nested Models in Mplus

Victimization & Delinquency Can victimization lead to delinquency?

Association What is an association? Step 1: scatterplot Is there a relationship between the variables? Positive/negative relationship? Are there any outliers? Linear relationship: increased victimization relates to higher levels of delinquency Delinquency What is an association: Go up in one up in the other – symetrical association Cor is a standardised metric, which means that both variables have been transformed onto the same scale – r values show the strength of the correltion (low, medium, high) effect size Victimization

Correlation Step 2: correlation Correlation coefficient describes degree of relationship between two variables. Standardised metric: both variables transformed to same scale. R value: strength of the correlation Effect sizes Low ~0.3 Medium ~0.5 High ~0.8 Delinquency What is an association: Go up in one up in the other – symetrical association Cor is a standardised metric, which means that both variables have been transformed onto the same scale – r values show the strength of the correltion (low, medium, high) effect size Victimization

Regression Linear regression model: y = 0 + 1x +  Does one variable (i.e. victimization) predict the other variable (i.e. delinquency). Gives information about how a unit change in one variable will change the other. Linear regression model: y = 0 + 1x +  y dependent variable 0 intercept or constant: the value of y when x is 0 1 gradient or slope: the increase in y when x increases by one unit  is the error 0 + 1x is the equation for the predicted values of y which lie on a straight line and  is the difference between the observed and predicted values of y

Ordinary Least Squares Regression y = 0 + 1 x +  Error ε= yobs – ymodel Delinquency (Y) Assumptions Residuals should have: a normal distribution zero mean (linear relationship) constant variance 1 1 Mean delinquency score when victimization is 0 (i.e. not victimized) 0 Victimization (X)

SEM: Reminder Observed variable Latent factor Effects of one variable on another (e.g., regression) Covariance or correlation When the model only contains observed variables it is called path analysis.

Example 1: Victimization & Delinquency Question: Does victimization at 13 years predict delinquency at 15 years? Path diagram: Del 15 Victim 13 Regression: Y = Delinquency 15 yrs X = Victimization 13 yrs

Example 1: Mplus input Mplus code Regression: Y = Delinquency 15 yrs X = Victimization 13 yrs Analysis: Estimator = ML; Choose a model estimator: Here ML (maximum likelihood) OLS/MLR/others available Model: Del15 on vbull13; Output: Sampstat stand; Standardised sample statistics Regression

Example 1: Output (before results) Fit statistics (here, model is saturated) Do the means look reasonable? GOOD!

Example 1: Output (results) Main model results: Does victimization relate to later delinquency? What are the values for: intercept slope residuals…?

Practical 1 Mplus code Regression: Y = Delinquency 15 yrs X = Victimization 13 yrs Analysis: Estimator = ML; Choose a model estimator: Here ML (maximum likelihood) OLS/ML/others available Model: Del15 on vbull13; Output: Sampstat stand; Standardised sample statistics Regression

Model Fit Regression line has two parameters 0 + 1 (and error term) From these we can estimate delinquency based on victimization score A model is: An approximation A representation of the data A way to summarise many data points Important to assess how well our model ‘fits’ our data. i.e. do the 2 parameters in our model effectively represent the data? Summarise many point of data

Model Fit Look at the association Choose an appropriate model i.e. 2 parameters for simple linear relationship Check whether model fits the data Is there a relationship? Does a linear model fit? Need an extra parameter Delinquency Delinquency Summarise many point of data Victimization Victimization

Nested Models Two models are nested if one model contains all the terms in the other + one additional term. The larger model is the complete/full model The smaller is the reduced/restricted model By using model fit, nested models can be used to test whether a particular parameter is necessary. Imagine a model with one parameter per observation: The model perfectly fits the data. BUT would not be useful – doesn’t provide an approximation to extrapolate results. So we try a model with fewer (k-1) parameters. This reduced model will ALWAYS fit less perfectly but… Is the full model a significantly better fit? If no significant difference use reduced k-1 model What about k-2 parameters, k-3, etc. Until we reach k-x where fit is significantly worse.

Nested Models Can also usually think about this in terms of pathways. Full Model: Victimization 12 Delinquency 15 Victimization 13 An extra pathway in the model Reduced Model: Victimization 12 Delinquency 15 Victimization 13

Nested Models The relationship between victimization and delinquency might differ in girls and boys – we can test this with nested models Boys Girls Delinquency Delinquency If adding parameter increase the likelihood that the model best describes the data (is increase significant) Victimization Victimization

Nested Models 1) Fit a model allowing boys and girls to be different 2) Fit a model constraining them to be the same (i.e. show the same relationship between victimization & delinquency) Does the model with fewer parameters (which is…?) show significantly worse fit? If not then (from a statistical point of view) we can constrain boys and girls to be the same. If adding parameter increase the likelihood that the model best describes the data (is increase significant)

Nested Models: Comparing chi-sq Chi-squared difference test sometimes called a likelihood ratio test Test statistic follows a chi-squared distribution: Chi sq reduced model – chi sq full model DF = df reduced model – df full model Degrees of Freedom (DF) relate to the number of free parameters in the model Likelihood ratio Chi sq has diff dist depending on df If adding parameter increase the likelihood that the model best describes the data (is increase significant)

Example 2: Gender Del 15 Victim 13 Del 15 Victim 13 Question: Is the effect of victimization on delinquency the same in boys and girls? Females: Multiple groups Males: Y = Delinquency 15 yrs X = Victimization 13 yrs Females: Del 15 Victim 13 Males: Del 15 Victim 13

Example 2: Mplus input Question: Is effect of victimization on delinquency the same in boys & girls? Grouping are gender (1 = male, 2 = female) Model: Del15 on vbull13; Male: Female: Check chi sq model fit: Saturated model (i.e. 0 dfs) Full model (M1)

Example 2: Check fit statistics Constrained model Constrain relationship to be equal in males and females using syntax : (1) You can put anything in the brackets as long as it’s the same for both groups… Check Chi Squared fit statistics:

Example 2: Comparing model fit Check Chi Squared fit statistics: M2-M1 - 4.134 - 0 = 4.134 Check against chi square distribution 4.134 > 3.841 So… Model 2 is a significantly worse fit than model 1.

Practical 2 Question: Is effect of victimization on delinquency the same in boys & girls? Grouping are gender (1 = male, 2 = female) Constrained model Model: Del15 on vbull13; Male: Female: