Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.

Slides:



Advertisements
Similar presentations
OpenMx Frühling Rijsdijk.
Advertisements

1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
SEM PURPOSE Model phenomena from observed or theoretical stances
Bivariate analysis HGEN619 class 2007.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
General Structural Equation (LISREL) Models
Structural Equation Modeling
Structural Equation Modeling: An Overview P. Paxton.
Confirmatory Factor Analysis
Univariate Model Fitting
Intro to Quantitative Genetics HGEN502, 2011 Hermine H. Maes.
Outline 1) Objectives 2) Model representation 3) Assumptions 4) Data type requirement 5) Steps for solving problem 6) A hypothetical example Path Analysis.
Linear Regression and Binary Variables The independent variable does not necessarily need to be continuous. If the independent variable is binary (e.g.,
Structural Equation Modeling
Causal Modelling and Path Analysis. Some Notes on Causal Modelling and Path Analysis. “Path analysis is... superior to ordinary regression analysis since.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
Multivariate Analysis Nick Martin, Hermine Maes TC21 March 2008 HGEN619 10/20/03.
Introduction to Multivariate Analysis Frühling Rijsdijk & Shaun Purcell Twin Workshop 2004.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Summarizing Variation Michael C Neale PhD Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
“Ghost Chasing”: Demystifying Latent Variables and SEM
Structural Equation Modeling
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
CHAPTER 30 Structural Equation Modeling From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach,
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
REGRESSION AND CORRELATION
Path Analysis Frühling Rijsdijk SGDP Centre Institute of Psychiatry King’s College London, UK.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Structural Equation Modeling Intro to SEM Psy 524 Ainsworth.
Structural Equation Modeling Continued: Lecture 2 Psy 524 Ainsworth.
Structural Equation Models – Path Analysis
Path Analysis HGEN619 class Method of Path Analysis allows us to represent linear models for the relationship between variables in diagrammatic.
Karri Silventoinen University of Helsinki Osaka University.
Structural Equation Modeling (SEM) With Latent Variables James G. Anderson, Ph.D. Purdue University.
10 SEM is Based on the Analysis of Covariances! Why?Analysis of correlations represents loss of information. A B r = 0.86r = 0.50 illustration.
Institute of Psychiatry King’s College London, UK
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Chapter Three TWO-VARIABLEREGRESSION MODEL: THE PROBLEM OF ESTIMATION
Path Analysis and Structured Linear Equations Biologists in interested in complex phenomena Entails hypothesis testing –Deriving causal linkages between.
Gene-Environment Interaction & Correlation Danielle Dick & Danielle Posthuma Leuven 2008.
Univariate Analysis Hermine Maes TC21 March 2008.
Genetic Theory Pak Sham SGDP, IoP, London, UK. Theory Model Data Inference Experiment Formulation Interpretation.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Model building & assumptions Matt Keller, Sarah Medland, Hermine Maes TC21 March 2008.
Introduction to Genetic Theory
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Chapter 16 PATH ANALYSIS. Chapter 16 PATH ANALYSIS.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Invest. Ophthalmol. Vis. Sci ;57(1): doi: /iovs Figure Legend:
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Re-introduction to openMx
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Structural Equation Modeling
Univariate modeling Sarah Medland.
عنوان عنوان فرعی.
Pak Sham & Shaun Purcell Twin Workshop, March 2002
Structural Equation Modeling
Sarah Medland faculty/sarah/2018/Tuesday
Product moment correlation
Linear Panel Data Models
Structural Equation Modeling (SEM) With Latent Variables
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Multivariate Genetic Analysis: Introduction
Structural Equation Modeling
Presentation transcript:

Path Analysis Frühling Rijsdijk

Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance Algebra Predicted Var/Cov of the Model Path Tracing Rules System of Linear Equations Path Diagrams Observed Var/Cov of the Data Covariance Algebra Twin Model SEM model building

Method of Path Analysis Allows us to represent linear models for the relationship between variables in diagrammatic form, e.g. a genetic model; a factor model; a regression model Makes it easy to derive expectations for the variances and covariances of variables in terms of the parameters of the proposed linear model Permits easy translation into matrix formulation as used by programs such as Mx, OpenMx.

Conventions of Path Analysis I Squares or rectangles denote observed variables Circles or ellipses denote latent (unmeasured) variables Upper-case letters are used to denote variables Lower-case letters (or numeric values) are used to denote covariances or path coefficients Single-headed arrows or paths (–>) represent hypothesized causal relationships - where the variable at the tail is hypothesized to have a direct causal influence on the variable at the head a Y = aX + eE Y VXVX X E e V E =1

Conventions of Path Analysis II Double-headed arrows ( ) are used to represent a covariance between two variables, which may arise through common causes not represented in the model. Double-headed arrows may also be used to represent the variance of a variable. Y = aX + bZ + eE a Y VXVX X E e V E =1 Z VZVZ b r

Conventions of Path Analysis III Variables that do not receive causal input from any one variable in the diagram are referred to as independent, or predictor or exogenous variables. Variables that do, are referred to as dependent or endogenous variables. Only independent variables are connected by double- headed arrows. Single-headed arrows may be drawn from independent to dependent variables or from dependent variables to other dependent variables. a Y VXVX X E e V E =1 Z VZVZ b r

Conventions of Path Analysis IV Omission of a two-headed arrow between two independent variables implies the assumption that the covariance of those variables is zero Omission of a direct path from an independent (or dependent) variable to a dependent variable implies that there is no direct causal effect of the former on the latter variable

Path Tracing The covariance between any two variables is the sum of all legitimate chains connecting the variables The numerical value of a chain is the product of all traced path coefficients in it A legitimate chain is a path along arrows that follow 3 rules:

(i)Trace backward, then forward, or simply forward from one variable to another. NEVER forward then backward! Include double-headed arrows from the independent variables to itself. These variances will be 1 for latent variables (ii) Loops are not allowed, i.e. we can not trace twice through the same variable (iii)There is a maximum of one curved arrow per path. So, the double-headed arrow from the independent variable to itself is included, unless the chain includes another double-headed arrow (e.g. a correlation path)

Since the variance of a variable is the covariance of the variable with itself, the expected variance will be the sum of all paths from the variable to itself, which follow the path tracing rules The Variance

Cov AB = kl + mqn + mpl Cov BC = no Cov AC = mqo Var A = k 2 + m kpm Var B = l 2 + n 2 Var C = o 2 ABC DEF 1 p q k n o m l 1 1

Path Diagrams for the Classical Twin Model

Quantitative Genetic Theory There are two sources of Genetic influences: Additive (A) and non-additive or Dominance (D) There are two sources of environmental influences: Common or shared (C) and non-shared or unique (E) cd PHENOTYPE 1 1 DC a 1 A e 1 E P

In the preceding diagram… A, D, C, E are independent variables –A = Additive genetic influences –D = Non-additive genetic influences (i.e., dominance) –C = Shared environmental influences –E = Non-shared environmental influences –A, D, C, E have variances of 1 Phenotype is a dependent variable –P = phenotype; the measured variable a, d, c, e are parameter estimates

PTwin 1 E CA PTwin 2 A CE Model for MZ Pairs Reared Together 1 1 Note: a, c and e are the same cross twins e a ca e c

PTwin 1 E CA PTwin 2 A CE Note: a, c and e are also the same cross groups e a ca e c Model for DZ Pairs Reared Together

PTwin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

PTwin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

PTwin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a

PTwin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a*1*a = a 2 + a

PTwin 1 E ec CA Variance of Twin 1 AND Twin 2 (for MZ and DZ pairs) a*1*a = a 2 + c*1*c = c 2 e*1*e = e 2 + Total Variance = a 2 + c 2 + e 2 a

PTwin 1 E CA Covariance Twin 1-2: MZ pairs PTwin 2 A CE e a ca e c

PTwin 1 E CA Covariance Twin 1-2: MZ pairs PTwin 2 A CE e a ca e c

PTwin 1 E CA Covariance Twin 1-2: MZ pairs Total Covariance = a 2 + PTwin 2 A CE e a ca e c

PTwin 1 E CA Covariance Twin 1-2: MZ pairs Total Covariance = a 2 + c 2 PTwin 2 A CE e a ca e c

PTwin 1 E CA Covariance Twin 1-2: DZ pairs Total Covariance =.5a 2 + c 2 PTwin 2 A CE e a ca e c

Tw1Tw2 Tw1 Tw2 Tw1 Tw2 Tw1 Tw2 Predicted Var-Cov Matrices

PTwin 1 E DA PTwin 2 A DE ADE Model 1(MZ) / 0.25 (DZ) 1/.5 e a da e d

Tw1Tw2 Tw1 Tw2 Tw1 Tw2 Tw1 Tw2 Predicted Var-Cov Matrices

ACE or ADE Cov(mz) = a 2 + c 2 or a 2 + d 2 Cov(dz) = ½ a 2 + c 2 or ½ a 2 + ¼ d 2 V P = a 2 + c 2 + e 2 or a 2 + d 2 + e 2 3 unknown parameters (a, c, e or a, d, e), and only 3 distinct predictive statistics: Cov MZ, Cov DZ, Vp this model is just identified

The twin correlations indicate which of the two components is more likely to fit the data: Cor(mz) = a 2 + c 2 or a 2 + d 2 Cor(dz) = ½ a 2 + c 2 or ½ a 2 + ¼ d 2 If a 2 =.40, c 2 =.20 r mz = 0.60 r dz = 0.40 If a 2 =.40, d 2 =.20 r mz = 0.60 r dz = 0.25 Effects of C and D are confounded ADE ACE

ADCE: classical twin design + adoption data Cov(mz) = a 2 + d 2 + c 2 Cov(dz) = ½ a 2 + ¼ d 2 + c 2 Cov(adopSibs) = c 2 V P = a 2 + d 2 + c 2 + e 2 4 unknown parameters (a, c, d, e), and 4 distinct predictive statistics: Cov MZ, Cov DZ, Cov adopSibs, Vp this model is just identified

Path Tracing Rules are based on Covariance Algebra

Three Fundamental Covariance Algebra Rules Cov (aX,bY) = ab Cov(X,Y) Cov (X,Y+Z) = Cov (X,Y) + Cov (X,Z) Var (X) = Cov(X,X)

The variance of a dependent variable (Y) caused by independent variable A, is the squared regression coefficient multiplied by the variance of the independent variable Example 1 Y a Y = aA A 1

Example 2 Y a Y = aA A Z a Z = aA A 1 1.5

Summary Path Tracing and Covariance Algebra have the same aim : to work out the predicted Variances and Covariances of variables, given the specified model The Ultimate Goal is to fit Predicted Variances / Covariances to observed Variances / Covariances of the data in order to estimate model parameters : –regression coefficients, correlations