Multivariate Analysis Hermine Maes TC19 March 2006 HGEN619 10/20/03.

Slides:



Advertisements
Similar presentations
Multivariate Twin Analysis
Advertisements

1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Bivariate analysis HGEN619 class 2007.
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Using MX for SEM analysis. Using Lisrel Analysis of Reader Reliability in Essay Scoring Votaw's Data Tau-Equivalent Model DA NI=4 NO=126 LA ORIGPRT1 WRITCOPY.
Confirmatory Factor Analysis
Univariate Model Fitting
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
Factor analysis Caroline van Baal March 3 rd 2004, Boulder.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
Multivariate Analysis Nick Martin, Hermine Maes TC21 March 2008 HGEN619 10/20/03.
Introduction to Multivariate Analysis Frühling Rijsdijk & Shaun Purcell Twin Workshop 2004.
Multivariate Genetic Analysis: Introduction(II) Frühling Rijsdijk & Shaun Purcell Wednesday March 6, 2002.
Summarizing Data Nick Martin, Hermine Maes TC21 March 2008.
Developmental models. Multivariate analysis choleski models factor models –y =  f + u genetic factor models –P j = h G j + c C j + e E j –common pathway.
Multiple raters March 7 th, 2002 Boulder, Colorado John Hewitt.
Longitudinal Modeling Nathan, Lindon & Mike LongitudinalTwinAnalysis_MatrixRawCon.R GenEpiHelperFunctions.R jepq.txt.
Heterogeneity Hermine Maes TC19 March Files to Copy to your Computer Faculty/hmaes/tc19/maes/heterogeneity  ozbmi.rec  ozbmi.dat  ozbmiysat(4)(5).mx.
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
Introduction to Linkage
Univariate Analysis in Mx Boulder, Group Structure Title Type: Data/ Calculation/ Constraint Reading Data Matrices Declaration Assigning Specifications/
Continuous heterogeneity Shaun Purcell Boulder Twin Workshop March 2004.
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
Univariate Analysis Hermine Maes TC19 March 2006.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Introduction to Multivariate Genetic Analysis Kate Morley and Frühling Rijsdijk 21st Twin and Family Methodology Workshop, March 2008.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Multivariate Threshold Models Specification in Mx.
Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P.
Institute of Psychiatry King’s College London, UK
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Heterogeneity Hermine Maes TC21 March Files to Copy to your Computer Faculty/hmaes/tc19/maes/heterogeneity  ozbmi.rec  ozbmi.dat  ozbmiysat(4)(5).mx.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Power and Sample Size Boulder 2004 Benjamin Neale Shaun Purcell.
Univariate Analysis Hermine Maes TC21 March 2008.
Longitudinal Modeling Nathan & Lindon Template_Developmental_Twin_Continuous_Matrix.R Template_Developmental_Twin_Ordinal_Matrix.R jepq.txt GenEpiHelperFunctions.R.
ALISON BOWLING CONFIRMATORY FACTOR ANALYSIS. REVIEW OF EFA Exploratory Factor Analysis (EFA) Explores the data All measured variables are related to every.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
Continuous heterogeneity Danielle Dick & Sarah Medland Boulder Twin Workshop March 2006.
Frühling Rijsdijk & Kate Morley
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Linkage in Mx & Merlin Meike Bartels Kate Morley Hermine Maes Based on Posthuma et al., Boulder & Egmond.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Categorical Data HGEN
Genetic simplex model: practical
Intro to Mx HGEN619 class 2005.
Multivariate Analysis
Bivariate analysis HGEN619 class 2006.
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Heterogeneity HGEN619 class 2007.
Longitudinal Analysis
Univariate Analysis HGEN619 class 2006.
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Univariate modeling Sarah Medland.
(Re)introduction to Mx Sarah Medland
Longitudinal Modeling
Sarah Medland faculty/sarah/2018/Tuesday
\nathan\RaterBias.
Multivariate Genetic Analysis
Multivariate Genetic Analysis: Introduction
Rater Bias & Sibling Interaction Meike Bartels Boulder 2004
Presentation transcript:

Multivariate Analysis Hermine Maes TC19 March 2006 HGEN619 10/20/03

Files to Copy to your Computer Faculty/hmaes/tc19/maes/multivariate  *.rec  *.dat  *.mx  Multivariate.ppt

Multivariate Questions I Bivariate Analysis: What are the contributions of genetic and environmental factors to the covariance between two traits? Multivariate Analysis: What are the contributions of genetic and environmental factors to the covariance between more than two traits?

Phenotypic Cholesky F1

Phenotypic Cholesky F2

Phenotypic Cholesky

Cholesky Decomposition

Saturated Model Use Cholesky decomposition to estimate covariance matrix Fully saturated Model: Cov P = F*F’  F: Lower nvar nvar

Phenotypic Single Factor

Residual Variances

Factor Analysis Explain covariance by limited number of factors Exploratory / Confirmatory Model: Cov P = F*F’ + E*E’  F: Full nvar nfac  E: Diag nvar nvar Model: Cov P = F*I*F’ + E*E’

Twin Data

Genetic Single Factor

Common Environmental Single Factor

Specific Environmental Single Factor

Single [Common] Factor X: genetic  Full 4 x 1  Full nvar x nfac Y: shared environmental Z: specific environmental

Residuals partitioned in ACE

Residual Factors T: genetic U: shared environmental V: specific environmental  Diag 4 x 4  Diag nvar x nvar

Independent Pathway Model

IP Independent pathways  Biometric model  Different covariance structure for A, C and E

Independent Pathway I G1: Define matrices Calculation Begin Matrices; X full nvar nfac Free ! common factor genetic path coefficients Y full nvar nfac Free ! common factor shared environment paths Z full nvar nfac Free ! common factor unique environment paths T diag nvar nvar Free ! variable specific genetic paths U diag nvar nvar Free ! variable specific shared env paths V diag nvar nvar Free ! variable specific residual paths M full 1 nvar Free ! means End Matrices; Start … Begin Algebra; A= X*X' + T*T'; ! additive genetic variance components C= Y*Y' + U*U'; ! shared environment variance components E= Z*Z' + V*V'; ! nonshared environment variance components End Algebra; End indpath.mx

Independent Pathway II G2: MZ twins #include iqnlmz.dat Begin Matrices = Group 1; Means M | M ; Covariance A+C+E | A+C _ A+C | A+C+E ; Option Rsiduals End G3: DZ twins #include iqnldz.dat Begin Matrices= Group 1; H full 1 1 End Matrices; Matrix H.5 Means M | M ; Covariance A+C+E | _ | A+C+E ; Option Rsiduals End

Independent Pathway III G4: Calculate Standardised Solution Calculation Matrices = Group 1 I Iden nvar nvar End Matrices; Begin Algebra; R=A+C+E; ! total variance S=(\sqrt(I.R))~; ! diagonal matrix of standard deviations P=S*X_ S*Y_ S*Z; ! standardized estimates for common factors Q=S*T_ S*U_ S*V; ! standardized estimates for spec factors End Algebra; Labels Row P a1 a2 a3 a4 a5 a6 c1 c2 c3 c4 c5 c6 e1 e2 e3 e4 e5 e6 Labels Col P var1 var2 var3 var4 var5 var6 Labels Row Q as1 as2 as3 as4 as5 as6 cs1 cs2 cs3 cs4 cs5 cs6 es1 es2 es3 es4 es5 es6 Labels Col Q var1 var2 var3 var4 var5 var6 Options NDecimals=4 End

Practical Example Dataset: NL-IQ Study 6 WAIS-III IQ subtests  var1 = onvolledige tekeningen / picture completion  var2 = woordenschat / vocabulary  var3 = paren associeren / digit span  var4 = incidenteel leren / incidental learning  var5 = overeenkomsten / similarities  var6 = blokpatronen / block design N MZF: 27, DZF: 70

Dat Files iqnlmz.dat Data NInputvars=18 Missing=-1.00 Rectangular File=iqnl.rec Labels famid zygos age_t1 sex_t1 var1_t1 var2_t1 var3_t1 var4_t1 var5_t1 var6_t1 age_t2 sex_t2 var1_t2 var2_t2 var3_t2 var4_t2 var5_t2 var6_t2 Select if zygos < 3 ;!select mz's Select var1_t1 var2_t1 var3_t1 var4_t1 var5_t1 var6_t1 var1_t2 var2_t2 var3_t2 var4_t2 var5_t2 var6_t2 ; iqnldz.dat.... Select if zygos > 2 ;!select dz's....

Goodness-of-Fit Statistics -2LLdfP2P2 pAIC)P 2 dfp Saturated (iqnlsat2.mx) Cholesky (cholesky.mx) Independent Pathway Common Pathway

MATRIX M This is a FULL matrix of order 1 by MATRIX X This is a LOWER TRIANGULAR matrix of order 6 by MATRIX Y This is a LOWER TRIANGULAR matrix of order 6 by MATRIX Z This is a LOWER TRIANGULAR matrix of order 6 by

Goodness-of-Fit Statistics -2LLdfP2P2 pAIC)P 2 dfp Saturated Cholesky Independent Pathway Common Pathway

MATRIX M This is a FULL matrix of order 1 by MATRIX P This is a computed FULL matrix of order 18 by 6 [=S*X_S*Y_S*Z] VAR1 VAR2 VAR3 VAR4 VAR5 VAR6 A A A A A A C C C C C C E E E E E E

Independent Pathway Model

Path Diagram to Matrices Variance Component a2a2 c2c2 e2e2 Common Factors [X] 6 x 1 [Y] 6 x 1 [Z] 6 x 1 Residual Factors [T] 6 x 6 [U] 6 x 6 [V] 6 x 6 #define nvar 6 #define nfac 1

MATRIX M This is a FULL matrix of order 1 by MATRIX T This is a DIAGONAL matrix of order 6 by MATRIX U This is a DIAGONAL matrix of order 6 by MATRIX V This is a DIAGONAL matrix of order 6 by MATRIX X This is a FULL matrix of order 6 by MATRIX Y This is a FULL matrix of order 6 by MATRIX Z This is a FULL matrix of order 6 by

Goodness-of-Fit Statistics -2LLdfP2P2 pAIC)P 2 dfp Saturated Cholesky Independent Pathway Common Pathway

MATRIX M This is a FULL matrix of order 1 by MATRIX P 18 by 1 [=S*X_S*Y_S*Z] VAR1 A A A A A A C C C C C C E E E E E E MATRIX Q This is a computed FULL matrix of order 18 by 6 [=S*T_S*U_S*V] VAR1 VAR2 VAR3 VAR4 VAR5 VAR6 AS AS AS AS AS AS CS CS CS CS CS CS ES ES ES ES ES ES

Exercise I 1. Drop common factor for shared environment 2. Drop common factor for specific environment 3. Add second genetic common factor

Phenotypic Single Factor

Latent Phenotype

Twin Data

Factor on Latent Phenotype

Common Pathway Model

Common Pathway Model I G1: Define matrices Calculation Begin Matrices; X full nfac nfac Free ! latent factor genetic path coefficient Y full nfac nfac Free ! latent factor shared environment path Z full nfac nfac Free ! latent factor unique environment path T diag nvar nvar Free ! variable specific genetic paths U diag nvar nvar Free ! variable specific shared env paths V diag nvar nvar Free ! variable specific residual paths F full nvar nfac Free ! loadings of variables on latent factor I Iden 2 2 M full 1 nvar Free ! means End Matrices; Start.. Begin Algebra; A= F&(X*X') + T*T'; ! genetic variance components C= F&(Y*Y') + U*U'; ! shared environment variance components E= F&(Z*Z') + V*V'; ! nonshared environment variance components L= X*X' + Y*Y' + Z*Z'; ! variance of latent factor End Algebra; End

Common Pathway II G4: Constrain variance of latent factor to 1 Constraint Begin Matrices; L computed =L1 I unit 1 1 End Matrices; Constraint L = I ; End G5: Calculate Standardised Solution Calculation Matrices = Group 1 D Iden nvar nvar End Matrices; Begin Algebra; R=A+C+E; ! total variance S=(\sqrt(D.R))~; ! diagonal matrix of standard deviations P=S*F; ! standardized estimates for loadings on F Q=S*T_ S*U_ S*V; ! standardized estimates for specific factors End Algebra; Options NDecimals=4 End

CP Common pathway  Psychometric model  Same covariance structure for A, C and E

Common Pathway Model

Path Diagram to Matrices Variance Component a2a2 c2c2 e2e2 Common Factor [X] 1 x 1 [Y] 1 x 1 [Z] 1 x 1 [F] 6 x 1 Residual Factors [T] 6 x 6 [U] 6 x 6 [V] 6 x 6 #define nvar 6 #define nfac 1

MATRIX M This is a FULL matrix of order 1 by MATRIX T This is a DIAGONAL matrix of order 6 by MATRIX U This is a DIAGONAL matrix of order 6 by MATRIX V This is a DIAGONAL matrix of order 6 by MATRIX X This is a FULL matrix of order 1 by MATRIX Y This is a FULL matrix of order 1 by MATRIX Z This is a FULL matrix of order 1 by

Goodness-of-Fit Statistics -2LLdfP2P2 pAIC)P 2 dfp Saturated Cholesky Independent Pathway Common Pathway

MATRIX M This is a FULL matrix of order 1 by MATRIX P 6 by 1 [=S*F] 1 F F F F F F MATRIX Q This is a computed FULL matrix of order 18 by 6 [=S*T_S*U_S*V] AS AS AS AS AS AS CS CS CS CS CS CS ES ES ES ES ES ES

WAIS-III IQ Verbal IQ  var2 = woordenschat / vocabulary  var3 = paren associeren / digit span  var5 = overeenkomsten / similarities Performance IQ  var1 = onvolledige tekeningen / picture completion  var4 = incidenteel leren / incidental learning  var6 = blokpatronen / block design

Exercise II 1. Change to 2 Latent Phenotypes corresponding to Verbal IQ and Performance IQ

Independent Pathway Model  Biometric Factor Model  Loadings differ for genetic and environmental common factors Common Pathway Model  Psychometric Factor Model  Loadings equal for genetic and environmental common factor Summary

Pathway Model

Two Common Pathway Model

Two Independent CP Model

Two Reduced Indep CP Model

Common Pathway Model

Independent Pathway Model