Fitting Univariate Models to Continuous and Categorical Data

Slides:



Advertisements
Similar presentations
Bivariate analysis HGEN619 class 2007.
Advertisements

Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Phenotypic factor analysis
Elizabeth Prom-Wormley & Hermine Maes
Univariate Model Fitting
Heritability – “the fraction of the total variation in a trait that is due to variation in genes.” (Freeman and Herron, 2007) V p – the total variance.
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
(Re)introduction to OpenMx Sarah Medland. Starting at the beginning  Opening R Gui – double click Unix/Terminal – type R  Closing R Gui – click on the.
Extended sibships Danielle Posthuma Kate Morley Files: \\danielle\ExtSibs.
Longitudinal Modeling Nathan, Lindon & Mike LongitudinalTwinAnalysis_MatrixRawCon.R GenEpiHelperFunctions.R jepq.txt.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
Genetic Dominance in Extended Pedigrees: Boulder, March 2008 Irene Rebollo Biological Psychology Department, Vrije Universiteit Netherlands Twin Register.
Univariate Analysis Hermine Maes TC19 March 2006.
Gene x Environment Interactions Brad Verhulst (With lots of help from slides written by Hermine and Liz) September 30, 2014.
Karri Silventoinen University of Helsinki Osaka University.
“Beyond Twins” Lindon Eaves, VIPBG, Richmond Boulder CO, March 2012.
Karri Silventoinen University of Helsinki Osaka University.
Karri Silventoinen University of Helsinki Osaka University.
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
 Go to Faculty/marleen/Boulder2012/Moderating_cov  Copy all files to your own directory  Go to Faculty/sanja/Boulder2012/Moderating_covariances _IQ_SES.
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
Introduction to OpenMx Sarah Medland. What is OpenMx? Free, Open-source, full–featured SEM package Software which runs on Windows, Mac OSX, and Linux.
Cholesky decomposition May 27th 2015 Helsinki, Finland E. Vuoksimaa.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Practical SCRIPT: F:\meike\2010\Multi_prac\MultivariateTwinAnalysis_MatrixRaw.r DATA: DHBQ_bs.dat.
Longitudinal Modeling Nathan Gillespie & Dorret Boomsma \\nathan\2008\Longitudinal neuro_f_chol.mx neuro_f_simplex.mx jepq6.dat.
Combined Linkage and Association in Mx Hermine Maes Kate Morley Dorret Boomsma Nick Martin Meike Bartels Boulder 2009.
Threshold Liability Models (Ordinal Data Analysis) Frühling Rijsdijk MRC SGDP Centre, Institute of Psychiatry, King’s College London Boulder Twin Workshop.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Univariate Analysis Hermine Maes TC21 March 2008.
Longitudinal Modeling Nathan & Lindon Template_Developmental_Twin_Continuous_Matrix.R Template_Developmental_Twin_Ordinal_Matrix.R jepq.txt GenEpiHelperFunctions.R.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
SEM with Measured Genotypes NIDA Workshop VIPBG, October 2012 Maes, H. H., Neale, M. C., Chen, X., Chen, J., Prescott, C. A., & Kendler, K. S. (2011).
Model building & assumptions Matt Keller, Sarah Medland, Hermine Maes TC21 March 2008.
More on thresholds Sarah Medland. A plug for OpenMx? Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Categorical Data HGEN
Extended Pedigrees HGEN619 class 2007.
Genetic simplex model: practical
Ordinal Data Sarah Medland.
Bivariate analysis HGEN619 class 2006.
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Re-introduction to openMx
Heterogeneity HGEN619 class 2007.
Univariate Analysis HGEN619 class 2006.
Path Analysis Danielle Dick Boulder 2008
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
Behavior Genetics The Study of Variation and Heredity
Structural Equation Modeling
Threshold Liability Models (Ordinal Data Analysis)
More on thresholds Sarah Medland.
Univariate modeling Sarah Medland.
Pak Sham & Shaun Purcell Twin Workshop, March 2002
(Re)introduction to Mx Sarah Medland
Longitudinal Modeling
Sarah Medland faculty/sarah/2018/Tuesday
More on thresholds Sarah Medland.
Univariate Modeling in umx
Lucía Colodro Conde Sarah Medland & Matt Keller
Bivariate Genetic Analysis Practical
Including covariates in your model
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Heritability of Subjective Well-Being in a Representative Sample
Rater Bias & Sibling Interaction Meike Bartels Boulder 2004
Presentation transcript:

Fitting Univariate Models to Continuous and Categorical Data September 9, 2014 Elizabeth Prom-Wormley & Hermine Maes ecpromwormle@vcu.edu 804-828-8154

Patterns of Twin Correlation ??? rMZ = 2rDZ Additive DZ twins on average share 50% of additive effects rMZ = rDZ Shared Environment rMZ > 2rDZ Additive & Dominance DZ twins on average share 25% of dominance effects rDZ > ½ rMZ Additive & “Shared Environment” A = 2(rMZ-rDZ) C = 2rDZ – rMZ E = 1- rMZ

BMI Twin Correlations A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ 0.30 0.78 A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ A = 0.96 C = -0.20 E = 0.22 ADE or ACE?

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

ADE Model- A Few Considerations Dominance (D) refers to non-additive genetic effects resulting from interactions between alleles at the same locus ADE models also include effects of interactions between different loci (epistasis) In addition to sharing on average 50% of their DNA, DZ twins also share about 25% of effects due to dominance

ADE Model Deconstructed Path Coefficients covA <- mxAlgebra( expression=a %*% t(a), name="A" ) covD <- mxAlgebra( expression=d %*% t(d), name=“D" ) covE <- mxAlgebra( expression=e %*% t(e), name="E" ) a 1 x 1 matrix d 1 x 1 matrix e 1 x 1 matrix

ADE Model Deconstructed Variance Components covA <- mxAlgebra( expression=a %*% t(a), name="A" ) covD <- mxAlgebra( expression=d %*% t(d), name=“D" ) covE <- mxAlgebra( expression=e %*% t(e), name="E" ) a aT * 1 x 1 matrix d dT * 1 x 1 matrix e eT * 1 x 1 matrix

ADE Model Deconstructed Means & Covariances meanG <- mxMatrix( type="Full", nrow=1, ncol=ntv, free=TRUE, values= 20, label="mean", name="expMean" ) covMZ <- mxAlgebra( expression= rbind( cbind (A+D+E , A+D), cbind(A+D , A+D+E)), name="expCovMZ" ) covDZ <- mxAlgebra( expression= rbind( cbind(A+D+E, 0.5%x%A+0.25%x%D), cbind(0.5%x%A+0.25%x%D , A+D+E)), name="expCovDZ" ) expMean expMean= 1 x 2 matrix T1 T2 T1 A+D+E T2 covMZ & covDZ= 2 x 2 matrix

ADE Model Deconstructed Means & Covariances covMZ <- mxAlgebra( expression= rbind( cbind(A+D+E , A+D), cbind(A+D , A+D+E)), name="expCovMZ" ) covDZ <- mxAlgebra( expression= rbind( cbind(A+D+E, 0.5%x%A+0.25%x%D), cbind(0.5%x%A+0.25%x%D, A+0.25%x%D+E)), name="expCovDZ" ) T2 T1 T2 T1 A+D+E 0.5A T2

ADE Model Deconstructed Means & Covariances

ADE Model Deconstructed 4 Parameters Estimated ExpMean Variance due to A Variance due to D Variance due to E

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

Testing Specific Questions Please Open twinADEcon.R ‘Full’ ADE Model Comparison against Saturated Nested Models AE Model Test significance of D E Model vs AE Model Test significance of A E Model vs ADE Model Test combined significance of A & D

AE Model Deconstructed New Object Containing AE Model Object containing Full Model AeModel <- mxModel( AdeFit, name="AE" ) The d path will not be estimated AeModel <- omxSetParameters( AeModel, labels=“d11", free=FALSE, values=0) AeFit <- mxRun(AeModel) Fixing the value of the c path to zero

Goodness of Fit Stats ep -2ll df AIC diff -2ll diff p Saturated ADE AE no! E

Goodness-of-Fit Statistics ep -2ll df AIC diff -2ll diff p Saturated 10 4055.93 1767 521.93 - ADE 4 4063.45 1773 517.45 7.52 6 0.28 AE 3 4067.66 1774 519.66 4.21 1 0.06 E 2 4591.79 1775 1041.79 528.3 0.00

What about the magnitudes of genetic and environmental contributions to BMI?

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

Estimated Values a d e c a2 d2 e2 c2 ADE - AE E

Estimated Values a d e c a2 d2 e2 ADE 0.57 0.54 0.41 - 0.37 0.22 AE 0.78 0.42 E 0.88 1.00

Potential Complications Assortative Mating Gene-Environment Correlation Gene-Environment Interaction Sex Limitation Gene-Age Interaction

Trying Again with Ordinal Data Open twinAceOrdS.R

Okay, so we have a sense of which model best explains the data Okay, so we have a sense of which model best explains the data...or do we?

BMI Twin Correlations A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ 0.30 0.78 A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ ADE or ACE?

ACE/ADE Conclusions- Problem Set 1 Due Tuesday, 9/16 at 5pm Modify twinAdeCon.R and modify to reflect an ACE model BMI (zygosity groups 1 and 3) Are additive genetic factors significant? Are dominance factors significant? Are specific environmental factors significant? Are shared environmental factors significant? Provide results (as tables), provide brief explanations and guide me through your tables Can also send script used to produce result

Goodness-of-Fit Statistics Model ep -2ll df AIC diff -2ll diff p Saturated ADE ACE

Estimated Values a d e c a2 d2 e2 c2 ADE AE ACE E

Thank You!