Univariate Twin Analysis

Slides:



Advertisements
Similar presentations
15 The Genetic Basis of Complex Inheritance
Advertisements

Sex-limitation models Karri Silventoinen. Testing sex-specific genetic effect It is possible to test whether the size of genetic and environmental variations.
Bivariate analysis HGEN619 class 2007.
Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Elizabeth Prom-Wormley & Hermine Maes
Univariate Model Fitting
Sex-limitation Models Brad Verhulst, Elizabeth Prom-Wormley (Sarah, Hermine, and most of the rest of the faculty that has contributed bits and pieces to.
Quantitative genetics
Quantitative Genetics Theoretical justification Estimation of heritability –Family studies –Response to selection –Inbred strain comparisons Quantitative.
The Inheritance of Complex Traits
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
Extended sibships Danielle Posthuma Kate Morley Files: \\danielle\ExtSibs.
Heterogeneity Boulder, Heterogeneity Questions Are the contributions of genetic and environmental factors equal for different groups, such as sex,
Summarizing Data Nick Martin, Hermine Maes TC21 March 2008.
ACDE model and estimability Why can’t we estimate (co)variances due to A, C, D and E simultaneously in a standard twin design?
Heterogeneity Hermine Maes TC19 March Files to Copy to your Computer Faculty/hmaes/tc19/maes/heterogeneity  ozbmi.rec  ozbmi.dat  ozbmiysat(4)(5).mx.
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
Univariate Analysis Hermine Maes TC19 March 2006.
Gene x Environment Interactions Brad Verhulst (With lots of help from slides written by Hermine and Liz) September 30, 2014.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Path Analysis Frühling Rijsdijk. Biometrical Genetic Theory Aims of session:  Derivation of Predicted Var/Cov matrices Using: (1)Path Tracing Rules (2)Covariance.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Karri Silventoinen University of Helsinki Osaka University.
Karri Silventoinen University of Helsinki Osaka University.
Karri Silventoinen University of Helsinki Osaka University.
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
1 Phenotypic Variation Variation of a trait can be separated into genetic and environmental components Genotypic variance  g 2 = variation in phenotype.
Spousal Associations for Alcohol Dependence and Educational Attainment Andrew Williams University of North Carolina Support from NIH Grants: AA07728, AA11998,
Heterogeneity Hermine Maes TC21 March Files to Copy to your Computer Faculty/hmaes/tc19/maes/heterogeneity  ozbmi.rec  ozbmi.dat  ozbmiysat(4)(5).mx.
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Practical SCRIPT: F:\meike\2010\Multi_prac\MultivariateTwinAnalysis_MatrixRaw.r DATA: DHBQ_bs.dat.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Univariate Analysis Hermine Maes TC21 March 2008.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
Model building & assumptions Matt Keller, Sarah Medland, Hermine Maes TC21 March 2008.
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Introduction to Multivariate Genetic Analysis Danielle Posthuma & Meike Bartels.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Multivariate Genetic Analysis (Introduction) Frühling Rijsdijk Wednesday March 8, 2006.
Stats Methods at IC Lecture 3: Regression.
Categorical Data HGEN
University of Colorado at Boulder
Extended Pedigrees HGEN619 class 2007.
Genetic simplex model: practical
Bivariate analysis HGEN619 class 2006.
Introduction to Multivariate Genetic Analysis
Fitting Univariate Models to Continuous and Categorical Data
Re-introduction to openMx
Heterogeneity HGEN619 class 2007.
Univariate Analysis HGEN619 class 2006.
MRC SGDP Centre, Institute of Psychiatry, Psychology & Neuroscience
Path Analysis Danielle Dick Boulder 2008
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
Behavior Genetics The Study of Variation and Heredity
Structural Equation Modeling
Univariate modeling Sarah Medland.
Pak Sham & Shaun Purcell Twin Workshop, March 2002
(Re)introduction to Mx Sarah Medland
Sarah Medland & Nathan Gillespie
Sarah Medland faculty/sarah/2018/Tuesday
Lucía Colodro Conde Sarah Medland & Matt Keller
Simulating and Modeling Genetically Informative Data
Chapter 7 Beyond alleles: Quantitative Genetics
Power Calculation for QTL Association
BOULDER WORKSHOP STATISTICS REVIEWED: LIKELIHOOD MODELS
Heritability of Subjective Well-Being in a Representative Sample
Presentation transcript:

Univariate Twin Analysis OpenMx Tc26 2012 Hermine Maes, Elizabeth Prom-Wormley, Nick Martin

Overall Questions to be Answered Does a trait of interest cluster among related individuals? Can clustering be explained by genetic or environmental effects? What is the best way to explain the degree to which genetic and environmental effects affect a trait?

Family & Twin Study Designs Family Studies Classical Twin Studies Adoption Studies Extended Twin Studies

The Data Australian Twin Register 18-30 years old, males and females 569 total MZ pairs 351 total DZ pairs The Data Australian Twin Register 18-30 years old, males and females Work from this session will focus on Body Mass Index (weight/height2) in females only Sample size MZF = 534 complete pairs (zyg = 1) DZF = 328 complete pairs (zyg = 3)

A Quick Look at the Data

Classical Twin Studies Basic Background The Classical Twin Study (CTS) uses MZ and DZ twins reared together MZ twins share 100% of their genes DZ twins share on average 50% of their genes Expectation- Genetic factors are assumed to contribute to a phenotype when MZ twins are more similar than DZ twins

Classical Twin Study Assumptions MZ twins are genetically identical Equal Environments of MZ and DZ pairs

Basic Data Assumptions MZ and DZ twins are sampled from the same population, therefore we expect :- Equal means/variances in Twin 1 and Twin 2 Equal means/variances in MZ and DZ twins Further assumptions would need to be tested if we introduce male twins and opposite sex twin pairs

“Old Fashioned” Data Checking MZ DZ T1 T2 mean 21.35 21.34 21.45 21.46 variance 0.73 0.79 0.77 0.82 covariance(T1-T2) 0.59 0.25 Nice, but how can we actually be sure that these means and variances are truly the same?

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions (equal means & variances for twin 1/twin 2 and MZ/DZ pairs) Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ACE or ADE Models 3- Test ACE (ADE) submodels to identify and report significant genetic and environmental contributions AE or CE or E Only Models 10

Saturated Twin Model

Saturated Code Deconstructed Will not display but good for reference in understanding code mMZ1 mMZ2 mDZ1 mDZ2 mean MZ = 1 x 2 matrix mean DZ = 1 x 2 matrix

Saturated Code Deconstructed Will not display but good for reference in understanding code T1 T2 vMZ1 cMZ21 vMZ2 covMZ = 2 x 2 matrix T1 T2 vDZ1 cDZ21 vDZ2 covDZ = 2 x 2 matrix

Time to Play... with OpenMx Please Open the File twinSatCon.R

Estimated Values 10 Total Parameters Estimated Saturated Model mean MZ DZ cov 10 Total Parameters Estimated mMZ1, mMZ2, vMZ1,vMZ2,cMZ21 mDZ1, mDZ2, vDZ1,vDZ2,cDZ21 Standardize covariance matrices for twin pair correlations (covMZ & covDZ)

Estimated Values 10 Total Parameters Estimated Saturated Model mean MZ 21.34 21.35 DZ 21.45 21.46 cov 0.73 0.77 0.59 0.79 0.24 0.82 10 Total Parameters Estimated mMZ1, mMZ2, vMZ1,vMZ2,cMZ21 mDZ1, mDZ2, vDZ1,vDZ2,cDZ21 Standardize covariance matrices for twin pair correlations (covMZ & covDZ)

Fitting Nested Models Saturated Model likelihood of data without any constraints fitting as many means and (co)variances as possible Equality of means & variances by twin order test if mean of twin 1 = mean of twin 2 test if variance of twin 1 = variance of twin 2 Equality of means & variances by zygosity test if mean of MZ = mean of DZ test if variance of MZ = variance of DZ

Estimated Values T1 T2 Equate Means & Variances across Twin Order mean MZ DZ cov Equate Means Variances across Twin Order & Zygosity

Estimates T1 T2 Equate Means & Variances across Twin Order mean MZ 21.35 DZ 21.45 cov 0.76 0.79 0.59 0.24 Equate Means Variances across Twin Order & Zygosity 21.39 0.78 0.61 0.23

Stats No significant differences between saturated model ep -2ll df AIC diff -2ll diff p Saturated 10 4055.93 1767 521.93 mT1=mT2 8 4056 1769 518 0.07 2 0.97 mT1=mT2 & varT1=VarT2 6 4058.94 1771 516.94 3.01 4 0.56 Zyg MZ=DZ 4063.45 1773 517.45 7.52 0.28 No significant differences between saturated model and models where means/variances/covariances are equal by zygosity and between twins

Questions? This is all great, but we still don’t understand the genetic and environmental contributions to BMI

Patterns of Twin Correlation ??? rMZ = 2rDZ Additive DZ twins on average share 50% of additive effects rMZ = rDZ Shared Environment rMZ > 2rDZ Additive & Dominance DZ twins on average share 25% of dominance effects Additive & “Shared Environment” A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ

BMI Twin Correlations A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ 0.30 0.78 A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ A = 0.96 C = -0.20 E = 0.22 ADE or ACE?

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

ADE Model A Few Considerations Dominance refers to non-additive genetic effects resulting from interactions between alleles at the same locus ADE models also include effects of interactions between different loci (epistasis) In addition to sharing on average 50% of their DNA, DZ twins also share about 25% of effects due to dominance

ADE Model Deconstructed Path Coefficients 1 x 1 matrix d 1 x 1 matrix e 1 x 1 matrix

ADE Model Deconstructed Variance Components * aT 1 x 1 matrix d * dT 1 x 1 matrix e * eT 1 x 1 matrix

ADE Model Deconstructed Means & Covariances expMean expMean= 1 x 2 matrix T1 T2 T1 A+D+E T2 covMZ & covDZ= 2 x 2 matrix

ADE Model Deconstructed Means & Covariances

ADE Model Deconstructed Means & Covariances

ADE Model Deconstructed 4 Parameters Estimated ExpMean Variance due to A Variance due to D Variance due to E

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

Time to Play... with OpenMx Please Open the File twinACECon.R

Testing Specific Questions ‘Full’ ADE Model Comparison against Saturated Nested Models AE Model test significance of D E Model vs AE Model test significance of A E Model vs ADE Model test combined significance of A & D

AE Model Deconstructed New Object Containing AE Model Object containing Full Model AeModel <- mxModel( AdeFit, name="AE" ) The d path will not be estimated AeModel <- omxSetParameters( AeModel, labels="c11", free=FALSE, values=0) AeFit <- mxRun(AeModel) Fixing the value of the c path to zero

Goodness of Fit Stats ep -2ll df AIC diff -2ll diff p Saturated ADE AE no! E

Goodness-of-Fit Statistics ep -2ll df AIC diff -2ll diff p Saturated 10 4055.93 1767 521.93 - ADE 4 4063.45 1773 517.45 7.52 6 0.28 AE 3 4067.66 1774 519.66 4.21 1 0.06 E 2 4591.79 1775 1041.79 528.3 0.00

What about the magnitudes of genetic and environmental contributions to BMI?

Univariate Analysis A Roadmap 1- Use the data to test basic assumptions inherent to standard ACE (ADE) models Saturated Model 2- Estimate contributions of genetic and environmental effects on the total variance of a phenotype ADE or ACE Models 3- Test ADE (ACE) submodels to identify and report significant genetic and environmental contributions AE or E Only Models

Estimated Values a d e c a2 d2 e2 c2 ADE - AE E

Estimated Values a d e c a2 d2 e2 ADE 0.57 0.54 0.41 - 0.37 0.22 AE 0.78 0.42 E 0.88 1.00

Okay, so we have a sense of which model best explains the data Okay, so we have a sense of which model best explains the data...or do we?

BMI Twin Correlations A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ 0.30 0.78 A = 2(rMZ-rDZ) C = 2rDZ - rMZ E = 1- rMZ ADE or ACE?

Giving the ACE Model a Try (if there is time) I think I would like to have the class “help” me put the ADE model together and run. However, I do have code ready for them to work on.

Goodness-of-Fit Statistics Model ep -2ll df AIC diff -2ll diff p Saturated ADE ACE

Goodness-of-Fit Statistics Model ep -2ll df AIC diff -2ll diff p Saturated 10 4055.93 1767 521.93 - ADE 4 4063.45 1773 517.45 7.52 6 0.28 ACE 4067.66 521.66 11.73 0.07

Estimated Values a d e c a2 d2 e2 c2 ADE AE ACE E

Estimated Values a d e c a2 d2 e2 c2 ADE 0.57 0.54 0.41 - 0.37 0.22 AE 0.78 0.42 ACE 0.00 0.56 0.68 0.59 E 0.88 1.00

Conclusions BMI in young OZ females (age 18-30) additive genetic factors: highly significant dominance: borderline significant specific environmental factors: significant shared environmental factors: not

Potential Complications Assortative Mating Gene-Environment Correlation Gene-Environment Interaction Sex Limitation Gene-Age Interaction

Thank You!