Attention Problems – SNP association Dorret Boomsma Toos van Beijsterveldt Michel Nivard.

Slides:



Advertisements
Similar presentations
Bivariate analysis HGEN619 class 2007.
Advertisements

Fitting Bivariate Models October 21, 2014 Elizabeth Prom-Wormley & Hermine Maes
Univariate Model Fitting
Multivariate Mx Exercise D Posthuma Files: \\danielle\Multivariate.
The Inheritance of Complex Traits
Biometrical genetics Manuel Ferreira Shaun Purcell Pak Sham Boulder Introductory Course 2006.
Biometrical genetics Manuel Ferreira Shaun Purcell Pak Sham Boulder Introductory Course 2006.
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
Genetic Theory Manuel AR Ferreira Egmond, 2007 Massachusetts General Hospital Harvard Medical School Boston.
Path Analysis Danielle Dick Boulder Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Developmental models. Multivariate analysis choleski models factor models –y =  f + u genetic factor models –P j = h G j + c C j + e E j –common pathway.
Introduction to Linkage
Childhood Dysregulation is Associated with Brain-Derived Neurotrophic Factor and the Serotonin 2A Receptor Rachel K. McEntee, BA 1, Robert R. Althoff,
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
David M. Evans Sarah E. Medland Developmental Models in Genetic Research Wellcome Trust Centre for Human Genetics Oxford United Kingdom Twin Workshop Boulder.
Heterogeneity II Danielle Dick, Tim York, Marleen de Moor, Dorret Boomsma Boulder Twin Workshop March 2010.
Gene x Environment Interactions Brad Verhulst (With lots of help from slides written by Hermine and Liz) September 30, 2014.
Regressing SNPs on a latent variable Michel Nivard & Nick Martin.
Mx Practical TC18, 2005 Dorret Boomsma, Nick Martin, Hermine H. Maes.
Raw data analysis S. Purcell & M. C. Neale Twin Workshop, IBG Colorado, March 2002.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Linkage Analysis in Merlin
Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić (BioPsy VU) Boulder Twin Workshop March 4, 2014 Based on PPTs.
 Go to Faculty/marleen/Boulder2012/Moderating_cov  Copy all files to your own directory  Go to Faculty/sanja/Boulder2012/Moderating_covariances _IQ_SES.
Introduction to Multivariate Genetic Analysis (2) Marleen de Moor, Kees-Jan Kan & Nick Martin March 7, 20121M. de Moor, Twin Workshop Boulder.
Extending Simplex model to model Ph  E transmission JANNEKE m. de kort & C.V. DolAn Contact:
Univariate modeling Sarah Medland. Starting at the beginning… Data preparation – The algebra style used in Mx expects 1 line per case/family – (Almost)
Longitudinal Modeling Nathan Gillespie & Dorret Boomsma \\nathan\2008\Longitudinal neuro_f_chol.mx neuro_f_simplex.mx jepq6.dat.
The importance of the “Means Model” in Mx for modeling regression and association Dorret Boomsma, Nick Martin Boulder 2008.
Gene-Environment Interaction & Correlation Danielle Dick & Danielle Posthuma Leuven 2008.
Combined Linkage and Association in Mx Hermine Maes Kate Morley Dorret Boomsma Nick Martin Meike Bartels Boulder 2009.
Linkage and association Sarah Medland. Genotypic similarity between relatives IBS Alleles shared Identical By State “look the same”, may have the same.
Mx modeling of methylation data: twin correlations [means, SD, correlation] ACE / ADE latent factor model regression [sex and age] genetic association.
Mx Practical TC20, 2007 Hermine H. Maes Nick Martin, Dorret Boomsma.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Continuous heterogeneity Danielle Dick & Sarah Medland Boulder Twin Workshop March 2006.
Model building & assumptions Matt Keller, Sarah Medland, Hermine Maes TC21 March 2008.
A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.
Welcome  Log on using the username and password you received at registration  Copy the folder: F:/sarah/mon-morning To your H drive.
Biometrical Genetics Shaun Purcell Twin Workshop, March 2004.
Developmental Models/ Longitudinal Data Analysis Danielle Dick & Nathan Gillespie Boulder, March 2006.
QTL Mapping Using Mx Michael C Neale Virginia Institute for Psychiatric and Behavioral Genetics Virginia Commonwealth University.
March 7, 2012M. de Moor, Twin Workshop Boulder1 Copy files Go to Faculty\marleen\Boulder2012\Multivariate Copy all files to your own directory Go to Faculty\kees\Boulder2012\Multivariate.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
Date of download: 11/12/2016 Copyright © 2016 American Medical Association. All rights reserved. From: Influence of Child Abuse on Adult DepressionModeration.
Extended Pedigrees HGEN619 class 2007.
Bivariate analysis HGEN619 class 2006.
Univariate Twin Analysis
Introduction to Multivariate Genetic Analysis
Re-introduction to openMx
Gene-Environment Interaction & Correlation
Heterogeneity HGEN619 class 2007.
Path Analysis Danielle Dick Boulder 2008
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
Univariate modeling Sarah Medland.
GxE and GxG interactions
Correlation for a pair of relatives
Biometrical model and introduction to genetic analysis
Why general modeling framework?
Heterogeneity Danielle Dick, Hermine Maes,
Pak Sham & Shaun Purcell Twin Workshop, March 2002
(Re)introduction to Mx Sarah Medland
Sarah Medland & Nathan Gillespie
Exercise: Effect of the IL6R gene on IL-6R concentration
Sarah Medland faculty/sarah/2018/Tuesday
Getting more from multivariate data: Multiple raters (or ratings)
Lucía Colodro Conde Sarah Medland & Matt Keller
Multivariate Genetic Analysis: Introduction
Presentation transcript:

Attention Problems – SNP association Dorret Boomsma Toos van Beijsterveldt Michel Nivard

SNP=single nucleotide polymorphism Test of association between SNP and trait = test of genetic association

Simple biometrical system: One SNP with two alleles (di-allelic): A and a; with frequencies p and q; (p+q = 1). Three genotypes: AA, Aa, aa; with frequencies p 2, 2pq and q 2 ; (p 2 + 2pq + q 2 = 1). AA, Aa and aa are coded as 0, 1, 2 (N of copies of e.g. a)

model aaAA a Aa d -a m With the 0,1,2 coding we assume that d = trait

Y = intercept + β1*SNP + β2*Sex + residual ADHD score = μ + β1*SNP + β2*Sex + ε SNP and Sex are observed and are treated as ‘definition variables’ We do not model their covariance structure, but only their effect on phenotypes

SNP1 AAP1 β Test of β = 0 gives a test of the significance of the SNP Are Attention Problems influenced by measured genotypes? The value of the SNP (0,1,2) MUST be present in the data file for each subject

SNP1 AAP1AAP2 SNP2 β1β1β1β1 Attention Problems in twin1 and twin2 are correlated (as a function of zygosity) Sex2 β2β2 Sex1 β2β2 r

Data file Note: - missing values for phenotypes are “.” - missing values for definition variables are -9 (!! Take care that Ss with missing definition variables also have missing phenotypes)

Genotyping in 4 candidate genes: *mono-aminergic system: -serotonin receptors (HTR) 2A (HTR2A) -catechol-O-methyltransferase (COMT) -tryptophane hydroxylase type 2 (TPH2) *neurogenesis: -brain derived neurotrophic factor (BDNF)

GenesSNPsMAF p-value HWE BDNFrs A<G15.6Ns rs C<G45.1Ns rs6265 T<C22.4Ns rs G<A24.8Ns rs A<G9.2Ns rs T<C40.6Ns rs A<C17.6Ns rs C<T31.6Ns COMTrs4680G<A43.8Ns HTR2Ars6311 T<C42.0Ns rs6314 A<G8.6Ns rs6313 A<G

COMT: Catechol-O-methyltransferase COMT is involved in the inactivation of the catecholamine neurotransmitters (dopamine, epinephrine, and norepinephrine). A functional single-nucleotide polymorphism (a common normal variant) of COMT results in a valine to methionine mutation at position 158 (Val158Met) [rs4680 ]. HTR2A 5-hydroxytryptamine (serotonin) receptor 2A This gene encodes one of the receptors for serotonin, a neurotransmitter with many roles

BDNF (Brain-derived neurotrophic factor) LD plot of D’ between the SNPs LD: linkage disequilibrium is the non-random association of alleles at two or more loci

TPH2rs G<T14.9Ns rs G<A42.0Ns rs T<C7.3Ns rs C<T43.2Ns rs C<T rs T<G25.4Ns rs A<G44.4Ns rs T<C41.7Ns rs A<G36.9Ns rs G<C41.6Ns rs A<G44.2Ns rs A<G48.7Ns rs C<T26.5Ns rs T<G17.7Ns

LD plot of TPH2 indicating D’ between the SNPs TPH2 Tryptophan hydroxylase is the rate- limiting enzyme in the synthesis of serotonin

SNP1 AAP1AAP2 SNP2 Attention Problems in twin1 and twin2 are correlated (as a function of zygosity) β1β1β1β1 Test of β = 0 gives the significance of the SNP Parameters to be estimated: -Intercept (T1 = T2) -Regression SNP (T1 = T2) -Regression Sex (T1=T2) -MZ and DZ covariance -Variance of AP Sex1Sex2 β2β2 β2β2

Exercise Fit the association model for 1 SNP per run (we consider 5 SNPs, script is for BDNF - SNP). Try to run for the other 4 SNPs. Test the significance of the regression coefficients (by comparing constrained and free model).

Univariate means modeling practical

Univariate mean moderation Copy the folder univariate means moderation to your drive (its in faculty/Michel) Open the R project in R studio. It is a five group script. (MZM,DZM,MZF,DZM,DOS) First we read in the data, run the script till line 20. Then we run “universal” matrices, these are the same in each group (till line 32)

Univariate mean moderation On line 41,65,91,116 and 142 we rad in the SNP rs4680 for twin 1 and twin 2. In openMX we tell the software to read actual data by adding the prefix ”data.” to the labels. Change the snp id into one of the following: rs6265, rs4680, rs6314, rs , rs Now run the model untill the mxRun command around line: 172.

Univariate mean moderation Around line 180 you are required to drop the SNP. Reset the starting value. What should this be according to you guys? Also reset the parameter to fixed!

Results rs6265 rs4680 rs6314 rs rs

-Genotyping was done in twin pairs (related) -Phenotypes : ratings by both parents (bivariate) -Parental ratings at ages 3, 7, 10, 12 (longitudinal) -Not all children reached at 12 yet (missing data)

Factorial association model for longitudinal Attention Problems in children Circle = latent (not observed) individual score; square / triangle= observed score; arrow = regression; double headed arrow = correlation

Implementation in Mx Factor loadings and correlations among latent phenotypes were obtained from running the model in a larger dataset of > 32,000 twins from 16,169 families, who participated at least once: 2,436 MZM, 2,856 DZM, 2,742 MZF, 2,556 DZF, 5,602 (DOS) twin pairs

AgeRaterFactor LoadingFactor loading Residual rMZ (residual) rDZ (residual) 3Mother Father Mother Father Mother Father Mother Father

Factorial association model : 16 phenotypes (2 twins, 2 raters, 4 time points) Parameters to be estimated: effect of SNP, effect of sex /age /rater, grand mean, twin correlations (for MZ and DZ twins) β λ SNP is 0,1, 2

Exercise 1 Fit the Factorial association model for 1 SNP per run (consider one of the 5 SNPs). 2 Fit the Factorial association model for all 5 SNPs simultaneously.

The factor model Use multivariate approaches to model all phenotypic data (all time points / all raters / all indicators) and do not force multivariate data into a single sum score. Advantage: increase in power Disadvantage: no standard GWAS software

Increase in statistical power Ferreira MA, Purcell SM. A multivariate test of association. A multivariate test of association. Bioinformatics. 2009, 1;25(1):132-3 (intercorrelations among phenotypes equal) Medland SE, Neale MC. An integrated phenomic approach to multivariate allelic association. Eur J Hum Genet (2):233-9 (factor models)An integrated phenomic approach to multivariate allelic association. van der Sluis S, Verhage M, Posthuma D, Dolan CV. Phenotypic complexity, measurement bias, and poor phenotypic resolution contribute to the missing heritability problem in genetic association studies. PLoS One ;5(11):e13929 (measurement invariance)Phenotypic complexity, measurement bias, and poor phenotypic resolution contribute to the missing heritability problem in genetic association studies. Minica CC, Boomsma DI, van der Sluis S, Dolan CV. Genetic association in multivariate phenotypic data: power in five models. Twin Res Hum Genet. 2010, 13(6): (also includes longitudinal simplex models)Genetic association in multivariate phenotypic data: power in five models.

MoFaMoFaMoFaMoFa iiii A = additive genes C = common environment E = unique environment P = latent phenotype Parameter in red = 1 Mo = mother rating Fa = mother rating Parameters to be estimated: Loading innovation Transmission A,C, E loadings