Graphical Diagnostics for Multiple QTL Investigation Brian S

Slides:



Advertisements
Similar presentations
The genetic dissection of complex traits
Advertisements

VISG Polyploids Project Rod Ball & Phillip Wilcox Scion; Sammie Yilin Jia Plant and Food Research Gail Timmerman-Vaughan Plant and Food Research Nihal.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
QTL Mapping R. M. Sundaram.
Inferring Causal Phenotype Networks Elias Chaibub Neto & Brian S. Yandell UW-Madison June 2010 QTL 2: NetworksSeattle SISG: Yandell ©
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Statistical association of genotype and phenotype.
Professor Brian S. Yandell joint faculty appointment across colleges: –50% Horticulture (CALS) –50% Statistics (Letters & Sciences) Biometry Program –MS.
QTL 2: OverviewSeattle SISG: Yandell © Seattle Summer Institute 2010 Advanced QTL Brian S. Yandell, UW-Madison
Bayesian MCMC QTL mapping in outbred mice Andrew Morris, Binnaz Yalcin, Jan Fullerton, Angela Meesaq, Rob Deacon, Nick Rawlins and Jonathan Flint Wellcome.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Complex Traits Most neurobehavioral traits are complex Multifactorial
October 2007Jax Workshop © Brian S. Yandell1 Bayesian Model Selection for Multiple QTL Brian S. Yandell University of Wisconsin-Madison
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Yandell © June Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
August 9, 2001www.stat.wisc.edu/~yandell/statgen1 Smooth Collaboration in Statistical Genomics Hong Lan 1, Yi Lin 2, Fei Zou 2, Samuel T. Nadler 1, Jonathan.
Yandell © 2003JSM Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits Brian S. Yandell University of Wisconsin-Madison
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
Yandell © 2003Wisconsin Symposium III1 QTLs & Microarrays Overview Brian S. Yandell University of Wisconsin-Madison
Why you should know about experimental crosses. To save you from embarrassment.
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
Efficient calculation of empirical p- values for genome wide linkage through weighted mixtures Sarah E Medland, Eric J Schmitt, Bradley T Webb, Po-Hsiu.
13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,
ModelNCSU QTL II: Yandell © Model Selection for Multiple QTL 1.reality of multiple QTL3-8 2.selecting a class of QTL models comparing QTL models16-24.
I. Statistical Methods for Genome-Enabled Prediction of Complex Traits OUTLINE THE CHALLENGES OF PREDICTING COMPLEX TRAITS ORDINARY LEAST SQUARES (OLS)
Bayesian Variable Selection in Semiparametric Regression Modeling with Applications to Genetic Mappping Fei Zou Department of Biostatistics University.
Plant Microarray Course
EQTLs.
Quantile-based Permutation Thresholds for QTL Hotspots
Quantile-based Permutation Thresholds for QTL Hotspots
Professor Brian S. Yandell
Identifying QTLs in experimental crosses
Genetical Genomics in the Mouse
upstream vs. ORF binding and gene expression?
Mapping variation in growth in response to glucose concentration
Model Selection for Multiple QTL
Multiple Traits & Microarrays
Bayesian Model Selection for Multiple QTL
Bayesian Inference for QTLs in Inbred Lines I
Gene mapping in mice Karl W Broman Department of Biostatistics
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Bayesian Model Selection for Quantitative Trait Loci with Markov chain Monte Carlo in Experimental Crosses Brian S. Yandell University of Wisconsin-Madison.
Inferring Genetic Architecture of Complex Biological Processes Brian S
4. modern high throughput biology
Inferring Causal Phenotype Networks
Statistical issues in QTL mapping in mice
Mapping Quantitative Trait Loci
High Throughput Gene Mapping Brian S. Yandell
Bayesian Interval Mapping
Brian S. Yandell University of Wisconsin-Madison 26 February 2003
Bayesian Model Selection for Multiple QTL
Genome-wide Association Studies
Bayesian Interval Mapping
Inferring Causal Phenotype Networks Driven by Expression Gene Mapping
QTL Studies with Microarray Data
Seattle SISG: Yandell © 2007
Seattle SISG: Yandell © 2008
examples in detail days to flower for Brassica napus (plant) (n = 108)
Inferring Genetic Architecture of Complex Biological Processes Brian S
Seattle SISG: Yandell © 2006
Note on Outbred Studies
Brian S. Yandell University of Wisconsin-Madison
Seattle SISG: Yandell © 2009
Lecture 9: QTL Mapping II: Outbred Populations
Bayesian Interval Mapping
GWAS-eQTL signal colocalisation methods
eQTL Tools a collaboration in progress
A: QTL analysis of genotyping data on chromosome 12 revealed a locus linked to glucose levels in F2 mice at 6 months of age at marker D12Mit231 with a.
Presentation transcript:

Graphical Diagnostics for Multiple QTL Investigation Brian S Graphical Diagnostics for Multiple QTL Investigation Brian S. Yandell University of Wisconsin-Madison www.stat.wisc.edu/~yandell/statgen studying diabetes with microarrays taking a multiple QTL approach handling high throughput phenotypes designing for expensive phenotypes 6-9 June 2004 CTC: Yandell © 2004

glucose insulin 6-9 June 2004 CTC: Yandell © 2004 (courtesy AD Attie)

1. studying diabetes in an F2 segregating cross of inbred lines B6.ob x BTBR.ob  F1  F2 selected mice with ob/ob alleles at leptin gene (chr 6) measured and mapped body weight, insulin, glucose at various ages (Stoehr et al. 2000 Diabetes) sacrificed at 14 weeks, tissues preserved gene expression data Affymetrix microarrays on parental strains, F1 (Nadler et al. 2000 PNAS; Ntambi et al. 2002 PNAS) RT-PCR for a few mRNA on 108 F2 mice liver tissues (Lan et al. 2003 Diabetes; Lan et al. 2003 Genetics) Affymetrix microarrays on 60 F2 mice liver tissues design (Jin et al. 2004 Genetics tent. accept) analysis (work in prep.) 6-9 June 2004 CTC: Yandell © 2004

mRNA expression as phenotype: interval mapping for SCD1 is complicated 6-9 June 2004 CTC: Yandell © 2004

Pareto diagram of QTL effects major QTL on linkage map (modifiers) minor QTL major QTL 3 polygenes 2 4 5 1 6-9 June 2004 CTC: Yandell © 2004

2. taking a multiple QTL approach improve statistical power, precision increase number of QTL detected better estimates of loci: less bias, smaller intervals improve inference of complex genetic architecture patterns and individual elements of epistasis appropriate estimates of means, variances, covariances asymptotically unbiased, efficient assess relative contributions of different QTL improve estimates of genotypic values less bias (more accurate) and smaller variance (more precise) mean squared error = MSE = (bias)2 + variance 6-9 June 2004 CTC: Yandell © 2004

comparing QTL models balance model fit with model "complexity“ want best fit (maximum likelihood or posterior) without too complicated a model information criteria or Bayes factor quantifies the balance Bayes information criteria (BIC) for classical approach Bayes factors (BF) for Bayesian approach find “better” models avoid selection bias (see Broman 2001) QTL of modest effect only detected sometimes genotypic effects biased upwards when detected stochastic QTL detection avoid sharp in/out dichotomy average over better models 6-9 June 2004 CTC: Yandell © 2004

QTL Bayes factors BF = posterior odds / prior odds BF equivalent to BIC simple comparison: 1 vs 2 QTL same as LOD test general comparison of models want Bayes factor >> 1 m = number of QTL indexes model complexity genetic architecture also important 6-9 June 2004 CTC: Yandell © 2004

Bayesian model assessment: number of QTL for SCD1 6-9 June 2004 CTC: Yandell © 2004

Bayesian LOD and h2 for SCD1 (summaries from R/bim) 6-9 June 2004 CTC: Yandell © 2004

Bayesian model assessment genetic architecture: chromosome pattern 6-9 June 2004 CTC: Yandell © 2004

trans-acting QTL for SCD1 multiple QTL Bayesian model averaging additive QTL? dominant QTL? 6-9 June 2004 CTC: Yandell © 2004

2-D scan: assumes only 2 QTL (scantwo with HK method, from R/qtl) epistasis LOD peaks joint LOD peaks 6-9 June 2004 CTC: Yandell © 2004

sub-peaks can be easily overlooked 6-9 June 2004 CTC: Yandell © 2004

6-9 June 2004 CTC: Yandell © 2004

3. handling high throughput dilemma want to focus on gene expression network ideally capture functional group in a few dimensions allow for complicated genetic architecture may have multiple loci influencing expression quick interval mapping assessment may be misleading many genes with epistasis affect coordinated fashion? focus gene mapping using dimension reduction initial screening using EB arrays to 2500+ mRNA identify 85 functional groups from 1500+ mRNA model selection for groups with stronger PC signals 6-9 June 2004 CTC: Yandell © 2004

hierarchical model for expression phenotypes (EB arrays: Christina Kendziorski) mRNA phenotype models given genotypic mean GQ common prior on GQ across all mRNA (use empirical Bayes to estimate prior) 6-9 June 2004 CTC: Yandell © 2004

EE: GQQ= GQq f(Y|EE) DE: GQQ  GQq f(Y|DE) For every mRNA transcript, two possible patterns (DE, EE) no QTL present EE: GQQ= GQq f(Y|EE) QTL present DE: GQQ  GQq f(Y|DE) Empirical Bayes methods (EB arrays) make use all of the data to make mRNA-specific inferences. 6-9 June 2004 CTC: Yandell © 2004

hierarchical model across expression phenotypes (Christina Kendziorski) vector of mRNA phenotypes organized by QTL genotype Y = (Y1,…,Yn) = (YQQ, YQq, Yqq) Y ~ f( Y |  ) if no QTL present Y ~ f(YQQ|GQQ) f(YQq|GQq) f(Yqq|Gqq) if QTL present marginal for phenotype across possible genotypic means Y ~ f0(Y) =  f() f(Y|) d if no QTL present Y ~ f1(Y) = f0(YQQ) f0(YQq) f0(Yqq) if QTL present mixture across possible patterns of expression Y ~ p0 f0(Y) + p1 f1(Y) p1 = prior probability of QTL present (could allow more possibilities—gene action, multiple QTL) 6-9 June 2004 CTC: Yandell © 2004

PC across microarray functional groups Affy chips on 60 mice ~40,000 mRNA 2500+ mRNA show DE (via EB arrays with marker regression) 1500+ organized in 85 functional groups 2-35 mRNA / group which are interesting? examine PC1, PC2 circle size = # unique mRNA 6-9 June 2004 CTC: Yandell © 2004

PC for two correlated mRNA 6-9 June 2004 CTC: Yandell © 2004

focus on translation machinery (EIF) 6-9 June 2004 CTC: Yandell © 2004

6-9 June 2004 CTC: Yandell © 2004

blue bars at 1%, 5%; width proportional to group size how well does PC1 do? lod peaks for 2 QTL at best pair of chr data (red) vs. 500 permutations (boxplots) blue bars at 1%, 5%; width proportional to group size 6-9 June 2004 CTC: Yandell © 2004

PC and DA for 1500+ mRNA traits PC shows little relation to genotypes DA based on best fit with marker pair D4Mit17 and D15Mit63 6-9 June 2004 CTC: Yandell © 2004

2-marker regression for DA1 on chr 4 & 15 across 1500+ mRNA traits (20% missing genotypes) 6-9 June 2004 CTC: Yandell © 2004

DA for selected chromosomes (mask pairs below p-value = 10–8) –log10(p-value) 6-9 June 2004 CTC: Yandell © 2004

4. designing for expensive phenotypes (Jin et al. 2004) microarray analysis ~ $1000 per mouse could only afford to assay 60 of 108 in panel wanted to preserve power to detect QTL selective phenotyping identify set of key markers framework map across subset of genome or key regions identified in previous studies chr 2,4,5,9,16,19 for physiological traits in diabetes/obesity study genotype all individuals in panel at markers select subset for phenotyping based on genotype interval map with no bias 6-9 June 2004 CTC: Yandell © 2004

simulated LOD profiles with 3 QTL on 2 chr map position 6-9 June 2004 CTC: Yandell © 2004

sensitivity = pr( detect QTL | QTL is real ) comparison of different selection methods improved power over random sample sensitivity = pr( detect QTL | QTL is real ) up to 80% sensitivity of full panel best with few markers near QTL genome-wide selection better than random sample Number of chromosomes examined 6-9 June 2004 CTC: Yandell © 2004

is this relevant to large QTL studies? selectively phenotype 50-75% of F2 mapping panel may capture most effects 1:2:1 F2 allele ratio of genotypes A:H:B 1:0:1 best for additive effects (50%) 1:1:1 best for general effects (75%) with little loss of power and dramatic reduction in cost two-stage selective phenotyping? genotype & phenotype subset of 100-300 could selectively phenotype using whole genome QTL map to identify key genomic regions selectively phenotype subset using key regions 6-9 June 2004 CTC: Yandell © 2004

contact information & resources email: byandell@wisc.edu web: www.stat.wisc.edu/~yandell/statgen QTL & microarray resources references, software, people R/bim freely available download R from cran.r-project.org for your system (Mac, Windows, Linux) Packages... Install package(s) from CRAN... qtl Packages... Install package(s) from Bioconductor... bim thanks: students: Chunfang “Amy” Jin, Fei Zou, Pat Gaffney, Jaya Satagopan, Meng Chen (UW Statistics) faculty/staff: Alan Attie, Hong Lan (UW Biochemistry); Michael Newton, Christina Kendziorski, Jason Fine (UW Biostatistics); Gary Churchill, Hao Wu (Jackson Labs) USDA/CSREES, NIH/NIDDK 6-9 June 2004 CTC: Yandell © 2004