Yandell © June 20051 Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison www.stat.wisc.edu/~yandell/statgen.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

The genetic dissection of complex traits
Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Introduction Materials and methods SUBJECTS : Balb/cJ and C57BL/6J inbred mouse strains, and inbred fruit fly strains number 11 and 70 from the recombinant.
Experimental crosses. Inbred Strain Cross Backcross.
12 Jan 2006Hort Retreat © Brian S Yandell1 Genetics, Microarrays & Evolution: Issues as a Statistician (sans formula) Brian S. Yandell Horticulture, Statistics.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
QTL Mapping R. M. Sundaram.
Inferring Causal Phenotype Networks Elias Chaibub Neto & Brian S. Yandell UW-Madison June 2010 QTL 2: NetworksSeattle SISG: Yandell ©
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Multiple Correlated Traits Pleiotropy vs. close linkage Analysis of covariance – Regress one trait on another before QTL search Classic GxE analysis Formal.
Quantitative Genetics
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Professor Brian S. Yandell joint faculty appointment across colleges: –50% Horticulture (CALS) –50% Statistics (Letters & Sciences) Biometry Program –MS.
QTL 2: OverviewSeattle SISG: Yandell © Seattle Summer Institute 2010 Advanced QTL Brian S. Yandell, UW-Madison
Human Chromosomes Male Xy X y Female XX X XX Xy Daughter Son.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
October 2005Jax Workshop © Brian S. Yandell1 Bayesian Model Selection for Multiple QTL Brian S. Yandell University of Wisconsin-Madison
Who was Mendel? Mendel – first to gather evidence of patterns by which parents transmit genes to offspring.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Yandell © 2003JSM Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits Brian S. Yandell University of Wisconsin-Madison
Pedagogical Objectives Bioinformatics/Neuroinformatics Unit Review of genetics Review/introduction of statistical analyses and concepts Introduce QTL.
Genetic correlations and associative networks for CNS transcript abundance and neurobehavioral phenotypes in a recombinant inbred mapping panel Elissa.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Final Remarks Genetical.
Yandell © 2003Wisconsin Symposium III1 QTLs & Microarrays Overview Brian S. Yandell University of Wisconsin-Madison
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Gene Mapping for Correlated Traits Brian S. Yandell University of Wisconsin-Madison Correlated TraitsUCLA Networks (c)
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Systems Genetics Approach to the Study of Brain Iron Regulation Byron C. Jones Professor of Biobehavioral Health & Pharmacology The Pennsylvania State.
ModelNCSU QTL II: Yandell © Model Selection for Multiple QTL 1.reality of multiple QTL3-8 2.selecting a class of QTL models comparing QTL models16-24.
Bayesian Variable Selection in Semiparametric Regression Modeling with Applications to Genetic Mappping Fei Zou Department of Biostatistics University.
EQTLs.
Quantile-based Permutation Thresholds for QTL Hotspots
Professor Brian S. Yandell
upstream vs. ORF binding and gene expression?
Statistical Applications in Biology and Genetics
Model Selection for Multiple QTL
Multiple Traits & Microarrays
Bayesian Model Selection for Multiple QTL
Graphical Diagnostics for Multiple QTL Investigation Brian S
Gene mapping in mice Karl W Broman Department of Biostatistics
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Inferring Genetic Architecture of Complex Biological Processes Brian S
4. modern high throughput biology
Inferring Causal Phenotype Networks
High Throughput Gene Mapping Brian S. Yandell
Brian S. Yandell University of Wisconsin-Madison 26 February 2003
Genome-wide Association Studies
Bayesian Interval Mapping
1. why study multiple traits together?
Inferring Causal Phenotype Networks Driven by Expression Gene Mapping
Seattle SISG: Yandell © 2008
examples in detail days to flower for Brassica napus (plant) (n = 108)
Inferring Genetic Architecture of Complex Biological Processes Brian S
Seattle SISG: Yandell © 2006
Note on Outbred Studies
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Brian S. Yandell University of Wisconsin-Madison
Seattle SISG: Yandell © 2009
Bayesian Interval Mapping
Presentation transcript:

Yandell © June Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison

Yandell © June studying diabetes in an F2 segregating cross of inbred lines –B6.ob x BTBR.ob  F1  F2 –selected mice with ob/ob alleles at leptin gene (chr 6) –measured and mapped body weight, insulin, glucose at various ages (Stoehr et al Diabetes) –sacrificed at 14 weeks, tissues preserved gene expression data –Affymetrix microarrays on parental strains, F1 (Nadler et al PNAS; Ntambi et al PNAS) –RT-PCR for a few mRNA on 108 F2 mice liver tissues (Lan et al Diabetes; Lan et al Genetics) –Affymetrix microarrays on 60 F2 mice liver tissues design (Jin et al Genetics tent. accept) analysis (work in prep.)

Yandell © June Type 2 Diabetes Mellitus

Yandell © June Insulin Requirement from Unger & Orci FASEB J. (2001) 15,312 decompensation

Yandell © June glucoseinsulin (courtesy AD Attie)

Yandell © June why map gene expression as a quantitative trait? cis- or trans-action? –does gene control its own expression? –or is it influenced by one or more other genomic regions? –evidence for both modes (Brem et al Science) simultaneously measure all mRNA in a tissue –~5,000 mRNA active per cell on average –~30,000 genes in genome –use genetic recombination as natural experiment mechanics of gene expression mapping –measure gene expression in intercross (F2) population –map expression as quantitative trait (QTL) –adjust for multiple testing

Yandell © June LOD map for PDI: cis-regulation (Lan et al. 2003)

Yandell © June Multiple Interval Mapping (QTLCart) SCD1: multiple QTL plus epistasis!

Yandell © June Bayesian model assessment: number of QTL for SCD1

Yandell © June Bayesian LOD and h 2 for SCD1

Yandell © June Bayesian model assessment: chromosome QTL pattern for SCD1

Yandell © June trans-acting QTL for SCD1 (no epistasis yet: see Yi, Xu, Allison 2003) dominance?

Yandell © June D scan: assumes only 2 QTL! epistasis LOD peaks joint LOD peaks

Yandell © June sub-peaks can be easily overlooked!

Yandell © June epistatic model fit

Yandell © June Cockerham epistatic effects

Yandell © June our Bayesian QTL software R: –freely available statistical computing application R –library(bim) builds on Broman’s library(qtl) QTLCart: statgen.ncsu.edu/qtlcart –Bmapqtl incorporated into QTLCart (S Wang 2003) R/bim –initially designed by JM Satagopan (1996) –major revision and extension by PJ Gaffney (2001) whole genome, multivariate and long range updates speed improvements, pre-burnin –built as official R library (H Wu, Yandell, Gaffney, CF Jin 2003) R/bmqtl –collaboration with N Yi, H Wu, GA Churchill –initial working module: Winter 2005 –improved module and official release: Summer/Fall 2005 –major NIH grant (PI: Yi)

Yandell © June

Yandell © June modern high throughput biology measuring the molecular dogma of biology –DNA  RNA  protein  metabolites –measured one at a time only a few years ago massive array of measurements on whole systems (“omics”) –thousands measured per individual (experimental unit) –all (or most) components of system measured simultaneously whole genome of DNA: genes, promoters, etc. all expressed RNA in a tissue or cell all proteins all metabolites systems biology: focus on network interconnections –chains of behavior in ecological community –underlying biochemical pathways genetics as one experimental tool –perturb system by creating new experimental cross –each individual is a unique mosaic

Yandell © June reduce 30,000 traits to 300-3,000 heritable traits probability a trait is heritable pr(H|Y,Q) = pr(Y|Q,H) pr(H|Q) / pr(Y|Q)Bayes rule pr(Y|Q) = pr(Y|Q,H) pr(H|Q) + pr(Y|Q, not H) pr(not H|Q) phenotype averaged over genotypic mean  pr(Y|Q, not H) = f 0 (Y) =  f(Y|G ) pr(G) dG if not H pr(Y|Q, H) = f 1 (Y|Q) =  q f 0 (Y q ) if heritable Y q = {Y i | Q i =q} = trait values with genotype Q=q finding heritable traits (from Christina Kendziorski)

Yandell © June hierarchical model for expression phenotypes (EB arrays: Christina Kendziorski) mRNA phenotype models given genotypic mean G q common prior on G q across all mRNA (use empirical Bayes to estimate prior)

Yandell © June why study multiple traits together? avoid reductionist approach to biology –address physiological/biochemical mechanisms –Schmalhausen (1942); Falconer (1952) separate close linkage from pleiotropy –1 locus or 2 linked loci? identify epistatic interaction or canalization –influence of genetic background establish QTL x environment interactions decompose genetic correlation among traits increase power to detect QTL

Yandell © June expression meta-traits: pleiotropy reduce 3,000 heritable traits to 3 meta-traits(!) what are expression meta-traits? –pleiotropy: a few genes can affect many traits transcription factors, regulators –weighted averages: Z = YW principle components, discriminant analysis infer genetic architecture of meta-traits –model selection issues are subtle missing data, non-linear search what is the best criterion for model selection? –time consuming process heavy computation load for many traits subjective judgement on what is best

Yandell © June PC for two correlated mRNA

Yandell © June PC across microarray functional groups Affy chips on 60 mice ~40,000 mRNA mRNA show DE (via EB arrays with marker regression) organized in 85 functional groups 2-35 mRNA / group which are interesting? examine PC1, PC2 circle size = # unique mRNA

Yandell © June PC meta-traits by functional group focus on 2 interesting groups

Yandell © June red lines: peak for PC meta-trait black/blue: peaks for mRNA traits arrows: cis-action?

Yandell © June (portion of) chr 4 regionchr 15 region ?

Yandell © June genotypes from Chr 4/Chr 15 locus pair (circle= centroid) DA creates best separation by genotype DA meta-traits on mRNA traits

Yandell © June relating meta-traits to mRNA traits SCD trait log2 expression DA meta-trait standard units

Yandell © June building graphical models infer genetic architecture of meta-trait –E(Z | Q, M) =  q =  0 +  {q in M}  qk find mRNA traits correlated with meta-trait –Z  YW for modest number of traits Y extend meta-trait genetic architecture –M = genetic architecture for Y –expect subset of QTL to affect each mRNA –may be additional QTL for some mRNA

Yandell © June posterior for graphical models posterior for graph given multivariate trait & architecture pr(G | Y, Q, M) = pr(Y | Q, G) pr(G | M) / pr(Y | Q) –pr(G | M) = prior on valid graphs given architecture multivariate phenotype averaged over genotypic mean  pr(Y | Q, G) = f 1 (Y | Q, G) =  q f 0 (Yq | G) f 0 (Y q | G) =  f(Y q | , G) pr(  ) d  graphical model G implies correlation structure on Y genotype mean prior assumed independent across traits pr(  ) =  t pr(  t )

Yandell © June from graphical models to pathways build graphical models QTL  RNA1  RNA2 –class of possible models –best model = putative biochemical pathway parallel biochemical investigation –candidate genes in QTL regions –laboratory experiments on pathway components

Yandell © June graphical models (with Elias Chaibub) f 1 (Y | Q, G=g) = f 1 (Y 1 | Q) f 1 (Y 2 | Q, Y 1 ) R2 D2 P2 QTL R1 D1 P1 observable trans-action unobservable meta-trait QTL RNA DNA observable cis-action? protein

Yandell © June summary expression QTL are complicated –need to consider multiple interacting QTL coherent approach for high-throughput traits –identify heritable traits –dimension reduction to meta-traits –mapping genetic architecture –extension via graphical models to networks many open questions –model selection –computation efficiency –inference on graphical models