Yandell © 2003JSM 20031 Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits Brian S. Yandell University of Wisconsin-Madison www.stat.wisc.edu/~yandell/statgen.

Slides:



Advertisements
Similar presentations
Selective Breeding & cDNA Microarrays
Advertisements

15 The Genetic Basis of Complex Inheritance
The genetic dissection of complex traits
Planning breeding programs for impact
Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Introduction Materials and methods SUBJECTS : Balb/cJ and C57BL/6J inbred mouse strains, and inbred fruit fly strains number 11 and 70 from the recombinant.
Qualitative and Quantitative traits
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
QTL Mapping R. M. Sundaram.
Inferring Causal Phenotype Networks Elias Chaibub Neto & Brian S. Yandell UW-Madison June 2010 QTL 2: NetworksSeattle SISG: Yandell ©
The Inheritance of Complex Traits
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Sessão Temática 2 Análise Bayesiana Utilizando a abordagem Bayesiana no mapeamento de QTL´s Roseli Aparecida Leandro ESALQ/USP 11 o SEAGRO / 50ª RBRAS.
Quantitative Genetics
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Quantitative Genetics
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
QTL mapping in animals. It works QTL mapping in animals It works It’s cheap.
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Professor Brian S. Yandell joint faculty appointment across colleges: –50% Horticulture (CALS) –50% Statistics (Letters & Sciences) Biometry Program –MS.
Chapter 5 Characterizing Genetic Diversity: Quantitative Variation Quantitative (metric or polygenic) characters of Most concern to conservation biology.
Dr. Scott Sebastian, Research Fellow, Pioneer Hi-Bred International Plant Breeding Seminar at University of California Davis Accelerated Yield.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Yandell © June Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison
1 Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene.
Lecture 22: Quantitative Traits II
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Final Remarks Genetical.
Yandell © 2003Wisconsin Symposium III1 QTLs & Microarrays Overview Brian S. Yandell University of Wisconsin-Madison
Why you should know about experimental crosses. To save you from embarrassment.
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
ModelNCSU QTL II: Yandell © Model Selection for Multiple QTL 1.reality of multiple QTL3-8 2.selecting a class of QTL models comparing QTL models16-24.
Quantitative genetics
Bayesian Variable Selection in Semiparametric Regression Modeling with Applications to Genetic Mappping Fei Zou Department of Biostatistics University.
Plant Microarray Course
Professor Brian S. Yandell
MULTIPLE GENES AND QUANTITATIVE TRAITS
upstream vs. ORF binding and gene expression?
Model Selection for Multiple QTL
Multiple Traits & Microarrays
Graphical Diagnostics for Multiple QTL Investigation Brian S
Gene mapping in mice Karl W Broman Department of Biostatistics
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Inferring Genetic Architecture of Complex Biological Processes Brian S
4. modern high throughput biology
High Throughput Gene Mapping Brian S. Yandell
Bayesian Interval Mapping
MULTIPLE GENES AND QUANTITATIVE TRAITS
Brian S. Yandell University of Wisconsin-Madison 26 February 2003
Genome-wide Association Studies
Inferring Causal Phenotype Networks Driven by Expression Gene Mapping
Seattle SISG: Yandell © 2008
examples in detail days to flower for Brassica napus (plant) (n = 108)
Inferring Genetic Architecture of Complex Biological Processes Brian S
Seattle SISG: Yandell © 2006
Brian S. Yandell University of Wisconsin-Madison
Chapter 7 Beyond alleles: Quantitative Genetics
Presentation transcript:

Yandell © 2003JSM Dimension Reduction for Mapping mRNA Abundance as Quantitative Traits Brian S. Yandell University of Wisconsin-Madison [Lan et al Genetics 164(4): August 2003]

Yandell © 2003JSM what is the goal of QTL study? uncover underlying biochemistry –identify how networks function, break down –find useful candidates for (medical) intervention –epistasis may play key role –statistical goal: maximize number of correctly identified QTL basic science/evolution –how is the genome organized? –identify units of natural selection –additive effects may be most important (Wright/Fisher debate) –statistical goal: maximize number of correctly identified QTL select “elite” individuals –predict phenotype (breeding value) using suite of characteristics (phenotypes) translated into a few QTL –statistical goal: mimimize prediction error

Yandell © 2003JSM what is a QTL? QTL = quantitative trait locus (or loci) –trait = phenotype = characteristic of interest –quantitative = measured somehow glucose, insulin, gene expression level Mendelian genetics –allelic effect + environmental variation –locus = location in genome affecting trait gene or collection of tightly linked genes some physical feature of genome

Yandell © 2003JSM typical phenotype assumptions normal "bell-shaped" environmental variation genotypic value G Q is composite of m QTL genetic uncorrelated with environment data histogram

Yandell © 2003JSM why worry about multiple QTL? so many possible genetic architectures! –number and positions of loci –gene action: additive, dominance, epistasis –how to efficiently search the model space? how to select “best” or “better” model(s)? –what criteria to use? where to draw the line? –shades of gray: exploratory vs. confirmatory study –how to balance false positives, false negatives? what are the key “features” of model? –means, variances & covariances, confidence regions –marginal or conditional distributions

Yandell © 2003JSM Pareto diagram of QTL effects major QTL on linkage map major QTL minor QTL polygenes (modifiers)

Yandell © 2003JSM advantages of multiple QTL approach improve statistical power, precision –increase number of QTL detected –better estimates of loci: less bias, smaller intervals improve inference of complex genetic architecture –patterns and individual elements of epistasis –appropriate estimates of means, variances, covariances asymptotically unbiased, efficient –assess relative contributions of different QTL improve estimates of genotypic values –less bias (more accurate) and smaller variance (more precise) –mean squared error = MSE = (bias) 2 + variance

Yandell © 2003JSM epistasis in parallel pathways (Gary Churchill) Z keeps trait value low Neither E 1 nor E 2 is rate limiting Loss of function alleles are segregating from parent A at E 1 and from parent B at E 2 Z X Y E 1 E 2

Yandell © 2003JSM epistasis in a serial pathway (Gary Churchill) ZXY E 1 E 2 Z keeps trait value high Neither E 1 nor E 2 is rate limiting Loss of function alleles are segregating from parent B at E 1 and from parent A at E 2

Yandell © 2003JSM epistasis examples (Doebley Stec Gustus 1995; Zeng pers. comm.) traits 1,4,9 1: dom-dom interaction 4: add-add interaction 9: rec-rec interaction (Fisher-Cockerham effects) 4 91

Yandell © 2003JSM why map gene expression as a quantitative trait? cis- or trans-action? –does gene control its own expression? –evidence for both modes (Brem et al Science) mechanics of gene expression mapping –measure gene expression in intercross (F2) population –map expression as quantitative trait (QTL technology) –adjust for multiple testing via false discovery rate research groups working on expression QTLs –review by Cheung and Spielman (2002 Nat Gen Suppl) –Kruglyak (Brem et al Science) –Doerge et al. (Purdue); Jansen et al. (Waginingen) –Williams et al. (U KY); Lusis et al. (UCLA) –Dumas et al. (2000 J Hypertension)

Yandell © 2003JSM idea of mapping microarrays (Jansen Nap 2001)

Yandell © 2003JSM goal: unravel biochemical pathways (Jansen Nap 2001)

Yandell © 2003JSM central dogma via microarrays (Bochner 2003)

Yandell © 2003JSM coordinated expression in mouse genome (Schadt et al Nature) expression pleiotropy in yeast genome (Brem et al Science)

Yandell © 2003JSM glucoseinsulin (courtesy AD Attie)

Yandell © 2003JSM Insulin Requirement from Unger & Orci FASEB J. (2001) 15,312 decompensation

Yandell © 2003JSM Type 2 Diabetes Mellitus from Unger & Orci FASEB J. (2001) 15,312

Yandell © 2003JSM studying diabetes in an F2 segregating cross of inbred lines –B6.ob x BTBR.ob  F1  F2 –selected mice with ob/ob alleles at leptin gene (chr 6) –measured and mapped body weight, insulin, glucose at various ages (Stoehr et al Diabetes) –sacrificed at 14 weeks, tissues preserved gene expression data –Affymetrix microarrays on parental strains, F1 key tissues: adipose, liver, muscle,  -cells novel discoveries of differential expression (Nadler et al PNAS; Lan et al in review; Ntambi et al PNAS) –RT-PCR on 108 F2 mice liver tissues 15 genes, selected as important in diabetes pathways SCD1, PEPCK, ACO, FAS, GPAT, PPARgamma, PPARalpha, G6Pase, PDI,…

Yandell © 2003JSM LOD map for PDI: cis-regulation Lan et al. (2003 submitted)

Yandell © 2003JSM Multiple Interval Mapping SCD1: multiple QTL plus epistasis!

Yandell © 2003JSM D scan: assumes only 2 QTL! epistasis LOD joint LOD

Yandell © 2003JSM trans-acting QTL for SCD1 (no epistasis yet: see Yi, Xu, Allison 2003) dominance?

Yandell © 2003JSM Bayesian model assessment: chromosome QTL pattern for SCD1

Yandell © 2003JSM high throughput dilemma want to focus on gene expression network –hundreds or thousands of genes/proteins to monitor –ideally capture networks in a few dimensions multivariate summaries of multiple traits –elicit biochemical pathways (Henderson et al. Hoeschele 2001; Ong Page 2002) may have multiple controlling loci –allow for complicated genetic architecture –could affect many genes in coordinated fashion –could show evidence of epistasis –quick assessment via interval mapping may be misleading

Yandell © 2003JSM why study multiple traits together? environmental correlation –non-genetic, controllable by design –historical correlation (learned behavior) –physiological correlation (same body) genetic correlation –pleiotropy one gene, many functions common biochemical pathway, splicing variants –close linkage two tightly linked genes genotypes Q are collinear

Yandell © 2003JSM high throughput: which genes are the key players? one approach: clustering of expression seed by insulin, glucose advantage: subset relevant to trait disadvantage: still many genes to study

Yandell © 2003JSM PC simply rotates & rescales to find major axes of variation

Yandell © 2003JSM multivariate screen for gene expressing mapping principal components PC1(red) and SCD(black) PC2 (22%) PC1 (42%)

Yandell © 2003JSM mapping first diabetes PC as a trait

Yandell © 2003JSM pFDR for PC1 analysis prior probability fraction of posterior found in tails

Yandell © 2003JSM false detection rates and thresholds multiple comparisons: test QTL across genome –size = pr( LOD( ) > threshold | no QTL at ) –threshold guards against a single false detection very conservative on genome-wide basis –difficult to extend to multiple QTL positive false discovery rate (Storey 2001) –pFDR = pr( no QTL at | LOD( ) > threshold ) –Bayesian posterior HPD region based on threshold  = { | LOD( ) > threshold }  { | pr( | Y,X,m ) large } –extends naturally to multiple QTL

Yandell © 2003JSM pFDR and QTL posterior positive false detection rate –pFDR = pr( no QTL at | Y,X, in  ) –pFDR = pr(H=0)*size pr(m=0)*size+pr(m>0)*power –power = posterior = pr(QTL in  | Y,X, m>0 ) –size = (length of  ) / (length of genome) extends to other model comparisons –m = 1 vs. m = 2 or more QTL –pattern = ch1,ch2,ch3 vs. pattern > 2*ch1,ch2,ch3