Lecture 9: QTL Mapping I:

Slides:



Advertisements
Similar presentations
Planning breeding programs for impact
Advertisements

Mapping genes with LOD score method
Gene by environment effects. Elevated Plus Maze (anxiety)
Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.
Experimental crosses. Inbred Strain Cross Backcross.
Gene Frequency and LINKAGE Gregory Kovriga & Alex Ratt.
Tutorial #2 by Ma’ayan Fishelson. Crossing Over Sometimes in meiosis, homologous chromosomes exchange parts in a process called crossing-over. New combinations.
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Joint Linkage and Linkage Disequilibrium Mapping
QTL Mapping R. M. Sundaram.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Statistical association of genotype and phenotype.
Sessão Temática 2 Análise Bayesiana Utilizando a abordagem Bayesiana no mapeamento de QTL´s Roseli Aparecida Leandro ESALQ/USP 11 o SEAGRO / 50ª RBRAS.
Introduction to Linkage Analysis March Stages of Genetic Mapping Are there genes influencing this trait? Epidemiological studies Where are those.
1 QTL mapping in mice, cont. Lecture 11, Statistics 246 February 26, 2004.
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
Karl W Broman Department of Biostatistics Johns Hopkins University Gene mapping in model organisms.
BIO341 Meiotic mapping of whole genomes (methods for simultaneously evaluating linkage relationships among large numbers of loci)
Linkage and LOD score Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
QTL mapping in animals. It works QTL mapping in animals It works It’s cheap.
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Gene, Allele, Genotype, and Phenotype
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Mapping populations Controlled crosses between two parents –two alleles/locus, gene frequencies = 0.5 –gametic phase disequilibrium is due to linkage,
Non-Mendelian Genetics
Class 3 1. Construction of genetic maps 2. Single marker QTL analysis 3. QTL cartographer.
Lecture 5: Major Genes, Polygenes, and QTLs
The rise (and fall) of QTL mapping:
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Joint Linkage and Linkage Disequilibrium Mapping Key Reference Li, Q., and R. L. Wu, 2009 A multilocus model for constructing a linkage disequilibrium.
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
INTRODUCTION TO ASSOCIATION MAPPING
Composite Method for QTL Mapping Zeng (1993, 1994) Limitations of single marker analysis Limitations of interval mapping The test statistic on one interval.
Grouping loci Criteria Maximum two-point recombination fraction –Example -r ij ≤ 0.40 Minimum LOD score - Z ij –For n loci, there are n(n-1)/2 possible.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
QTL Cartographer A Program Package for finding Quantitative Trait Loci C. J. Basten Z.-B. Zeng and B. S. Weir.
Association between genotype and phenotype
Mapping and cloning Human Genes. Finding a gene based on phenotype ’s of DNA markers mapped onto each chromosome – high density linkage map. 2.
Population structure at QTL d A B C D E Q F G H a b c d e q f g h The population content at a quantitative trait locus (backcross, RIL, DH). Can be deduced.
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
- Type of Study Composite Interval Mapping Program - Genetic Design.
Why you should know about experimental crosses. To save you from embarrassment.
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
Lecture 11: Linkage Analysis IV Date: 10/01/02  linkage grouping  locus ordering  confidence in locus ordering.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Identifying QTLs in experimental crosses
upstream vs. ORF binding and gene expression?
Gene mapping in mice Karl W Broman Department of Biostatistics
Lecture 7: QTL Mapping I:
Statistical issues in QTL mapping in mice
Mapping Quantitative Trait Loci
EpiQTL mapping of glucosinolate mean and CV
Lecture 10: QTL Mapping II: Outbred Populations
Lecture 9: QTL Mapping II: Outbred Populations
Linkage Analysis Problems
Composite Interval Mapping Program
Quantitative Trait Locus (QTL) Mapping
Presentation transcript:

Lecture 9: QTL Mapping I: Inbred Line Crosses

Experimental Design: Crosses P1 x P2 F1 F1 B1 Backcross design B2 F1 Backcross design Fk F2 F1 Advanced intercross Design (AIC, AICk) F1 x F1 F2 F2 design

Experimental Designs: Marker Analysis Single marker analysis Flanking marker analysis (interval mapping) Composite interval mapping Interval mapping plus additional markers Multipoint mapping Uses all markers on a chromosome simultaneously

Conditional Probabilities of QTL Genotypes The basic building block for all QTL methods is Pr(Qk | Mj) --- the probability of QTL genotype Qk given the marker genotype is Mj. P r ( Q k j M ) = Consider a QTL linked to a marker (recombination Fraction = c). Cross MMQQ x mmqq. In the F1, all gametes are MQ and mq In the F2, freq(MQ) = freq(mq) = (1-c)/2, freq(mQ) = freq(Mq) = c/2

Hence, Pr(MMQQ) = Pr(MQ)Pr(MQ) = (1-c)2/4 Pr(MMQq) = 2Pr(MQ)Pr(Mq) = 2c(1-c) /4 Pr(MMqq) = Pr(Mq)Pr(Mq) = c2 /4 Why the 2? MQ from father, Mq from mother, OR MQ from mother, Mq from father Since Pr(MM) = 1/4, the conditional probabilities become Pr(QQ | MM) = Pr(MMQQ)/Pr(MM) = (1-c)2 Pr(Qq | MM) = Pr(MMQq)/Pr(MM) = 2c(1-c) Pr(qq | MM) = Pr(MMqq)/Pr(MM) = c2

2 Marker loci Suppose the cross is M1M1QQM2M2 x m1m1qqm2m2 M1 Q M2 Genetic map c1 c2 c12 In F2, Pr(M1QM2) = (1-c1)(1-c2) No interference: c12 = c1 + c2 - 2c1c2 Complete interference: c12 = c1 + c2 Pr(M1Qm2) = (1-c1) c2 Pr(m1QM2) = (1-c1) c2 Likewise, Pr(M1M2) = 1-c12 = 1- c1 + c2 A little bookkeeping gives

P r ( Q j M 1 2 ) = ° c q -

Expected Marker Means ( π ° ) = 2 a 1 c The expected trait mean for marker genotype Mj is just π M j = N X k 1 Q P r ( ) For example, if QQ = 2a, Qa = a(1+k), qq = 0, then in the F2 of an MMQQ/mmqq cross, ( π M ° m ) = 2 a 1 c - • If the trait mean is significantly different for the genotypes at a marker locus, it is linked to a QTL • A small MM-mm difference could be (i) a tightly-linked QTL of small effect or (ii) loose linkage to a large QTL

( ) Hence, the use of single markers provides for detection of a QTL. However, single marker means does not allow separate estimation of a and c. Now consider using interval mapping (flanking markers) ° ' π M 1 2 m = a µ c + ∂ ( ) - This is essentially a for even modest linkage ( ∂ µ - c 1 = 2 ° π M m a ' ) Hence, a and c can be estimated from the mean values of flanking marker genotypes

Linear Models for QTL Detection The use of differences in the mean trait value for different marker genotypes to detect a QTL and estimate its effects is a use of linear models. One-way ANOVA. Effect of marker genotype i on trait value z i k = π + b e Value of trait in kth individual of marker genotype type i Detection: a QTL is linked to the marker if at least one of the bi is significantly different from zero Estimation (QTL effect and position): This requires relating the bi to the QTL effects and map position

z = π + a b d e Detecting epistasis i k One major advantage of linear models is their flexibility. To test for epistasis between two QTLs, used an ANOVA with an interaction term Effect from marker genotype at first marker set (can be > 1 loci) Interaction between marker genotypes i in 1st marker set and k in 2nd marker set Effect from marker genotype at second marker set z = π + a i b k d e • At least one of the ai significantly different from 0 ---- QTL linked to first marker set • At least one of the bk significantly different from 0 ---- QTL linked to second marker set • At least one of the dik significantly different from 0 ---- interactions between QTL in sets 1 and two

Maximum Likelihood Methods ML methods use the entire distribution of the data, not just the marker genotype means. More powerful that linear models, but not as flexible in extending solutions (new analysis required for each model) Basic likelihood function: Sum over the N possible linked QTL genotypes Distribution of trait value given QTL genotype is k is normal with mean mQk. (QTL effects enter here) ` ( z j M ) = N X k 1 ' ; π Q æ 2 P r Trait value given marker genotype is type j Probability of QTL genotype k given marker genotype j --- genetic map and linkage phase entire here This is a mixture model

ML methods combine both detection and estimation Of QTL effects/position. Test for a linked QTL given from the LR test Maximum of the likelihood under a no-linked QTL model L R = ° 2 l n m a x ` r ( z ) - Maximum of the full likelihood The LR score is often plotted by trying different locations for the QTL (I.e., values of c) and computing a LOD score for each L O D ( c ) = ° l o g 1 ∑ m a x ` r z ; ∏ R 2 n ' 4 : 6 - { }

A typical QTL map from a likelihood analysis

Interval Mapping with Marker Cofactors Now suppose we also add the two markers flanking the interval (i-1 and i+2) Consider interval mapping using the markers i and i+1. QTLs linked to these markers, but outside this interval, can contribute (falsely) to estimation of QTL position and effect CIM also (potentially) includes unlinked markers to account for QTL on other chromosomes. i i+1 i+2 i-1 Inclusion of markers i-1 and i+2 fully account for any linked QTLs to the left of i-1 and the right of i+2 Interval being mapped Interval mapping + marker cofactors is called Composite Interval Mapping (CIM) However, still do not account for QTLs in the areas X k 6 = i ; + 1 b x j CIM works by adding an additional term to the linear model ,

Power and Repeatability: The Beavis Effect QTLs with low power of detection tend to have their effects overestimated, often very dramatically As power of detection increases, the overestimation of detected QTLs becomes far less serious This is often called the Beavis Effect, after Bill Beavis who first noticed this in simulation studies