Lecture 7: QTL Mapping I:

Slides:



Advertisements
Similar presentations
Planning breeding programs for impact
Advertisements

Experimental crosses. Inbred Strain Cross Backcross.
Tutorial #2 by Ma’ayan Fishelson. Crossing Over Sometimes in meiosis, homologous chromosomes exchange parts in a process called crossing-over. New combinations.
Fig. 4-1 Chapter 4 overview. Genetic recombination: mixing of genes during gametogenesis that produces gametes with combinations of genes that are different.
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Joint Linkage and Linkage Disequilibrium Mapping
QTL Mapping R. M. Sundaram.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Statistical association of genotype and phenotype.
Lecture 9: QTL Mapping I:
Sessão Temática 2 Análise Bayesiana Utilizando a abordagem Bayesiana no mapeamento de QTL´s Roseli Aparecida Leandro ESALQ/USP 11 o SEAGRO / 50ª RBRAS.
1 QTL mapping in mice, cont. Lecture 11, Statistics 246 February 26, 2004.
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
BIO341 Meiotic mapping of whole genomes (methods for simultaneously evaluating linkage relationships among large numbers of loci)
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
QTL mapping in animals. It works QTL mapping in animals It works It’s cheap.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Mapping populations Controlled crosses between two parents –two alleles/locus, gene frequencies = 0.5 –gametic phase disequilibrium is due to linkage,
Class 3 1. Construction of genetic maps 2. Single marker QTL analysis 3. QTL cartographer.
The rise (and fall) of QTL mapping:
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Joint Linkage and Linkage Disequilibrium Mapping Key Reference Li, Q., and R. L. Wu, 2009 A multilocus model for constructing a linkage disequilibrium.
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
INTRODUCTION TO ASSOCIATION MAPPING
Composite Method for QTL Mapping Zeng (1993, 1994) Limitations of single marker analysis Limitations of interval mapping The test statistic on one interval.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
QTL Cartographer A Program Package for finding Quantitative Trait Loci C. J. Basten Z.-B. Zeng and B. S. Weir.
Association between genotype and phenotype
Mapping and cloning Human Genes. Finding a gene based on phenotype ’s of DNA markers mapped onto each chromosome – high density linkage map. 2.
Population structure at QTL d A B C D E Q F G H a b c d e q f g h The population content at a quantitative trait locus (backcross, RIL, DH). Can be deduced.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
- Type of Study Composite Interval Mapping Program - Genetic Design.
Why you should know about experimental crosses. To save you from embarrassment.
Using Merlin in Rheumatoid Arthritis Analyses Wei V. Chen 05/05/2004.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Lecture 11: Linkage Analysis IV Date: 10/01/02  linkage grouping  locus ordering  confidence in locus ordering.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Identifying QTLs in experimental crosses
Complex Trait Genetics in Animal Models
upstream vs. ORF binding and gene expression?
New Methods for Analyzing Complex Traits
Relationship between quantitative trait inheritance and
Modern Synthesis concepts from Laboratory Genetics
Gene mapping in mice Karl W Broman Department of Biostatistics
Recombination (Crossing Over)
The breakpoint process on RI chromosomes
Mapping Quantitative Trait Loci
Statistical Methods for Quantitative Trait Loci (QTL) Mapping II
Lecture 5: Major Genes, Polygenes, and QTLs
Gene Linkage.
Error Checking for Linkage Analyses
EpiQTL mapping of glucosinolate mean and CV
Lecture 10: QTL Mapping II: Outbred Populations
Linkage analysis and genetic mapping
Lecture 9: QTL Mapping II: Outbred Populations
Linkage Analysis Problems
Evan G. Williams, Johan Auwerx  Cell 
Modern Synthesis concepts from Laboratory Genetics
A: QTL analysis of genotyping data on chromosome 12 revealed a locus linked to glucose levels in F2 mice at 6 months of age at marker D12Mit231 with a.
Composite Interval Mapping Program
Quantitative Trait Locus (QTL) Mapping
Presentation transcript:

Lecture 7: QTL Mapping I: Inbred Line Crosses

Experimental Design: Crosses P1 x P2 F1 F1 B1 Backcross design B2 F1 Backcross design Fk F2 F1 Advanced intercross Design (AIC, AICk) F1 x F1 F2 F2 design

Experimental Designs: Marker Analysis Single marker analysis Flanking marker analysis (interval mapping) Composite interval mapping Interval mapping plus additional markers Multipoint mapping Uses all markers on a chromosome simultaneously

Conditional Probabilities of QTL Genotypes The basic building block for all QTL methods is Pr(Qk | Mj) --- the probability of QTL genotype Qk given the marker genotype is Mj. P r ( Q k j M ) = Consider a QTL linked to a marker (recombination Fraction = c). Cross MMQQ x mmqq. In the F1, all gametes are MQ and mq In the F2, freq(MQ) = freq(mq) = (1-c)/2, freq(mQ) = freq(Mq) = c/2

Hence, Pr(MMQQ) = Pr(MQ)Pr(MQ) = (1-c)2/4 Pr(MMQq) = 2Pr(MQ)Pr(Mq) = 2c(1-c) /4 Pr(MMqq) = Pr(Mq)Pr(Mq) = c2 /4 Why the 2? MQ from father, Mq from mother, OR MQ from mother, Mq from father Since Pr(MM) = 1/4, the conditional probabilities become Pr(QQ | MM) = Pr(MMQQ)/Pr(MM) = (1-c)2 Pr(Qq | MM) = Pr(MMQq)/Pr(MM) = 2c(1-c) Pr(qq | MM) = Pr(MMqq)/Pr(MM) = c2

2 Marker loci Suppose the cross is M1M1QQM2M2 x m1m1qqm2m2 M1 Q M2 Genetic map c1 c2 c12 In F2, Pr(M1QM2) = (1-c1)(1-c2) No interference: c12 = c1 + c2 - 2c1c2 Complete interference: c12 = c1 + c2 Pr(M1Qm2) = (1-c1) c2 Pr(m1QM2) = (1-c1) c2 Likewise, Pr(M1M2) = 1-c12 = 1- c1 + c2 A little bookkeeping gives

- - P r ( Q j M 1 2 ) = ° c q - - - - -

Expected Marker Means ( π ° ) = 2 a 1 c The expected trait mean for marker genotype Mj is just π M j = N X k 1 Q P r ( ) For example, if QQ = 2a, Qa = a(1+k), qq = 0, then in the F2 of an MMQQ/mmqq cross, ( π M ° m ) = 2 a 1 c - • If the trait mean is significantly different for the genotypes at a marker locus, it is linked to a QTL • A small MM-mm difference could be (i) a tightly-linked QTL of small effect or (ii) loose linkage to a large QTL

( ) Hence, the use of single markers provides for detection of a QTL. However, single marker means does not allow separate estimation of a and c. Now consider using interval mapping (flanking markers) ° ' π M 1 2 m = a µ c + ∂ ( ) - This is essentially a for even modest linkage ( ∂ µ - c 1 = 2 ° π M m a ' ) Hence, a and c can be estimated from the mean values of flanking marker genotypes

Linear Models for QTL Detection The use of differences in the mean trait value for different marker genotypes to detect a QTL and estimate its effects is a use of linear models. One-way ANOVA. Effect of marker genotype i on trait value z i k = π + b e Value of trait in kth individual of marker genotype type i Detection: a QTL is linked to the marker if at least one of the bi is significantly different from zero Estimation (QTL effect and position): This requires relating the bi to the QTL effects and map position

z = π + a b d e Detecting epistasis i k One major advantage of linear models is their flexibility. To test for epistasis between two QTLs, used an ANOVA with an interaction term Effect from marker genotype at first marker set (can be > 1 loci) Interaction between marker genotypes i in 1st marker set and k in 2nd marker set Effect from marker genotype at second marker set z = π + a i b k d e • At least one of the ai significantly different from 0 ---- QTL linked to first marker set • At least one of the bk significantly different from 0 ---- QTL linked to second marker set • At least one of the dik significantly different from 0 ---- interactions between QTL in sets 1 and two

Maximum Likelihood Methods ML methods use the entire distribution of the data, not just the marker genotype means. More powerful that linear models, but not as flexible in extending solutions (new analysis required for each model) Basic likelihood function: Sum over the N possible linked QTL genotypes Distribution of trait value given QTL genotype is k is normal with mean mQk. (QTL effects enter here) ` ( z j M ) = N X k 1 ' ; π Q æ 2 P r Trait value given marker genotype is type j Probability of QTL genotype k given marker genotype j --- genetic map and linkage phase entire here This is a mixture model

ML methods combine both detection and estimation Of QTL effects/position. Test for a linked QTL given from the LR test Maximum of the likelihood under a no-linked QTL model L R = ° 2 l n m a x ` r ( z ) - Maximum of the full likelihood The LR score is often plotted by trying different locations for the QTL (I.e., values of c) and computing a LOD score for each L O D ( c ) = ° l o g 1 ∑ m a x ` r z ; ∏ R 2 n ' 4 : 6 - { }

A typical QTL map from a likelihood analysis

Interval Mapping with Marker Cofactors Consider interval mapping using the markers i and i+1. QTLs linked to these markers, but outside this interval, can contribute (falsely) to estimation of QTL position and effect Now suppose we also add the two markers flanking the interval (i-1 and i+2) CIM also (potentially) includes unlinked markers to account for QTL on other chromosomes. i i+1 i+2 i-1 Inclusion of markers i-1 and i+2 fully account for any linked QTLs to the left of i-1 and the right of i+2 Interval being mapped Interval mapping + marker cofactors is called Composite Interval Mapping (CIM) However, still do not account for QTLs in the areas X k 6 = i ; + 1 b x j CIM works by adding an additional term to the linear model ,

Power and Repeatability: The Beavis Effect QTLs with low power of detection tend to have their effects overestimated, often very dramatically As power of detection increases, the overestimation of detected QTLs becomes far less serious This is often called the Beavis Effect, after Bill Beavis who first noticed this in simulation studies