Mapping genes with LOD score method

Slides:



Advertisements
Similar presentations
MENDELIAN GENETICS.
Advertisements

Linkage and Genetic Mapping
Medical Genetics 1 Prof Duncan Shaw
Chapter 7 Hypothesis Testing
Planning breeding programs for impact
Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.
Gene Frequency and LINKAGE Gregory Kovriga & Alex Ratt.
Tutorial #2 by Ma’ayan Fishelson. Crossing Over Sometimes in meiosis, homologous chromosomes exchange parts in a process called crossing-over. New combinations.
Instructor: Dr. Jihad Abdallah Linkage and Genetic Mapping
Linkage Aims: Must be able to outline what linkage is and how it is brought about. Should be able to explain the detection of linkage between genes using.
1 BBS- 6. INTRODUCTION METHODS OF HOMOZYGOSITY MAPPING HOMOZYGOSITY MAPPER GENETIC LINKAGE LOD SCORE METHOD 2.
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Linkage Genes linked on the same chromosome may segregate together.
Chi Square Analyses: Comparing Frequency Distributions.
. Learning – EM in ABO locus Tutorial #08 © Ydo Wexler & Dan Geiger.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
31 January, 2 February, 2005 Chapter 6 Genetic Recombination in Eukaryotes Linkage and genetic diversity.
1 How many genes? Mapping mouse traits, cont. Lecture 2B, Statistics 246 January 22, 2004.
How to find genetic determinants of naturally varying traits?
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
. Learning – EM in The ABO locus Tutorial #9 © Ilan Gronau.
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
Recombination Mapping
Genetic Recombination in Eukaryotes
Hypothesis Testing CJ 526. Probability Review Review P = number of times an even can occur/ P = number of times an even can occur/ Total number of possible.
BIO341 Meiotic mapping of whole genomes (methods for simultaneously evaluating linkage relationships among large numbers of loci)
Probability (cont.). Assigning Probabilities A probability is a value between 0 and 1 and is written either as a fraction or as a proportion. For the.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Non-Mendelian Genetics
Chapter 3 – Basic Principles of Heredity. Johann Gregor Mendel (1822 – 1884) Pisum sativum Rapid growth; lots of offspring Self fertilize with a single.
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory. How.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Genetic design. Testing Mendelian segregation Consider marker A with two alleles A and a BackcrossF 2 AaaaAAAaaa Observationn 1 n 0 n 2 n 1 n 0 Expected.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Lecture 15: Linkage Analysis VII
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Sir Archibald E Garrod – alcaptonuria – black urine - (Madness of King George)
Mapping and cloning Human Genes. Finding a gene based on phenotype ’s of DNA markers mapped onto each chromosome – high density linkage map. 2.
Statistics for Political Science Levin and Fox Chapter Seven
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
1 Genetic Mapping Establishing relative positions of genes along chromosomes using recombination frequencies Enables location of important disease genes.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Did Mendel fake is data? Do a quick internet search and can you find opinions that support or reject this point of view. Does it matter? Should it matter?
Lecture 11: Linkage Analysis IV Date: 10/01/02  linkage grouping  locus ordering  confidence in locus ordering.
Linkage and Mapping Bonus #2 due now. The relationship between genes and traits is often complex Complexities include: Complex relationships between alleles.
PROBABILITY AND STATISTICS The laws of inheritance can be used to predict the outcomes of genetic crosses For example –Animal and plant breeders are concerned.
Mendel & Genetic Variation Chapter 14. What you need to know! The importance of crossing over, independent assortment, and random fertilization to increasing.
Chi Square Pg 302. Why Chi - Squared ▪Biologists and other scientists use relationships they have discovered in the lab to predict events that might happen.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
AP Biology Heredity PowerPoint presentation text copied directly from NJCTL with corrections made as needed. Graphics may have been substituted with a.
The Chi Square Test A statistical method used to determine goodness of fit Chi-square requires no assumptions about the shape of the population distribution.
Relationship between quantitative trait inheritance and
Recombination (Crossing Over)
Mendel’s Laws of Heredity
And Yet more Inheritance
Discrete Event Simulation - 4
Genetic Mapping Linked Genes.
Parametric Methods Berlin Chen, 2005 References:
Lecture 9: QTL Mapping II: Outbred Populations
Linkage Analysis Problems
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Completion and analysis of Punnett squares for dihybrid traits
Presentation transcript:

Mapping genes with LOD score method

LOD score method Aim: Determine , the recombinant fraction (fraction of gametes that are recombinant), using data from relatively small families. Reminder:  vary from 0 (2 genes completely linked) to 0.5 (2 genes are unlinked).

LOD score method (cont.) There are 4 basic steps in the process: Determine the expected frequencies of F2 phenotypes for every value of  from 0.01 to 0.5 Determine the “likelihood” (L) that the family data observed resulted from the given  value: the maximum likelihood is the best estimate of  for given data. Determine the Odds Ratio and logarithm of the odds ratio (lod score) by comparing the Likelihood for each value of  to the Likelihood for unlinked genes (=0.5) Add lod scores from different families to achieve an acceptably high lod score so a specific most likely  can be assigned.

LOD score method (cont.) Lets see how it works on two genes showing complete dominance: A B a b P: x A B a b F1: x aa bb aa B_ A_ bb A_ B_ F2:

LOD score method (cont.) Step 1: Calculate the expected frequency of offspring for values of  fro 0.01 to 0.5 Example: Lets calculate expected offspring number for =0.2: P(Ab)=P(aB)=0.1 ; P(AB)=P(ab)=0.4 2. AB/AB 0.16 Ab/AB 0.04 aB/AB AB/Ab Ab/Ab 0.01 aB/Ab AB/aB Ab/aB aB/aB AB/ab Ab/ab aB/ab F2 phenotype cell sums expected freq A_ B_ .16+.04+.04+.16+.04+.01+.04+.01+.16 0.66 A_ bb 0.01+0.04+0.04 0.09 aa B_ aa bb 0.16

LOD score method (cont.) Step 2: Estimate the observed family data in light of the expected distribution of offspring for each R value. This is done by determining likelihood (L) of the observed family for each value of R. The likelyhood is simply the probability of the observed family, as determined by the multinomial theorem (see http://mathworld.wolfram.com/MultinomialDistribution.html) Lets define our terms for the observed family: a = number of A_ B_ offspring b = number of A_ bb offspring c = number of aa B_ offspring d = number of aa bb offspring n = total offspring (a+b+c+d)

LOD score method (cont.) …and terms for the expected family proportions (obtained fro Step1 above): p = expected proportion of A_ B_ offspring q = expected proportion of A_ bb offspring r = expected proportion of aa B_ offspring s = expected proportion of aa bb offspring Then Likelihood will be calculated by the next formula:

LOD score method (cont.) Example: A family as in previous example has 5 children: 2 of A_ B_ phenotype, 1 with aa B_ and 2 with aa bb. What is the likelihood of this family, given =0.2? L=(5!/2!0!1!2!)(0.66)2(.09)0 (.09)1 (.16)2=0.0301

LOD score method (cont.) Steps 3 and 4: Combining data from several families. We want to be able to compare (and add) data from several different families, to get a good estimate of R. To do this, the L values must be standardized by calculating Odds Ratio (OR), which is the ratio of the L for each  value divided by the L for =0.5 . Then, the logarithm of Odds Ratio is taken; this is the lod score (Z). Lod scores from different families can be added (this is equivalent to multiplying the Odds Ratios, as in the AND rule for two events – family 1 and family 2 – both occurring). A total lod score for some  value of 3.0 is considered proof of linkage between two genes, which is not exactly right as will be explained futher…

Exclusion Mapping In linkage analysis the main goal is localizing disease genes relative to well-characterized marker loci (lod score > 3). However with any given marker, the probability of finding a positive test result is quite low as human genome is quite large and most randomly selected markers are not linked. However, negative results are also results and may be used for elimination of various chromosomal regions from consideration…

Exclusion Mapping (cont.) It’s important to remember that the likelihood ratio test is a test of hypothesis of no linkage, such that in the absence of a significant test result, you fail to reject H0, meaning that there is no significant evidence for linkage. However, this does not mean that you accept H0 and have proved by the failure to achieve a significant test result that there is no linkage. It’s quite another thing to prove the absence of linkage – a problem that can be statistically very complicated…

Exclusion Mapping (cont.) Morton has proposed (1955) that the test of linkage be treated as a sequential likelihood ratio test (LRT) of a simple hypothesis, = 1 . He proposed that the new families continue to be sampled until either the criterion Z(1)>3 is fulfilled, in which case the hypothesis of no linkage is rejected, or until Z(1)<-2, in which case you would reject the hypothesis of linkage. As long as -2<Z(1)<3, no conclusion may be made.

Exclusion Mapping (cont.) Chotai (1984) extended this concept to the general case such that the positive test is considered significant whenever Zmax>3; and the negative test is considered significant on { | Z () < -2}, and the disease gene may be excluded from this part of genome. The same criteria may be applied for both two-point and multipoint scores…

Model Errors and Exclusion Mapping It has bee shown that using incorrect model for the disease doesn’t in general lead to an increased false-positive rate (Clerget-Darpoux et al., 1986), as maximizing the lod score over models does (Weeks et al. 1990a). In other words, you are not more likely to obtain lod scores of 3 in the absence of linkage under the wrong model than using the correct one… If there is linkage, however, there is lower power to detect it when the model parameters are incorrectly specified…

Model Errors and Exclusion Mapping (cont.) Contrary to the lack of false-positives, the false-negative rate may be astronomical when an analysis is performed under incorrect model. It’s quite easy to design an example where disease gene will be mistakenly “excluded” from it’s region by Z()<-2 criterion: If there is a linkage in only 20% of families, then summing the lod scores across the families can easily lead us to spurious exclusions.

Model Errors and Exclusion Mapping (cont.) For this reason, doing a linkage analysis with a complex disease, for which the model is not accurately known, it’s not wise to use exclusion analysis because the exclusion results obtained apply only to that specific model. You can only say that this region may be excluded only if the analysis model is correct…