Lab 9: Linkage Disequilibrium. Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate.

Slides:



Advertisements
Similar presentations
Two-locus systems. Scheme of genotypes genotype Two-locus genotypes Multilocus genotypes genotype.
Advertisements

Lab 10: Mutation, Selection and Drift
Lab 10: Mutation, Selection and Drift. Goals 1.Effect of mutation on allele frequency. 2.Effect of mutation and selection on allele frequency. 3.Effect.
Lab 3 : Exact tests and Measuring Genetic Variation.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Lab 4: Inbreeding and Kinship. Inbreeding Causes departure from Hardy-Weinburg Equilibrium Reduces heterozygosity Changes genotype frequencies Does not.
METHODS FOR HAPLOTYPE RECONSTRUCTION
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
AN INTRODUCTION TO RECOMBINATION AND LINKAGE ANALYSIS Mary Sara McPeek Presented by: Yue Wang and Zheng Yin 11/25/2002.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Section 3 Characterizing Genetic Diversity: Single Loci Gene with 2 alleles designated “A” and “a”. Three genotypes: AA, Aa, aa Population of 100 individuals.
Joint Linkage and Linkage Disequilibrium Mapping
1) Linkage means A) Alleles at different loci are independent B) Alleles at different loci are physically close to each other and on the same chromosome.
Algorithms, games, and evolution Erick Chastain, Adi Livnat, Christos Papadimitriou, and Umesh Vazirani Nasim Mobasheri Spring 2015.
. Learning – EM in ABO locus Tutorial #08 © Ydo Wexler & Dan Geiger.
Study of Microevolution
1 How many genes? Mapping mouse traits, cont. Lecture 2B, Statistics 246 January 22, 2004.
Population Genetics What is population genetics?
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
Estimating recombination rates using three-site likelihoods Jeff Wall Program in Molecular and Computational Biology, USC.
Hardy Weinberg. Hardy Weinberg refers to Populations.
Lecture 2: Basic Population and Quantitative Genetics.
Genetic variation, detection, concepts, sources, and forces
Population Genetics Learning Objectives
Broad-Sense Heritability Index
Lab 12. Linkage Disequilibrium November 28, 2012.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
14 Population Genetics and Evolution. Population Genetics Population genetics involves the application of genetic principles to entire populations of.
PowerPoint Slides for Chapter 16: Variation and Population Genetics Section 16.2: How can population genetic information be used to predict evolution?
Non-Mendelian Genetics
Lab 11 :Test of Neutrality and Evidence for Selection.
Population assignment likelihoods in a phylogenetic and demographic model. Jody Hey Rutgers University.
Genetic Linkage. Two pops may have the same allele frequencies but different chromosome frequencies.
Gene Hunting: Linkage and Association
Evolution of Populations
Population Genetics  Population- a group of members of the same species living in a given area  Ex. –People in CR Metro Area –Oak trees at Rock Island.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
 Linked Genes Learning Objective DOT Point: predict the difference in inheritance patterns if two genes are linked Sunday, June 05,
Joint Linkage and Linkage Disequilibrium Mapping Key Reference Li, Q., and R. L. Wu, 2009 A multilocus model for constructing a linkage disequilibrium.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
INTRODUCTION TO ASSOCIATION MAPPING
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
Lab 7. Estimating Population Structure. Goals 1.Estimate and interpret statistics (AMOVA + Bayesian) that characterize population structure. 2.Demonstrate.
Lab 9: Linkage Disequilibrium. Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate.
Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Lab 11 :Test of Neutrality and Evidence for Selection
Lab 4: Inbreeding and Kinship. Inbreeding Reduces heterozygosity Does not change allele frequencies.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
OUTLINE 22 Forces that disrupt HW equilibrium
Genetic Linkage.
Measuring Evolutionary Change Over Time
Hardy-Weinberg Theorem
HARDY-WEINBERG and GENETIC EQUILIBRIUM
Genetic Linkage.
Recombination (Crossing Over)
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Diversity of Individuals and Evolution of Populations
The ‘V’ in the Tajima D equation is:
Basic concepts on population genetics
The Evolution of Populations
The Mechanisms of Evolution
Genetic Linkage.
Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation- Maximization Algorithm for Unphased Diploid Genotype Data  Daniele.
THE EVOLUTION OF POPULATIONS
Linkage Analysis Problems
Hardy-Weinberg Lab Data
Presentation transcript:

Lab 9: Linkage Disequilibrium

Goals 1.Estimation of LD in terms of D, D’ and r 2. 2.Determine effect of random and non-random mating on LD. 3.Estimate LD from diploid genotype data using EM-algorithm.

LD estimation in two-locus (A&B) and two- allele (1 & 2) model A1 A2 B1 B2B1 B2 p1 p2 q1 q2q1 q2 GameteObserved gametic frequency Expected gametic frequency under linkage equilibrium AlleleAllele frequency A1B1 x 11 p1q1p1q1 A1p1=x11+x12 A1B2 x 12 p1q2p1q2 A2p2= x21+x22 A2B1 x 21 p2q1p2q1 B1q1= x11+x21 A2B2 x 22 p2q2p2q2 B2q2= x12+x22

If D > 0, D max = min(p 1 q 2, p 2 q 1 ) If D < 0, D max = min(p 1 q 1, p 2 q 2 ). Different measures of LD

Allele history High drift or Selective sweep Time

LD Broken by recombination A1A1 B1B1 A2A2 B2B2 A1A1 B2B2 A2A2 B1B1 A1A1 B1B1 A1A1 B2B2 A1A1 B2B2 A1A1 B1B1

Closer proximity -> less recombination -> stronger LD

Decay of LD Recombination rate for self-fertilizing organisms:

a)Calculate D, D’, and r 2, and test the statistical significance of the gametic disequilibrium between the two loci. b)Because the linkage phase of each mother tree was known, Adams and Joly were able to estimate that the recombination rate between the two loci is c = i) What is the expected value of D in the next generation (i.e., in the offspring of the seeds that were included in the study)? ii) How many generations of random mating will it take for D to decay below 0.005? iii) What is the expected value of D in the next generation if: S = 0.1? S = 0.5? S = 0.9? c)Repeat the calculations from b) assuming c = 0.5 (i.e., assuming that the two loci are physically unlinked). d)Discuss the relative importance of rates of recombination and self-fertilization in determining the rate of decay of LD. Problem 1. In most conifers, gamete frequencies and the linkage phase of diploid genotypes can be determined directly because seeds contain relatively large amounts of haploid nutritional tissue (called endosperm or megagametophyte), which originates from the maternal gamete. As part of a study of the linkage relationship among allozyme loci in loblolly pine (Pinus taeda), Adams and Joly (1980) sampled 456 gametes at loci phosphoglucose isomerase 2 (PGI2, for simplicity, let this be locus A) and glutamate-oxaloacetate transaminase 1 (GOT1, let this be locus B) and observed the following numbers of gametes. GameteCount A1B1148 A1B278 A2B188 A2B2142 Total456

Problem 2. Compare rates of decay of r 2 with physical distance in sequences from the phytochrome B2 (PHYB2) gene in European aspen (Populus tremula) and the phytochrome C (PHYC) gene in Arabidopsis thaliana. a)Show scatter plots with trend lines illustrating the decay of r 2 with physical distance for each gene. b)How do the patterns of LD differ between these two species, and why? (BIOLOGICAL EXPLANATION) c)GRADUATE STUDENTS ONLY: Provide facts and citations supporting your biological explanation.

When we genotype, we often don’t know the actual haplotypes – Unphased haplotypes Can use a maximum likelihood method to obtain haplotype frequencies – Expectation Maximization (EM) Haplotypes through EM

1.Initialize – Guess the gamete frequencies 2.Expectation Step – Find expected frequencies of known phase genotypes given gamete frequencies 3.Maximization Step – Find expected frequencies of all unphased genotypes given gamete frequencies a.Use to make new gamete frequency estimates where n= # of unphased genotypes in the samples, n1, n2….n5, are the # of times each unphased genotype was observed in the sample, and P1, P2, …., P5 are the expected frequencies of the unphased genotypes in the sample.

Problem 3. File human_LD.arp contains data for humans from two populations (Han and Melanesian) genotyped for the same loci you have analyzed for departures from Hardy-Weinberg Equilibrium. The Han sample includes individuals from a broad geographic area in China, whereas the Melanesian sample only includes individuals from the Bougainville Island. Use Arlequin to test for significant linkage disequilibrium among the 10 loci in each of these populations. a)How do you interpret the difference in the number of linked loci in the two populations? (STATISTICAL AND BIOLOGICAL INTERPRETIONS) b)GRADUATE STUDENTS ONLY: How many pairs of loci are expected to show significant LD at α = 0.05 by chance (i.e., if there is no gametic disequilibrium among them in the population)? c)GRADUATE STUDENTS ONLY: Provide facts and citations supporting your biological interpretation of the results.

Han