Lecture 21: Tests for Departures from Neutrality November 9, 2012.

Slides:



Advertisements
Similar presentations
IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Advertisements

Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Lab 3 : Exact tests and Measuring of Genetic Variation.
Lab 3 : Exact tests and Measuring Genetic Variation.
Alleles = A, a Genotypes = AA, Aa, aa
Lecture 21: Introduction to Neutral Theory and Phylogenetics March 31, 2014.
Modeling Populations forces that act on allelic frequencies.
Chapter 23: Population Genetics (Microevolution)
Lecture 19: Causes and Consequences of Linkage Disequilibrium March 21, 2014.
Lecture 23: Introduction to Coalescence April 7, 2014.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Pattern of similarity between Europeans and Neanderthals Green et al. Science 328, 710 (2010)
Detection of domestication genes and other loci under selection.
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
14 Molecular Evolution and Population Genetics
From population genetics to variation among species: Computing the rate of fixations.
Bruce Walsh, University of Arizona
Population Genetics What is population genetics?
Scott Williamson and Carlos Bustamante
Hardy Weinberg: Population Genetics
Lamarck vs Darwin worksheet Bell Ringer
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
Hidenki Innan and Yuseob Kim Pattern of Polymorphism After Strong Artificial Selection in a Domestication Event Hidenki Innan and Yuseob Kim A Summary.
Population Genetics Learning Objectives
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Evolution of Populations
Models of Molecular Evolution I Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.1 – 7.2.
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012.
Lab 11 :Test of Neutrality and Evidence for Selection.
Population assignment likelihoods in a phylogenetic and demographic model. Jody Hey Rutgers University.
Lecture 23: Causes and Consequences of Linkage Disequilibrium November 16, 2012.
Lecture 21 Based on Chapter 21 Population Genetics Copyright © 2010 Pearson Education Inc.
Lecture 3: Allele Frequencies and Hardy-Weinberg Equilibrium August 27, 2012.
Lecture 5: Genetic Variation and Inbreeding August 31, 2015.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Lecture 17: Phylogenetics and Phylogeography
Selectionist view: allele substitution and polymorphism
Lecture 19 : Mutation, Selection, and Neutral Theory
Lecture 20 : Tests of Neutrality
Lecture 20: Introduction to Neutral Theory
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Lecture 3: Allele Frequencies and Hardy-Weinberg Equilibrium August 24, 2015.
Molecular evolution Part I: The evolution of macromolecules.
Lab 11 :Test of Neutrality and Evidence for Selection
The plant of the day Pinus longaevaPinus aristata.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Modelling evolution Gil McVean Department of Statistics TC A G.
Lecture 21: Introduction to Phylogenetics November 9, 2015.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Topics How to track evolution – allele frequencies
Gene350 Animal Genetics Lecture August 2009.
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
Signatures of Selection
The neutral theory of molecular evolution
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Linkage and Linkage Disequilibrium
The Neutral Theory M. Kimura, 1968
Testing the Neutral Mutation Hypothesis
The ‘V’ in the Tajima D equation is:
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
Genetic Linkage.
Testing for Selective Neutrality
4-Population Genetics Notes
Presentation transcript:

Lecture 21: Tests for Departures from Neutrality November 9, 2012

uIntroduction to neutral theory uMolecular clock uExpectations for allele frequency distributions under neutral theory Last Time

Today uSequence data and quantification of variation  Infinite sites model  Nucleotide diversity (π) uSequence-based tests of neutrality  Ewens-Watterson Test  Tajima’s D  Hudson-Kreitman-Aguade  Synonymous versus Nonsynonymous substitutions  McDonald-Kreitman

Expected Heterozygosity with Mutation-Drift Equilibrium under IAM uAt equilibrium: uRemembering that H = 1-f: set 4Neμ = θ

Allele Frequency Distributions uNeutral theory allows a prediction of frequency distribution of alleles through process of birth and demise of alleles through time uComparison of observed to expected distribution provides evidence of departure from Infinite Alleles model uDepends on f, effective population size, and mutation rate Hartl and Clark 2007 Black: Predicted from Neutral Theory White: Observed (hypothetical)

Ewens Sampling Formula. Probability the i-th sampled allele is new given i alleles already sampled: Probability of sampling a new allele on the first sample: Probability of observing a new allele after sampling one allele: Probability of sampling a new allele on the third and fourth samples: Expected number of different alleles (k) in a sample of 2N alleles is: Example: Expected number of alleles in a sample of 4: Population mutation rate: index of variability of population:

Ewens Sampling Formula uPredicts number of different alleles that should be observed in a given sample size if neutrality prevails under Infinite Alleles Model  Small θ, E(n) approaches 1  Large θ, E(n) approaches 2N uθ can be predicted from number of observed alleles for given sample size uCan also predict expected homozygosity (f e ) under this model where E(n) is the expected number of different alleles in a sample of N diploid individuals, and  = 4N e .

Ewens-Watterson Test uCompares expected homozygosity under the neutral model to expected homozygosity under Hardy- Weinberg equilibrium using observed allele frequencies  Comparison of allele frequency distributions uf e comes from infinite allele model simulations and can be found in tables for given sample sizes and observed allele numbers

Ewens-Watterson Test Example uDrosophila pseudobscura collected from winery uXanthine dehydrogenase alleles u15 alleles observed in 89 chromosomes uf HW = uGenerated f e by simulation: mean fefe Hartl and Clark 2007 How would you interpret this result?

Most Loci Look Neutral According to Ewens-Watterson Test Expected Homozygosity f e Hartl and Clark 2007

DNA Sequence Polymorphisms uDNA sequence is ultimate view of standing genetic variation: no hidden alleles  Is this really true?  What about back mutation? uSignatures of past evolution are contained in DNA sequence uNeutral theory presents null model uDepartures due to:  Selection  Demographic events -Bottlenecks, founder effects -Population admixture

Sequence Alignment uNecessary first step for comparing sequences within and between species uMany different algorithms  Tradeoff of speed and accuracy

Quantifying Divergence of Sequences uNucleotide diversity (π) is average number of pairwise differences between sequences where N is number of sequences in sample, p i and p j are frequency of sequences i and j in the sample, and π ij is the proportion of sites that differ between sequences i and j

Sample Calculation of π A->B, 1 difference A->C, 1 difference B->C, 2 differences A B C On average, there are polymorphisms per kb between pairs of haplotypes in the population

Tajima’s D Statistic uInfinite Sites Model: each new mutation affects a new site in a sequence uExpected number of polymorphic sites in all sequences: where m is length of sequence, and where n is number of different sequences compared

Sample Calculation of θ S Two polymorphic sites S= A B C

Tajima’s D Statistic uTwo different ways of estimating same parameter: uDeviation of these two indicates deviation from neutral expectations where V(d) is variance of d

Tajima’s D Expectations uD=0: Neutrality uD>0  Balancing Selection: Divergence of alleles (π) increases OR  Bottleneck: S decreases uD<0  Purifying or Positive Selection: Divergence of alleles decreases OR  Population expansion: Many low frequency alleles cause low average divergence

Balancing Selection Balancing selection   ‘balanced’ mutation Neutral mutation Slide adapted from Yoav Gilad  Should increase nucleotide diversity (  )  Decreases polymorphic sites (S) initially.  D>0

Recent Bottleneck  Rare alleles are lost  Polymorphic sites (S) more severely affected than nucleotide nucleotide diversity (  )  D>0 Standard neutral model

Positive Selection and Purifying Selection sweep  S S Slide adapted from Yoav Gilad Advantageous mutation Neutral mutation  Should decrease both nucleotide diversity (  ) and polymorphic sites (S) initially.  S recovers due to mutation   recovers slowly: insensitive to rare alleles  D<0  s  s s Time recovery

Standard neutral model Often two main haplotypes, some rare alleles Rapid Population Growth will also result in an excess of rare alleles even for neutral loci Slide adapted from Yoav Gilad Time Rapid population size increase Most alleles are rare  Most alleles are rare  Nucleotide diversity (  ) depressed  Polymorphic sites (S) unchanged or even enhanced : 4N e μ is large  D<0

How do we distinguish these two forms of divergence (selection vs demography)?

Hudson-Kreitman-Aguade Test uDivergence between species should be of same magnitude as variation within species uProvides a correction factor for mutation rates at different sites uComplex goodness of fit test uPerform test for loci under selection and supposedly neutral loci

Polymorphism Divergence Neutral LocusTest Locus A /20 ≈ 3/8 Slide adapted from Yoav Gilad Hudson-Kreitman-Aguade (HKA) test Polymorphism: Variation within species Divergence: Variation between species

Polymorphism Divergence Neutral LocusTest Locus B /20 >> 3/19 Slide adapted from Yoav Gilad Hudson-Kreitman-Aguade (HKA) test Conclusion: polymorphism lower than expected in Test Locus B: Selective sweep?

Mauricio 2001; Nature Reviews Genetics 2, 376 TeosinteMaizeMaize w/TBR mutation

HKA Example: Teosinte Branched uLab exercise: test Teosinte-Branched Gene for signature of purifying selection in maize compared to Teosinte relative uCompare to patterns of polymorphism and diversity in Alchohol Dehydrogenase gene