Lecture 20 : Tests of Neutrality

Slides:



Advertisements
Similar presentations
Alleles = A, a Genotypes = AA, Aa, aa
Advertisements

Chapter 23: The Evolution of Populations
Lecture 21: Introduction to Neutral Theory and Phylogenetics March 31, 2014.
Sampling distributions of alleles under models of neutral evolution.
Lecture 9: Introduction to Genetic Drift February 14, 2014.
Chapter 17 Population Genetics and Evolution, part 2 Jones and Bartlett Publishers © 2005.
Random fixation and loss of heterozygosity
Exam Thursday Covers material through Today’s lecture Practice problems and answers are posted Bring a calculator 5 questions, answer your favorite 4 Please.
Chapter 23: Population Genetics (Microevolution)
Change in frequency of the unbanded allele (q) as a function of q for island populations. Equilibrium points a)Strong selection for q, little migration.
Lecture 23: Introduction to Coalescence April 7, 2014.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Molecular evolution:   how do we explain the patterns of variation observed in DNA sequences? how do we detect selection by comparing silent site substitutions.
Gene flow and genetic differentiation This population is composed of only 20 individuals In this population, selection favors the AA genotype by 10%
14 Molecular Evolution and Population Genetics
From population genetics to variation among species: Computing the rate of fixations.
1 Mendelian Genetics in Populations II Migration, Genetic Drift, and Nonrandom Mating.
Scott Williamson and Carlos Bustamante
Lamarck vs Darwin worksheet Bell Ringer
Standard error of estimate & Confidence interval.
Hidenki Innan and Yuseob Kim Pattern of Polymorphism After Strong Artificial Selection in a Domestication Event Hidenki Innan and Yuseob Kim A Summary.
Population Genetics Learning Objectives
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Lecture 21: Tests for Departures from Neutrality November 9, 2012.
Evolution of Populations
The Evolution of Populations Chapter 23 Biology – Campbell Reece.
Models of Molecular Evolution I Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.1 – 7.2.
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012.
Lab 11 :Test of Neutrality and Evidence for Selection.
Lecture 3: population genetics I: mutation and recombination
Lecture 23: Causes and Consequences of Linkage Disequilibrium November 16, 2012.
Course outline HWE: What happens when Hardy- Weinberg assumptions are met Inheritance: Multiple alleles in a population; Transmission of alleles in a family.
Lecture 19: Mutation November 2, Last Time  Human origins  Human population structure  Signatures of selection in human populations  Neanderthals,
Lecture 21 Based on Chapter 21 Population Genetics Copyright © 2010 Pearson Education Inc.
Lecture 3: Allele Frequencies and Hardy-Weinberg Equilibrium August 27, 2012.
Lecture 5: Genetic Variation and Inbreeding August 31, 2015.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Coalescent Models for Genetic Demography
The plant of the day Bristlecone pine - Two species Pinus aristata (CO, NM, AZ), Pinus longaeva (UT, NV, CA) Thought to reach an age far greater than any.
1 Population Genetics Definitions of Important Terms Population: group of individuals of one species, living in a prescribed geographical area Subpopulation:
Selectionist view: allele substitution and polymorphism
Lecture 19 : Mutation, Selection, and Neutral Theory
Lecture 18: Mutation October 30, Last Time uExam to be returned Monday. Answer key is posted uEffects of population structure, admixture, selection,
Lecture 20: Introduction to Neutral Theory
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Lab 11 :Test of Neutrality and Evidence for Selection
The plant of the day Pinus longaevaPinus aristata.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
By Bryce Perry and Cecil Brown
Lecture 21: Introduction to Phylogenetics November 9, 2015.
(last 10 slides from Chapter 16) Chapter 17 Population Genetics and Evolution Jones and Bartlett Publishers © 2005.
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
HS-LS-3 Apply concepts of statistics and probability to support explanations that organisms with an advantageous heritable trait tend to increase in proportion.
Lecture 6 Genetic drift & Mutation Sonja Kujala
Topics How to track evolution – allele frequencies
Data analyses Course code: ZOO560 Week 3
Why study population genetic structure?
Signatures of Selection
The neutral theory of molecular evolution
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
The Neutral Theory M. Kimura, 1968
The ‘V’ in the Tajima D equation is:
Testing for Selective Neutrality
Evolution by Genetic Drift : Main Points (p. 231)
Evolution by Genetic Drift : Main Points (p. 231)
Presentation transcript:

Lecture 20 : Tests of Neutrality November 6, 2015

Last Time Mutation and selection Infinite alleles and stepwise mutation models Introduction to neutral theory

Today Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based tests of neutrality Ewens-Watterson Test Tajima’s D Hudson-Kreitman-Aguade Synonymous versus Nonsynonymous substitutions McDonald-Kreitman nov6_neutraltest

The main power of neutral theory is it provides a theoretical expectation for genetic variation in the absence of selection.

Equilibrium Heterozygosity under IAM Frequencies of individual alleles are constantly changing Balance between loss and gain is maintained 4Neμ>>1: mutation predominates, new mutants persist, H is high 4Neμ<<1: drift dominates: new mutants quickly eliminated, H is low

Effects of Population Size on Expected Heterozgyosity Under Infinite Alleles Model (μ=10-5) Rapid approach to equilibrium in small populations Higher heterozygosity with less drift

Fate of Alleles in Mutation-Drift Balance Generations from birth to fixation Time between fixation events Time to fixation of a new mutation is much longer than time to loss

Fate of Alleles in Mutation-Drift-Selection Balance Purifying Selection Which case will have the most alleles on average at any given time? Highest HE? What will this depend upon? Neutrality Balancing Selection/Overdominance

Number of Observations of Allele Assume you take a sample of 100 alleles from a large (but finite) population in mutation-drift equilibrium. What is the expected distribution of allele frequencies in your sample under neutrality and the Infinite Alleles Model? A. Number of Observations of Allele Number of Alleles 2 4 6 8 10 B. 2 4 6 8 10 C. 2 4 6 8 10

Allele Frequency Distributions Neutral theory allows a prediction of frequency distribution of alleles through process of birth and demise of alleles through time Comparison of observed to expected distribution provides evidence of departure from Infinite Alleles model Depends on f, effective population size, and mutation rate Hartl and Clark 2007 Black: Predicted from Neutral Theory White: Observed (hypothetical)

Ewens Sampling Formula Population mutation rate: index of variability of population: Probability the i-th sampled allele is new given i alleles already sampled: Probability of sampling a new allele on the first sample: Probability of observing a new allele after sampling one allele: Probability of sampling a new allele on the third and fourth samples: . Expected number of different alleles (k) in a sample of 2N alleles is: Example: Expected number of alleles in a sample of 4:

Ewens Sampling Formula Predicts number of different alleles that should be observed in a given sample size if neutrality prevails under Infinite Alleles Model Small , E(n) approaches 1 Large , E(n) approaches 2N  can be predicted from number of observed alleles for given sample size Can also predict expected homozygosity (fe) under this model where E(n) is the expected number of different alleles in a sample of N diploid individuals, and  = 4Ne.

Ewens-Watterson Test Compares expected homozygosity under the neutral model to expected homozygosity under Hardy-Weinberg equilibrium using observed allele frequencies Comparison of allele frequency distributions fe comes from infinite allele model simulations and can be found in tables for given sample sizes and observed allele numbers

Ewens-Watterson Test Example Drosophila pseudobscura collected from winery Xanthine dehydrogenase alleles 15 alleles observed in 89 chromosomes fHW = 0.366 Generated fe by simulation: mean 0.168 fe Hartl and Clark 2007 How would you interpret this result?

Most Loci Look Neutral According to Ewens-Watterson Test Expected Homozygosity fe Hartl and Clark 2007

DNA Sequence Polymorphisms DNA sequence is ultimate view of standing genetic variation: no hidden alleles Is this really true? What about back mutation? Signatures of past evolution are contained in DNA sequence Neutral theory presents null model Departures due to: Selection Demographic events Bottlenecks, founder effects Population admixture

Sequence Alignment Necessary first step for comparing sequences within and between species Many different algorithms Tradeoff of speed and accuracy

Quantifying Divergence of Sequences Nucleotide diversity (π) is average number of pairwise differences between sequences where N is number of sequences in sample, pi and pj are frequency of sequences i and j in the sample, and πij is the proportion of sites that differ between sequences i and j

Sample Calculation of π 5 10 15 20 25 30 35 A B C A->B, 1 difference A->C, 1 difference B->C, 2 differences On average, there are 18.67 polymorphisms per kb between pairs of haplotypes in the population

Tajima’s D Statistic Infinite Sites Model: each new mutation affects a new site in a sequence Expected number of polymorphic sites in all sequences: where m is length of sequence, and where n is number of different sequences compared

Sample Calculation of S 5 10 15 20 25 30 35 A B C Two polymorphic sites S=2

where V(d) is variance of d Tajima’s D Statistic Two different ways of estimating same parameter: Deviation of these two indicates deviation from neutral expectations where V(d) is variance of d

Tajima’s D Expectations D=0: Neutrality D>0 Balancing Selection: Divergence of alleles (π) increases OR Bottleneck: S decreases D<0 Purifying or Positive Selection: Divergence of alleles decreases Population expansion: Many low frequency alleles cause low average divergence

Slide adapted from Yoav Gilad Balancing Selection Balancing selection   ‘balanced’ mutation Neutral mutation Should increase nucleotide diversity () Decreases polymorphic sites (S) initially. D>0 Slide adapted from Yoav Gilad

Standard neutral model Recent Bottleneck Rare alleles are lost Polymorphic sites (S) more severely affected than nucleotide nucleotide diversity () D>0 Standard neutral model

Positive Selection and Purifying Selection sweep recovery  S  s Time Advantageous mutation Neutral mutation Should decrease both nucleotide diversity () and polymorphic sites (S) initially. S recovers due to mutation  recovers slowly: insensitive to rare alleles D<0 Slide adapted from Yoav Gilad

Standard neutral model Rapid Population Growth will also result in an excess of rare alleles even for neutral loci Standard neutral model Rapid population size increase Most alleles are rare Most alleles are rare Nucleotide diversity () depressed Polymorphic sites (S) unchanged or even enhanced : 4Neμ is large D<0 Time Often two main haplotypes, some rare alleles Slide adapted from Yoav Gilad

How do we distinguish these two forms of divergence (selection vs demography)?