Molecular Evolution and Population Genetics A few notes on population genetics of interest in phylogenetics Thomas Mailund.

Slides:



Advertisements
Similar presentations
The Five Factors of Evolution
Advertisements

Sampling distributions of alleles under models of neutral evolution.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Population Genetics I. Evolution: process of change in allele
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
Islands in Africa: a study of structure in the source population for modern humans Rosalind Harding Depts of Statistics, Zoology & Anthropology, Oxford.
From population genetics to variation among species: Computing the rate of fixations.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Patterns of Natural Selection
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
The Evolution of Populations Individuals do not evolve, populations do…Darwin Lynn English High School Science Biology/Ms. Mezzetti POPULATION GENETICS.
How Populations Grow What is a population?  A population consists of all the individuals of a species that live together in one place at one time. What.
Mechanisms of Change The following four processes are the basic mechanisms by which evolution occurs. Its not just one mechanism that Evolution thrives.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
The Evolution of Populations Chapter 23 Biology – Campbell Reece.
Course outline HWE: What happens when Hardy- Weinberg assumptions are met Inheritance: Multiple alleles in a population; Transmission of alleles in a family.
Calculating branch lengths from distances. ABC A B C----- a b c.
Recombination, Mutation, Genetic Drift, Gene Flow Also evolution Also evolution.
Finite population. - N - number of individuals - N A and N a – numbers of alleles A and a in population Two different parameters: one locus and two allels.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Coalescent Models for Genetic Demography
Recombination, Mutation, Genetic Drift, Gene Flow Also evolution Also evolution.
Selectionist view: allele substitution and polymorphism
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny Ch. 7 & 8.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Restriction enzyme analysis The new(ish) population genetics Old view New view Allele frequency change looking forward in time; alleles either the same.
Fun fact: Humans are one of the least diverse species; fruit flies are 10x as diverse.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
Molecular Clocks and Evolution Bio 135 Summer 2009.
By Bryce Perry and Cecil Brown
BME 130 – Genomes Lecture 20 Population Genomics I.
Monkey Business Bioinformatics Research Center University of Aarhus Thomas Mailund Joint work with Asger Hobolth, Ole F. Christiansen and Mikkel H. Schierup.
Association mapping Fundamental Principles and a few methods Thomas Mailund Slides:
Lecture 6 Genetic drift & Mutation Sonja Kujala
Chapter 14 Genetic Algorithms.
Chapter 16 Section 1 Genes and Variation
Around the World Biology edition.
Evolutionary genomics can now be applied beyond ‘model’ organisms
An Algorithm for Computing the Gene Tree Probability under the Multispecies Coalescent and its Application in the Inference of Population Tree Yufeng Wu.
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
Human Chimp How does DNA evolve? Nucleotide substitutions
COALESCENCE AND GENE GENEALOGIES
Evolution as genetic change
PROCESS OF EVOLUTION.
LECTURE 7: Microevolution Part 5 Genetic Drift
Genetic Variations with Populations
Agents for Evolutionary Change
Statistical Modeling of Ancestral Processes
Testing the Neutral Mutation Hypothesis
Molecular Clocks Rose Hoberman.
Type Topic in here! Created by Educational Technology Network
The coalescent with recombination (Chapter 5, Part 1)
There is a Great Diversity of Organisms
Biology I Chapters 16.
DNA and the Genome Key Area 7c Speciation.
11.1 Genetic Variation within Popln
There is a Great Diversity of Organisms on Planet Earth……….why?
III. Darwin’s Theory of Natural Selection
THE EVOLUTION OF POPULATIONS
The Evolution of Populations
First, let’s talk about the word THEORY…
Lecture 19 – Non-tree Based Methods
Presentation transcript:

Molecular Evolution and Population Genetics A few notes on population genetics of interest in phylogenetics Thomas Mailund

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events Speciation (birth)

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events Speciation

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events Extinction (death)

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges Mutation

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges There is no such process!

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges There is no such process! I dare you to observe a 'species'...

Species Evolution ● A common view: – Death/birth process of species ● Binary tree with inner nodes being speciation events – Mutations accumulate on edges There is no such process! An abstraction over a more complicated process...

Population Evolution ● Wright-Fisher model – Discrete, non-overlapping generations – Constant population size – Each individual in one generation is a random copy of an individual from the previous generation

Population Evolution ● Wright-Fisher model – Discrete, non-overlapping generations – Constant population size – Each individual in one generation is a random copy of an individual from the previous generation

Population Evolution ● Wright-Fisher model – Discrete, non-overlapping generations – Constant population size – Each individual in one generation is a random copy of an individual from the previous generation

Population Evolution ● Wright-Fisher model – Discrete, non-overlapping generations – Constant population size – Each individual in one generation is a random copy of an individual from the previous generation

Population Evolution ● Wright-Fisher model – Discrete, non-overlapping generations – Constant population size – Each individual in one generation is a random copy of an individual from the previous generation

Population Evolution Loss Mutation

Population Evolution Species view

Population Evolution Mutation Fixation

Population Evolution Species view

Population Evolution Mutation Polymorphism...

Population Evolution Species view ???

Population Evolution There is no such process! I dare you to observe a 'generation'...

Population Evolution There is no such process! This abstraction is good enough for now...

Unsupported Claims ● Species are populations on a larger scale ● Speciation events are population splits plus time

Unsupported Claims ● Species are populations on a larger scale ● Speciation events are population- splits plus time (no one really knows how speciation occurs...)

Population Evolution Species view ??? What 'type' is the species gene here?

Population Evolution Species view Individuals view

Population Evolution Species view Individuals view ???

Population Evolution Species view Individuals view 1/5 4/5

Dealing with polymorphism --A C A----G T---C---A T G A----G C---C---A A G G----G C---C---A A C A----G T---C---A T C A----G T---C---A T C A----T T---A---A A C A----G T---C---A A C A----G T---C---G T C A----T T---C---A A C A----G T---C---A A C G----T C---A---A A C A----G C---C---G---- Two populations/species with polymorphic sites Alleles polymorphic in both population

Dealing with polymorphism --A C A----G T---C---A T G A----G C---C---A A G G----G C---C---A A C A----G T---C---A T C A----G T---C---A T C A----T T---A---A A C A----G T---C---A A C A----G T---C---G T C A----T T---C---A A C A----G T---C---A A C G----T C---A---A A C A----G C---C---G---- Two populations/species with polymorphic sites Alleles fixed in one population

Dealing with polymorphism --A C A----G T---C---A T G A----G C---C---A A G G----G C---C---A A C A----G T---C---A T C A----G T---C---A T C A----T T---A---A A C A----G T---C---A A C A----G T---C---G T C A----T T---C---A A C A----G T---C---A A C G----T C---A---A A C A----G C---C---G---- Different allele-frequencies for different genes The difference is only in frequencies...

Distances Based on Frequencies ● Consider vectors of allele- frequencies – Here n alleles at a single gene Take values in range and can be normalized as ● Population distance just average over genes Roger's Distance

Distances Based on Frequencies ● Consider single (two-allele) gene – Generalizes to hyper-spheres for more alleles ● Distance as angle ● Population distance just average over genes Take values in rangeand can be normalized by Bhattacharyya's Distance

Distances Based on Frequencies ● Consider single (two-allele) gene – Generalizes to hyper-spheres for more alleles ● Distance as cord-length ● Population distance just average over genes Take values in rangeand can be normalized by Cavalli-Sforza & Edwards' Distance

Population vs Individual Trees Population view Individuals view

Population vs Individual Trees Population view Individuals view

Population vs Individual Trees Population view Individuals view

Population vs Individual Trees ● Only a problem if – Splitting time is very fast – Or population sizes are very large

Species vs Individual Trees ● Only a problem if – Splitting time is very fast – Or population sizes are very large ● Not really happening at the species level – At least hard to imagine how...

Species vs Gene Trees ● Only a problem if – Splitting time is very fast – Or population sizes are very large ● Not really happening at the species level – At least hard to imagine how... ● But it does happen between species when looking at some parts of the genome! – Recombination splits the genome in segments – Some segments can have different (gene) trees than the species tree

Recombination Recap ● Recombination events split the individual's genomes in segments ● Some coalesce sooner than others

New Observation ● Recombination events split the individual's genomes in segments ● Some coalesce sooner than others ● Some might not coalesce until after (back in time) the species have merged

Example: Great Apes ● From fossils and molecular data we know that the species tree is: ● But a large fraction of genes (10-20%) support: Human Chimps Gorilla Human Chimps Gorilla or Human ChimpsGorilla

Example: Great Apes Hobolt et al Human Chimps Gorilla Case A: Only Human and Chimp can coalesce here. Always consistent with species tree. Case B: All species can coalesce here. Only a third consistent with species tree.

Example: Great Apes Hobolt et al Human Chimps Gorilla Case A: Only Human and Chimp can coalesce here. Always consistent with species tree. Case B: All species can coalesce here. Only a third consistent with species tree. How often both H and C reach B depends on the H+C population size and the time between the H+C+B split and the H+C split.

Example: Great Apes Hobolt et al. 2006

coalHMM Hobolt et al. 2006

coalHMM ● Simulated data: Hobolt et al. 2006

coalHMM ● 100 kbp real data: Hobolt et al. 2006

coalHMM ● From fraction of segments in state B and length of segments before state changes we can infer: – Species population sizes (both current species and ancestral species over the time span considered) – Speciation times Hobolt et al. 2006

Take-Home Message ● When dealing with populations, or closely related species: – Polymorphism must be taken into account – Recombination must be taken into account