N=50 s=0.150 replicates s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among.

Slides:



Advertisements
Similar presentations
IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Advertisements

Evolution of genomes.
Quick Lesson on dN/dS Neutral Selection Codon Degeneracy Synonymous vs. Non-synonymous dN/dS ratios Why Selection? The Problem.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Atelier INSERM – La Londe Les Maures – Mai 2004
Molecular Evolution Revised 29/12/06
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Model Selection Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
14 Molecular Evolution and Population Genetics
MCB 5472 The Queue, Phylogenetic Reconstruction and Selection Peter Gogarten Office: BSP 404 phone: ,
The gradualist point of view (review) Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages.
MCB 371/372 positive selection 4/25/05 Peter Gogarten Office: BSP 404 phone: ,
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
1 Detecting selection using phylogeny. 2 Evaluation of prediction methods  Comparing our results to experimentally verified sites Positive (hit)Negative.
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
MCB 372 Phylogenetic reconstruction Peter Gogarten Office: BSP 404 phone: ,
From population genetics to variation among species: Computing the rate of fixations.
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
Ml mapping From: Olga Zhaxybayeva and J Peter Gogarten BMC Genomics 2002, 3:4 Olga Zhaxybayeva and J Peter Gogarten BMC Genomics 2002, 3:4.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Course overview Tuesday lecture –Those not presenting turn in short review of a paper using the method being discussed Thursday computer lab –Turn in short.
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
MCB 5472 Types of Selection Peter Gogarten Office: BSP 404 phone: ,
Neutral mutations Neither advantageous nor disadvantageous Invisible to selection (no selection) Frequency subject to ‘drift’ in the population Random.
Positive selection A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Trees – what might they mean? Calculating a tree is comparatively easy, figuring out what it might mean is much more difficult. If this is the probable.
Molecular Evolution, Part 2 Everything you didn’t want to know… and more! Everything you didn’t want to know… and more!
MCB 371/372 quartets positive selection 4/20/05 Peter Gogarten Office: BSP 404 phone: ,
Adaptive Molecular Evolution Nonsynonymous vs Synonymous.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Model Selection Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
MCB5472 Computer methods in molecular evolution Lecture 4/14/2014.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
Molecular phylogenetics
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
Selection versus drift The larger the population the longer it takes for an allele to become fixed. Note: Even though an allele conveys a strong selective.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
PHYLOGENETICS CONTINUED TESTS BY TUESDAY BECAUSE SOME PROBLEMS WITH SCANTRONS.
Identifying and Modeling Selection Pressure (a review of three papers) Rose Hoberman BioLM seminar Feb 9, 2004.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Selectionist view: allele substitution and polymorphism
What is positive selection?
NEW TOPIC: MOLECULAR EVOLUTION.
Bayes’ Theorem Reverend Thomas Bayes ( ) Posterior Probability represents the degree to which we believe a given model accurately describes the.
Molecular evolution Part I: The evolution of macromolecules.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Neutral mutations Neither advantageous nor disadvantageous Invisible to selection (no selection) Frequency subject to ‘drift’ in the population Mutation.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Modelling evolution Gil McVean Department of Statistics TC A G.
Human survivorship Developed Developing Bob May (2007), TREE 22:
Phylogenetic reconstruction, probability mapping, types of selection. Peter Gogarten Office: BSP 404 phone: ,
BME 130 – Genomes Lecture 20 Population Genomics I.
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Phylip PHYLIP (the PHYLogeny Inference Package) is a package of programs for inferring phylogenies (evolutionary trees). PHYLIP is the most widely-distributed.
LBA ProtPars. LBA Prot Dist no Gamma and no alignment.
Evolution of gene function
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Distances.
What are the Patterns Of Nucleotide Substitution Within Coding and
Codon based alignments in Seaview
Testing the Neutral Mutation Hypothesis
Bayesian inference with MrBayes Molecular Phylogenetics – exercise
Phylogenetic reconstruction
Pedir alineamiento múltiple
DN/dS.
Characteristics of Neutral and Deleterious Protein-Coding Variation among Individuals and Populations  Wenqing Fu, Rachel M. Gittelman, Michael J. Bamshad,
Presentation transcript:

N=50 s=0.150 replicates

s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among yourselves) E.g.: N=10 6, s=0: average time to fixation: 4*10 6 generations s=0.01: average time to fixation: 2900 generations N=10 4, s=0: average time to fixation: generations s=0.01: average time to fixation: generations => substitution rate of mutation under positive selection is larger than the rate wite which neutral mutations are fixed.

Positive selection (s>0) A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive selection or Darwinian selection. NOTE : Fitness = ability to survive and reproduce Modified from from

Negative selection (s<0) A new allele (mutant) confers some decrease in the fitness of the organism Selection acts to remove this allele Also called purifying selection Modified from from

Neutral mutations Neither advantageous nor disadvantageous Invisible to selection (no selection) Frequency subject to ‘drift’ in the population Mutation rate = Substitution rate (independent of population size) In small populations, slightly deleterious or advantagous mutations behave as if neutral. (N eff *s<<1)

Degeneracy of 1 st vs 2 nd vs 3 rd position sites results in 25.5% synonymous changes and 74.5% non synonymous changes (Yang&Nielsen,1998). Genetic Code

Measuring Selection on Genes Null hypothesis = neutral evolution Under neutral evolution, synonymous changes should accumulate at a rate equal to mutation rate Under neutral evolution, amino acid substitutions should also accumulate at a rate equal to the mutation rate From: mentor.lscf.ucsb.edu/course/spring/eemb102/lecture/Lecture7.ppt

Testing for selection using dN/dS ratio dN/dS ratio ( aka Ka/Ks or ω (omega) ratio) where dN = number of non-synonymous substitutions / number of possible non-synonymous substitutions dS =number of synonymous substitutions / number of possible non- synonymous substitutions dN/dS >1 positive, Darwinian selection dN/dS =1 neutral evolution dN/dS <1 negative, purifying selection

dambe Three programs worked well for me to align nucleotide sequences based on the amino acid alignment, One is DAMBE (works well for windows). This is a handy program for a lot of things, including reading a lot of different formats, calculating phylogenies, it even runs codeml (from PAML) for you.DAMBE The procedure is not straight forward, but is well described on the help pages. After installing DAMBE go to HELP -> general HELP -> sequences -> align nucleotide sequences based on …-> If you follow the instructions to the letter, it works fine. DAMBE also calculates Ka and Ks distances from codon based aligned sequences. Alternatives are tranalign from the EMBOSS package, andtranalignEMBOSS Seaview (see below)

dambe (cont)

Codon based alignments in Seaview Load nucleotide sequences (no gaps in sequences, sequence starts with nucleotide corresponding to 1 st codon position) Select view as proteins

Codon based alignments in Seaview With the protein sequences displayed, align sequences Select view as nucleotides

PAML (codeml) the basic model

sites versus branches You can determine omega for the whole dataset; however, usually not all sites in a sequence are under selection all the time. PAML (and other programs) allow to either determine omega for each site over the whole tree,, or determine omega for each branch for the whole sequence,. It would be great to do both, i.e., conclude codon 176 in the vacuolar ATPases was under positive selection during the evolution of modern humans – alas, a single site does not provide much statistics ….

Sites model(s) work great have been shown to work great in few instances. The most celebrated case is the influenza virus HA gene. A talk by Walter Fitch (slides and sound) on the evolution of this molecule is here.here This article by Yang et al, 2000 gives more background on ml aproaches to measure omega. The dataset used by Yang et al is here: flu_data.paup. article by Yang et al, 2000 flu_data.paup

sites model in MrBayes begin mrbayes; set autoclose=yes; lset nst=2 rates=gamma nucmodel=codon omegavar=Ny98; mcmcp samplefreq=500 printfreq=500; mcmc ngen=500000; sump burnin=50; sumt burnin=50; end; The MrBayes block in a nexus file might look something like this:

plot LogL to determine which samples to ignore the same after rescaling the y-axis

for each codon calculate the the average probability enter formula copy paste formula plot row

To determine credibility interval for a parameter (here omega<1): Select values for the parameter, sampled after the burning. Copy paste to a new spreadsheet,

Sort values according to size, Discard top and bottom 2.5% Remainder gives 95% credibility interval.