IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.

Slides:



Advertisements
Similar presentations
Neutral Theory of Molecular Evolution most base substitutions are selectively neutral drift dominates evolution at the molecular level Under drift, rate.
Advertisements

1 Number of substitutions between two protein- coding genes Dan Graur.
Quick Lesson on dN/dS Neutral Selection Codon Degeneracy Synonymous vs. Non-synonymous dN/dS ratios Why Selection? The Problem.
R ATES OF P OINT M UTATION. The rate of mutation = the number of new sequence variants arising in a predefined target region per unit time. Target region.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Pattern of similarity between Europeans and Neanderthals Green et al. Science 328, 710 (2010)
14 Molecular Evolution and Population Genetics
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
From population genetics to variation among species: Computing the rate of fixations.
The gradualist point of view Evolution occurs within populations where the fittest organisms have a selective advantage. Over time the advantages genes.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Positive selection A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive.
Genome Evolution. Amos Tanay 2009 Genome evolution Lecture 7: Selection in protein coding genes.
1 Functional prediction in proteins (purifying and positive selection)
Reseach Training Presentation By Yanhong Zhao Department of Evolutionary Functional Genomics, Uppsala University, Sweden Supervisor: Prof. Ulf Lagercrantz.
Adaptive Molecular Evolution Nonsynonymous vs Synonymous.
Maximum Likelihood Molecular Evolution. Maximum Likelihood The likelihood function is the simultaneous density of the observation, as a function of the.
What happens when we change DNA? MUTATIONS.  What do you think a mutation is?  What happens to you during a mutation? MUTATIONS.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Translating the Genetic Code Gene expression part 3.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Lecture 21: Tests for Departures from Neutrality November 9, 2012.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012.
Lab 11 :Test of Neutrality and Evidence for Selection.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Rates and Fitness Effects of Mutations Adam Eyre-Walker (University of Sussex)
Introduction to Bioinformatics.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Selectionist view: allele substitution and polymorphism
N=50 s=0.150 replicates s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among.
NEW TOPIC: MOLECULAR EVOLUTION.
Molecular evolution Part I: The evolution of macromolecules.
Genomics of Adaptation
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Evolution of individual genes in humans
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Schematic of Eukaryotic Protein-Coding Locus
BME 130 – Genomes Lecture 20 Population Genomics I.
Human survivorship Developed Developing Bob May (2007), TREE 22:
LBA ProtPars. LBA Prot Dist no Gamma and no alignment.
Hudson Kreitman Aguadé 1987
Detection of the footprint of natural selection in the genome
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Evolution of gene function
Genetics and Evolutionary Biology
Causes of Variation in Substitution Rates
Signatures of Selection
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Fig. 1. (a) Relationship of codon adaptation index (CAI) and the number of substitutions per fourfold-degenerate site (dS) between D. melanogaster.
The Neutral Theory M. Kimura, 1968
Pipelines for Computational Analysis (Bioinformatics)
A Zero-Knowledge Based Introduction to Biology
Nucleotide subsitutions
Distances.
Molecular basis of evolution.
Schematic of Eukaryotic Protein-Coding Locus
What are the Patterns Of Nucleotide Substitution Within Coding and
Testing the Neutral Mutation Hypothesis
Molecular evolution: traditional tests of neutrality
Pedir alineamiento múltiple
TRANSLATE THIS SEGMENT OF mRNA INTO A PROTEIN
Presentation transcript:

IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection

What is positive selection?

Positive selection is selection on a particular trait - and the increased frequency of an allele in a population

Woolhouse et al, Nat. Genet Directional selection Balancing selection Population level Positive selection can drive the changes in frequencies of two alleles

Species B Species C Species A Diversifying positive selection Interspecific level Positive selection driving divergence

Why is it interesting to identify traits which have undergone or are under positive selection? Function Evolution Environment ……

How can we detect positive selection?

Changes in a protein sequence….

Come from changes in the nucleotide sequence

Quantifying non-synonymous variation - an estimate of positive selection Synonymous mutations: neutral mutations Non-synonymous mutations: non-neutral mutations

Rate of synonymous mutations Rate of non-synonymous mutations To measure positive selection:

Positive selection between species Neutral evolution Purifying selection Positive selection f.ex. effector genes f.ex. housekeeping genes f.ex. pseudogenes K s or d S K a or d N K A or d N : rate of non-synonymous divergence K S or d S : rate of synonymous divergence Evolution between species

Positive selection in a population Neutral evolution Purifying selection Positive selection f.ex. housekeeping genes f.ex. pseudogenes PS PS P N : rate of non-synonymous polymorphisms P S : rate of synonymous polymorphisms Evolution within species P N

Species A Species B Species C d N / d S P N / P s Estimates of non-synonymous and synonymous polymorphisms and substitutions provide insight into the evolutionary processes Analysing divergence and polymorphism: K A / K S ratios > 1 indicate positive selection K A / K S ratios < 1 indicate negative selection K A / K S ratios = 1 indicates neutral evolution K A and d N : rate of non-synonymous substitutions K S and d S : rate of synonymous substitutions P N : Amount of non-synonymous polymorphisms P S : Amount of synonymous polymorphisms Ka/Ks branch-specific estimate

Species A Species B Species C d N / d S P N / P s Estimates of non-synonymous and synonymous polymorphisms and substitutions provide insight into the evolutionary processes Contrasting divergence and polymorphism: Ka/Ks The branch specific d N / d S ratios are measures of adaptive evolution particular to one branch Ratios of P N / P S provide insight into the strength of purifying selection in the species Ratios of K A / K S provide insight into the amount of non- synonymous divergence

Nei and Gojobori, Counts of non-synonymous mutations for each gene ( N d ) - Counts of synonymous mutations for each gene ( S d ) - Counts of potential non-synonymous sites for each gene ( N ) - Counts of potential synonymous sites for each gene ( S ) Non-synonymous substitution rate: K A = N d / N Synonymous substitution rate: K S = S d / S Ratio K A /K S as an inidicator of evolutionary mode in each gene Basic analyses of the proportion of non- synonymous to synonymous divergence K A /K S

Calculate potential synonymous sites (S) for each codon A fourfold degenerate site counts as S = 1 (N = 0) A non-degenerate site counts as S = 0 (N = 1) A two fold degenerate site counts as S = 1/3 (N = 2/3) 1.Proline S = = 1 2.Phenylalanine S = /3 = 1/3 3.For Glycine S = = 1, for Alanine S = = Take the average: S=1 4.Leucine for UUA, S = 1/ /3 = 2/3 for CUA, S = 1/ = 4/3 Take the average of these: S = 1 for codon 4 5Phenylalanine for UUU, S = 1/3 for guanine, S = 1 Take average: S = 2/3 For whole sequence, S = 1 + 1/ /3 = 4 N = total number of sites: S = = 11 Counts of possible synonymous sites for each gene (S) Pro Phe Gly Leu Phe Seq 1CCC UUU GGG UUA UUU Seq 2CCC UUC GAG CUA GUA Pro Phe Ala Leu Val

Calculate S d and N d for each codon. 1. S d = 0,N d = 0 2. S d = 1,N d = 0 3. S d = 0,N d = 1 4. S d = 1,N d = 0 5. this could happen in two ways UUU --> GUU --> GUA N d = 1 S d = 1 Route 1: S d = 1, N d = 1 UUU --> UUA --> GUA N d = 1 N d = 1 Route 2: S d = 0, N d = 2 Take average of these two: S d = 0.5, N d = 1.5 Total S d = 2.5Total N d = 2.5 S d / S = 2.5/4 = 0.625N d / N = 2.5/11 = dN/dS = Pro Phe Gly Leu Phe Seq 1CCC UUU GGG UUA UUU Seq 2CCC UUC GAG CUA GUA Pro Phe Ala Leu Val Counts of synonymous changes

Positive selection between species Neutral evolution Purifying selection Positive selection f.ex. effector genes f.ex. housekeeping genes f.ex. pseudogenes ds ds d N

Species A Species B Species C P N / P s When positive selection is related to species divergence Contrasting divergence and polymorphism: Ka/Ks

McDonald Kreitman (MK) test to contrast within and between species variation

Repl: Nonsynonymous, Syn: Synonymous Fixed: Substitution, Poly: Polymorphisms Drosophila dataset alcohol dehydrogenase

The proportion of non-synonymous fixed differences between species much higher than the proportion of non-synonymous polymorphisms MK test contrasts within and between species synonymous and non-synonymous differences Contingency table can be tested by a G-test

Conclusion from MK-test: Adh locus in Drosophila has accumulated adaptive mutations (been under positive selection) when the Drosophila species diverged

One problem with the “counting methods” Sometimes the signal of selection is not very strong

Positive selection on one or few particular codons or in one particular branch  Evolutionary model to detect selection in particular codons or branches