1 Functional prediction in proteins (purifying and positive selection)

Slides:



Advertisements
Similar presentations
Evolution and proteins You can see the effects of evolution, not only in the whole organism, but also in its molecules - DNA and protein For a mutation.
Advertisements

IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Motivation “Nothing in biology makes sense except in the light of evolution” Christian Theodosius Dobzhansky.
Towards realistic codon models: among site variability and dependency of synonymous and nonsynonymous rates Itay Mayrose Adi Doron-Faigenboim Eran Bacharach.
Phylogenetics workshop: Protein sequence phylogeny week 2 Darren Soanes.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
14 Molecular Evolution and Population Genetics
HIV/AIDS as a Microcosm for the Study of Evolution.
1 Detecting selection using phylogeny. 2 Evaluation of prediction methods  Comparing our results to experimentally verified sites Positive (hit)Negative.
Multiple Sequence Alignment (MSA) and Phylogeny. One of the options to get multiple sequence Fasta file.
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Positive selection A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Adaptive Molecular Evolution Nonsynonymous vs Synonymous.
1 HW Clarifications Homology implies shared ancestry Partial sequence identity does not necessarily imply homology A high coverage of sequence identity.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
HIV Structure, Lifecycle, and Replication
Retrovirus Biology Immunology/HIV Michael Para, MD 1.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Evolution: Lamarck Evolution: Change over time Evolution: Change over time Lamarck Lamarck Use / disuse Use / disuse Theory of inheritance of ACQUIRED.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Generating Diversity: how genes and genomes evolve Erin “They call me Dr. Worm” Friedman 29 September 2005.
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
The Molecular Clock? By: T. Michael Dodson. Hypothesis For any given macromolecule (a protein or DNA sequence) the rate of evolution is approximately.
PHYLOGENETICS CONTINUED TESTS BY TUESDAY BECAUSE SOME PROBLEMS WITH SCANTRONS.
Using Molecular Information to Investigate the Evolutionary Origin of the HIV Virus.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Introduction to Bioinformatics.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Evolution is the unifying concept of biology. Two Central Themes of Biology Adaptation - How and in what ways do organisms function and become better.
Codon usage bias Ref: Chapter 9 Xuhua Xia dambe.bio.uottawa.ca.
Construction of Substitution Matrices
Calculating branch lengths from distances. ABC A B C----- a b c.
How Does Antiretroviral Therapy Affect HIV Mutation and Vice Versa? Arlin Toro Devin Iimoto Devin Iimoto.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Identifying and Modeling Selection Pressure (a review of three papers) Rose Hoberman BioLM seminar Feb 9, 2004.
Supporting Scientific Collaboration Online SCOPE Workshop at San Diego Supercomputer Center March 19-22, 2008.
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Selectionist view: allele substitution and polymorphism
N=50 s=0.150 replicates s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among.
NEW TOPIC: MOLECULAR EVOLUTION.
Chapter 22 Descent with Modification: A Darwinian View.
Objective: Chapter 23. Population geneticists measure polymorphisms in a population by determining the amount of heterozygosity at the gene and molecular.
Please feel free to chat amongst yourselves until we begin at the top of the hour. 1.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
BME 130 – Genomes Lecture 20 Population Genomics I.
Molecular Evolution. Study of how genes and proteins evolve and how are organisms related based on their DNA sequence Molecular evolution therefore is.
1 Prediction of functional/structural sites in a protein using conservation and hyper-variation (ConSeq, ConSurf, Selecton)
LBA ProtPars. LBA Prot Dist no Gamma and no alignment.
Evolution is the unifying concept of biology
Evolutionary genomics can now be applied beyond ‘model’ organisms
Detection of the footprint of natural selection in the genome
Evolution of gene function
Causes of Variation in Substitution Rates
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Linkage and Linkage Disequilibrium
The Neutral Theory M. Kimura, 1968
In-Text Art, Ch. 16, p. 316 (1).
Population Evolution Population evolution: overall change in a whole population of organisms Gene pool: all the available genes carried in a particular.
Distances.
Models of Sequence Evolution
What are the Patterns Of Nucleotide Substitution Within Coding and
Molecular Evolution.
LMU Department of Biology
Pedir alineamiento múltiple
Presentation transcript:

1 Functional prediction in proteins (purifying and positive selection)

2 1. Introduction: evolution & sequence analysis

3 Darwin – the theory of natural selection  Adaptive evolution: Favorable traits will become more frequent in the population

4 Adaptive evolution  When natural selection favors a single allele and therefore allele frequency continuously shifts in one direction

5 Kimura – the theory of neutral evolution  Neutral evolution: Most molecular changes have no effect on the phenotype (neutral) Selection operates to preserve a trait (no change)

6 Purifying Selection  Stabilizes a trait in a population: Small babies  more illness Large babies  more difficult birth…  Baby weight is stabilized round 3-4 Kg

7 Purifying selection (conservation) - the molecular level  Histone 3

8 Synonymous vs. non-synonymous substitutions Purifying selection: excess of synonymous substitutions relative to non-synonymous substitutions Synonymous substitution: GUU  GUC Non-synonymous substitution: GUU  GCU

9 Synonymous vs. non-synonymous substitutions  Histone 3 Non-syn. Syn.

10 Conservation as a means of predicting function Infer the rate of evolution at each site

11 Conservation as a means of predicting function Low rate of evolution  constraints on the site to prevent disruption of function/structure: active sites, protein-protein interactions, protein core etc HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE

12 Which site is more conserved? HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE

13 Use phylogenetic information HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE A G A A A G A A A A G G

14 ConSurf/ConSeq web servers: Prediction of conserved residues by estimating evolutionary rates at each site

15 Working process Input a protein with a known 3D structure (PDB ID or file provided by the user) Find homologous protein sequences (psi-blast) Perform multiple sequence alignment (removing doubles)Construct an evolutionary tree Project the results on the 3D structureCalculate the conservation score for each site

16 ConSurf example: potassium channel  An integral membrane protein with sequence similarity to all known K+ channels, particularly in the pore region.  PDB ID: 1bl8 chain A

17 ConSurf results

18  ConSeq performs the same analysis as ConSurf but presents the results on the sequence.  Predicts buried/exposed relation exposed & conserved  functionally important sites exposed & conserved  functionally important sites buried & conserved  structurally important sites buried & conserved  structurally important sites

19 2. Positive selection & drug resistance

20 Darwin – the theory of natural selection  Adaptive evolution: Favorable traits will become more frequent in the population

21 Adaptive evolution at the molecular level

22 Adaptive evolution at the molecular level Look for changes which confer an advantage

23 Naïve detection  Observe a multiple sequence alignment: variable regions = adaptive evolution??

24 Naïve detection  The problem – how do we know which sites are not under any selection pressure (“non-important” sites) and which are under adaptive evolution?

25 Solution – we look at the DNA synonymous non- synonymous

26 Solution – we look at the DNA Purifying selection Syn > Non-syn Adaptive evolution = Positive selection Non-syn > Syn Neutral selection Syn = Non-syn

27 Also known as… Ka/Ks (or dn/ds, or ω) ratio  Purifying selection: Ka < Ks (Ka/Ks <1)  Neutral selection: Ka = Ks (Ka/Ks = 1)  Positive selection: Ka > Ks (Ka/Ks >1) Non- synonymous substitution rate Synonymous substitution rate

28 Examples for positive selection  Proteins involved in the immune system  Proteins involved in host-pathogen interaction (‘arms-race’)  Proteins following gene duplication  Proteins involved in reproduction systems

29 Accumulation of substitutions (syn. or non-syn.) depends on the evolutionary time that elapsed since the divergence of the analyzed species. When distant species are analyzed saturation of syn. substitutions is often encountered Synonymous vs. non-synonymous substitutions

30 Selecton – a server for the detection of purifying and positive selection Stern et al., Nucleic Acids Res 35, W506 (2007).

31 Detecting drug resistance using Selecton

32 HIV: molecular evolution paradigm Rapidly evolving virus: 1.High mutation rate (low fidelity of reverse transcriptase) 2.High replication rate

33 Drug resistance No drug Drug Adaptive evolution (positive selection)

34 HIV Protease Protease is an essential enzyme for viral replication Drugs against Protease are always part of the “cocktail”

35 Ritonavir Inhibitor  Ritonavir (RTV) is a specific protease inhibitor (drug) C 37 H 48 N 6 O 5 S 2

36 Used Selecton to analyse HIV-1 protease gene sequences from patients that were treated with RTV only

37

38 Example: HIV Protease  Primary mutations  Secondary mutations   novel predictions (experimental validation)

39 Rate shifts and HIV sub-types

40 Rate shifts V Chimp V Rhesus A Squirrel K Rat M Mouse V Human

41 Rate shifts V V A K M V Low evolutionary rate High evolutionary rate

42 Rate shifts Specificity determinants:  Different phylogenetic groups V Chimp V Rhesus A Squirrel A Rat A Mouse V Human Gain of function?

43 Rate shifts Specificity determinants:  Following gene duplication V S. paradoxus V S. mikatae A S. cervisiae A S. paradoxus A S. mikatae V S. cervisiae Tropomyosin 1 Tropomyosin 2

44 Rate shifts in HIV subtypes

45 HIV subtypes

46 Which sites are responsible for the differences between the subtypes?  Detection of rate-shifts in all 9 subtypes

47 Significant rate shift in all HIV genes proportion # rate-shift sites 0.184Env Gag Nef Pol Rev Tat Vif 0.054Vpr Vpu

48 Gag Position12  Wild-type (E)  Site which contributes to Protease Inhibitor (Amprenavir) drug resistance (K) E E E K Q R K K

49 C C A G F D J KEQNR

50 Summary  Sequence analysis can provide valuable information about protein function  The basic signal: conservation:  Positive “Darwinian” selection:  Rate-shifts (specificity determinants)