Download presentation
Presentation is loading. Please wait.
1
1 Functional prediction in proteins (purifying and positive selection)
2
2 1. Introduction: evolution & sequence analysis
3
3 Darwin – the theory of natural selection Adaptive evolution: Favorable traits will become more frequent in the population
4
4 Adaptive evolution When natural selection favors a single allele and therefore allele frequency continuously shifts in one direction
5
5 Kimura – the theory of neutral evolution Neutral evolution: Most molecular changes have no effect on the phenotype (neutral) Selection operates to preserve a trait (no change)
6
6 Purifying Selection Stabilizes a trait in a population: Small babies more illness Large babies more difficult birth… Baby weight is stabilized round 3-4 Kg
7
7 Purifying selection (conservation) - the molecular level Histone 3
8
8 Synonymous vs. non-synonymous substitutions Purifying selection: excess of synonymous substitutions relative to non-synonymous substitutions Synonymous substitution: GUU GUC Non-synonymous substitution: GUU GCU
9
9 Synonymous vs. non-synonymous substitutions Histone 3 Non-syn. Syn.
10
10 Conservation as a means of predicting function Infer the rate of evolution at each site
11
11 Conservation as a means of predicting function Low rate of evolution constraints on the site to prevent disruption of function/structure: active sites, protein-protein interactions, protein core etc. 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE
12
12 Which site is more conserved? 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE
13
13 Use phylogenetic information 1234567 HumanDMAAHAM ChimpDEAAGGC CowDQAAWAP FishDLAACAL S. cerevisiae DDGAFAA S. pombe DDGALGE A G A A A G A A A A G G
14
14 ConSurf/ConSeq web servers: Prediction of conserved residues by estimating evolutionary rates at each site
15
15 Working process Input a protein with a known 3D structure (PDB ID or file provided by the user) Find homologous protein sequences (psi-blast) Perform multiple sequence alignment (removing doubles)Construct an evolutionary tree Project the results on the 3D structureCalculate the conservation score for each site
16
16 ConSurf example: potassium channel An integral membrane protein with sequence similarity to all known K+ channels, particularly in the pore region. PDB ID: 1bl8 chain A
17
17 ConSurf results
18
18 http://conseq.bioinfo.tau.ac.il/ ConSeq performs the same analysis as ConSurf but presents the results on the sequence. Predicts buried/exposed relation exposed & conserved functionally important sites exposed & conserved functionally important sites buried & conserved structurally important sites buried & conserved structurally important sites
19
19 2. Positive selection & drug resistance
20
20 Darwin – the theory of natural selection Adaptive evolution: Favorable traits will become more frequent in the population
21
21 Adaptive evolution at the molecular level
22
22 Adaptive evolution at the molecular level Look for changes which confer an advantage
23
23 Naïve detection Observe a multiple sequence alignment: variable regions = adaptive evolution??
24
24 Naïve detection The problem – how do we know which sites are not under any selection pressure (“non-important” sites) and which are under adaptive evolution?
25
25 Solution – we look at the DNA synonymous non- synonymous
26
26 Solution – we look at the DNA Purifying selection Syn > Non-syn Adaptive evolution = Positive selection Non-syn > Syn Neutral selection Syn = Non-syn
27
27 Also known as… Ka/Ks (or dn/ds, or ω) ratio Purifying selection: Ka < Ks (Ka/Ks <1) Neutral selection: Ka = Ks (Ka/Ks = 1) Positive selection: Ka > Ks (Ka/Ks >1) Non- synonymous substitution rate Synonymous substitution rate
28
28 Examples for positive selection Proteins involved in the immune system Proteins involved in host-pathogen interaction (‘arms-race’) Proteins following gene duplication Proteins involved in reproduction systems
29
29 Accumulation of substitutions (syn. or non-syn.) depends on the evolutionary time that elapsed since the divergence of the analyzed species. When distant species are analyzed saturation of syn. substitutions is often encountered Synonymous vs. non-synonymous substitutions
30
30 Selecton – a server for the detection of purifying and positive selection http://selecton.bioinfo.tau.ac.il Stern et al., Nucleic Acids Res 35, W506 (2007).
31
31 Detecting drug resistance using Selecton
32
32 HIV: molecular evolution paradigm Rapidly evolving virus: 1.High mutation rate (low fidelity of reverse transcriptase) 2.High replication rate
33
33 Drug resistance No drug Drug Adaptive evolution (positive selection)
34
34 HIV Protease Protease is an essential enzyme for viral replication Drugs against Protease are always part of the “cocktail”
35
35 Ritonavir Inhibitor Ritonavir (RTV) is a specific protease inhibitor (drug) C 37 H 48 N 6 O 5 S 2
36
36 Used Selecton to analyse HIV-1 protease gene sequences from patients that were treated with RTV only
37
37
38
38 Example: HIV Protease Primary mutations Secondary mutations novel predictions (experimental validation)
39
39 Rate shifts and HIV sub-types
40
40 Rate shifts V Chimp V Rhesus A Squirrel K Rat M Mouse V Human
41
41 Rate shifts V V A K M V Low evolutionary rate High evolutionary rate
42
42 Rate shifts Specificity determinants: Different phylogenetic groups V Chimp V Rhesus A Squirrel A Rat A Mouse V Human Gain of function?
43
43 Rate shifts Specificity determinants: Following gene duplication V S. paradoxus V S. mikatae A S. cervisiae A S. paradoxus A S. mikatae V S. cervisiae Tropomyosin 1 Tropomyosin 2
44
44 Rate shifts in HIV subtypes
45
45 HIV subtypes
46
46 Which sites are responsible for the differences between the subtypes? Detection of rate-shifts in all 9 subtypes
47
47 Significant rate shift in all HIV genes proportion # rate-shift sites 0.184Env 0.0420Gag 0.1721Nef 0.0333Pol 0.2529Rev 0.1513Tat 0.0713Vif 0.054Vpr 0.3529Vpu
48
48 Gag Position12 Wild-type (E) Site which contributes to Protease Inhibitor (Amprenavir) drug resistance (K) E E E K Q R K K
49
49 C C A G F D J KEQNR
50
50 Summary Sequence analysis can provide valuable information about protein function The basic signal: conservation: http://consurf.tau.ac.il Positive “Darwinian” selection: http://selecton.bioinfo.tau.ac.il http://selecton.bioinfo.tau.ac.il Rate-shifts (specificity determinants)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.