Download presentation
Presentation is loading. Please wait.
1
Probabilistic Roadmaps: A Tool for Computing Ensemble Properties of Molecular Motions Serkan Apaydin, Doug Brutlag 1 Carlos Guestrin, David Hsu 2 Jean-Claude Latombe, Chris Varma Computer Science Department Stanford University 1 Department of Biochemistry, Stanford University 2 Computer Science Department, University of North Carolina
2
Goal of our Research Develop efficient computational representations and algorithms to study molecular pathways for protein folding and ligand-protein binding Protein folding RECOMB ’02 Ligand-protein binding ECCB ‘02
3
Acknowledgements People: Leo Guibas Michael Levitt, Structural Biology Itay Lotan Vijay Pande, Chemistry Fabian Schwarzer Amit Singh Rohit Singh Funding: NSF-ITR ACI-0086013 Stanford’s Bio-X and Graduate Fellowship programs
4
Analogy with Robotics
5
Configuration Space Approximate the free space by random sampling Probabilistic Roadmaps
6
Probabilistic Roadmap free space [Kavraki, Svetska, Latombe,Overmars, 95]
7
Probabilistic Completeness The probability that a roadmap fails to correctly capture the connectivity of the free space goes to 0 exponentially in the number of milestones (~ running time). Random sampling is convenient incremental scheme for approximating the free space
8
Computed Examples
9
Biology Robotics Energy field, instead of joint control Continuous energy field, instead of binary free and in-collision spaces Multiple pathways, instead of single collision-free path Potentially many more degrees of freedom Relation to real world is more complex
10
Initial Work [Singh, Latombe, Brutlag, 99] Study of ligand-protein binding Probabilistic roadmaps with edges weighted by energetic plausibility Search of most plausible paths
11
Initial Work [Singh, Latombe, Brutlag, 99] Study of ligand-protein binding Probabilistic roadmaps with edges weighted by energetic plausibility Search of most plausible paths Study of energy profiles along such paths Catalytic Site energy
12
Initial Work [Singh, Latombe, Brutlag, 99] Study of ligand-protein binding Probabilistic roadmaps with edges weighted by energetic plausibility Search of most plausible paths Study of energy profiles along such paths Extensions to protein folding [Song and Amato, 01] [Apaydin et al., 01]
13
New Idea: Capture the stochastic nature of molecular motion by assigning probabilities to edges vivi vjvj P ij
14
Why is this a good idea? 1)We can approximate Monte Carlo simulation as closely as we wish 2)Unlike with MC simulation, we avoid the local-minima problem 3)We can consider all pathways in the roadmap at once to compute ensemble properties
15
Edge probabilities Follow Metropolis criteria: Self-transition probability: vjvj vivi P ij P ii
16
Stochastic simulation on roadmap and Monte Carlo simulation converge to same Boltzmann distribution S Stochastic Roadmap Simulation P ij
17
Problems with Monte Carlo Simulation Much time is wasted in local minima Each run generates a single pathway
18
Solution P ij Treat roadmap as a Markov chain and use the First-Step Analysis tool
19
Example #1: Probability of Folding p fold Unfolded set Folded set p fold 1- p fold “We stress that we do not suggest using p fold as a transition coordinate for practical purposes as it is very computationally intensive.” Du, Pande, Grosberg, Tanaka, and Shakhnovich “On the Transition Coordinate for Protein Folding” Journal of Chemical Physics (1998). HIV integrase [Du et al. ‘98]
20
P ii F: Folded setU: Unfolded set First-Step Analysis P ij i k j l m P ik P il P im Let f i = p fold (i) After one step: f i = P ii f i + P ij f j + P ik f k + P il f l + P im f m =1 One linear equation per node Solution gives p fold for all nodes No explicit simulation run All pathways are taken into account Sparse linear system
21
In Contrast … Computing p fold with MC simulation requires: Performing many MC simulation runs Counting the number of times F is attained first for every conformation of interest:
22
Computational Tests 1ROP (repressor of primer) 2 helices 6 DOF 1HDD (Engrailed homeodomain) 3 helices 12 DOF H-P energy model with steric clash exclusion [Sun et al., 95]
23
1ROP Correlation with MC Approach
24
1HDD
25
Computation Times (1ROP) Monte Carlo: 49 conformations Over 11 days of computer time Over 10 6 energy computations Roadmap: 5000 conformations 1 - 1.5 hours of computer time ~15,000 energy computations ~4 orders of magnitude speedup!
26
Example #2: Ligand-Protein Interaction Computation of escape time from funnels of attraction around potential binding sites (funnel = ball of 10A rmsd)
27
Computing Escape Time with Roadmap Funnel of Attraction i j k l m P ii P im P il P ik P ij i = 1 + P ii i + P ij j + P ik k + P il l + P im m (escape time is measured as number of steps of stochastic simulation) = 0
28
Similar Computation Through Simulation Similar Computation Through Simulation [Sept, Elcock and McCammon `99] 10K to 30K independent simulations
29
Applications 1)Distinguishing catalytic site: Given several potential binding sites, which one is the catalytic site?
30
Complexes Studied ligandprotein# random nodes # DOFs oxamate1ldm80007 Streptavidin1stp800011 Hydroxylamine4ts180009 COT1cjw800021 THK1aid800014 IPM1ao5800010 PTI3tpi800013
31
Distinction Based on Energy ProteinBound state Best potential binding site 1stp-15.1-14.6 4ts1-19.4-14.6 3tpi-25.2-16.0 1ldm-11.8-13.6 1cjw-11.7-18.0 1aid-11.2-22.2 1ao5-7.5-13.1 (kcal/mol) Able to distinguish catalytic site Not able
32
Distinction Based on Escape Time ProteinBound state Best potential binding site 1stp3.4E+91.1E+7 4ts13.8E+101.8E+6 3tpi1.3E+115.9E+5 1ldm8.1E+53.4E+6 1cjw5.4E+84.2E+6 1aid9.7E+51.6E+8 1ao56.6E+75.7E+6 (# steps) Able to distinguish catalytic site Not able
33
Applications 1)Distinguishing catalytic site 2)Computational mutagenesis C C O O O GLN-101 ARG-106 ASP-195 HIS-193 ASP-166 ARG-169 NADH + + + Loop Chemical environment of LDH-NADH-substrate complex (pyruvate) (catalyzes conversion of pyruvate to lactate in the presence of NADH CH 3 Some amino acids are deleted entirely, replaced by other amino acids, or sidechains altered
34
Binding of Pyruvate to LDH ASP-195 HIS-193 ASP-166 ARG-169 + + + THR-245 C C O O O CH 3 NADH GLN-101 ARG-106 Loop
35
Results C C O O O GLN-101 ARG-106 ASP-195 HIS-193 ASP-166 ARG-169 NADH + + + Loop CH 3 THR-245 Mutant Escape TimeChange Wildtype3.216E6N/A
36
Results C C O O O GLN-101 ALA-106 ASP-195 ALA-193 ASP-166 ARG-169 NADH + Loop CH 3 MutantEscape TimeChange Wildtype3.216E6N/A His193 Ala Arg106 Ala 4.126E2
37
Results MutantEscape TimeChange Wildtype3.216E6N/A His193 Ala Arg106 Ala 4.126E2 His193 Ala3.381E3 Arg106 Ala2.550E2 Asp195 Asn5.221E7 Gln101 Arg1.669E6No change Thr245 Gly4.607E5 C C O O O GLN-101 ARG-106 ASP-195 HIS-193 ASP-166 ARG-169 NADH + + + Loop CH 3 GLY-245
38
Conclusion Probabilistic roadmaps are a promising computational tool for studying ensemble properties of molecular pathways Current and future work: Better kinetic/energetic models Experimentally verifiable tests Non-uniform sampling strategies Encoding MD simulation
39
Stochastic simulation on a roadmap and MC simulation converge to the same distribution (Boltzman): For any set S, >0, >0, >0, there exists N such that a roadmap with N milestones has error bounded by: with probability at least 1- vsvs vgvg S Stochastic Roadmap Simulation
40
Energy Function [Sun et. al. ‘95] Based on pairwise sidechain centroid distances H-P model Amino acids classified as either hydrophobic or polar Hydrophobic residues contact rewarded Exclusion term to prevent steric clashes
41
Ligand-Protein Modeling DOF = 10 –3 coordinates to position root atom; –2 angles to specify first bond; –Angles for all remaining non-terminal atoms; –Bond angles are assumed constant; Protein assumed rigid [Singh, Latombe and Brutlag `99] x,y,z
42
Energy of Interaction EvEv R ij EcEc E v = 0.2[(R 0 /R ij ) 12 - 2(R 0 /R ij ) 6 ] E c = 332 Q i Q j /( R ij ) Energy = van der Waals interaction ( E v ) + electrostatic interaction ( E c )
43
Solvent Effects Is only valid for an infinite medium of uniform dielectric; Dielectric discontinuities result in induced surface charges; Solution: Poisson-Boltzman equation E c = 332 Q i Q j /( R ij ) Use Delphi [ Rocchia et al `01] Finite Difference solution is based on discretizing the workspace into a uniform grid. [ (r). (r)] - (r)k(r) 2 sinh([ (r)] + 4 r f (r)/kT = 0
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.