Flexible-Protein Docking Dr Jonathan Essex School of Chemistry University of Southampton.

Slides:



Advertisements
Similar presentations
Evaluating Free Energies of Binding using Amber: The MM-PBSA Approach.
Advertisements

Case Study: Dopamine D 3 Receptor Anthagonists Chapter 3 – Molecular Modeling 1.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Improving enrichment rates A practical solution to an impractical problem Noel O’Boyle Cambridge Crystallographic Data Centre
Computational Drug Design Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
How to approximate complex physical and thermodynamic interactions? Employ rigid or flexible structures for ligand and receptor (Side-chains or Back-bone.
Techniques for rare events: TA-MD & TA-MC Giovanni Ciccotti University College Dublin and Università “La Sapienza” di Roma In collaboration with: Simone.
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Docking of Protein Molecules
FLEX* - REVIEW.
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Molecular Docking S. Shahriar Arab. Overview of the lecture Introduction to molecular docking: Definition Types Some techniques Programs “Protein-Protein.
Algorithms and Software for Large-Scale Simulation of Reactive Systems _______________________________ Ananth Grama Coordinated Systems Lab Purdue University.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Conformational Sampling
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Investigating Protein Conformational Change on a Distributed Computing Cluster Christopher Woods Jeremy Frey Jonathan Essex University.
Chicago, July 22-23, 2002DARPA Simbiosys Review 1 Monte Carlo Particle Simulation of Ionic Channels Trudy van der Straaten Umberto Ravaioli Beckman Institute.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
The Geometry of Biomolecular Solvation 2. Electrostatics Patrice Koehl Computer Science and Genome Center
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
By Siavoush Dastmalchi Tabriz University of Medical Sciences Tabriz-Iran Modelling the Structures of G Protein-Coupled Receptors Aided by Three-Dimensional.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
A two-state homology model of the hERG K + channel: application to ligand binding Ramkumar Rajamani, Brett Tongue, Jian Li, Charles H. Reynolds J & J PRD.
Altman et al. JACS 2008, Presented By Swati Jain.
Rational Drug Design : HIV Integrase. A process for drug design which bases the design of the drug upon the structure of its protein target. 1.Structural.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
1/20 Study of Highly Accurate and Fast Protein-Ligand Docking Method Based on Molecular Dynamics Reporter: Yu Lun Kuo
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Introduction. Zn 2+ homeostasis is regulated at the transcriptional level by the DNA-binding protein SmtB. Manipulation of Zn 2+ homeostasis could act.
R L R L L L R R L L R R L L water DOCKING SIMULATIONS.
Lecture 9: Theory of Non-Covalent Binding Equilibria Dr. Ronald M. Levy Statistical Thermodynamics.
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Structure/function studies of HIV proteins HIV gp120 V3 loop modelling using de novo approaches HIV protease-inhibitor binding energy prediction.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
1 of 21 SDA development -Description of sda Description of sda-5a - Sda for docking.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Simplified picture of the principles used for multiple copy simultaneous search (MCSS) and for computational combinatorial ligand design (CCLD). Simplified.
International Chemical Design & Discovery Course 2017
Virtual Screening.
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
Alexey Sulimov, Ekaterina Katkova, Vladimir Sulimov,
Large Time Scale Molecular Paths Using Least Action.
Volume 23, Issue 12, Pages (December 2015)
Complementarity of Structure Ensembles in Protein-Protein Binding
Geometry-Based Sampling of Conformational Transitions in Proteins
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Flexible-Protein Docking Dr Jonathan Essex School of Chemistry University of Southampton

Southampton

Programme Existing small-molecule docking –Typical approximations, and outcomes Evidence for receptor flexibility, and consequences Methods for accommodating protein flexibility in docking: –The ensemble approach –The induced fit approach

Existing small-molecule docking Taylor, R.D. et al. J. Comput. Aided Mol. Des. 16, (2002) Many docking algorithms (some 127 references in this 2002 review!) Most docking algorithms: –Rigid receptor hypothesis Limited receptor flexibility in, for example, GOLD – polar hydrogens

Existing small-molecule docking Most docking algorithms: –Range of ligand sampling methods Pattern matching, GA, MD, MC… –Treatment of intermolecular forces: Simplified scoring functions: empirical, knowledge-based and molecular mechanics Very simple treatment of solvation and entropy, or completely ignored!

Existing small-molecule docking And how well do they work? –Jones, G. et al. J. Mol. Biol. 267, (1997) –In re-docking studies, achieved a 71 % success rate This is probably typical of most of these methods So what’s missing?

The scoring function Existing functions inadequate –Too simplified, for reasons of computational expediency –Solvation and entropy often inadequately treated Possible solutions? –More physics

The rigid receptor hypothesis Murray, C.W. et al. J. Comput. Aided Mol. Des. 13, (1999) –Docking to thrombin, thermolysin, and neuraminidase –PRO_LEADS – Tabu search –In self docking, ligand conformation correctly identified as the lowest energy structure – 76 % –For cross-docking – 49 % successful –Some of the associated protein movements very small

The rigid receptor hypothesis Erickson, J.A. et al. J. Med. Chem. 47, (2004) –Docking of trypsin, thrombin and HIV1-p –Self-docking, docking to a single structure that is closest to the average, and docking to apo structures –Docking accuracy declines on docking to the average structure, and is very poor for docking to apo –Decline in accuracy correlated with degree of protein movement

The rigid receptor hypothesis Erickson, J.A. et al. J. Med. Chem. 47, (2004) proteinRMSD / Å cocomplexes RMSD / Å apo % self % average % apo trypsin thrombin HIV1-p

Models of Protein-Ligand Binding Goh, C.-S. et al. Curr. Opin. Struct. Biol. 14, (2004) Review of receptor flexibility for protein- protein interactions

Models of Protein-Ligand Binding This paper classifies protein-protein binding in terms of these models Induced fit assumed if there is no experimental evidence for a pre-existing equilibrium of multiple conformations Note that strictly this is an artificial distinction –Statistical mechanics – all states are accessible with a non-zero probability –For induced fit, probability of observing bound conformation without the ligand may be very small

Protein flexibility in drug design Teague, S.J. Nature Reviews 2, (2003) Effect of ligand binding on free energy

Protein flexibility in drug design Multiple conformations of a few residues –Acetylcholinesterase Phe330 flexible – acts as a swinging gate

Protein flexibility in drug design Movement of a large number of residues –Acetylcholinesterase (again!)

Protein flexibility in drug design Table 1 in Teague paper lists pharmaceutically relevant flexible targets (some 30 systems!) Consequences of protein flexibility for ligand design –One site, several ligand binding modes possible

Protein flexibility in drug design Consequences –Allosteric inhibition –Binding often remote from active site – NNRTIs Proteins in metabolism and transport –Promiscuous Bind many compounds, in many orientations E.g P450 cam substrates, camphor versus thiocamphor (two orientations, different to camphor!)

Experimental evidence for population shift Binding kinetics –Binding to low-population conformation should yield slow kinetics –  G barrier –Observed for p38 MAP kinase - mobile loop Rates of association vary between 8.5 x 10 5 and 4.3 x 10 7 M -1 s -1, depending on whether conformational change involved –Slow kinetics can make experimental comparison between assays difficult –Slow kinetics can improve ADME properties!

Nitrogen Regulatory Protein C (NtrC) plays a central role in the bacterial metabolism of nitrogen N-terminal receiver domain Central catalytic domain DNA binding domain Experimental evidence for population shift

Asp54 Phosphate Changing nitrogen levels promote the activity of NtrB kinase NtrB kinase phosphorylates NtrC at aspartate 54 in the receiver domain Protein conformational change

Asp54 Phosphate Phosphorylation promotes conformational change in the receiver domain Protein conformational change

NtrC – active and inactive conformations apparent P-NtrC – protein shifted towards activated conformation Volkman, B.F. et al. Science 291, (2001)

Summary Protein flexibility important in ligand design Two basic mechanisms –Selection of a binding conformation from a pre- existing ensemble – population shift –Induced fit – binding to a previously unknown conformation –Thermodynamically, these mechanisms are identical Evidence for population shift from binding kinetics, and protein NMR

Docking methods for incorporating receptor flexibility Ensemble docking –Docking to individual protein structures, or parts of protein structures – “ensemble docking” –Docking to a single average structure – “soft docking” Induced fit modelling Carlson, H.A. Curr. Opin. Chem. Biol. 6, (2002)

Ensemble docking Generate an ensemble of structures, and dock to them Experimentally derived structures –NMR or X-ray structures Computationally derived structures –Molecular dynamics –Simulated annealing –Normal mode propagation

FlexE Claussen, H. et al. J. Mol. Biol. 308, (2001) Extension of the FlexX algorithm: –Preferred conformations for ligands identified –Simplified scoring function adopted – based on hydrogen bonds, ionic interactions etc. –Break ligand into base fragments by severing acyclic single bonds

FlexE Extension of the FlexX algorithm: –Base fragments placed in active site by superposing interaction centres –Incrementally reconstruct ligand onto base fragments –Test each partial solution and continue with the best for further reconstruction

FlexE United protein description –Use a set of protein structures representing flexibility, mutations, or alternative protein models –Assumes that overall shape of the protein and active site is maintained across the series –FlexE selects the combination of partial protein structures that best suit the ligand –Flexibility given by FlexE is therefore defined by the protein input structures

FlexE United protein description –Similar parts of the protein structures are merged –Dissimilar parts of the protein are treated as separate alternatives

FlexE United protein description –Some combinations of the structural features are incompatible and not considered –As the ligand is constructed, the optimum protein structure is identified –Combination strategy for the protein may result in a structure not present in the original data set

FlexE Evaluation –10 proteins, 105 crystal structures –RMSD < 2.0 Å, within top ten solution, 67 % success –Cross-docking with FlexX gave 63 % –FlexE faster than cross-docking with FlexX Aldose reductase - very flexible active site –FlexE docking successful (3 ligands) –Using only one rigid protein structure would not have worked

Ensemble docking Advantages: –Well-defined computational problem –Computational cost generally scales linearly with number of structures (potential combinatorial explosion) –Can use either experimental information, or structures derived from computation Disadvantages: –What happens if the appropriate bound receptor conformation is not present in the ensemble?

Soft-Docking Knegtel, R.M.A. et al. J. Mol. Biol. 266, (1997) Build interaction grids within DOCK that incorporate the effect of more than one protein structure Effectively soften and average the different structures

Soft-Receptor Modelling Österberg, F. et al. Proteins 46, (2002) Similar approach applied to Autodock grids –Energy-weighted grid –Boltzmann-type weighting applied to reduce the influence of repulsive terms Combined grids performed very well – HIV protease

Soft-Receptor Modelling

Advantages –Low computational cost – use of single averaged protein model –Can use experimental or simulation derived structures Disadvantages –Cope with large-scale motion? –How reliable is this “averaged” representation? –Mutually exclusive binding regions could be simultaneously exploited –Active sites enlarged

Induced-Fit Docking Methods Allow protein conformational change at the same time as the docking proceeds Taking some of these algorithms, in no particular order…

Induced-Fit Docking Methods Molecular dynamics methods: –Mangoni, R. et al. Proteins 35, (1999) –Separate thermal baths used for protein and ligand to facilitate sampling Multicanonical molecular dynamics: –Nakajima, N. et al. Chem. Phys. Lett. 278, (1997) –Bias normal molecular dynamics to yield a flat energy distribution

Induced-Fit Docking Methods Monte Carlo methods –Apostolakis, J. et al. J. Comput. Chem. 19, (1998) Hybrid Monte Carlo and minimisation method. Poisson- Boltzmann continuum solvation used –ICM, Abagyan, R. et al. J. Comput. Chem. 15, (1997) Conventional MC, plus side-chain moves from a rotamer library Minimisation again required VS - J. Mol. Biol. 337, (2004)

Induced-Fit Docking Methods FDS Taylor, R. et al. J. Comput. Chem. 24, (2003) Flexible ligand/flexible protein docking –large side chain motions, rotamer library Solvation included “on the fly” –continuum solvation model – GB/SA Soft-core potential energy function –anneal the potential to improve sampling

Arabinose Binding Protein Rigid protein docking Low energy structures are essentially identical to the X- ray structure Dock starting from experimental result, does not return to it

Arabinose Binding Protein Flexible protein docking Experimental structure found A number of other structures are isoenergetic Cannot uniquely identify the experimental structure

Arabinose Binding Protein Flexible protein docking Most successful structure with experiment (transparent) Most successful structure, experiment, and isoenergetic mode

Monte Carlo Docking 15 complexes studied Rigid receptor –13/15 identified X-ray binding mode –8/15 were the unique, lowest energy structures –3/15 were part of a cluster of low-energy binding modes Flexible receptor –11/15 identified X-ray binding mode –3/15 were the unique, lowest energy structure –6/15 were part of a cluster of low-energy binding modes

FAB Fragment Two isoenergetic binding modes Closest seedIsoenergetic seed

Conclusion Rigid protein docking as successful as other methods, but much more expensive Flexible protein docking does find X-ray structures, but does not uniquely identify them –Refine scoring function? Using this methodology, need to consider a number of structures Further validation required

Summary Two main approaches for modelling receptor flexibility –Use of multiple structures (experimental or theoretical) either independently, or averaged in some way – ensemble approach –Allow the receptor to adopt conformations under the influence of the ligand – induced fit approach

Summary Ensemble is the more widely employed – less expensive, but limited somewhat by the composition of the ensemble Induced fit should overcome this disadvantage of ensemble methods Induced fit methods can have significant sampling problems –not computationally limited –search space large, and increasing as extra degrees of freedom added

Flexible protein docking – a case study Wei, B.Q. et al. J. Mol. Biol. 337, (2004) Use experimental structures Like FlexE, flexible regions move independently, and are able to recombine Modified version of DOCK used

Flexible protein docking – a case study Receptor decomposed into three parts –Green – rigid –Blue and red – two flexible parts Ligand scored against each component Best-fit protein conformation assembled from these components

Flexible protein docking – a case study Scoring function –Electrostatic (potential from PB), van der Waals –Solvation (scaled AMSOL result according to buried surface area) Large ligands favoured for large cavities –Penalty for forming the larger cavity introduced

Flexible protein docking – a case study In screening, enrichment improved compared to docking against individual conformations ACD screened against L99A M102Q mutant of T4L –18 compounds that were predicted to bind and change cavity conformation, tested –14 found to bind –X-ray structures obtained on 7

Flexible protein docking – a case study Predicted ligand geometries reproduced (< 0.7 Å) In five structures, part of observed cavity changes reproduced In two structures, receptor conformations not part of original data set, and therefore not reproduced!

Flexible protein docking – a case study New ligands found by flexible receptor docking Receptor conformational energy needs to be considered

Conclusion Rigid receptor approximation not universal Two main approaches to modelling receptor flexibility –Ensemble –Induced fit Further validation of these methods needed

Acknowledgements Flexible Docking –Richard Taylor, Phil Jewsbury, Astra Zeneca Practical –Donna Goreham, Sebastien Foucher