Computer simulations in drug design, and the GRID Dr Jonathan W Essex University of Southampton.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Design of a Grid Enabled Database System to Facilitate Reuse, Provenance Tracking and Automated Processing of Chemical Information Robert Gledhill University.
The GATE-LAB system Sorina Camarasu-Pop, Pierre Gueth, Tristan Glatard, Rafael Silva, David Sarrut VIP Workshop December 2012.
PhysChem Forum, 29 Nov 2006, Newhouse 1 Memories and the future: From experimental to in silico physical chemistry Han van de Waterbeemd AstraZeneca, DMPK.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
BioSimGRID: A GRID Database of Biomolecular Simulations Mark S.P. Sansom
CH 328 Biomolecular Modelling Instructors: R. Woods, E. Fadda Schedule: Lectures (24) Wednesday / Thursday 9-10 am Dillon Theatre Computer labs (24) Monday.
BioCoRE and GEMS: Cyber Infrastructure for Cyber Chemistry Jesús A. Izaguirre Computer Science & Engineering University of Notre Dame with Kirby Vandivort.
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
QM/MM Study of Cytochrome P450 BM3 Catalysis Mechanism and Application in Drug Design.
Summary Molecular surfaces QM properties presented on surface Compound screening Pattern matching on surfaces Martin Swain Critical features Dave Whitley.
An Integrated Approach to Protein-Protein Docking
Extending a molecular docking tool to run simulations on clouds Damjan Temelkovski Dr. Tamas Kiss Dr. Gabor Terstyanszky University of Westminster.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
Ana Damjanovic (JHU, NIH) JHU: Petar Maksimovic Bertrand Garcia-Moreno NIH: Tim Miller Bernard Brooks OSG: Torre Wenaus and team.
AHM 2005 e-MalariaUniversity of Southampton1 Jeremy Frey E-Malaria AHM 2005 Jeremy Frey School of Chemistry University of Southampton.
Centre for Earth Systems Engineering Research Infrastructure Transitions Research Consortium (ITRC) David Alderson & Stuart Barr What is the aim of ITRC?
Computational Science jsusciencesimulation Principles of Scientific Simulation Spring Semester 2005 Geoffrey Fox Community.
BioSimGRID and BioSimGRID ’lite’ - Towards a worldwide repository for biomolecular simulation Philip C Biggin
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Jeremy C. Smith, University of Heidelberg Introduction to Protein Simulations and Drug Design R P.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
ExTASY 0.1 Beta Testing 1 st April 2015
Comb-e-day e-models Dr Jonathan W Essex University of Southampton.
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Biomolecular Modelling and Simulation Julia M Goodfellow, Birkbeck College, University of London.
Investigating Protein Conformational Change on a Distributed Computing Cluster Christopher Woods Jeremy Frey Jonathan Essex University.
G. Terstyanszky, T. Kukla, T. Kiss, S. Winter, J.: Centre for Parallel Computing School of Electronics and Computer Science, University of.
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
Parameter Sweep Workflows for Modelling Carbohydrate Recognition ProSim Project Tamas Kiss, Gabor Terstyanszky, Noam Weingarten.
Grid Computing at The Hartford Condor Week 2008 Robert Nordlund
Chem. 860 Molecular Simulations with Biophysical Applications Qiang Cui Department of Chemistry and Theoretical Chemistry Institute University of Wisconsin,
NGS Innovation Forum, Manchester4 th November 2008 Condor and the NGS John Kewley NGS Support Centre Manager.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
“Grids and eScience” Mark Hayes Technical Director - Cambridge eScience Centre GEFD Summer School 2003.
The UK eScience Grid (and other real Grids) Mark Hayes NIEeS Summer School 2003.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
CaliBayes and BASIS: e-Science applications for Systems Biology research Yuhui Chen Institute for Ageing and Health Centre for Integrated Systems Biology.
Applications & a Reality Check Mark Hayes. Applications on the UK Grid Ion diffusion through radiation damaged crystal structures (Mark Calleja, Mark.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
1/20 Study of Highly Accurate and Fast Protein-Ligand Docking Method Based on Molecular Dynamics Reporter: Yu Lun Kuo
Building the e-Minerals Minigrid Rik Tyer, Lisa Blanshard, Kerstin Kleese (Data Management Group) Rob Allan, Andrew Richards (Grid Technology Group)
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
The National Grid Service Mike Mineter.
INFSO-RI Enabling Grids for E-sciencE CHARON System Jan Kmuníček, Petr Kulhánek, Martin Petřek CESNET, Czech Republic.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Grid Computing Unit I Introduction. Information anytime anywhere!!! support computation across administrative domains Generally  virtualizing computing.
 The generated models are used in various coarse-grain and other molecular modelling studies.  Coarse-grain analysis includes: Gaussian Network Models.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
The National Grid Service Mike Mineter.
BioSimGrid PI:: Mark Sansom (Oxford) Southampton :: Jon Essex, Stuart Murdock Oxford :: Mark Sansom, Kaihsu Tai Birkbeck College.
1 Porting applications to the NGS, using the P-GRADE portal and GEMLCA Peter Kacsuk MTA SZTAKI Hungarian Academy of Sciences Centre for.
Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility Applications to molecular simulation Work on this paper.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 22 February 2006.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Docking and Virtual Screening Using the BMI cluster
FESR Consorzio COMETA - Progetto PI2S2 Molecular Modelling Applications Laura Giurato Gruppo di Modellistica Molecolare (Prof.
Page 1 Molecular Modeling Service in Profacgen. Page 2 The three-dimensional structure of a protein provides essential information about its biological.
Integrating Scientific Tools and Web Portals
Centre for Computational Science, University College London
Grid Portal Services IeSE (the Integrated e-Science Environment)
Computer simulations in drug design, and the GRID
Virtual Screening.
Computational biochemistry: enzyme catalysis, protein dynamics and drug design Adrian Mulholland (Chemistry; lead, BristolBridge; Chair, UoB HPC Exec,
An Integrated Approach to Protein-Protein Docking
Presentation transcript:

Computer simulations in drug design, and the GRID Dr Jonathan W Essex University of Southampton

Computer simulations? Molecular dynamics –Solve F = ma for particles in system –Follow time evolution of system –Average system properties over time Monte Carlo –Perform random moves on system –Accept or reject move on the basis of change in energy –No time information

Drug action Administration –Oral, intravenous… –pKa, solubility Transport and permeation –Passage through membranes –Partition coefficient, permeability Protein binding –Binding affinity, specificity –Allosteric effects Metabolism –P450 enzymes –Catalysis Free energy simulations Structure generation QM/MM Digital filtering Parallel tempering Free energy simulations Modelling

Three main areas of work Protein-ligand binding –Structure prediction –Binding affinity prediction Membrane modelling and small molecule permeation –Bioavailability –Coarse-grained membrane models Protein conformational change –Induced-fit –Large-scale conformational change

Small molecule docking Flexible protein docking Most successful structure with experiment (transparent) Most successful structure, experiment, and isoenergetic mode

Replica-exchange free energy Free energy methods that are applied between exchanges are the same as normal Exchanges require little extra computational cost

Membrane permeation z1z2 z Z-constraint  Free Energies 

Drugs:  -blockers alprenololatenololpindolol not “exotic” chemical groups not too big similar structure, pK a and weight, but different lipophilicity and oral absorption much experimental data about Caco-2 permeabilities

Reversible digitally filtered MD Conformational change associated with low frequency vibrations Use digital filter applied to atomic velocities to amplify low frequency motion

RDFMD Effect of a quenching filter

Comb-e-Chem EPSRC Grid computing project Main objectives: –Equipment on the Grid – high-throughput small molecule crystallography –E-lab – pervasive computing, provenance –Structure modelling and property prediction Automatic calculations performed on deposition of a new structure to the database Distributed (Grid) computing

Distributed computing Heterogeneous computers – personal PCs Rather unstable Network often slow Suited for embarrassingly parallel calculations etc. –E-malaria (JISC) Poor-man’s resource –Cheap –Large amount of power available

Condor Condor project started in 1988 to harness the spare cycles of desktop computers Available as a free download from the University of Wisconsin-Madison Runs on Linux, Unix and Windows. Unix software can be compiled for Windows using Cygwin For more information see

Distributed computing and replica exchange Run multiple simulations is parallel Use different conditions in different simulations to enhance conformational change (e.g. temperature) 300 K320 K340 K360 K

300 K 320 K 340 K360 K380 K400 K

1 300 K 320 K 340 K360 K380 K400 K

1 300 K 320 K 340 K360 K380 K400 K

2 300 K 320 K 340 K360 K380 K400 K

3 300 K 320 K 340 K360 K380 K400 K

5 300 K 320 K 340 K360 K380 K400 K

5 300 K 320 K 340 K360 K380 K400 K Catch-up cluster

5 300 K 320 K 340 K360 K380 K400 K Catch-up cluster

Blue bars represent a node running an even iteration Green bars represent a node running an odd iteration Red spots show when the owner of the node is using it (interrupting our job) Yellow bars show when the job is moved over to the catchup cluster Activity of nodes over the last day

Catchup cluster was a large collection of non- dedicated nodes. Condor master server crashed! University network failed! Catchup cluster redesignated to six fast, dedicated nodes. One slow or interrupted iteration delays large parts of the simulation Activity of nodes since the start of the simulation

Current paradigm for biomolecular simulation Target selection: literature based; interesting protein/problem System preparation: highly interactive, slow, idiosyncratic Simulation: diversity of protocols Analysis: highly interactive, slow, idiosyncratic Dissemination: traditional – papers, talks, posters Archival: archive data… and then lose the tape!

Managing MD data: BioSimGRID u u Distributed database environment u Software tools for interrogation and data-mining u Generic analysis tools u Annotation of simulation data 1 st Level Metadata – describing the simulation data 2 nd Level Metadata – describing the results of generic analyses Simulation Data Analyse Data Distributed Raw Data Application Distributed Query York Nottingham Birmingham Oxford Bristol Southampton London

Comparative simulations Increase significance of results –Effect of force field –Simulation protocol Long simulations and multiple simulations Biology emerges from the comparisons –Very easy to over-interpret protein simulations –What’s noise, and what’s systematic?

Test Application: Comparison of Active Site Dynamics OMPLA – bacterial outer membrane lipase; GROMACS; Oxford AChE – neurotransmitter degradation at synapses; NWChem; UCSD (courtesy of Andrew McCammon) Both have catalytic triad at active site – compare conformational dynamics

BioSimGRID prototype Prototype: Summer 2003 (UK e-science All-Hands meeting) Database Access: DBI / PythonDB2 / PortalLib Apache/SSL SQL Editor Applications Server Analysis Tool HTML Generator AAA Module Other HTTP(S) DB2 Cluster Traj Query Tool Video/Img Generator Python Web Portal Environment Python Environment TCP/IP WEB TCP/IP

Revised structure Simulation Hybrid Storage database Flat file Analysis Toolkit RMSD RMSF Volume Distances Internal Angles Surface Result Files Trajectory Metadata User input 1. Submission of trajectory 2. Generation of Metadata 3. Data- on-demand Query 4. Analysis BioSimGrid Visualisation Tools 5. View Result Config. Files

Example of script use (distance matrix) FC = FrameCollection(‘2,5-8’) myDistanceMatrix = DistanceMatrix(FC) myDistanceMatrix.createPNG() myDistanceMatrix.createAGR() Script Calculates Distance Matrix User has requested result as PNG Grace project file was also produced

QM drug binding protein motions drug diffusion Future directions: Multiscale biomolecular simulations u Membrane bound enzymes – major drug targets (cf. ibruprofen, anti-depressants, endocannabinoids); gated access to active site coupled to membrane fluctuations u Complex multi-scale problem: QM/MM; ligand binding; membrane/protein fluctuations; diffusive motion of substrates/drugs in multiple phases u Need for integrated simulations on GRID-enabled HPC resources Bristol Southampton Oxford London

Computational challenges u Need to integrate HPC, cluster & database resources u Funding: bid to BBSRC under consideration… IntBioSim HPCx Linux cluster BioSimGRID database

Acknowledgements My group: Stephen Phillips Richard Taylor Daniele Bemporad Christopher Woods Robert Gledhill Stuart Murdock My funding: BBSRC, EPSRC, Royal Society, Celltech, AstraZeneca, Aventis, GlaxoSmithKline My collaborators: Mark Sansom, Adrian Mulholland, David Moss, Oliver Smart, Leo Caves, Charlie Laughton, Jeremy Frey, Peter Coveney, Hans Fangohr, Muan Hong Ng, Kaihsu Tai, Bing Wu, Steven Johnston, Mike King, Phil Jewsbury, Claude Luttmann, Colin Edge