Download presentation
Presentation is loading. Please wait.
Published byKylie Wiggs Modified over 9 years ago
1
BioSimGRID: A GRID Database of Biomolecular Simulations Mark S.P. Sansom http://indigo1.biop.ox.ac.uk mark@biop.ox.ac.uk
2
Overview u Introduction to biomolecular simulations u www.biosimgrid.org u Why? Case study – added value from comparisons u How? Progress towards a prototype of BioSimGRID u The future? Towards computational systems biology
3
MD Simulations: from Structure to Dynamics u Molecular simulations as a tool for protein structure analysis u MD – Newtonian simulation of molecular dynamics using an empirical forcefield u Why? - Proteins move X-ray structure: average structure at 100 K in crystal MD simulations: dynamics at 300 K in water (& membrane) u Challenge: to relate structural dynamics to biological function
4
Molecular Dynamics u Describe the forces on all atoms: bonded (bonds, angles, dihedrals) non-bonded (van der Waals, electrostatics) u Describe the initial atom positions: u Integrate: F = ma (a few million times…) u Result: positions and energies of all atoms during a few nanoseconds u Applications: liquids … peptides … proteins … membranes u Membrane + protein + water = ca. 50,000 atoms u Need for comparative analysis of simulations – GRID data and collaboration u Need for efficient parallelisation – clusters and/or HPC
5
Current Paradigm for MD Simulations u Target selection: literature based; interesting protein/problem u System preparation: highly interactive; slow; idiosyncratic u Simulation: diversity of protocols u Analysis: highly interactive; slow; idiosyncratic u Dissemination: traditional – papers, posters, talks u Archival: ‘archive’ data … and then mislay the tape!
6
Integrating Simulations and Structural Biology of Proteins Novel structure (RCSB) Sequence alignment Biomedically relevant homologue(s) Homology model(s) MD simulations Biomolecular simulation database Comparative analysis Evaluation/refinement of model Biological and pharmacological simulation & modelling e.g. drug discovery bacterial K channel mammalian K channel dynamics in membrane drug docking calculations Interaction site dynamics bioinformatics & structural biology BioSimGRID drug discovery
7
Comparative Simulations: Drug Receptors u Why? – increase significance of results u Sampling – long simulations and multiple simulations u Sampling via biology – exploiting evolution u Biology emerges from comparisons… u e.g. mammalian receptor vs. bacterial binding protein u Rat GluR2 EC fragment u Major receptor in mammalian brains – drug target u MD simulations with/without bound ligands u Analyse inter-domain motions glutamate S1 S2
8
GluR2 – Flexibility & Gating… u Flexibility depends on ligand occupancy & species u Gating mechanism – decrease in flexibility on channel activation u But … incomplete sampling u Need: longer simulations & comparative simulations empty Kainate Glutamate >> > “OFF” “ON” 01.01.50.5 1 2 3 4 time (ns) RMSD (Å) 0 empty +Kai +Glu 2.0
9
GlnBP – A Bacterial Binding Protein u GlnBP – bacterial 2-domain periplasmic binding protein u Similar fold to mammalian GluR2 u X-ray shows ligand binding induces domain closure u MD shows ligand binding reduces inter-domain motions - cf. GluR2 simulations + Gln empty Gln bound X-ray structures MD Simulation empty Gln bound
10
Main Initial Tasks u To establish a distributed database environment u To develop Grid/Web services using GT3/OGSA infrastructure u To develop software tools for interrogation and data-mining u To develop generic analysis tools u Annotation of simulation data with biological and structural data from other databases York Nottingham Birmingham Oxford RAL Southampton London collaborating groups
11
Oxford –database management system (Bing Wu) –(meta)data curatorship & integration (Kaihsu Tai) Southampton –application programming interface & data retrieval (Muan Hong Ng) –generic analysis tools (Stuart Murdock) Dividing up the Tasks
12
table trajectory: one entry for each trajectory table coordinate: {x, y, z} one entry for each atom in each residue in each frame in each trajectory table atom: one entry for each atom in each residue in each trajectory table residue: one entry for each residue in each trajectory table frame: one entry for each frame in each trajectory dictionary tables metadata tables Database Design: Simplified
13
Database Design: A More Complete Version
14
Simulation Metadata u Difficult to extract from published literature u This is a prototype: a needs analysis with users/depositors must be conducted u Annotation/links to other biological databases essential id molecules author depositors affiliations publications method src_stru ref_stru prog ver hardware num_of_proc timestep num_of_frame ens_type thermostat solvent forcefield ele_stat equ_prot hyd_atom unit_shape … metadata
15
Database Editor & SQL Query Capability
16
BioSimGRID Prototype Target date for prototype: July 2003
17
Deliverables to Date… Database schema Sample database (with test trajectories) Prototype shared between 2 sites Analysis tools – preliminary versions Interface to database for data retrieval Python hosting environment
18
Roadmap u Dec 2002 – project started u July 2003 – (internal) prototype u September 2003 – working prototype (All Hands meeting) u November 2003 – test ‘real world’ applications u December 2003 – multi-site prototype u 2004 – multi-site deposition of data u 2005 – open up to additional groups for deposition/testing
19
Future Directions u HTMD – simulations coupled to structural genomics Diamond light source u Computational system biology – virtual outer membrane HPCx u Multiscale biomolecular simulations – from QM/MM to meso- scale modelling GRID-enabled simulations u Combine all of these with BioSimGRID…
20
Structural Genomics & HTMD u Overall vision – simulation as an integral component of structural genomics u Needs capacity computation – GRID? u MD database (distributed) – BioSimGRID synchrotron MD database novel biology… compute GRID
21
Towards a Virtual Outer Membrane (vOM) u First step towards computational systems biology – a suitable system u Bacterial OMs – 5 or 6 proteins = 90% of protein content u Structures or good homology models of proteins are available u Complex lipid – outer leaflet is lipopolysaccharide (LPS) u Minimum system size ca. 2.5x10 6 atoms; simulation times ca. 50 ns cf. current FhuA – 80,000 atoms & 10 ns – need HPCx
22
Multiscale Biomolecular Simulations u Membrane bound enzymes – major drug targets (cf. ibruprofen, anti-depressants, endocannabinoids) u Complex multi-scale problem: QM/MM; ligand binding; membrane/protein fluctuations; diffusive motion of substrates/drugs in multiple phases u Need for GRID-based integrated simulations
23
Oxford Dr Phil Biggin Dr Carmen Domene Dr Alessandro Grottesi Dr Andrew Hung Dr Daniele Bemporad Dr Shozeb Haider Dr Kaihsu Tai Dr Bing Wu George Patargias Oliver Beckstein Yalini Pathy Pete Bond Jonathan Cuthbertson Sundeep Deol Jeff Campbell Loredana Vaccaro Jennifer Johnston Katherine Cox Robert d’Rozario John Holyoake Andrew Pang BBSRCDTI The Wellcome TrustGSK EC (TMR) OeSC (EPSRC & DTI) EPSRC OSC (JIF) MRC BioSimGRID Leo Caves (York) Simon Cox (Southampton) Jon Essex (Southampton) Paul Jeffreys (Oxford) Charles Laughton (Nottingham) David Moss (Birkbeck) Oliver Smart (Birmingham) Southampton Dr Stuart Murdock Dr Muan Hong Ng Dr Richard Maurer Dr Hans Fangohr Steve Johnston
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.