DockCrunch and Beyond... The future of receptor-based virtual screening Bohdan Waszkowycz, Tim Perkins & Jin Li Protherics Molecular Design Ltd Macclesfield,

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

1 Real World Chemistry Virtual discovery for the real world Joe Mernagh 19 May 2005.
Analysis of High-Throughput Screening Data C371 Fall 2004.
A Multiobjective Approach to Combinatorial Library Design Val Gillet University of Sheffield, UK.
Improving enrichment rates A practical solution to an impractical problem Noel O’Boyle Cambridge Crystallographic Data Centre
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
ABCD Flexsim-R: A new 3D descriptor for combinatorial library design and in-silico screening 2 nd Joint Sheffield Conference on Chemoinformatics: Computational.
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
Bioinformatics Vol. 21 no (Pages ) Reporter: Yu Lun Kuo (D )
Future CAMD Workloads and their Implications for Computer System Design IEEE 6th Annual Workshop on Workload Characterization.
Computational Biology: A Measurement Perspective Alden Dima Information Technology Laboratory
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
Prototyping. Horizontal Prototyping Description of Horizontal Prototyping A Horizontal, or User Interface, Prototype is a model of the outer shell of.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Luddite: An Information Theoretic Library Design Tool Jennifer L. Miller, Erin K. Bradley, and Steven L. Teig July 18, 2002.
Docking of Protein Molecules
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Pharmacophore-based Molecular Docking Bert E. Thomas, Diane Joseph- McCarthy, Juan C.Avarez.
RAPID: Randomized Pharmacophore Identification for Drug Design PW Finn, LE Kavraki, JC Latombe, R Motwani, C Shelton, S Venkatasubramanian, A Yao Presented.
System Engineering Instructor: Dr. Jerry Gao. System Engineering Jerry Gao, Ph.D. Jan System Engineering Hierarchy - System Modeling - Information.
Protein Structure and Drug Discovery Workshop To be held at Monash University, Mebourne, Australia October 3 rd to 4 th 2006 Molecular Visualization Learn.
Introduction to Computer Technology
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Combinatorial Chemistry and Library Design
INFO425: Systems Design INFORMATION X Finalizing Scope (functions/level of automation)  Finalizing scope in terms of functions and level of.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Predictive Runtime Code Scheduling for Heterogeneous Architectures 1.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
Cluster Reliability Project ISIS Vanderbilt University.
Data Mining Process A manifestation of best practices A systematic way to conduct DM projects Different groups has different versions Most common standard.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development 3.
Presented by: Reem Alshahrani. Outlines What is Virtualization Virtual environment components Advantages Security Challenges in virtualized environments.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Whole Test Suite Generation. Abstract Not all bugs lead to program crashes, and not always is there a formal specification to check the correctness of.
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Docking and Virtual Screening Using the BMI cluster
Molecular Modeling in Drug Discovery: an Overview
TIDEA Target (and Lead) Independent Drug Enhancement Algorithm.
FESR Consorzio COMETA - Progetto PI2S2 Molecular Modelling Applications Laura Giurato Gruppo di Modellistica Molecolare (Prof.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS
Ligand-Based Structural Hypotheses for Virtual Screening
Virtual Screening.
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
Reporter: Yu Lun Kuo (D )
Bohdan Waszkowycz, Tim Perkins & Jin Li
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Chapter 2: Building a System
Presentation transcript:

DockCrunch and Beyond... The future of receptor-based virtual screening Bohdan Waszkowycz, Tim Perkins & Jin Li Protherics Molecular Design Ltd Macclesfield, UK

Outline Structure-based virtual screening –an achievable (and possibly useful) tool for drug discovery –the DockCrunch validation study Protherics’ experience since DockCrunch –methods: making VS a routine task –analysis: getting the most from your data –the future (and beyond)

Virtual Screening computational screening targeted selection screen smaller focused libraries compound collections virtual libraries receptor structure molecular docking

Why Use Molecular Docking? Most detailed representation of binding site –overcomes simplifications of pharmacophores –identify both conservative and novel solutions –impetus for de novo design/optimisation Broad range of analyses applicable –diverse scoring/selection criteria Quality/throughput of available methods –good enough, despite technical limitations

DockCrunch Validation study for large-scale virtual screening –flexible ligand/rigid receptor docking –PRO_LEADS docking code using ChemScore scoring function –1.1M druglike ACD-SC compounds –dock versus oestrogen receptor (agonist and antagonist structures) –collaboration with SGI

Oestradiol:Oestrogen Receptor Complex

DockedEnergy Profiles Agonist ReceptorAntagonist receptor Achieve good separation in terms of predicted binding affinity

DockCrunch Results Demonstrated technical feasibility –1.1M cpds docked in 6 days/64 processor Origin –implemented automated pre- and post-processing Demonstrated potential for lead identification –successful discrimination of seeded known hits –activity for 21 out of 37 assayed compounds –ER binding affinities to 7nM Ki –novel non-steroidal chemistries

Since DockCrunch... VS established as a routine CAMD task: –2.2M structures docked in DockCrunch –1.5M docked versus in-house target –2.5M docked to date in external contracts –project 1: 0.25M Dec 2000 –project 2: 0.25M Jan 2001 –project 3: 1M Feb 2001 –project 4: 1M March-April 2001 –project 5: 0.5M to do in May... –diverse targets/databases/project objectives

Virtual Screening within Prometheus Database preparation e.g. salt removal, protonation Database pre-filtering select drug-like profile Receptor-ligand docking predict binding mode/affinity Analysis graphical browsing, subset selection Receptor structure Commercial databases Virtual databases

PRO_LEADS Docking Tabu search + extended ChemScore function –robust prediction of binding free energy –85% success rate achieved across diverse test set Pre-calculated grids for energies/neighbour lists –defines extent of binding site –automatically/graphically defined Selection of PRO_LEADS docking protocol –use standard protocol across all receptors –specific constraints or modified energy terms available if desired

Example of Grid Definition cAMP-dependent kinase (1YDS) contact surface coloured by lipophilicity

Docking Throughput Standard protocols take 1–5 mins/ligand –e.g. typical VS run at ~4 min for 3M tabu steps –250k cpds/week on 100 processor Linux cluster (VA Linux 750MHz PIII) PLUNDER script for parallelization –automatic processing of ligand batches –balances processor workload –works across heterogeneous architectures –supplies running time statistics –handles hardware failures

Data Analysis and Subset Selection Intrinsic problems of scoring functions: –cannot parameterize all critical interactions –try to take account of induced fit effects –calibrated only versus good binders –ignore co-operativity in binding When applied to random datasets: –predicted affinity typically normal distributed –overestimates binding affinity of random set  energy alone not ideal for subset selection

Achieving Better Selection Need to supplement scoring function –consensus scoring schemes Explore more fundamental descriptors of receptor:ligand complementarity –capture characteristics of diverse receptor types –assess deficiencies of existing scoring functions –use as simple filters or as pseudo energy terms

Enrichment Rates Effect of different selection criteria for ER set for recovery of seeded compounds

Requirements for Analysis Package VS generates huge data output –want to be able to browse through entire dataset Real-time navigation of large datasets –graphing property distributions –selections based on property filters –browsing of 3D models within selections –initiating additional property calculations –data transformations –writing subset/reports

PropertyViewer

Approach to Analysis 1. Preliminary exploration –browse property distributions –comparisons with known ligands 2. Initial elimination of poor structures –DockedEnergy, component energies –DE corrected for size/functionality –receptor:ligand steric complementarity –polar/lipophilic surface complementarity

Approach to Analysis 3. Further filtering  define focused subsets –tighter 2D property filters –clustering by 2D chemistry –presence of key 3D binding interactions –specific H-bonds, specific lipo contacts, pocket occupancy, volume overlap with reference ligand/fragment, etc –similarity/diversity of 3D binding mode –3D similarity descriptors –final ranking by DockedEnergy or hybrid energy/complementarity scoring function

DockedEnergy vs Size

Complementarity Space ER and FXa datasets

Addressing More Difficult Cases - COX2 Knowns show clustering in property space despite modest DockedEnergy

Improvements in Docking Function original docking function some misdocked knowns new docking function more consistent docking +ve shift in random energies

Comparison of filters in subset selection 87% pass 2D filters 37% pass energy filters 22% pass complementarity filters 1% 12% 22% 9% 2% 43% 0% Initial filtering to ~10% –energy filters –complementarity –2D properties Selection of final ~1% subset –3D structural features –preferred binding motifs –2D/3D diversity

Conclusions Established VS as a routine CAMD task –focused software development –achieved success in drug discovery projects VS is more than a black box –data mining is worthwhile –explore receptor-ligand complementarity to achieve good subset selection and point towards better scoring functions

Future Directions for VS Exploit expanding computing resource –improved docking/scoring functions –improved receptor representations Broader application of VS –evaluation of drugability of early targets –screening of very large virtual libraries –routine screening across protein families –DMPK issues

Acknowledgements Tim Perkins Martin Harrison Richard SykesCarol Baxter Richard HallChris Murray David FrenkelJin Li David Sheppard Thanks to: SGI, MSI, MDL, VA Linux