Christopher Reynolds Supervisor: Prof. Michael Sternberg Bioinformatics Department Division of Molecular Biosciences Imperial College London.

Slides:



Advertisements
Similar presentations
Estimating the False Discovery Rate in Multi-class Gene Expression Experiments using a Bayesian Mixture Model Alex Lewin 1, Philippe Broët 2 and Sylvia.
Advertisements

Predicting Kinase Binding Affinity Using Homology Models in CCORPS
Analysis of High-Throughput Screening Data C371 Fall 2004.
Miroslav Brumovský28th July 2011 Miroslav Brumovský 28 th July 2011 Methods using polarization for in silico fragment-based drug design.
Functional Site Prediction Selects Correct Protein Models Vijayalakshmi Chelliah Division of Mathematical Biology National Institute.
In silico small molecule discovery Sales Target gene Discover hit Hit to lead Optimise lead Clinical Target gene identified with a viable assay High throughput.
Improving enrichment rates A practical solution to an impractical problem Noel O’Boyle Cambridge Crystallographic Data Centre
Why multiple scoring functions can improve docking performance - Testing hypotheses for rescoring success Noel M. O’Boyle, John W. Liebeschuetz and Jason.
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
University of Wisconsin ISMB 2002 Department of Biostatistics Department of Computer Science Mining Three-dimensional Chemical Structure Data Sean McIlwain.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
A Study on Feature Selection for Toxicity Prediction*
Discovering Substructures in Chemical Toxicity Domain Masters Project Defense by Ravindra Nath Chittimoori Committee: DR. Lawrence B. Holder, DR. Diane.
Systems Biology Biological Sequence Analysis
Enhancing the C-48 STAT3 Inhibitor
Design of Small Molecule Drugs Targeted to RNA RNA Ontology Group May
Active Learning Strategies for Drug Screening 1. Introduction At the intersection of drug discovery and experimental design, active learning algorithms.
Predicting Highly Connected Proteins in PIN using QSAR Art Cherkasov Apr 14, 2011 UBC / VGH THE UNIVERSITY OF BRITISH COLUMBIA.
eHiTS Score Darryl Reid, Zsolt Zsoldos, Bashir S. Sadjad, Aniko Simon, The next stage in scoring function evolution: a new statistically.
GGAGATTCTGGGCCACTTTGGTTCCCCATGAGCCAAGACGGCACTTCTAATTTGCATTCCCTACCGGAGTCCCTGTCTGTAGCCAGCCTGGCTTTCAGCTGGTGCCCAAAGTGACAAATGTATCTGCAATGACAAAGGTAC CCTGGAAGGGCTCGCCCTCTGCGGAATTTCAGTTCATGCAGGCCTTGGTGCTTCCACATCTGTCCAAGGGCCTTTCAAATGTGACTTTTAACTCTGTGGATTGATTTGCCCGG
Pharmacophore and FTrees
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Asia’s Largest Global Software & Services Company Genomes to Drugs: A Bioinformatics Perspective Sharmila Mande Bioinformatics Division Advanced Technology.
Christopher Reynolds, Stephen Muggleton and Michael Sternberg Bioinformatics and Computing Departments Imperial College London.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Introduction In order for us to learn from the extensive prior literature we have collated information on molecules screened versus Mycobacterium tuberculosis.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Designing a Tri-Peptide based HIV-1 protease inhibitor Presented by, Sushil Kumar Singh IBAB,Bangalore Submitted to Dr. Indira Ghosh AstraZeneca India.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
1 In Silico Screening of Zinc (II) Enzyme Inhibitors Using ILP Tadasuke Ito, Shotaro Togami, Shin Aoki and Hayato Ohwada Department of Industrial Administration.
“Emergency discovery” of novel antimicrobials among known drugs in response to new and re-emerging infectious threats A. Cherkasov UBC / VGH Infectious.
1 John Mitchell; James McDonagh; Neetika Nath Rob Lowe; Richard Marchese Robinson.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
1 Cheminformatics David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Computer-aided drug discovery (CADD)/design methods have played a major role in the development of therapeutically important small molecules for several.
Catalyst TM What is Catalyst TM ? Structural databases Designing structural databases Generating conformational models Building multi-conformer databases.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Use of Machine Learning in Chemoinformatics
Molecular Cell Biology Logic and Approaches to Research Cooper.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
Molecular Modeling in Drug Discovery: an Overview
TIDEA Target (and Lead) Independent Drug Enhancement Algorithm.
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Nehad A. El Sayed, Amal A. H. Eissa, Reem K. Arafa and Ghada F. El Masry* Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Cairo University.
Natural products from plants
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Jump to first page Relational Data. Jump to first page Inductive Logic Programming (ILP) n Can use ILP to find a set of rules capturing a property that.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
PINALOG Protein Interaction Network Alignment and its implication in function prediction and complex detection Hang Phan Prof. Michael J.E. Sternberg.
Christopher Reynolds Supervisor: Prof. Michael J.E. Sternberg
Computational Tools Seminar
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Ligand-Based Structural Hypotheses for Virtual Screening
Ingenuity Knowledge Base
Experimental confirmation of in vitro GPCR activity of compounds and their scaffolds screened from a chemical library. Experimental confirmation of in.
Virtual Screening.
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
ASSIGNMENT 10 Use the UP and DOWN tags as provided and query the LINCS. These are the KRAS-DEP (UP) and KRAS-IND (DOWN) gene signatures from Singh et al.
Cheminformatics Basics
Docking validation of Dud778-dUTPase cocrystal structure.
Perturbational Gene-Expression Signatures for Combinatorial Drug Discovery  Chen-Tsung Huang, Chiao-Hui Hsieh, Yun-Hsien Chung, Yen-Jen Oyang, Hsuan-Cheng.
Model of the change in receptor structure on engagement of the ligand IFN-γ. Model of the change in receptor structure on engagement of the ligand IFN-γ.
Presentation transcript:

Christopher Reynolds Supervisor: Prof. Michael Sternberg Bioinformatics Department Division of Molecular Biosciences Imperial College London

Investigational Novel Drug Discovery by Example. A proprietary technology developed by Equinox Pharma that uses a system developed from Inductive Logic Programming for drug discovery. This approach generates human-comprehensible weighted rules which describe what makes the molecules active. In a blind test, INDDEx™ had a hit rate of 30%, predicting around 30 active molecules, each capable of being the start of a new drug series. INDDEx™

Fragmentation of molecules into chemically relevant substructure Inductive Logic Programming generates QSAR rules Screens model against molecular database Novel hits Observed activity

Dataset

Fragmentation Molecules broken into chemically relevant fragments. Simplest fragmentation is to break the molecule into its component atoms. More complex fragmentations break the molecule into fragments relating to hydrophobicity and charge.

Deriving logical rules Create a series of hypotheses linking the distances of different structure fragments. For each hypothesis, find how good an indicator of activity it is. Hypotheses above a certain compression can be classed as rules.

Example ILP rules active(A):- positive(A, B), Nsp2(A, C), distance(A, B, C, 5.2, 0.5). active(A):- phenyl(A, B), phenyl(A, C), distance(A, B, C, 0.0, 0.5). Molecule is active if there is a positive charge centre and an sp 2 orbital nitrogen atom 5.2 ± 0.5 Å apart. Molecule is active if a phenyl ring is present.

Calculate correlation Deriving and quantifying the rules Inductive Logic Hypotheses + −+− Support Vector Machine

Screening Apply model to a database of molecules. (ZINC) Contains 11,274,443 molecules available to buy “off-the- shelf”. INDDEx™ pre-calculates descriptors to save time.

Testing Tested on publically available data Directory of Useful Decoys (DUD) Case study Finding molecules to inhibit the SIRT2 protein.

Testing methodology 40 protein targets Actives Decoys All Decoys95,171 Decoys

Enrichment curves % of ranked database % of known ligands retrieved Results for LASSO and DOCK from (Reid et al. 2008), and results for PharmaGist from (Dror et al. 2009)

Enrichment Factors Enrichment factor EF 1% EF 0.1%

Performance, similarity, and target set size Number of active ligands Mean similarity of dataset / Average of ROC area

Similarity versus performance Dataset mean similarity Enrichment Factor at 1% Drug-Like Molecules Pearson’s R = 0.71

Testing scaffold hopping AtomsBondsTotal NANA NBNB N AB N AB N A + N B - N AB

Testing scaffold hopping % of ranked database % of known ligands retrieved

Rule (all distances have a tolerance of 1 Ångström)Fit to training data Rule examples for PDGFrb

Case study: SIRT2 inhibition SIRT2 is NAD-dependent deacetylase sirtuin-2. 3 chains, each a domain. Inhibition can cause apoptosis in cancer cell lines (Li, Genes Cells, 2011).

Molecules found by in vitro tests to have some low activity against SIRT2

Predicted molecules docked against modelled SIRT2 protein structure using GOLD™

SIRT2 results Training data 8 molecules IC 50 activities between 1.5 µM and 78 µM 8 molecules with best consensus INDDEx and docking scores purchased and tested. All molecules were structurally distinct from training molecules. Two molecules had activity. One had IC 50 of 3.4 μM. Better than all but one of the training data molecules.

Summary INDDEx has been shown to be a powerful screening method whose strength lies in learning topological descriptors of multiple active compounds. INDDEx can achieve a good rate of scaffold hopping even when there are low numbers of active compounds to learn from. Potential new drug leads found for SIRT2 protein. Testing is continuing.

Imagery Wikimedia Commons iStockPhoto® Funding BBSRC Equinox Pharma All of you for listening. Acknowledgments Mike Sternberg Stephen Muggleton Ata Amini Suhail Islam SIRT2 drug design Paolo Di Fruscia Matt Fuchter Eric Lam Chemistry Development Kit

Questions?