a Virtual Compound Space

Slides:



Advertisements
Similar presentations
Solutions for Cheminformatics
Advertisements

Virtual Synthesis - Reactor
August 2010, ACS National meeting, Boston Representation of Markush structures from molecules towards patents Szabolcs Csepregi Solutions for Cheminformatics.
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia First presented at Applications of Cheminformatics.
Optimized Virtual Screening
Screen Ligand based virtual screening presented by … maintained by Miklós Vargyas Last update: 13 April 2010.
Version 5.3, February 2010 Scientific & technical presentation JChem Base.
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia January, 2007 Structural Search Using ChemAxon.
May, 2008 Presenting: Szabolcs Csepregi The ChemAxon Markush project overview and development discussion.
Scientific & technical presentation Fragmenter Nóra Máté Sept 2005.
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
Scientific & technical presentation Calculator Plugins January 2011.
Version 5.3, April 2010 The ChemAxon Markush project overview and development discussion.
Structural Search Using ChemAxon Tools
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
Whats new in JChem back-end and Markush storage, search and enumeration Szabolcs Csepregi Solutions for Cheminformatics.
Java Solutions for Cheminformatics June 2007 Company and product overview.
In Silico Synthesis György Pirok, Nóra Máté. Elements of the Virtual Synthesis Technology A language for describing chemical rules –Chemical Terms A library.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
1 of of 18 Introduction 3 of 18 Motivation Virtual synthesis for the assessment of practical chemical hypothesis's. What is the set of all molecules.
UGM 2006 Miklós Vargyas Scientific Workshop Maximum Common Substructure.
The new JKlustor suite Miklós Vargyas Solutions for Cheminformatics.
Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia.
ChemAxon in 3D Gábor Imre, Adrián Kalászi and Miklós Vargyas Solutions for Cheminformatics.
1 Miklós Vargyas May, 2005 Compound Library Annotation.
UGM, June, 2007 Presenting: Szabolcs Csepregi JChem Base and Cartridge latest.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
1 Szabolcs Csepregi May, 2005 Structural Search Using ChemAxon Tools.
Chemaxon's chemo-informatics toolkit integration into the Affectis Data Management System Database Automated Data Integration - Example: IC50 Data generated.
CONFIDENTIAL INFORMATION KnowTox, a Franco- Hungarian collaborative project relative to toxicity.
Introduction to Metabolizer
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
The curve corresponding to the maximal or optimal QCI values was determined for each level of the clustering. This maximal curve was used as reference.
1 Real World Chemistry Virtual discovery for the real world Joe Mernagh 19 May 2005.
1 Miklós Vargyas, Judit Papp May, 2005 MarvinSpace – live demo.
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Java Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
Java Solutions for Cheminformatics March About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular.
Solutions for Cheminformatics
Analysis of High-Throughput Screening Data C371 Fall 2004.
Christopher Reynolds Supervisor: Prof. Michael Sternberg Bioinformatics Department Division of Molecular Biosciences Imperial College London.
Computational strategies and methods for building drug-like libraries Tim Mitchell, John Holland and John Woods Cambridge Discovery Chemistry & Oxford.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
2DS01 Statistics 2 for Chemical Engineering lecture 5.
Active Learning Strategies for Compound Screening Megon Walker 1 and Simon Kasif 1,2 1 Bioinformatics Program, Boston University 2 Department of Biomedical.
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Christopher Reynolds, Stephen Muggleton and Michael Sternberg Bioinformatics and Computing Departments Imperial College London.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Similarity Methods C371 Fall 2004.
EXPLORING CHEMICAL SPACE FOR DRUG DISCOVERY Daniel Svozil Laboratory of Informatics and Chemistry.
Daniel Brown. D9.1 Discuss the use of a compound library in drug design. Traditionally, a large collection of related compounds are synthesized individually.
Faculté de Chimie, ULP, Strasbourg, FRANCE
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Combinatorial Chemistry Advanced Medicinal Chemistry (Pharm 5219): Section A Ref.: An Introduction to Medicinal Chemistry, 3 rd ed. 2005, G.L.Patrick,
Combinatorial Chemistry. Synthesis of many structures (diversity) combinatorial technology, combinatorial library molecular diversity What is Combinatorial.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
Use of Machine Learning in Chemoinformatics
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
3D Fragment Consortium Dr Andy Morley Project Manager.
Molecular Modeling in Drug Discovery: an Overview
Natural products from plants
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Christopher Reynolds Supervisor: Prof. Michael J.E. Sternberg
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Statistics 2 for Chemical Engineering lecture 5
Synthetic Routes.
Presentation transcript:

a Virtual Compound Space Screening a Virtual Compound Space Szabolcs Csepregi Ferenc Csizmadia Szilárd Dóránt Nóra Máté György Pirok Zsuzsanna Szabó Jenő Varga Miklós Vargyas ChemAxon Ltd. Máramaros köz 3/a 1037 Budapest Hungary www.chemaxon.com

Drug research Finding or making a needle in the hay stack? virtual screening JChem Screen de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

Drug research Finding or making a needle in the hay stack? virtual screening JChem Screen de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

Virtual Screening Find something similar to a fistful of needles corporate database structures found known actives

Molecular similarity How to tackle it? Quantitative assessment of similarity/dissimilarity of structures need a numerically tractable form molecular descriptors, fingerprints, structural keys Sequences/vectors of bits, or numeric values that can be compared by distance functions, similarity metrics.

Virtual screening using fingerprints Multiple query structures 0100010100011101010000110000101000010011000010100000000100100000 0001101110011101111110100000100010000110110110000000100110100000 0100010100110100010000000010000000010010000000100100001000101000 0101110100110101010111111000010000011111100010000100001000101000 0001000100010100010100100000000000001010000010000100000100000000 0100010100010100000000000000101000010010000000000100000000000000 0101010101111100111110100000000000011010100011100100001100101000 0100010100011000010000011000000000010001000000110000000001100000 0000000100000000010000100000000000001010100000000100000100100000 0101110100110101010111111000010000011111100010000100001000101000 hits queries hypothesis fingerprint metric 0000000100001101000000101010000000000110000010000100001000001000 0100010110010010010110011010011100111101000000110000000110001000 0100010100011101010000110000101000010011000010100000000100100000 0001101110011101111110100000100010000110110110000000100110100000 0100010100110100010000000010000000010010000000100100001000101000 0100011100011101000100001011101100110110010010001101001100001000 0101110100110101010111111000010000011111100010000100001000101000 0100010100111101010000100010000000010010000010100100001000101000 0001000100010100010100100000000000001010000010000100000100000000 0100010100010011000000000000000000010100000010000000000000000000 0100010100010100000000000000101000010010000000000100000000000000 0101010101111100111110100000000000011010100011100100001100101000 0100010100011000010000011000000000010001000000110000000001100000 0000000100000000010000100000000000001010100000000100000100100000 0100010100010100000000100000000000010000000000000100001000011000 0001000100001100010010100000010100101011100010000100001000101000 0100011100010100010000100001001110010010000010001100000000101000 0101010100010100010100100000000000010010000010010100100100010000 targets target fingerprints

Optimized virtual screening Parameterized metrics asymmetry factor scaling factor asymmetry factor weights

How good is optimized virtual screening? β2-adrenoceptor antagonist

Is virtual screening a discovery tool? Scaffold hopping

Drug research Finding or making a needle in the hay stack? virtual screening JChem Screen de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

JChem AnalogMaker Workflow Lead Candidates

Fragmentation Examples Fragmentation rules Original molecule Generated fragments Fragment 1 amide 2 Amide Fragment 2 amide 1 ester 1 Ester Fragment 3 ester 2

Fragmentation RECAP rules 1 = amide 2 =ester 3 = amine 4 = urea 5 = ether 6 = olefin 7 = quaternary nirogen 8 = aromatic N carbon 9 = lactam N carbon 10 = aromatic carbon – aromatic carbon 11 = sulphonamide Xiao Qing Lewell, Duncan B. Judd, Stephen P. Watson, Michael M. Hann; RECAP – retrosynthetic combinatorial analysis procedure: a powerful new technique for identifying privileged molecular fragments with useful applications in combinatorial chemistry. J. Chem. Inf. Comput. Sci. 1998, 38, 511–522

JChem AnalogMaker General algorithm start create building block library generate pharmacophore hypothesis of active compounds create several starting compounds by random combination of some building blocks select parent structure convergence or end of optimization generate  variants of parent stop

Variant generation Example: TOPAS modifier G. Schneider et al, J. Comput.-Aided Mol. Design, 14(2000): 487-494 G. Schneider et al, Angew. Chem. Int. Ed., 39(2000): 4130-4133

Drug research Finding or making a needle in the hay stack? virtual screening JChem Screen de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

Drug research Finding or making a needle in the hay stack? virtual screening JChem Screen ? de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

random virtual synthesis Drug research Screening a virtual compound space virtual screening JChem Screen random virtual synthesis JChem Synthesizer de novo design JChem AnalogMaker advantages disadvantages fast hits are readily available for in vitro screening limited number of available compounds advantages disadvantages fast virtual molecules are likely to be synthetically available practically infinite virtual compound space structural novelty advantages disadvantages practically unlimited virtual compound space structural novelty synthetic accessibility of virtual hits is a problem

Screening a virtual compound space Smart reactions Generic (simple) the equation describes the transformation only few hundred generic reactions can form the basic armory of a preparative chemist Specific (complex) chemo-, recognizes reactive and inactive functional groups regio-, "knows" directing rules stereo-, inversion/retention Customizable to improve reaction model quality

Smart reactions Chemoselectivity REACTIVITY: !match(ratom(3), "[#6][N,O,S:1][N,O,S]", 1)

Smart reactions Chemoselectivity REACTIVITY: !match(ratom(3), "[#6][N,O,S:1][N,O,S]", 1) && !match(ratom(3), "[N,O,S:1][C,P,S]=[N,O,S]", 1)

Smart reactions Regioselectivity SELECTIVITY: -charge(ratom(1)) TOLERANCE: 0.0045

Smart reactions Regioselectivity SELECTIVITY: -charge(ratom(1)) TOLERANCE: 0.0045

Smart reaction library Example Baeyer-Villiger ketone oxidation SELECTIVITY: charge(ratom(2), "sigma")

Smart reaction library Baeyer-Villiger ketone oxidation Generic reaction

Smart reaction library Example Baeyer-Villiger ketone oxidation

Virtual compound space JChem Synthesizer Screen Hits Active set1 Workflow Virtual compound space Available chemicals Synthesizer Screen Hits Active setn Smart reaction library

JChem Synthesizer example Dopamine D2 actives Active sets were kindly provided by Aureus Pharma within a research collaboration between Aureus and ChemAxon.

JChem Synthesizer example Virtual hits similarity: 2D pharmacophore fingerprint, weighted Euclidean metric optimized for 20 random d2 actives

JChem Synthesizer example Best virtual hits 9.88 9.82 9.53 9.73

JChem Synthesizer example Synthesis path step 1 Knoevenagel-Doebner condensation

JChem Synthesizer example step 2 Baylis-Hillman vinyl alkylation

JChem Synthesizer example step 3 Lawesson thiacarbonylation

JChem Synthesizer example step 4 Dess-Martin alcohol oxidization

JChem Synthesizer example Software and performance data virtual reactions: 500-1000 reactions/s random synthesis: 10-20 structures/s pharmacophore fingerprint generation: 100 structure/s (includes pharmacophore point perception) metric optimization: 57 sec (13 parameterized metrics, 20 structures in training set, 50 spikes) virtual screening: 7500 structure/s pure Java client: P4 1.6GHz, RH Linux, java 1.4.2 database server: P4 2.4GHz, Windows XP, MySQL

Acknowledgements ChemAxon Jean-Michael Drancourt François Petitet Modest von Korff, Matthias Steger (Axovan is now part of Actelion.) Alex Allardyce ChemAxon

Contact Miklós Vargyas mvargyas@chemaxon.hu office: +36 1 453 2661 mobile: +36 70 381 3205