Solutions for Cheminformatics

Slides:



Advertisements
Similar presentations
February 2013 Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Advertisements

Solutions for Cheminformatics April 2010 Company and product overview.
Virtual Synthesis - Reactor
August 2010, ACS National meeting, Boston Representation of Markush structures from molecules towards patents Szabolcs Csepregi Solutions for Cheminformatics.
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia First presented at Applications of Cheminformatics.
Solutions for Cheminformatics September 2009 Company and product overview.
Version 5.3, February 2010 Scientific & technical presentation JChem Base.
Scientific & technical presentation JChem Cartridge for Oracle
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia January, 2007 Structural Search Using ChemAxon.
May, 2008 Presenting: Szabolcs Csepregi The ChemAxon Markush project overview and development discussion.
Scientific & technical presentation Fragmenter Nóra Máté Sept 2005.
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Version 5.3, April 2010 The ChemAxon Markush project overview and development discussion.
Calculator Plugins József Szegezdi, Nóra Máté. ChemAxon Calculator Plugins ChemAxons plugin handling mechanism provides a framework for calculating various.
Structural Search Using ChemAxon Tools
Scientific & technical presentation Standardizer January 2008.
Chemical Naming Daniel Bonniot, PhD October 2008.
Nov 2008 Scientific & technical presentation JChem for Excel.
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
4 August 2009Copyright © 2009 – Kelaroo, Inc. Kelaroo & ChemAxon Robert D. Feinstein, PhD Vice President & CSO, Kelaroo, Inc.
Whats new in JChem back-end and Markush storage, search and enumeration Szabolcs Csepregi Solutions for Cheminformatics.
Java Solutions for Cheminformatics June 2007 Company and product overview.
In Silico Synthesis György Pirok, Nóra Máté. Elements of the Virtual Synthesis Technology A language for describing chemical rules –Chemical Terms A library.
Scientific & technical presentation Calculator Plugins József Szegezdi, Nóra Máté Sept 2005.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
The new JKlustor suite Miklós Vargyas Solutions for Cheminformatics.
Integrating JChem and Marvin into the Integrity ® Drug Discovery and Development Portal Rosa Alentorn, Gerard Chiva and Ann Wescott ChemAxon UGM, 7-8 June.
Solutions for Cheminformatics
Prediction of Xenobiotic Metabolism and Major Metabolites György Pirok Solutions for Cheminformatics.
Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia.
Genostar 2009 Genostar Bioinformatics Solutions Connecting, completing and exploring biochemical and genomic data with Metabolic Pathway Builder ChemAxon's.
Welcome to San Diego!! Alex Drijver, CEO Solutions for Cheminformatics.
1 Miklós Vargyas May, 2005 Compound Library Annotation.
UGM, June, 2007 Presenting: Szabolcs Csepregi JChem Base and Cartridge latest.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
ChemAxon - Pipeline Pilot Integration
1 Szabolcs Csepregi May, 2005 Structural Search Using ChemAxon Tools.
19 May 2005Copyright © 2005 – Kelaroo, Inc. Kelaroo Applications & ChemAxon Components: Reagent Management Robert D. Feinstein, Ph.D. Kelaroo, Inc. –
Chemaxon's chemo-informatics toolkit integration into the Affectis Data Management System Database Automated Data Integration - Example: IC50 Data generated.
Introduction to Metabolizer
UGM, June, 2007 Szabolcs Csepregi Markush: Whats new, development discussions.
Java Solutions for Cheminformatics Structure based predictions – new plugins Zsolt Mohácsi, Nóra Máté, József Szegezdi, Ödön Farkas, Gábor Imre, Imre Jákli.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
June, 2007 Akos Papp Corporate Registration System - A future solution.
DeltaSofts ChemCart Next Generation Access to Research Data ChemAxon User Group Meeting Budapest, Hungary June 13-14, 2007.
June, 2007 David Spender*, Erika Biró What's new in Marvin and development discussion.
Solutions for Cheminformatics Marvin features and news Akos Papp.
1 Miklós Vargyas, Judit Papp May, 2005 MarvinSpace – live demo.
JChem Node extension for KNIME Workbench
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Java Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
Java Solutions for Cheminformatics March About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular.
Solutions for Cheminformatics
Analysis of High-Throughput Screening Data C371 Fall 2004.
September 2014, Version Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Molecular Descriptors
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
1 Cheminformatics David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
June 2016, Version Scientific & technical Presentation Pipeline Pilot Integration.
Pipeline pilot Components
Daylight and Discovery
Virtual Screening.
Presentation transcript:

Solutions for Cheminformatics Methods for tautomer enumeration, -searching and -duplicate filtering József Szegezdi, Zsolt Mohácsi, Tamás Csizmazia, Szilárd Dóránt, Ákos Papp, György Pirok, Szabolcs Csepregi, Ferenc Csizmadia August 2010, ACS National meeting, Boston Solutions for Cheminformatics

August 2010, ACS National meeting, Boston Contents Tautomerization plugin Methods for generating tautomers How tautomers are used in property predictions Tautomer duplicate search Generic tautomer generation Handling of stereochemistry Tautomer substructure search Custom tautomerization: Standardizer August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston ChemAxon Cheminformatics toolkits and applications HQ: Budapest, Hungary Founded: 1998 Main customers: pharma, biotech, publishing 3rd party applications and web sites. (e.g. Integrity, Reaxis, PDB ligand search, ELN-s, registration systems, etc) August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston ChemAxon Main products: Structure drawing & visualization (Marvin family) Chemical DB tools (JChem family) Property predictions (Calculator plugins) Drug discovery tools (Reactor, JKlustor, etc.) Development strategy: customer-driven August 2010, ACS National meeting, Boston

What is tautomerization? August 2010, ACS National meeting, Boston

Tautomerization Plugin August 2010, ACS National meeting, Boston

Tautomerization Plugin Can generate: Dominant tautomer distribution with estimated ratio: those likely to exist (e.g. at a given pH). Major tautomer: the most dominant one All tautomers: all theoretically possible Generic tautomer: used for duplicate tautomer searching Canonical tautomer: canonicalization based on empirical rules August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Algorithm Identify H donors and acceptors. Filter them depending on parameters and method. Consider bond paths between donors/acceptors. Process results of 2 & 3: Combinatorically enumerate, Rank results (dominant & canonical), Calculate distribution (dominant), Filter & select (canonical) Combine paths into regions (generic) August 2010, ACS National meeting, Boston

Enhancing property predictions ChemAxon property predictions have option to consider tautomers: Log D: whole dominant tautomer distribution is used, weighted by the ratio of isomers. pKa, major microspecies, logP: the single major tautomer is used for the calculation. August 2010, ACS National meeting, Boston

Tautomer searching methods in JChem August 2010, ACS National meeting, Boston

Table/index option „Duplicate search uses tautomers” – check box at table / index creation. It makes all duplicate searches „tautomer search” on that table Algorithm: Convert query to generic tautomer Tautomer hash code screening Graph matching of generic tautomers August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Generic tautomer Generic tautomer calculation is accessible in Marvin for understanding/checking behavior: August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Generic tautomer Example: Generic tautomer August 2010, ACS National meeting, Boston

Generic tautomer – algorithm H donor and acceptor atoms are identified Excluded parts are located. (e.g. Protected stereo, aromaticity, etc.) Tautomer regions are identified based on alternating single/double bonds, etc. between H donor and acceptor atoms. Assignment of tautomer regions Variable bonds are replaced by any bonds. Bonding electron count, number of D and T in the region is attached to the region as data. (Needed for graph matching.) August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Stereochemistry Theory: When spontaneous tautomerization happens in nature, the specific stereo configuration is lost easily in two steps of back and forth tautomerization. (E.g. racemization.) For this reason, in general we ignore stereochemistry in tautomer regions, but there are options to protect different types of stereochemistry. August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Stereochemistry Current practices in JChem (considering structure registration): Tetrahedral stereo centers (which are marked by wedge bonds) are excluded from tautomerization ( = they are never included in tautomer regions). Explanation: wedge stereo drawing is usually not accidental. This way the chemists express that a separate stereoisomer can be isolated. Double bond(E/Z) stereochemistry is allowed to tautomerize. Explanation: It is very easy to „accidentally” define E/Z stereochemistry. – Just by the atom coordinates. This way it is not easy to decide what the original intention was. August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Stereochemistry Examples: Generic tautomer August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Stereochemistry Example 2: As a result, each row below will be recognized as different compounds. Molecules in the same row are recognized as tautomers of each other: (“protect double bond stereo” option is off) (“protect all tetrahedral stereo centers” option is on) August 2010, ACS National meeting, Boston

Tautomer search option Query molecule is enumerated Query: non-tautomer tautomer search: search: August 2010, ACS National meeting, Boston

Standardization methods Canonical tautomer Custom transforms – examples: August 2010, ACS National meeting, Boston

Standardization method Custom transform example for ring-chain tautomerism: Mollica et al, J.Pharmaceutical Sciences, Vol 60, No 9, p1380, 1971 August 2010, ACS National meeting, Boston

Comparison of the methods Available search types Registration speed Tautomer search speed tautomer non-tautomer Duplicate Substructure Duplicate tautomer table option OK - * - (under dev.) Small overhead Fast Tautomer search option - As normal May be slow Canonical tautomer in standardization Custom standardization transforms * Available via tautomer search option (covered in row 2). August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Under development Further filters (e.g. customizable stable groups) Built-in ring-chain tautomerization in tautomer plugin Switchable tautomer search for tautomer table option in JChem August 2010, ACS National meeting, Boston

August 2010, ACS National meeting, Boston Summary Tautomer calculation plugin offers various methods Tautomers can improve property predictions JChem offers 4 ways for searching tautomers August 2010, ACS National meeting, Boston

Acknowledgements József Szegezdi – development of the tautomerization algorithms Zsolt Mohácsi – plugin framework, standardizer Szilárd Doránt, Tamás Csizmazia, Ali Baharev – tautomer search Ákos Papp, György Pirok, Ferenc Csizmadia, Szabolcs Csepregi – concepts

References [1] Martin, Y.C.: Let’s not forget tautomers J Comput Aided Mol Des (2009) 23:693–704, DOI 10.1007/s10822-009-9303-2 [2] Szegezdi, J.; Csizmadia, F: Tautomer generation. pKa based dominance conditions for generating dominant tautomers. American Chemical Society meeting, Aug 19-23rd, 2007 http://www.chemaxon.com/conf/Tautomer_generation_A4.pdf [3] Pirok, G. et al: Standardizer - Molecular Cosmetics for Chemoinformatics. Drug Discovery Technology, August 7-10th, 2006 http://www.chemaxon.com/conf/standardizer.pdf