Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia.

Slides:



Advertisements
Similar presentations
February 2013 Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Advertisements

Solutions for Cheminformatics
Solutions for Cheminformatics April 2010 Company and product overview.
Virtual Synthesis - Reactor
August 2010, ACS National meeting, Boston Representation of Markush structures from molecules towards patents Szabolcs Csepregi Solutions for Cheminformatics.
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia First presented at Applications of Cheminformatics.
Version 5.3, February 2010 Scientific & technical presentation JChem Base.
Scientific & technical presentation JChem Cartridge for Oracle
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia January, 2007 Structural Search Using ChemAxon.
May, 2008 Presenting: Szabolcs Csepregi The ChemAxon Markush project overview and development discussion.
Scientific & technical presentation Fragmenter Nóra Máté Sept 2005.
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
Scientific & technical presentation MarvinSketch and MarvinView
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
Version 5.3, April 2010 The ChemAxon Markush project overview and development discussion.
Calculator Plugins József Szegezdi, Nóra Máté. ChemAxon Calculator Plugins ChemAxons plugin handling mechanism provides a framework for calculating various.
UGM 2006 Miklós Vargyas Whats new in JKlustor. Overview An introduction to JKlustor –Brief history of the product –Main features –Usage examples –Performance.
Technical presentation Alternatives to MDL ® Cheshire.
Structural Search Using ChemAxon Tools
Scientific & technical presentation Standardizer January 2008.
Nov 2008 Scientific & technical presentation JChem for Excel.
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
Whats new in JChem back-end and Markush storage, search and enumeration Szabolcs Csepregi Solutions for Cheminformatics.
Java Solutions for Cheminformatics June 2007 Company and product overview.
In Silico Synthesis György Pirok, Nóra Máté. Elements of the Virtual Synthesis Technology A language for describing chemical rules –Chemical Terms A library.
Scientific & technical presentation Calculator Plugins József Szegezdi, Nóra Máté Sept 2005.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Integrating JChem and Marvin into the Integrity ® Drug Discovery and Development Portal Rosa Alentorn, Gerard Chiva and Ann Wescott ChemAxon UGM, 7-8 June.
Whats new in Marvin? GUI features –atom sets, MarvinSpace integration, better customization –AWT is not used anymore File IO –new file types and seeking.
Java Solutions for Cheminformatics April 2006 JChem Cartridge For Oracle - Latest.
1 Miklós Vargyas May, 2005 Compound Library Annotation.
UGM, June, 2007 Presenting: Szabolcs Csepregi JChem Base and Cartridge latest.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
ChemAxon - Pipeline Pilot Integration
1 Szabolcs Csepregi May, 2005 Structural Search Using ChemAxon Tools.
Introduction to Metabolizer
UGM, June, 2007 Szabolcs Csepregi Markush: Whats new, development discussions.
Java Solutions for Cheminformatics Structure based predictions – new plugins Zsolt Mohácsi, Nóra Máté, József Szegezdi, Ödön Farkas, Gábor Imre, Imre Jákli.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
June, 2007 David Spender*, Erika Biró What's new in Marvin and development discussion.
ChemAxon for Developers Ferenc Csizmadia 2008 November – Last updated: 2010 April.
Solutions for Cheminformatics Marvin features and news Akos Papp.
JChem Node extension for KNIME Workbench
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Java Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
Java Solutions for Cheminformatics March About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular.
Solutions for Cheminformatics
EasySearch Technical Overview. Ever seen a website without a full text search? BUT – Search is expensive Financially Computationally – Search is complicated.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Fall 2006.
Information Retrieval in Practice
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
September 2014, Version Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Overview of Search Engines
Electronic Thesis and Dissertation Database Errors Ryan Mestre Luke Schmader Client: Zhiwu Xie Blacksburg March 3, 2014 Virginia Tech CS 4624.
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
What does WWW stand for? And following abbreviations? HTTP: Hyper Text Transfer Protocol HTML: Hyper Text Mark-up Language URL: Uniform Resource Locator.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Federated Database Set Up Greg Magsamen ITK478 SIA.
Z39 Server and Z39.50 Gateway. Z39 Configuration Z39.50 Server Bath Profile conformance has been added to the Z39 Server. Z39 server supports Structure.
Python and Chemical Informatics The Daylight and OpenEye toolkits, part II Presented by Andrew Dalke, Dalke Scientific Software for David Wild’s I590 course.
Web based spectrum databases and utilities László Dobos Tamás Budavári István Csabai MAGPOP kick-off meeting, January Cassis.
HELM 2.0 Toolkit Code Orientation. HELM 2.0 Package overview 2 HELM2NotationToolkit ChemistryToolkit ChemistryToolkitMarvinChemistryToolkitCDK HELMNotationParser.
June 2016, Version Scientific & technical Presentation Pipeline Pilot Integration.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Information Retrieval in Practice
Pipeline pilot Components
AI Discovery Template IBM Cloud Architecture Center
Presentation transcript:

Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia

Why standardize structures? Canonicalisation Uniformization of structures without changing the chemical content to recognize duplicates, functional groups (aromatization, mesomers, tautomers,... ) Beautification Making the structures visually more attractive ( dearomatization, cleaning coordinates, wedge orientation,... ) Modification Conversion of structures by modifying its original content as a preparation step for further chemoinformatics tasks (transformations, removing stereo, removing R-groups,...). often difficult to categorize the standardization actions

Canonicalisation making hydrogens explicit converting to canonical mesomer form transforming to user defined mesomer form Hydrogens aromatizing Kekülé rings Resonant structures converting to canonical tautomer form removing user defined fragments transforming to user defined tautomer form Tautomers expanding stoichiometry Other removing small fragments making hydrogens implicit setting the chiral flag

Mesomers

Tautomers oxo-enol, enamine-imine

Tautomers pyridone-pyridol

Fragment removal

Specific counterion removal

Solvent removal

Stoichiometry expansion expanding salt stoichiometry

Stoichiometry expansion expanding reaction stoichiometry

Beautification calculating 2D coordinates Hydrogens converting aromatic rings to Kekülé format Resonant structures making hydrogens implicit Cleaning reallocating wedge bonds contracting/expanding/ ungrouping abbreviated and multiple groups Groups template based cleaning 3D geometry optimization

Template-based Cleaning 2D-coordinate calculation of macrocycles or bridged systems

query Template-based Cleaning aligning search results to the query

client Canonicalization During Database Import Relational Database input structures canonicalization configurationoriginal structurescanonicalized structures server Standardizer JChem Base / Cartridge

client Sending Query to the Database Relational Database server query structure canonicalization configurationcanonicalized query query is compared to the canonicalized structures Standardizer JChem Base / Cartridge

Displaying Result Structures Relational Database original structures server client beautification configuration beautified structures Standardizer JChem Base / Cartridge

Modification custom transformations +

API and command line interface Standardizer st = new Standardizer(new File("standardize.xml")); st.standardize(mol); standardize input.sdf -c config.xml -o output.smiles

Live Demonstration

Applications: Virtual Synthesis

Applications: Structure Databases

How can ChemAxon Help Free for non commercial websites Free for academic teaching and research Academic Package Free Academic Package to be extended to cover academic networks – campus-wide roll out

Acknowledments Ferenc Csizmadia Nóra Máté István Cseh Szabó Attila Szilárd Dóránt Péter Kovács Szabolcs Csepregi