Presentation is loading. Please wait.

Presentation is loading. Please wait.

Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia.

Similar presentations


Presentation on theme: "Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia."— Presentation transcript:

1 Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia

2 Why standardize structures? Canonicalisation Uniformization of structures without changing the chemical content to recognize duplicates, functional groups (aromatization, mesomers, tautomers,... ) Beautification Making the structures visually more attractive ( dearomatization, cleaning coordinates, wedge orientation,... ) Modification Conversion of structures by modifying its original content as a preparation step for further chemoinformatics tasks (transformations, removing stereo, removing R-groups,...). often difficult to categorize the standardization actions

3 Canonicalisation making hydrogens explicit converting to canonical mesomer form transforming to user defined mesomer form Hydrogens aromatizing Kekülé rings Resonant structures converting to canonical tautomer form removing user defined fragments transforming to user defined tautomer form Tautomers expanding stoichiometry Other removing small fragments making hydrogens implicit setting the chiral flag

4 Mesomers

5 Tautomers oxo-enol, enamine-imine

6 Tautomers pyridone-pyridol

7 Fragment removal

8 Specific counterion removal

9 Solvent removal

10 Stoichiometry expansion expanding salt stoichiometry

11 Stoichiometry expansion expanding reaction stoichiometry

12 Beautification calculating 2D coordinates Hydrogens converting aromatic rings to Kekülé format Resonant structures making hydrogens implicit Cleaning reallocating wedge bonds contracting/expanding/ ungrouping abbreviated and multiple groups Groups template based cleaning 3D geometry optimization

13 Template-based Cleaning 2D-coordinate calculation of macrocycles or bridged systems

14 query Template-based Cleaning aligning search results to the query

15 client Canonicalization During Database Import Relational Database input structures canonicalization configurationoriginal structurescanonicalized structures server Standardizer JChem Base / Cartridge

16 client Sending Query to the Database Relational Database server query structure canonicalization configurationcanonicalized query query is compared to the canonicalized structures Standardizer JChem Base / Cartridge

17 Displaying Result Structures Relational Database original structures server client beautification configuration beautified structures Standardizer JChem Base / Cartridge

18 Modification custom transformations +

19 API and command line interface Standardizer st = new Standardizer(new File("standardize.xml")); st.standardize(mol); standardize input.sdf -c config.xml -o output.smiles

20 Live Demonstration

21 Applications: Virtual Synthesis

22 Applications: Structure Databases

23 How can ChemAxon Help Free for non commercial websites Free for academic teaching and research Academic Package Free Academic Package to be extended to cover academic networks – campus-wide roll out

24 Acknowledments Ferenc Csizmadia Nóra Máté István Cseh Szabó Attila Szilárd Dóránt Péter Kovács Szabolcs Csepregi


Download ppt "Standardizer Molecular Cosmetics for Chemoinformatics György Pirok Nóra Máte István Cseh Szilárd Dóránt Péter Kovács Szabolcs Csepregi Ferenc Csizmadia."

Similar presentations


Ads by Google