Version 5.3, February 2010 Scientific & technical presentation JChem Base.

Slides:



Advertisements
Similar presentations
February 2013 Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Advertisements

Solutions for Cheminformatics
Solutions for Cheminformatics April 2010 Company and product overview.
Virtual Synthesis - Reactor
August 2010, ACS National meeting, Boston Representation of Markush structures from molecules towards patents Szabolcs Csepregi Solutions for Cheminformatics.
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia First presented at Applications of Cheminformatics.
Solutions for Cheminformatics September 2009 Company and product overview.
Scientific & technical presentation JChem Cartridge for Oracle
1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia January, 2007 Structural Search Using ChemAxon.
May, 2008 Presenting: Szabolcs Csepregi The ChemAxon Markush project overview and development discussion.
Scientific & technical presentation Fragmenter Nóra Máté Sept 2005.
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
Scientific & technical presentation MarvinSketch and MarvinView
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Version 5.3, April 2010 The ChemAxon Markush project overview and development discussion.
Structural Search Using ChemAxon Tools
Scientific & technical presentation Standardizer January 2008.
Chemical Naming Daniel Bonniot, PhD October 2008.
Nov 2008 Scientific & technical presentation JChem for Excel.
Szilárd Dóránt May 2006 Building on JChem Base. Contents Introduction Structural overview The Property Table JChem structure tables The log table Standardization.
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
Whats new in JChem back-end and Markush storage, search and enumeration Szabolcs Csepregi Solutions for Cheminformatics.
JChem Base chemical database
In Silico Synthesis György Pirok, Nóra Máté. Elements of the Virtual Synthesis Technology A language for describing chemical rules –Chemical Terms A library.
Solutions for Cheminformatics
Interfacing the JChem Suite outside of Java Jonathan Lee Solutions for Cheminformatics.
Welcome to San Diego!! Alex Drijver, CEO Solutions for Cheminformatics.
UGM, June, 2007 Presenting: Szabolcs Csepregi JChem Base and Cartridge latest.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
ChemAxon - Pipeline Pilot Integration
1 Szabolcs Csepregi May, 2005 Structural Search Using ChemAxon Tools.
Leveraging ChemAxon Cheminformatics in an Integrated Drug Discovery and Development Platform Zhenbin Li, Paul Starbard, Jim Gregory, Donald Chen, Paul.
UGM, June, 2007 Szabolcs Csepregi Markush: Whats new, development discussions.
Name to structure, Structure to name, chemicalize.org Daniel Bonniot Solutions for Cheminformatics.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
DeltaSofts ChemCart Next Generation Access to Research Data ChemAxon User Group Meeting Budapest, Hungary June 13-14, 2007.
June, 2007 David Spender*, Erika Biró What's new in Marvin and development discussion.
ChemAxon for Developers Ferenc Csizmadia 2008 November – Last updated: 2010 April.
Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,
Solutions for Cheminformatics Marvin features and news Akos Papp.
Name to structure, Structure to name, chemicalize.org Daniel Bonniot de Ruisselet Solutions for Cheminformatics.
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
Java Solutions for Cheminformatics March About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular.
Solutions for Cheminformatics
Dr. Matthew Wright Product Director.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
1 Web-Enabled Decision Support Systems Access Introduction: Touring Access Prof. Name Position (123) University Name.
Oracle SQL Developer Data Modeler 3.0: Technical Overview March 2011.
Integrating Access with the Web and with Other Programs.
September 2014, Version Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Tutorial 11: Connecting to External Data
CSCI 6962: Server-side Design and Programming
1 InstantJChem: a flexible chemical database system G. Marcou, D. Horvath + Laboratoire d’infochimie, Université de Strasbourg, 1, rue Blaise Pascal,
ASP.NET Programming with C# and SQL Server First Edition
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
® Microsoft Office 2010 Access Tutorial 3 Maintaining and Querying a Database.
Copyright © 2006 Knovel Corporation Streamline Your Science and Engineering Research
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
CHAPTER 7 LESSON C Creating Database Reports. Lesson C Objectives  Display image data in a report  Manually create queries and data links  Create summary.
June 2016, Version Scientific & technical Presentation Pipeline Pilot Integration.
Pipeline pilot Components
Tutorial 7 – Integrating Access With the Web and With Other Programs
Presentation transcript:

version 5.3, February 2010 Scientific & technical presentation JChem Base

Introduction to JChem Base High performance Java based tools for: storage, search and retrieval of chemical structures and associated data The components can be integrated into web-based or standalone applications in association with other ChemAxon tools

Structural overview Web browser Application Web application JChem Base API: Chemical logic Structure cache JDBC driver: Standard interface to the RDBMS RDBMS (e.g. Oracle, MySQL, etc.) : Storage and security

Compatibility and integration File formats: SMILES MDL molfile (v2000 and v3000) MDL SDF RXN RDF MRV IUPAC name, InChI Markush DARC CDX Integration: extensive API for Java.NET JChem Cartridge for Oracle Database engines: Oracle MySQL MS SQL Server PostgreSQL MS Access IBM DB2 Derby etc. Operating systems: Windows Linux Mac OS X Solaris etc.

JSP example application Features: Substructure, Superstructure, Full, Exact fragment, Similarity and Perfect search Molecular Descriptor similarity search with descriptor coloring Substructure hit alignment and coloring, inverse hit list Chemical Terms filter Import / Export Export of hits Insert / Modify / Delete structures AJAX in JChem Webservices

Structure search features See detailed information on structure search: Wide range of query atoms Query properties R-group queries Full SMARTS support Coordination compounds Link nodes Pseudo atoms, lone pairs Relative stereo Reaction search features Hit coloring, position variation Polymers

Search options Some selected structure search options: Stereo on/off Ignore charge/isotope/radical/ valence/polymers, etc. Vague bond matching options Chemical Terms filter Tautomer search Inverse hit list Maximum search time / number of hits Combine with non-structure conditions Ordering of results etc.

JChem Base 5.2.2, Intel Quad Q GHz, 8 GB RAM; Oracle Performance (1) Number of compounds Elapsed time Duplicates not checked Duplicates checked 10,00021 s26 s 100,0002 min 4 s2 min 34 s 200,0004 min 24 s5 min 13 s QueryNumber of hitsSearch time s s 6, s 146,2565,66 s Compound registration: Substructure search in PubChem (19.5 million compounds):

Performance (2) Similarity search: Tanimoto >0.9 JChem Base 5.2.2, Intel Quad Q GHz, 8 GB RAM; Oracle QueryNumber of hitsSearch time s s s

Markush structures Markush structure registration and search Markush features R-groups Atom lists, bond lists Position variation bond Link nodes and repeating units Homology variation (alkyl, aryl, etc.) Compatible Markush enumeration plugin

Administration with JChemManager User interface for creating tables import export deleting rows dropping tables Most functions are also available from command line.

Standardization Default standardization includes: – Hydrogen removal – Aromatization Custom standardization can be specified for each table by specifying an XML configuration file at table creation or in the Table Options dialog of JChem Manager (jcman)

Custom Standardization Example afterbefore Standardizer

The property table The property table stores information about JChem structure tables, including: Fingerprint parameters Custom standardization rules Other table options and information More than one property table can be used, each property table represents a particular JChem environment.

Table types Control allowed chemical structures and available operations Molecule Reaction Markush Query Any structure

The structure of JChem tables Column nameExplanation cd_idunique numeric identifier in the table cd_structure the imported structure in the original format, without modifications (except for the removal of data fields) cd_smiles; cd_smarts; cd_markush the standardized structure format dependig on the different table types, used by the search process cd_formulathe formula of the standardized structure cd_sortable_formulaformula representation for alphanumerical sorting cd_molweightthe molecular weight of the standardized structure cd_hash; cd_flags; cd_fp… fields used internally for structure searching cd_timestampthe date and time of the insertion of the row [user fields]custom data fields can be added by the user

Structural search in database Two stage method provides optimal performance: 1. Rapid pre-screening reduces the number of possible hit candidates Chemical Hashed Fingerprints are used for substructure and superstructure searches Hash code is used for duplicate filtering (usually during compound registration) 2. Graph search algorithm is used to determine the final hit list

Structure Cache Contains Fingerprints for screening and ChemAxon Extended SMILES for ABAS Instant access to the structures for the search process Reduced load on the database server Incremental update ensures minimum overhead after changes in the table Small memory footprint due to –SMILES compression –Optimized storage technique Approximately 100MB memory needed for 1 million typical drug-like structures (using default, 512 bit long fingerprints)

Future plans Graphical user interface for R-group decomposition Arbitrary table structure (Java and.NET API for JChem index) Maximum common substructure search type Additional layer: JChem Server (later also as grid) Compound registration system API

Summary ChemAxons JChem Base API provides sophisticated high performance tools for the developer to deal with chemical structures and associated data. Building on the JChem API is convenient, because: Our various tools integrate seamlessly Both high and low level API classes are available Responsive developer-to-developer support

Links JChem home page: Online tryout: API documentation: Brochure:

Visit other technical presentations MarvinSketch/View MarvinSpace Calculator Plugins JChem Base JChem Cartridge Standardizer Screen JKlustor Fragmenter Reactor