Name to structure, Structure to name, chemicalize.org Daniel Bonniot de Ruisselet Solutions for Cheminformatics.

Slides:



Advertisements
Similar presentations
February 2013 Szilárd Dóránt Scientific & technical Presentation Pipeline Pilot Integration.
Advertisements

Microfilm CAR, Film and Paper Scanners OCR Image Indexing VersaVIEW VersaIMAGE GOLD Twain Paper Scanners Automated Microfilm Readers TWAIN & ISIS Microfilm.
Don’t Type it! OCR it! How to use an online OCR..
End-to-end document capture, indexation, OCR to Microsoft SharePoint
Version 5.3, February 2010 Scientific & technical presentation JChem Base.
Scientific & technical presentation JChem Cartridge for Oracle
Scientific & technical presentation Calculator Plugins January 2011.
Instant JChem INFORMATICS MATTERS
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
Calculator Plugins József Szegezdi, Nóra Máté. ChemAxon Calculator Plugins ChemAxons plugin handling mechanism provides a framework for calculating various.
JChem Web Services Server Jonathan Lee Solutions for Cheminformatics Technical Product Presentation.
Chemical Naming Daniel Bonniot, PhD October 2008.
Nov 2008 Scientific & technical presentation JChem for Excel.
Pipeline Pilot Integration Szilard Dorant Solutions for Cheminformatics.
Solutions for Cheminformatics
Interfacing the JChem Suite outside of Java Jonathan Lee Solutions for Cheminformatics.
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
Name to structure, Structure to name, chemicalize.org Daniel Bonniot Solutions for Cheminformatics.
Java Solutions for Cheminformatics Structure based predictions – new plugins Zsolt Mohácsi, Nóra Máté, József Szegezdi, Ödön Farkas, Gábor Imre, Imre Jákli.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
Solutions for Cheminformatics Marvin features and news Akos Papp.
June, 2007 Petr Hamernik Extending Instant JChem 2.0 Architecture & API.
June, 2007 Daniel Bonniot IUPAC Naming. Available in Marvin since (April 2007) Present several ways to use it Evaluation Work in progress.
2008 Accelrys EUGM Pipelining ChemAxon Szilard Dorant Solutions for Cheminformatics.
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
NIMAC 2.0: The Accessible Media Producer Portal NIMAC 2.0 for AMPs.
Visit the ccScan Website Scan, Import, and Automatically File documents to the Cloud SCAN, IMPORT, AND AUTOMATICALLY FILE DOCUMENTS TO SALESFORCE ® Introduction.
Multi-Media Handbook The software program that efficiently manages all types of information.
HalFILE 2.1 New Application Features Session I-a.
WEB SERVICES. FIRST AND FOREMOST - LINKS Tomcat AXIS2 -
Microsoft Word By: Phuong Nguyen.
1 How Do I Order From.decimal? Rev 05/04/09 This instructional training document may be updated at anytime. Please visit and check the.
Enhancing Spotfire with the Power of R
AIMSweb Progress Monitor Online User Training
Introduction Embedded Universal Tools and Online Features 2.
EndNote. What is EndNote:  EndNote is referencing software that enables you to create a database of references from your readings. Your database of references.
XHTML Basics.
Internet Research Techniques Graham Seibert Copyright 2006 This is a segment of the draft version of a large syllabus. I need your feedback to improve.
Client Lunch & Learn (12:15). Association for Information & Image Management Nov Research Scanner Utilization.
February 24, 2015 Allison Kidd, ATRC. Direct Services for CSU Students & Employees with Disabilities Ensure Equal Access to Technology & Electronic Information.
Introducing new web content management tools for Priority...
By Hrishikesh Gadre Session II Department of Mechanical Engineering Louisiana State University Engineering Equation Solver Tutorials.
Before class begins… Help us to assess this session and plan for future workshops Please complete the Advanced Refworks Pre-learning assessment at:
AN OVERVIEW OF MAC PDF TOOLS 1. PDF Tools for Mac PDF files can be used either in Windows, Unix or Apple’s Mac OS operating system commonly. It still.
Chad Kafka Technology Coach & Teacher Twitter / Buzz / Skype / Delicious: chadkafka In The Classroom.
Overview of Mini-Edit and other Tools Access DB Oracle DB You Need to Send Entries From Your Std To the Registry You Need to Get Back Updated Entries From.
SqlReports Dean Dahlvang PSUG-MO March About Dean Dean Dahlvang Director of Administrative Technology for the Proctor.
Inking. 2 Pen Basics You MUST keep your pen tethered at all times. If you lose the stylus, the replacement cost is $30. Buttons should face YOU in garage.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
The most powerful high-speed scanning, indexing and OCR solution on the market Supports many high speed scanners: Fujitsu, Canon, Kodak, Epson, Avision,
Confidential, I.R.I.S. © 2005, All rights reserved Discover… The most robust solution to structure, index, compress and convert all your documents into.
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
Enricher Converter Analyzer Parser & Renderer UNIVERSAL, FAST AND RELIABLE.
Managing References Using the free reference management tool Zotero.
Lesson 12: Creating a Manual and Using Mail Merge.
What is TrinDocs A fully integrated document management system enabling: Archiving Instant Retrieval Workflow & Routing OCR and Intelligent Form Recognition.
GUI Design With The Appx Client Presented By: Gary Rogers.
Department of Computer Science Internet Performance Measurements using Firefox Extensions Scot L. DeDeo Professor Craig Wills.
Comparison of different output options from Stata
Test Automation For Web-Based Applications Portnov Computer School Presenter: Ellie Skobel.
How to combine IRIS products Available APIs Examples of integrations Ole Andersen Senior Strategic Account Manager.
Capture and Storage of Tabular Data Leveraging Ephesoft and Alfresco W. Gary Cox Senior Consultant Blue Fish Development Group.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
XP New Perspectives on Creating Web Pages With Word Tutorial 1 1 Creating Web Pages With Word Tutorial 1.
TechKnowlogy Conference August 2, 2011 Using GoogleDocs for Collaboration.
Editing Editing – the process of updating a word processing document to: make changes correct errors make it visually appealing.
AHG Advanced Techniques for PDF Accessibility
The effort-saving, cost-cutting, low-overhead, cloud capture platform.
Affinity Consulting John A. Federico
Presentation transcript:

Name to structure, Structure to name, chemicalize.org Daniel Bonniot de Ruisselet Solutions for Cheminformatics

Motivations Why use chemical names? Easier than drawing Familiar Used in patents, articles,...

Overview We will talk about: Name generation (structure to name, s2n) Name import (name to structure, n2s) Name extraction (document to structure, d2s) Name correction (OCR-error fixing) Name highlighting (chemicalize.org)

Structure to name Usage: Plugin in MarvinView and MarvinSketch Label updated in real-time in MarvinSketch Batch: Save As: IUPAC Name in MarvinView Batch: command line (molconvert, cxcalc) Instant JChem Options: Strict IUPAC or traditional Timeout

Structure to name Already stable (focus on n2s) Comparison between and on NCI database (260K structures) Both over 99.9% named, 4% changed More fused names supported (60% to 66%): 5-methyl-6-azatricyclo[ ^{2,7}]tetradeca-1(10),2(7),3,5,8,11,13-heptaene 3-methylbenzo[f]quinoline Better support for ions (e.g. -olate) Stricter IUPAC numbering and priorities Overspecific E/Z labels removed

Name to structure Usage Edit/Import Name... in MarvinSketch Automatic format recognition Paste a name from the clipboard Open IUPAC Name file in MarvinView or MarvinSketch (.name extension) Batch from command line (molconvert)

Name to structure: evaluation Molecule->Name->Molecule on NCI 5.1.0: 90.0% names imported, 68.7% identical 5.2.2: 97.6% names imported, 94.1% identical 5.2.5: 97.8% names imported, 95.9% identical Pubchem data 5.1.0: 88.8% names imported, 94.0% identical 5.2.2: 98.3% names imported, 95.6% identical 5.2.5: 98.3% names imported, 96.1% identical Name to structure: +33% speed

Customization (new) Extend name-to-structure conversion using your in-house data Dictionary file Simple API: Database lookup Webservice lookup... Fully flexible

Document to structure (new) Goal Process documents containing text Recognize chemical names Convert them to structures Return locations, names and structures Formats Implemented: text, html, xml Planned: PDF, Doc,...

OCR error fixing (new) Scanned texts contain numerous recognition errors Examples: L (small L) instead of 1 I instead of l (Il, iL) Uses: By default in d2s Option in n2s (upcoming)

Chemicalize.org Adds structural information to existing public webpages Popup window with structure image Link to structure predictions (logP, pKa,...) Searchable structure->webpage index Could be installed natively on custom website, with custom features

Chemicalize.org

Recap Name-to-structure and structure-to-name available and improving fast Document-to-structure just released Extend using your in-house dictionaries and databases Try it, send feedback, we're listening!

Find out more Product descriptions & links Forum Presentations and posters Download