EBI is an Outstation of the European Molecular Biology Laboratory. Chemoinformatics and Metabolism Paula de Matos.

Slides:



Advertisements
Similar presentations
Dynamic web application for drug design research M. Chapman 1, N. MacCuish 1, J. MacCuish 1 J. Bradley 2, J. Blankley 3 1 Mesa Analytics & Computing, Inc.,
Advertisements

1 Szabolcs Csepregi*, Szilárd Dóránt, Nóra Máté, Miklós Vargyas, Péter Kovács, György Pirok, Ferenc Csizmadia First presented at Applications of Cheminformatics.
Scientific & technical presentation JChem Cartridge for Oracle
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Java Solutions for Cheminformatics Feb 2008 Whats new for PP.
4 August 2009Copyright © 2009 – Kelaroo, Inc. Kelaroo & ChemAxon Robert D. Feinstein, PhD Vice President & CSO, Kelaroo, Inc.
Whats new in JChem back-end and Markush storage, search and enumeration Szabolcs Csepregi Solutions for Cheminformatics.
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Genostar 2009 Genostar Bioinformatics Solutions Connecting, completing and exploring biochemical and genomic data with Metabolic Pathway Builder ChemAxon's.
Java Solutions for Cheminformatics March About Us Molecule Drawing and Visualization Structure Searching Cartridge Structure Standardization Molecular.
Solutions for Cheminformatics
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
Distributed Drug Discovery Indiana University Purdue University, Indianapolis.
Dr. Matthew Wright Product Director.
EBI is an Outstation of the European Molecular Biology Laboratory. ChEBI: The story so far.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Services | Research | Training | Industry Small Molecules Resources at the EBI Dr. Louisa Bellis Chemical Content Curator, ChEMBL Group EMBL-EBI, UK Bioinformatics.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
Cheminformatics Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Personalia: Pre-Sheffield Batchelor’s degree in Chemistry at Oxford Pre-university job in my local public library system Chemistry or information science?
Christoph Steinbeck Cologne University Bioinformatics Center (CUBIC) Folie 1 16:39:56 Reviving Analytical Data of the Past with Open Submission Databases.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
EBI is an Outstation of the European Molecular Biology Laboratory. IntEnz Integrated relational Enzyme database 23 May 2015.
Collaborative Information Management: Advanced Information Processing in Bioinformatics Joost N. Kok LIACS - Leiden Institute of Advanced Computer Science.
Contents of this Talk [Used as intro to Genome Databases Seminar, 2002] Overview of bioinformatics Motivations for genome databases Analogy of virus reverse-eng.
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.
EBI is an Outstation of the European Molecular Biology Laboratory. Web Services Programmatic access to Life Sciences resources. Rodrigo Lopez.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
AMBIT Software for Data Management and (Q)SAR Applications Nina Jeliazkova Bulgarian Academy of Sciences Institute for Parallel Processing Sofia Bulgaria.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
EBI is an Outstation of the European Molecular Biology Laboratory. ChEBI: an EBI chemistry reference.
Oracle8 JDBC Drivers Section 2. Common Features of Oracle JDBC Drivers The server-side and client-side Oracle JDBC drivers provide the same basic functionality.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
VAMOS Visualization of Accessible Molecular Space A new compound filtering and selection interface Spotfire User Conference - Europe - May , 2003.
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
Fundamentals of Database Chapter 7 Database Technologies.
Doug Raiford Lesson 3.  More and more sequence data is being generated every day  Useless if not made available to other researchers.
CZ3253: Computer Aided Drug design Lecture 3: Drug and Cheminformatics Databases Prof. Chen Yu Zong Tel:
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
AMBIT Chemoinformatics Software for Data Management Joanna Jaworska Nina Jeliazkova P&G Brussels, Ideaconsult Ltd., Belgium Bulgaria.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
The Optimization Plug-in for the BioUML Platform E. O. Kutumova 1,2,*, A. S. Ryabova 1,3, N. I. Tolstyh 1, F. A. Kolpakov 1,2 1 Institute of Systems Biology,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. ChEBI: The story so far Paula de Matos.
ChemBank Building a Public Web Resource Using Daycart Erik Brauner Head of Chemical and Biological Computing Harvard Institute of Chemistry and Cell Biology.
1 Cheminformatics David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
A collaborative tool for sequence annotation. Contact:
EBI is an Outstation of the European Molecular Biology Laboratory. Rhea Annotated reactions database 17 December 2015.
Bioinformatics Project BB201 Metabolism A.Nasser
EMBL-EBI Chemistry & the PDB MSDchem Primary Developer: Dimitris Dimitropoulos.
EBI is an Outstation of the European Molecular Biology Laboratory. Tutorial 5: ChEBI - On-line Submission and Curation.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Taming the Big Data in Computational Chemistry #euroCRIS2015 Barcelona 9-11-XI-2015 Carles Bo ICIQ (BIST) -
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Use of Machine Learning in Chemoinformatics
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
OncoTrack Bioinformatics Workshop Max Planck Institute for Molecular Genetics, Berlin Wednesday 6 th November 2013 TimeSubject 13:30-15:00 Introduction.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
General & Background InformationPractical & Useful DataDetailed, Original Research Encyclopedias Dictionaries Reference Texts Books Safety Information.
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
Jmol, a java molecular viewer
Virtual Screening.
SDMX IT Tools SDMX Registry
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. Chemoinformatics and Metabolism Paula de Matos

Indexing, searching and dissemination of chemical information Cheminformatics Algorithms and Toolkits Natural Products and Metabolomics Chemoinformatics and Metabolism Group Research

Chemical Entities of Biological Interest A database containing a freely available, manually annotated dictionary of molecular entities focused on ‘small’ chemical compounds. Provides a method to navigate the chemical space via an ontology ChEBI aims to provide a central, definitive reference of chemical nomenclature.

Dictionary Resource for Nomenclature

Mostly small entities Big entities too like alumina amylose metaborate Excludes proteins and nucleic acids What does ChEBI cover?

7

ChEBI Web Services Programmatic access to a ChEBI entry SOAP based Java implementation Clients currently available in Java and perl Four methods with which to access data getLiteEntity getCompleteEntity getOntologyParents getOntologyChildren Documented at

ChEBI Status

ChEBI further info Mailing lists: Submitting data

> Lines of Code, >900 Classes, > 9000 Methods Library Generation Virtual Screening Molecular Property Prediction Visualization (1) Steinbeck, C.; Hoppe, C.; Kuhn, S.; Guha, R.; Willighagen, E. L. Current Pharmaceutical Design 2006, 12, (2) Steinbeck<, C.; Han, Y. Q.; Kuhn, S.; Horlacher, O.; Luttmann, E., Willighagen, E. Journal of Chemical Information and Computer Sciences 2003, 43, The Chemistry Development Kit (CDK): An Open Source Java-Library for Structural Chemo- and Bioinformatics

I/O (CML, MDL Molfile, SDF, PDB) SMILES InChI Input/Output Structure-Diagram-Layout (SDG) 2D Rendering 3D Rendering Visualization 3D Model-Builder Atom-Typing Force-Field Representation of Biomolecular Structures Modelling Isomorphism detection Maximum-Common-Substructure Searches SMARTS- and Substructure searches Ring searches Aromaticity detection Chemical Graphs Deterministic Isomer generator Stochastic Structure Generators via Simulated Annealing Genetic Algorithms Library Enumeration Fingerprinting > 70 QSAR-Descriptors QSAR model building Properties The Chemistry Development Kit (CDK)

Example: Structure Diagram Generation

COOH Hetero- aryl Bitscreen coding for structural features O-Alkyl- NH 2 Alky IMolecule superstructure = MoleculeFactory.makeIndole(); IMolecule substructure = MoleculeFactory.makePyrrole(); Fingerprinter fingerprinter = new Fingerprinter(); BitSet superBS = fingerprinter.getFingerprint(superstructure); BitSet subBS = fingerprinter.getFingerprint(substructure); boolean isSubset = FingerprinterTool.isSubset(superBS, subBS); Example: Fingerprinting

registered developers on SF 86 people subscribed to cdk-devel list 111 people subscribed to cdk-user list CDK in numbers

,966 downloads since 2001 CDK in numbers

CDK article (2003) cited 68 times CDK in numbers

CDK info Project home page: Mailing list: Documentation

OrChem Oracle chemistry plug-in using the Chemistry Development Kit (CDK) providing substructure and similarity searches for chemical graphs.Chemistry Development Kit OrChem is suitable for Oracle 11G and onwards Not an Oracle data cartridge - it doesn't need Oracle's extensibility architecture because its Java components run as Java stored procedures inside the Oracle standard JVM (Aurora)

Problem Chemical substructure or similarity searching is computationally expensive especially on a large dataset?

OrChem database structure

Example OrChem Queries Similarity search select * from table( orchem_simsearch.search( 'OC4=C(C(=C3OC(C)(COC=1C=CC(=CC=1)CC2C(=O)NC(=O)S 2)CCC3=C4C)C)C','SMILES',0.8,null,'N') ) ; Substructure search select orchem_subsearch.search(molfile,'MOL',50,'Y') from compounds where molregno=12345;

Fingerprint distribution

Parallel vs. Non parallel Performance of substructure search on 3.5 million compounds

Substructure benchmarking Performance of substructure search on 3.5 million compounds

Similarity Benchmarking

OrChem info Mailing list: