The Red Pill Roger Sayle, Geoff Skillman, Matthew Stahl Robert Tolbert OpenEye Scientific Software.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation JChem Cartridge for Oracle
Advertisements

Structural Search Using ChemAxon Tools
1 Szabolcs Csepregi May, 2005 Structural Search Using ChemAxon Tools.
1 György Pirok, Szilárd Dóránt May, 2005 What is Marvin and how to...
Solutions for Cheminformatics Marvin features and news Akos Papp.
1 caAdapter Jan 24, caAdapter The caAdapter is an open source tool that facilitates HL7 version 3 message building, parsing and validation based.
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
Software Connectors Software Architecture. Importance of Connectors Complex, distributed, multilingual, modern software system functionality and managing.
Didier Perroud Raynald Seydoux Frédéric Barras.  Abstract  Objectives  Modalities ◦ Project modalities ◦ CASE/CARE  Implementation ◦ VICI, Iphone,
Workflow Exchange and Archival: The KSW File and the Kepler Object Manager Shawn Bowers (For Chad Berkley & Matt Jones) University of California, Davis.
Software Connectors. Attach adapter to A Maintain multiple versions of A or B Make B multilingual Role and Challenge of Software Connectors Change A’s.
CS 497C – Introduction to UNIX Lecture 21: - The Shell Chin-Chih Chang
Establishing the overall structure of a software system
Mike Jackson EPCC OGSA-DAI Today Release 2.2 Principles and Architectures for Structured Data Integration: OGSA-DAI.
Software Architecture Patterns (2). what is architecture? (recap) o an overall blueprint/model describing the structures and properties of a "system"
Dr. Kalpakis CMSC 461, Database Management Systems Introduction.
November 2011 At A Glance GREAT is a flexible & highly portable set of mission operations analysis tools that increases the operational value of ground.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Messaging Technologies Group: Yuzhou Xia Yi Tan Jianxiao Zhai.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 11 Slide 1 Architectural Design.
CS451 Lecture 13: Architectural Design Chapter 10
Database Architecture Introduction to Databases. The Nature of Data Un-structured Semi-structured Structured.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
Architectural Design portions ©Ian Sommerville 1995 Establishing the overall structure of a software system.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
May 2009 ChemAxon - What’s New?. What’s new and hot? All products have seen enhancements in the past 12 months BUT WHAT’S REALLY HOT?
Standards for Digital Data Representation 1) The IUPAC/NIST Chemical Identifier 2) IUPAC Terminology NSF Workshop Constructing a Kinetics Database NIST,
Kinemage; Rasmol and Chime C371 Chemical Informatics.
Faculty Advisor – Dr. Suraj Kothari Client – Jon Mathews Team Members – Chaz Beck Marcus Rosenow Shaun Brockhoff Jason Lackore.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Scientific Applications of XML Arvind Hulgeri, Shantanu Godbole
Term 2, 2011 Week 1. CONTENTS Problem-solving methodology Programming and scripting languages – Programming languages Programming languages – Scripting.
Applications I: Partial Charges and Potential Energy Surface Scans Lecture CompChem 5 Chemistry 347 Hope College.
_______________________________________________________________CMAQ Libraries and Utilities ___________________________________________________Community.
Python and Chemical Informatics The Daylight and OpenEye toolkits, part II Presented by Andrew Dalke, Dalke Scientific Software for David Wild’s I590 course.
Interoperable sharing of groundwater data across international boarders is essential for the proper management of global water resources. However storage.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Mobile Data Management Ken Rimey : Personal Distributed Information Store (PDIS) 2005-: Too Many Boxes - end-user digital asset management for.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Develop Use Cases Evaluate Existing Models Develop/Extend Model Test ModelDocument 1. Commercial This use-case involves identifying the location and properties.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Operating Systems Lecture 10. Agenda for Today Review of previous lecture Input, output, and error redirection in UNIX/Linux FIFOs in UNIX/Linux Use of.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Introduction to Information and Computer Science
Faculty Advisor – Dr. Suraj Kothari Client – Jon Mathews Team Members – Chaz Beck Marcus Rosenow Shaun Brockhoff Jason Lackore.
Software Connectors Acknowledgement: slides mostly from Software Architecture: Foundations, Theory, and Practice; Richard N. Taylor, Nenad Medvidovic,
Copyright © Richard N. Taylor, Nenad Medvidovic, and Eric M. Dashofy. All rights reserved. Software Connectors in Practice Software Architecture.
Chapter – 8 Software Tools.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
APRIL 10, Meeting Agenda  Prototype 2 Goals  Robust Connections Demo  System Diagnostics Tool Demo  Final Prototype Risk Mitigation  Final.
APRIL 10, Meeting Agenda  Prototype 2 Goals  Robust Connections Demo  System Diagnostics Tool Demo  Final Prototype Risk Mitigation  Final.
Software Connectors. What is a Software Connector? 2 What is Connector? – Architectural element that models Interactions among components Rules that govern.
Faculty Advisor – Dr. Suraj Kothari Client – Jon Matthews Team Members – Chaz Beck Marcus Rosenow Shaun Brockhoff Jason Lackore Importance of SoftwareProject.
Molecular Dynamics Analysis Toolkit Karl Debiec and Nick Rego Chong Group Department of Chemistry August 30 th 2013.
System SOFTWARE.
Distributed Databases
Web Ontology Language for Service (OWL-S)
Object-Orientated Programming
Dr. Bhavani Thuraisingham The University of Texas at Dallas
Section 4.1 Inverse Functions.
Internal components of a computer.
Presentation transcript:

The Red Pill Roger Sayle, Geoff Skillman, Matthew Stahl Robert Tolbert OpenEye Scientific Software

Integration The process of computing an integral; the inverse of differentiation.

Integration The organization of the psychological or social traits and tendencies of a personality into a harmonious whole.

Data Integration Merge (data) into a [harmonious] whole  Chaining data generation  Extensible data storage

OEChem Programming toolkit Python/C++ API's Public API Precise handling of chemistry Multiple models of chemistry  Aromaticity  Atom types  Valence models  Query semantics

Perception Kekule form Aromaticity (Daylight, Tripos, Merck, MDL, OpenEye) Atom types Topological symmetry Stereochemistry (tetrahedral, cis/trans) Partial charges Biomonomers recognition Bond orders from coordinates

Aromaticity Models Yes No Yes No YesYes/NoNo YesNoN/A No Yes No OpenEyeDaylightMMFFTriposMDL

Data Integration Merge (data) into a [harmonious] whole  Chaining data generation  Extensible data storage

Chaining Data Generation Software ASoftware BData -Challenging in a heterogeneous software environment -Lossless data conversion -Feature perception

Extensible Data Storage Source Data

Question How often do people (mis)use SD files for attaching data to molecules?

Extensible Data Structures Python: atom.SetStringData(“Spam”,”Eggs”) atom.GetStringData(“Spam”) C++: class Foo {}; Foo foo; mol.SetData(“VeryNiceData”,foo); mol.GetData (“VeryNiceData”);

Chemical EXchange An interchange language to enable components to communicate Model similar to Unix pipes and single purpose commands CEX stream contains objects (molecule, message) Extensible named property/value pairs Each component in the CEX pipeline can read some objects and properties from the input stream and add new ones to the output stream

OEBinary V2 Extensible tag/data format Heirarchical Persistent objects (automatic for POD types) Dynamic data parsing Efficient storage of conformers Ideal for storage as BLOB Lossless data storage possible Definition publicly available

Conclusions I have no idea what 'data integration' really means OEChem maintains the integrity of chemical data Extensible persistent data structures likely facilitate data integration OEChem provides extensible persistent data structures OEChem likely facilitates data integration

Acknowledgments Geoff Skillman Bob Tolbert Roger Sayle AstraZeneca Pharmaceuticals Vertex Pharmaceuticals