Migrating to DayCart Introduction Thor database concepts. Data (Chemical Structure) hierarchy. Thor data model Daylight/Oracle cartridge data model.

Slides:



Advertisements
Similar presentations
Enhanced Data Description for End Users ScribeKey, LLC Brian Hebert, Solutions Architect
Advertisements

Database management system (DBMS)  a DBMS allows users and other software to store and retrieve data in a structured way  controls the organization,
CSCI3170 Introduction to Database Systems
C6 Databases.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Day 3 - Basics of MySQL What is MySQL What is MySQL How to make basic tables How to make basic tables Simple MySQL commands. Simple MySQL commands.
Let’s try Oracle. Accessing Oracle The Oracle system, like the SQL Server system, is client / server. For SQL Server, –the client is the Query Analyser.
Quick-and-dirty.  Commands end in a semi-colon ◦ If you forget, another prompt line shows up  Either continue the command or…  End it with a semi-colon.
Winter 2002Arthur Keller – CS 1808–1 Schedule Today: Jan. 29 (T) u Modifications, Schemas, Views. u Read Sections Assignment 3 due. Jan. 31 (TH)
CSE 190: Internet E-Commerce Lecture 10: Data Tier.
7/2/2015Murali Mani -- CS5421 Database Management Systems DB Application Development Project Statement + Introduction to Oracle.
Introduction To Databases IDIA 618 Fall 2014 Bridget M. Blodgett.
©HCCS & IBM® 2008 Stephen Linkin1 Database Management Systems On z/OS Stephen S. Linkin Houston Community College © HCCS and IBM 2008.
How a little code can help with support.. Chris Barba – Developer at Cimarex Energy Blog:
Winter 2006Keller Ullman Cushing8–1 Turning in Assignments Please turn in hard copy (use only in the direst of circumstances). I am not your secretary.
Daybase (DayCart) Introduction What is ‘Daybase’ ? Oracle Schema Objects. Oracle Datatypes. Simple Example. Demo.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
Chapter 2: SQL – The Basics Objectives: 1.The SQL execution environment 2.SELECT statement 3.SQL Developer & SQL*Plus.
Relational Databases Database Driven Applications Retrieving Data Changing Data Analysing Data What is a DBMS An application that holds the data manages.
Copyright © Curt Hill The Relational Model of Database Basic organization and terms.
What is a schema ? Schema is a collection of Database Objects. Schema Objects are logical structures created by users to contain, or reference, their data.
Oracle Dependencies Analyzer ODA Over time, in large companies we see many Legacy systems that work with several Databases, this.
Transforming TurboIMAGE Data for Eloquence, Oracle, and More By Bob Green, Robelle
CIS 210 Systems Analysis and Development Week 6 Part II Designing Databases,
ChemBank Building a Public Web Resource Using Daycart Erik Brauner Head of Chemical and Biological Computing Harvard Institute of Chemistry and Cell Biology.
SCUHolliday - coen 1788–1 Schedule Today u Modifications, Schemas, Views. u Read Sections (except and 6.6.6) Next u Constraints. u Read.
Python and Chemical Informatics The Daylight and OpenEye toolkits, part II Presented by Andrew Dalke, Dalke Scientific Software for David Wild’s I590 course.
Prepared By Prepared By : VINAY ALEXANDER ( विनय अलेक्सजेंड़र ) PGT(CS),KV JHAGRAKHAND.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 3rd Edition Copyright © 2009 John Wiley & Sons, Inc. All rights.
Oracle & SQL Introduction. Database Concepts Revision DB? DBMS? DB Application? Application Programs? DBS? Examples of DBS? Examples of DBMS? 2Oracle.
SQL Fundamentals  SQL: Structured Query Language is a simple and powerful language used to create, access, and manipulate data and structure in the database.
Prepared by The Smartpath Information Systems
MySQL More… 1. More on SQL In MySQL, the Information Schema is the “Catalog” in the SQL standard SQL has three components: Data definition Data manipulation.
Visual Programing SQL Overview Section 1.
D Copyright © Oracle Corporation, All rights reserved. Loading Data into a Database.
Gollis University Faculty of Computer Engineering Chapter three: Data Definition Instructor: Mukhtar M Ali “Hakaale” BCS.
Starting with Oracle SQL Plus. Today in the lab… Connect to SQL Plus – your schema. Set up two tables. Find the tables in the catalog. Insert four rows.
Oracle & SQL. Oracle Data Types Character Data Types: Char(2) Varchar (20) Clob: large character string as long as 4GB Bolb and bfile: large amount of.
CS320 Web and Internet Programming SQL and MySQL Chengyu Sun California State University, Los Angeles.
Database Design: Solving Problems Before they Start! Ed Pollack Database Administrator CommerceHub.
Understanding Core Database Concepts Lesson 1. Objectives.
Geog. 314 Working with tables.
Introduction To Oracle
CS320 Web and Internet Programming SQL and MySQL
Data Definition and Data Types
CSCI-100 Introduction to Computing
CS1222 Using Relational Databases and SQL
DATABASE MANAGEMENT SYSTEM
1 Demand of your DB is changing Presented By: Ashwani Kumar
Database.
CS1222 Using Relational Databases and SQL
Teaching slides Chapter 8.
Physical Database Design
Defining a Database Schema
Databases.
Intro to Relational Databases
Database Systems Instructor Name: Lecture-3.
Final Review Topics Chapter 4 SQL,
CS1222 Using Relational Databases and SQL
CS1222 Using Relational Databases and SQL
CS3220 Web and Internet Programming SQL and MySQL
The ultimate in data organization
CS3220 Web and Internet Programming SQL and MySQL
CS1222 Using Relational Databases and SQL
Turn on spool and save to file a.txt
Understanding Core Database Concepts
Database Instructor: Bei Kang.
eSeries Entities By Julie Ladner
CS1222 Using Relational Databases and SQL
02 | Mastering Your Data Graeme Malcolm | Data Technology Specialist, Content Master Pete Harris | Learning Product Planner, Microsoft.
Presentation transcript:

Migrating to DayCart

Introduction Thor database concepts. Data (Chemical Structure) hierarchy. Thor data model Daylight/Oracle cartridge data model. Other Considerations. What are the steps. Demo.

Thor Database Concepts Datatrees, Datatypes, Dataitems and Datafields. –These four concepts are closely related. Datatrees - Is the method for representing chemical information. Datatypes - A set of definition that indicate the meaning of a dataitems fields. Dataitems - Tag names a datatype, which in turn defines the meaning of each datafield. Datafields - A string of characters; the unit of data.

Data (Chemical Structure) hierarchy Data stored for each chemical entity are organized on a hierarchical basis with each entity expressed in terms of Parent, Version and Preparation(Oracle only). Parent is the basic structure which is free of salts, solvates and radio labels. Version is the translation of a Parent into an actual compound, i.e. a free base, salt, solvates, etc... Preparation being a discrete batch of that Version.

Data (Chemical Structure) hierarchy Parent Version 1Version 2Version n Prep Thor Database, with all molecule information

Daylight Thor data tree model SMILES CC(C)C(O)C1CCCCC1 FP Timestamp Graph CC(C)C(O)C1CCCCC1 DB_NO 873 PISM DB_NO 97 VISM ATOM_STER 2,1,1,4,1,7,1,10,1 SALT 1,10

Thor data tree in lexical form $SMI $DB_NO PISM ATOM_STER $DB_NO PISM ATOM_STER BOND_STER ISO $DB_NO VISM SALT.

Daylight /Oracle data model(1) Table:R_PARENT NameNull?Type DB_NONOT NULLNUMBER(7) SMILESNOT NULLVARCHAR(700) ATOM_STERVARCHAR(200) BOND_STERVARCHAR(200) ISOVARCHAR(200) MIXT_INFOVARCHAR(200) CSTVARCHAR(200) Table:R_VERSION NameNull?Type DB_NONOT NULLNUMBER(7) SMILESNOT NULLVARCHAR(700) PARENT_DB_NONOT NULLNUMBER(7) SALT_INFOVARCHAR(80)

Daylight/Oracle data model(2) Table:R_PARENT NameNull?Type DB_NONOT NULLNUMBER(7) SMILESNOT NULLVARCHAR(700) Table:R_PARENT_INFO NameNull?Type DB_NONOT NULLNUMBER(7) DATA_TYPENOT NULL NUMBER(2) DATA_INFONOT NULL VARCHAR(200) Table:R_VERSION NameNull?Type DB_NONOT NULLNUMBER(7) SMILESNOT NULLVARCHAR(700) PARENT_DB_NONOT NULLNUMBER(7) SALT_INFOVARCHAR(80)

Daylight/Oracle Cartridge model Prep_1 DB_NO Data Version_DB_NO Prep_2 Prep_3 Prep_2 Prep_1 Prep_2 Parent DB_NO Data SMILES Version_1 DB_NO Data Parent_DB_NO SMILES Data Version_2 DB_NO Parent_DB_NO SMILES DB_NO Data Parent_DB_NO SMILES Version_n Data

Other Considerations 2D coordinates or connection table. –2D coordinates as in Thor. –Connection table (Molfile, RXNfile, TGFfile … etc) Data conversion for input/search. –Nitro groups (charge separated, double bond or dont care). –Parent and salt molecules. Indexes for normal column data. Indexes for chemical structure data –i.e. ddexact, ddgraph, ddblob … etc.

Other Considerations SMILES column size (what to use?). –700 bytes or less, Oracle will allow you to put unique indexes and blob-based index. This may or may-not be important to you. –greater than 700 bytes you can use blob-based index.

What are the steps Design the data model. –Consider input, access and modification of your data. –Consider how chemical data is going to be searched. Design the database schema. Export data from Thor (or use SD files). Create PL/SQL program to load the data. If the data is formatted you may be able to use sqlldr command to load the tables. For data from SD files use mol2smi procedure.

DEMO

Thanks John Bradshaw Jack Delany Daylight team Michael Wainwright (Modgraph)