Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Chapter 10: Designing Databases
Data Modeling and Database Design Chapter 1: Database Systems: Architecture and Components.
Introduction to Databases
Management Information Systems, Sixth Edition
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
File Systems and Databases
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
VxOware Progress Report August How to create a new section? Configure section –Create metadata structure (template) –Create elements map for web.
IST Databases and DBMSs Todd S. Bacastow January 2005.
January, 23, 2006 Ilkay Altintas
Semantic Sensor/Device Description System EEEM042-Mobile Applications and Web Services Assignment- Spring Semester 2015 Prof. Klaus Moessner, Dr Payam.
Ricerca Distribuita Semantica Protocolli opensource per la condivisione di risorse online.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Week 1 Lecture MSCD 600 Database Architecture Samuel ConnSamuel Conn, Asst. Professor Suggestions for using the Lecture Slides.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
DBMS Lecture 9  Object Database Management Group –12 Rules for an OODBMS –Components of the ODMG standard  OODBMS Object Model Schema  OO Data Model.
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
Austin code camp 2010 asp.net apps with azure table storage PRESENTED BY CHANDER SHEKHAR DHALL
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Concepts and Terminology Introduction to Database.
Project Overview Bibliographic merging, Endeca, and Web application.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
ABC Insurance Co. Paul Barry Steve Randolph Jing Zhou CSC8490 Database Systems & File Management Dr. Goelman Villanova University August 2, 2004.
C++ Implementation ( Version 1 – Text Interface ) Elimination of services of our system. Elimination of services of our system. General Flow of the program.
Data Management Console Synonym Editor
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
1 Design Issues in XML Databases Ref: Designing XML Databases by Mark Graves.
What’s new in Kentico CMS 5.0 Michal Neuwirth Product Manager Kentico Software.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Keywords Searching and Analysis System Member Student ID Role 刘亮 Liu Liang System Analyst 顾子俊 Gu Zijun Developer 杜菡菡 Du Hanhan
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
1 CS 430 Database Theory Winter 2005 Lecture 2: General Concepts.
Prepared By Prepared By : VINAY ALEXANDER ( विनय अलेक्सजेंड़र ) PGT(CS),KV JHAGRAKHAND.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
2) Database System Concepts and Architecture. Slide 2- 2 Outline Data Models and Their Categories Schemas, Instances, and States Three-Schema Architecture.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
SERIALIZED DATA STORAGE Within a Database James Devens (devensj)
AIRS Meeting GSFC, February 1, 2002 ECS Data Pool Gregory Leptoukh.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Fundamental of Database Systems
Databases and DBMSs Todd S. Bacastow January
Databases (CS507) CHAPTER 2.
Semantic Database Builder
Building Search Systems for Digital Library Collections
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
File Systems and Databases
Database Systems Instructor Name: Lecture-3.
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Database Design Hacettepe University
UNIT-I Introduction to Database Management Systems
Database management systems
Standardizing and industrializing a business process – the dissemination use case Alessio Cardacino - ESTP Course “Information standards.
Presentation transcript:

Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik

Glyco Vault Serves as a storage for files with experiment results, item descriptions, ontologies, etc. Data is heterogenous. Amount of data will be massively increasing. Hierarchical storage (as file system) is not enough – users need to add annotations. Users should be able to find interesting data (relatively) easy.

Interface functionality (basic) What we expect from Glyco Vault? Insert file to GV Get file from GV Delete specific file from GV Annotate with metadata Get file annotations Find file(s)

Sample file properties in GV Basic ID (unique) Name Date created User (owner) File type (InForma) Modification record … Specific (extended) Machine used for experiment Associated workflow Names of used samples Version number Other linked files …

File description (metadata) We cannot limit properties to currently known – schema must be flexible to accommodate future changes. File ID: GV-123 Name: ABC.txt Version: 9 Created: Owner: Will Machine: PCR Tissue: Parrot …

Ontological File System for GV File GV-123 ABC.txt Will PCR Parrot Version Machine ID Creation date Owner Name Tissue Binaries Ontology with File-centric view. File is connected to its properties and other Files

Protocol data in GlycoVault Prepare samples Final result Statistical Analysis Get raw data Run PCR machine Description of samples Well assignment Person preparing Date/time of experiment PCR configuration Machine description (use of resource) Raw results (binaries) Raw results (values with annotation) Intermediate analytic data Final values of fluorescence Assigned to specific genes

Ontologies for annotation UserDefined attributes ProPreO ProtocolSchema GlycO and others … CoreFileOnt Core descriptions for all stored files (id, name, user, date …) Descriptions of used protocols and produced data Processes Data provenance Machines, etc. Data-specific ontologies Attributes and relations defined by users (not captured by other ontologies)

Insert and annotate file Insert simple file with its type FileID insert(Object aBinaries, InForma aType) Manage annotations boolean annotate(FileID aFile, Annotation aMetadata) boolean removeAnnotation(FileID aFile, Annotation aMetadata) Annotations Relation  Resource (object) Relation  Literal (value) Use of PropreO ontolgy (in some part) Add standard, independent file metadata

Get file and its annotations Get raw data file from GV Object getFile(FileID aFile) return binaries associated with file stored in Glyco Vault Get file descriptions Annotation[] describe(FileID aFile) return all direct annotations of a file to display for user Get file in specific format using InForma transformation should be external to GV

Search for files Find file by annotations In simple form, describe constraints directly associated with file FileID[] find(AnnotationConstraint[] aConstraints) Annotation constraint includes relation, value object and/or filter on value SPARQL queries with constraints in file ontology for complex cases GUI query builder/wizard with file metadata

Search for files Search is essential part of system and most complicated one. Efficient (DB/KB performance) Easy for users (GUI, Web) Browsing methods should be also added for easer navigation Idea: Faceted type browsing for files, eg. Use of Leon’s tool for browsing files and their descriptions

Interface (initial proposal) FileID insert(Object aBinaries, InForma aType) boolean annotate(FileID aFile, Annotation aMetadata) boolean removeAnnotation(FileID aFile, Annotation aMetadata) Object getFile(FileID aFile) Annotation[] describe(FileID aFile) FileID[] find(AnnotationConstraint[] aConstraints) boolean delete(FileID aFile) Type: Annotation Type: AnnotationConstraint

File ontology access Access to file ontology is needed to construct proper annotation Use of defined resources, eg. PCR Machine Creation of resources Specific instances Classes and hierarchies Listing available types and/or formats for annotations Possible access: Choice from pre-populated list Result of ‘can query’ Suggest as you type (Lucene index) List of classes with attributes and individuals (LSDIS library)

Administrators File system uses ontology, so administrators must be able to modify existing ontology to handle new types of annotations Maybe creation of some classes and instances can be pushed to specific users? During development easiest is to use direct ontology access Direct insert/delete of triples Use of SPARQL Later  some GUI interface would be required

GlycoVault on Oracle and Jena SQL Oracle platform API Jena Data (ont) Data (bin) JDBC read only full JDBC (administrator) SPARQL Ont Management experiment tables D2R server

GlycoVault – hybrid access Access via JDBC and SPARQL Ontologies will have a pointer to SQL data, can be a query, how to get real values (?) Data will be duplicated between SQL tables and ontology. What should be granularity of ontology data? JDBC access is read only for consistency purposes Modification allowed only for administrators