ER 2006 - Tucson Schema Mediated Exchange of Temporal XML Data Curtis Dyreson – Washington State University Richard T. Snodgrass – University of Arizona.

Slides:



Advertisements
Similar presentations
1 Copyright ©2007 Sandpiper Software, Inc. Vocabulary, Ontology & Specification Management at OMG Elisa Kendall Sandpiper Software
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
XML/EDI Overview West Chester Electronic Commerce Resource Center (ECRC)
Introduction to WSDL presented by Xiang Fu. Source WSDL 1.1 specification WSDL 1.1 specification – WSDL 1.2 working draft WSDL.
W3C XML Schema: what you might not know (and might or might not like!) Noah Mendelsohn Distinguished Engineer IBM Corp. October 10, 2002.
An Aspect-Oriented Approach For Web Application Access Control Presented by: Mohamed Hassan Carleton University Carleton University
ModelicaXML A Modelica XML representation with Applications Adrian Pop, Peter Fritzson Programming Environments Laboratory Linköping University.
1 XEM: Managing the Evolution of XML Documents Author: Hong Su, Diane Kramer. Li Chen, Kajal Claypool and Elke A. Rundensteiner Presented by: Li Shuhong.
ICS 123 XML: It’s a Good Thing Richard N. Taylor & Eric M. Dashofy ICS 123 S2002.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Combining Static and Dynamic Data in Code Visualization David Eng Sable Research Group, McGill University PASTE 2002 Charleston, South Carolina November.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Towards a Temporal World-wide Web: A Transaction-time Server Curtis Dyreson Electrical Engineering and Computer Science Washington State University, USA.
Automatic Data Ramon Lawrence University of Manitoba
Change-Centric Management of Versions in an XML Warehouse Amélie Marian Columbia University Serge Abiteboul, Grégory Cobéna, Laurent Mignet INRIA-Rocquencourt.
1 DCS861A-2007 Emerging IT II Rinaldo Di Giorgio Andres Nieto Chris Nwosisi Richard Washington March 17, 2007.
VLDB ‘99 Edinbugh, Scotland Capturing and Querying Multiple Aspects of Semistructured Data Curtis Dyreson (formerly) Dept. of Comp. Sci., James Cook University.
Jennifer Widom XML Data XML Schema. Jennifer Widom XML Schema “Valid” XML Adheres to basic structural requirements  Also adheres to content-specific.
 Schema-less databases Really…? In actuality, there is no such thing as a schema-less database In a relational database, the schema is explicit and created.
Automated Tracking of Online Service Policies J. Trent Adams 1 Kevin Bauer 2 Asa Hardcastle 3 Dirk Grunwald 2 Douglas Sicker 2 1 The Internet Society 2.
XCube XML For Data Warehouses By Sven Groot. Data warehouses Contains data drawn from several databases and external sources Contains data drawn from.
XML: Extensible Markup Language FST-UMAC Gong Zhiguo.
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
Database Systems – Data Warehousing
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
XML – Extensible Markup Language XML eXtensible – add to language. Markup – delimit info using tags. Language – a way to express info.
1 COS 425: Database and Information Management Systems XML and information exchange continued Last time: XML document structure XML querying with XQuery.
Architecture-Based Runtime Software Evolution Peyman Oreizy, Nenad Medvidovic & Richard N. Taylor.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
1 Maintaining Semantics in the Design of Valid and Reversible SemiStructured Views Yabing Chen, Tok Wang Ling, Mong Li Lee Department of Computer Science.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
AIXM Users’ Conference, March Implementing AIXM in Instrument Flight Procedures Automation Presenter: Iain Hammond MacDonald, Dettwiler &
Development Process and Testing Tools for Content Standards OASIS Symposium: The Meaning of Interoperability May 9, 2006 Simon Frechette, NIST.
(Business) Process Centric Exchanges
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
The NMWG Framework A (very) brief introduction Raphael Dourado 13/04/20121.
Web Services Standards. Introduction A web service is a type of component that is available on the web and can be incorporated in applications or used.
Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on: Global Computing (GC) Proactive.
XML Grammar and Parser for WSOL Kruti Patel, Vladimir Tosic, Bernard Pagurek Network Management & Artificial Intelligence Lab Department of Systems & Computer.
Standards for Technology in Automotive Retail STAR Update Michelle Vidanes STAR XML Data Architect April 30 th, 2008.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
May 24, 2004IVOA Interop Meeting1 An AXIS-based Java SkyNode Ramon Williamson NCSA T HE US N ATIONAL V IRTUAL O BSERVATORY.
INRIA - Progress report DBGlobe meeting - Athens November 29 th, 2002.
16/11/ Semantic Web Services Language Requirements Presenter: Emilia Cimpian
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
REST By: Vishwanath Vineet.
Manufacturing Systems Integration Division Development Process and Testing Tools for Content Standards Simon Frechette National Institute of Standards.
ODATA DESIGN PRINCIPLES July 26, BUILD ON HTTP, REST OData is a RESTful HTTP Protocol Build on HTTP Entities modeled as Resources Relationships.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Briefing and Planning meeting on INSPIRE validator implementation – Discussion 16/12/2015.
Viewpoint Modeling and Model-Based Media Generation for Systems Engineers Automatic View and Document Generation for Scalable Model- Based Engineering.
What problems are we trying to solve? Hannes Tschofenig.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
PerfSONAR Schema and Topology Martin Swany. Schema Key Goals: Extensibility, Normalization, Readability Break representation of performance measurements.
XPERANTO: A Middleware for Publishing Object-Relational Data as XML Documents Michael Carey Daniela Florescu Zachary Ives Ying Lu Jayavel Shanmugasundaram.
Data Mining and Data Warehousing: Concepts and Techniques What is a Data Warehouse? Data Warehouse vs. other systems, OLTP vs. OLAP Conceptual Modeling.
Guillaume Lapierre-Ganivet
Database Processing with XML
The Re3gistry software and the INSPIRE Registry
Genome Workbench Chuong Huynh NIH/NLM/NCBI New Delhi, India
Data Model.
Distributed Database Management Systems
CSE591: Data Mining by H. Liu
Semi-Structured data (XML)
CS240A: Databases and Knowledge Bases A Taxonomy of Temporal DBs
Presentation transcript:

ER Tucson Schema Mediated Exchange of Temporal XML Data Curtis Dyreson – Washington State University Richard T. Snodgrass – University of Arizona Sabah Currim – University of Arizona Faiz Currim – University of Iowa

Scenario Genomic data from NCBI Data collection is growing/changing Want data and data provenance (who, what, when, …)

Obtaining Web Data Overwrite D D NCBI Request D D Write D

Data Evolves Download XML formatted data (as of January 1) trypsin 4 Download again (as of March 6) trypsin 4, beta-cell receptor

NCBI Refreshing the Data (using SDOs) Change summary of D Request updates to D since time t D Update D D old Copy D XMLDiff What about versions between D and D old ? Did I download “valid” data? My DB is pretty big…

Did I Download the Right Data? Validate against schema XML Data Schema Namespace Validating Parser Valid

Fragment of the Genomic Schema … …

Uses of an XML Schema Validation XML editors Guides query formulation Query optimization Provides a web service binding

D Validate the “delta” with the temporal schema cost is size of change NCBI A Temporal Data Collection ΔD [t,now] Request updates to D since time t D Temporal D t now Extend history of D Temporal Schema Which elements vary over time

Outline Motivation  XSchema Architecture Summary

Goals for a Temporal Schema Make it easy to create a schema for temporal data Identify which data is temporal Upwards compatibility Minimal extensions of XML Schema Reuse off-the-shelf parsers/tools Support Valid and transaction time Data (element) versioning Schema versioning Logical/physical independence  Flexible timestamp representation and location

Persistent Elements An item is an element that persists across snapshots. Item identifier (like a temporally-invariant key) January snapshotMarch snapshot … …

Extend a Snapshot Schema Specify which elements are temporal Temporal elements have Item identifiers Simple constraints (state/event, existence/content-varying) <txs:transactionTime kind="state" contentVarying="true" existenceVarying="no gaps"/> …. definition of gene from the snapshot schema omitted for space…

A version is a change in an item. DOM inequivalence Versions January snapshotMarch snapshot … …

Temporal Genomic Data trypsin 4 …next version of gene… …ontology item…

Outline Motivation  XSchema Architecture Summary

XML Data Snapshot data validated with a snapshot schema Construct a representational schema (details in paper) Can also validate the “delta” Validating Temporal Data Snapshot Schema Namespace Validating Parser Valid Construction Process Representational Schema Not valid Temporal Data Valid

Property of a “Good” Construction Every snapshot must conform to the snapshot schema Temporal data Valid (Temporal) Validating Parser Validating Parser Snapshot Schema Temporal Schema Snapshot At time T

Outline Motivation  XSchema Architecture Summary

Related Work – Temporal XML Change detection and management Nguyen, Abiteboul, Cobena, Preda, SIGMOD 2001 Xyleme’s Alerter, described in Data Engineering Bulletin, 2001 Dyreson, Lin, Wang WWW 2004 Leonardi, Bhowmick, ER 2006 Representing time-varying XML documents (versioning) Chawathe, Abiteboul, Widom, ICDE 1998 Dyreson, Böhlen, Jensen, VLDB 1999 Chien, Tsotras, Zaniolo, VLDB 2000 Marian, Abiteboul, Cobena, Mignet, VLDB 2001 Buneman, Khanna, Tajima, Tan, SIGMOD 2002, TODS 2004 Rosado, Marquez, Gonzalez, ECDM 2006 XML Versioning Use Cases (W3C)

Related Work – XML Schemas XML Schema languages Many, but XML Schema is backed by the W3C Incremental XML validation Bouchou & Halfeld-Ferrari, DBPL 2003 Papkonstantinou & Vianu, ICDT 2003 Barbosa, Mendelzon, Libkin, Mignet, Arenas, ICDE 2004 Temporal XML schemas Currim, Currim, Dyreson, Snodgrass, EDBT 2004 Dyreson, Snodgrass, Currim, Currim, Joshi, XSDM 2006

An Overarching Vision Aspect-oriented programming Cross-cutting concerns Augment behavior without changing the code Example aspects: logging, garbage collection Program.java Aspect.java weaver Aspect Enhanced.java javac Cut points

Aspects for Data? What are cross-cutting concerns? Milieu of metadata Time is an aspect security time reliability

Aspects in Schema Design Schema for aspect + schema for data Our paper describes the “plumbing” for a temporal aspect data (snapshot) schema aspect schema schema weaver schema tapestry conventional validating parser aspect validator Validation aspect + XML data imports schema snapshot gluer snapshot data imports schema

Our Contributions Temporal schema specification What is time-varying Some simple constraints Validate temporal data ΔD [t-now] cost Upwards compatible with XML Schema Handle schema evolution (Dyreson et al., XSDM ’06) Suite of tools Reuse and extend existing tools

 XSchema Project Tools (Beta)  VALIDATOR – Validating temporal XML document for conventional and temporal constraints SQUASH – Generating a temporal document from a sequence of snapshot documents UNSQUASH – Extracting snapshot documents from a temporal document RESQUASH – Changing a document representation to be consistent with the new physical annotation.