XPERANTO: A Middleware for Publishing Object-Relational Data as XML Documents Michael Carey Daniela Florescu Zachary Ives Ying Lu Jayavel Shanmugasundaram.

Slides:



Advertisements
Similar presentations
XML May 3 rd, XQuery Based on Quilt (which is based on XML-QL) Check out the W3C web site for the latest. XML Query data model –Ordered !
Advertisements

CSE 6331 © Leonidas Fegaras XML and Relational Databases 1 XML and Relational Databases Leonidas Fegaras.
SilkRoute: A Framework for Publishing Relational Data in XML Mary Fernández, AT&T Labs - Research Dan Suciu, Univ. of Washington Yada Kadiyska, Univ. of.
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
Storing and Querying XML Documents Using Relational Databases Mustafa Atay Wayne State University Detroit, MI February 28, 2006.
Ling Wang, Mukesh Mulchandani Advisor: Elke A. Rundensteiner Rainbow Research group, DSRG, WPI Updating XQuery Views over Relational Data.
XML Views El Hazoui Ilias Supervised by: Dr. Haddouti Advanced XML data management.
1 COS 425: Database and Information Management Systems XML and information exchange.
1 Statistics XML: –Altavista: 800,000 pages returned. –Amazon.com: 242 books. In comparison: –God: 12,000 books, 7 Million pages –Bible: 32,000 books,
Introduction XML: an emerging standard for exchanging data on the WWW. Relational database: most wildly used DBMS. Goal: how to map the relational data.
1 New Ways of Querying the Web by Eliahu Brodsky and Alina Blizhovsky.
2005rel-xml-i1 Relational to XML Transformations  Background & Issues  Preliminaries  Execution strategies  The SilkRoute System.
Database Systems and XML David Wu CS 632 April 23, 2001.
XML + Databases = ? (DIMACS Workshop, 3/2000) Mike Carey Exploratory Database Systems Department IBM Almaden Research Center
© Prentice Hall CHAPTER 3 Computer Software.
Managing XML and Semistructured Data Lecture 17: Publishing XML Data From Relations Prof. Dan Suciu Spring 2001.
Managing XML and Semistructured Data Lecture 18: Publishing XML Data From Relations Prof. Dan Suciu Spring 2001.
Efficiently Publishing Relational Data as XML Documents Jayavel Shanmugasundaram University of Wisconsin-Madison/ IBM Almaden Research Center Joint work.
8/17/20151 Querying XML Database Using Relational Database System Rucha Patel MS CS (Spring 2008) Advanced Database Systems CSc 8712 Instructor : Dr. Yingshu.
IST Databases and DBMSs Todd S. Bacastow January 2005.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
Dan SuciuTools for XML Data Exchange Dan Suciu AT&T Labs Joint work with Mary Fernandez.
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
1 Maintaining Semantics in the Design of Valid and Reversible SemiStructured Views Yabing Chen, Tok Wang Ling, Mong Li Lee Department of Computer Science.
1 CS 430 Database Theory Winter 2005 Lecture 17: Objects, XML, and DBMSs.
1 What Is XML? eXtensible Markup Language for data –Standard for publishing and interchange –“Cleaner” SGML for the Internet Applications: –Data exchange.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
1 XQuery to SQL by XML Algebra Tree Brad Pielech, Brian Murphy Thanks: Xin.
Lecture A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Lecture 24 – Part 2 XML Query Processing Phil Gibbons April.
[ Part III of The XML seminar ] Presenter: Xiaogeng Zhao A Introduction of XQL.
XML Introduction. Markup Language A markup language must specify What markup is allowed What markup is required How markup is to be distinguished from.
XML Engr. Faisal ur Rehman CE-105T Spring Definition XML-EXTENSIBLE MARKUP LANGUAGE: provides a format for describing data. Facilitates the Precise.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
Computing & Information Sciences Kansas State University Friday, 20 Oct 2006CIS 560: Database System Concepts Lecture 24 of 42 Friday, 20 October 2006.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
XML May 6th, Instructor AnHai Doan Brief bio –high school in Vietnam & undergrad in Hungary –M.S. at Wisconsin –Ph.D. at Washington under Alon &
1 © 2013 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the.
Efficiently Publishing Relational Data as XML Documents IBM Almaden Research Center Eugene Shekita Rimon Barr Michael Carey Bruce Lindsay Hamid Pirahesh.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
Partial Query-Evaluation in Internet Query Engines Jayavel Shanmugasundaram Kristin Tufte David DeWitt David Maier Jeffrey Naughton University of Wisconsin.
IFS180 Intro. to Data Management Chapter 10 - Unions.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
Databases and DBMSs Todd S. Bacastow January
CS 325 Spring ‘09 Chapter 1 Goals:
Database Systems: Design, Implementation, and Management Tenth Edition
Chapter 2: Database System Concepts and Architecture - Outline
Datab ase Systems Week 1 by Zohaib Jan.
Management of XML and Semistructured Data
CS422 Principles of Database Systems Course Overview
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Efficiently Publishing Relational Data as XML Documents
Database Processing with XML
Relational Algebra Chapter 4, Part A
Advanced Database Models
XML Data Introduction, Well-formed XML.
SilkRoute: A Framework for Publishing Rational Data in XML
MANAGING DATA RESOURCES
Semi-Structured data (XML Data MODEL)
Alin Deutsch, University of Pennsylvania Mary Mernandez, AT&T Labs
What is XML?.
Convert (flatten) IATI XML file to CSV file(s) using XQUERY
Wednesday, May 29, 2002 XML Storage Final Review
Lecture 8: XML Data Wednesday, October
Syllabus Introduction Website Management Systems
CSE591: Data Mining by H. Liu
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: e.g., structured files, scientific data, XML. Managing such.
Wednesday, May 22, 2002 XML Publishing, Storage
Semi-Structured data (XML)
Toward an Ontology-Driven Architectural Framework for B2B E. Kajan, L
Presentation transcript:

XPERANTO: A Middleware for Publishing Object-Relational Data as XML Documents Michael Carey Daniela Florescu Zachary Ives Ying Lu Jayavel Shanmugasundaram Eugene Shekita Subbu Subramanian

Outline Motivation XPERANTO Architecture Query Processing Conclusion

XML Example Addison-Wesley Timbuktu

What is the big deal about XML? Elegantly models complex, hierarchical/ graph-structured data Domain-specific tags (unlike HTML) Simple!  Fast emerging as dominant standard for data exchange on the WWW

Why Object-Relational Data? Most business data stored in object-relational databases Unlikely to change in the near future –Scalability, Reliability, Performance, Tools  Need flexible and efficient way to publish object-relational data as XML

Book BookIdBookName 1843 Selfish Ev.. Author BookId FirstNameLastName 1843 CharlesDarwin 1843RichardDawkins Publisher PubId PubName 20 Addison-Wesley 39Prentice-Hall Address Timbuktu USA PubId Ice Station AlistarMcLean Price

XML View for Partners Addison-Wesley Prentice-Hall

Allow Partners to Query View Get books written by Darwin WHERE Darwin ELEMENT_AS $b IN BooksXMLView CONSTRUCT $b

The Problem O-R DBMS The Internet Queryable XML Views XML QueryXML Results

Outline Motivation XPERANTO Architecture Query Processing Conclusion

What is XPERANTO? Xml Publishing of Entities, Relationships ANd Typed Objects Middleware providing queryable XML views over any object-relational database

XPERANTO Guiding Principles Future web application developers may not adore SQL like we do (unbelievable!!!) Allows them to create and use XML views in pure XML terms –Default XML view of ORDBMS –Can create more complex views using XML query language

Book BookIdBookName 1843 Selfish Ev.. Author BookId FirstNameLastName 1843 CharlesDarwin 1843RichardDawkins Publisher PubId PubName 20 Addison-Wesley 39Prentice-Hall Address Timbuktu USA PubId Ice Station AlistarMcLean Price

Default XML View 1843 Selfish Evolution Ice Station Zebra … … …

Creating an XPERANTO View CREATE VIEW BooksXMLView As WHERE $bid $bname $pid $bprice IN DefaultXMLView CONSTRUCT {WHERE $pid $pname IN DefaultXMLView CONSTRUCT $pname } {WHERE $bid $fn $ln IN DefaultXMLView CONSTRUCT }

Query Processing in XPERANTO ORDBMS Query Parser Query Rewrite SQL Translation XML Tagging Query over XML ViewXML Result XQGM SQL Query Relational result XPERANTO

Outline Motivation XPERANTO Architecture Query Processing –XQGM –View composition –SQL generation –Data/Meta-data querying Conclusion

XQGM Xml Query Graph Model General language neutral representation –Replacing XML-QL with Quilt should be easy (we think!) Extension of DB2’s QGM

Why extend QGM? Many query language operations are the same (e.g., select, join, aggregation) Can support sophisticated SQL functionality (e.g., universal quantification) Borrow existing techniques from industrial strength system Natural translation to SQL

QGM Extensions New types for XML elements and attributes New functions to create and navigate them –CreateXMLElement Tag name Attribute list Sub-element list –GetXMLSubElements XML element

Outline Motivation XPERANTO Architecture Query Processing –XQGM –View composition –SQL generation and Tagging –Data/Meta-data querying Conclusion

View Composition Eliminate unnecessary XML element and attribute construction Done using equivalences Example: GetSubElements(CreateElement(tags, atts, subelems)) = subelems

Outline Motivation XPERANTO Architecture Query Processing –XQGM –View composition –SQL generation and Tagging –Data/Meta-data querying Conclusion

SQL Generation and Tagging Use “Sorted Outer Union” approach –Shown to be efficient and stable [VLDB 2000] Generate single SQL query to get the content in document order Tag in constant space inside/outside the engine All computation/memory intensive processing done inside ORDBMS

Outline Motivation XPERANTO Architecture Query Processing –XQGM –View composition –SQL generation and Tagging –Data/Meta-data querying Conclusion

Addison-Wesley BookIdBookName 1843 Selfish Ev.. Author BookId FirstNameLastName 1843 CharlesDarwin 1843RichardDawkins Price AlistarMcLean Discount 10% Prentice Hall BookIdBookName 6509 Ice Station.. Price 9.00 Tax 5%

XML View for Partners Addison-Wesley Prentice-Hall

Query over XML View Send me books written by publishers A-D WHERE $pn ELEMENT_AS $b, “A” < $pn, $pn < “D” IN BooksXMLView CONSTRUCT $b

Outline Motivation XPERANTO Architecture Query Processing –XQGM –View composition –SQL generation and Tagging –Data/Meta-data querying Conclusion

Important to provide queryable XML views over object-relational DBMSs XPERANTO is middleware providing this functionality –Pure XML philosophy (“Look Ma, no SQL!”) –Data/Meta-data query support –Harnesses ORDBMS query engine

Future Work Constructing recursive structures Extensions for order Updateable/insertable views

Related Work SilkRoute [Fernandez et. al.] –Concurrent project –Middleware providing queryable XML views Differences –Pure XML philosophy –Query data/meta-data –Pushing processing inside relational engine –Object-relational structures

XPERANTO Architecture Views XML Schema O-R Database SQL Query Processor Stored TablesSystem Catalog Metadata Services View Services Type & Table Services Query Translation XQGM XML-QL Parser XQGM Query Rewrite SQL Translation XML Schema Generator Catalog Info XML Tagger Data Tuples Table & Type Info SQL Queries