LegoDB XML-to-Relational Mapping using LegoDB Dustin Anderson CSC560 a way to map XML Schema structures to relational tables.

Slides:



Advertisements
Similar presentations
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Advertisements

IS698: Database Management Min Song IS NJIT. The Relational Data Model.
Chapter 3 : Relational Model
An Introduction to XML Based on the W3C XML Recommendations.
CSE 6331 © Leonidas Fegaras XML and Relational Databases 1 XML and Relational Databases Leonidas Fegaras.
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
1 Class Number – CS 304 Class Name - DBMS Instructor – Sanjay Madria Instructor – Sanjay Madria Lesson Title – EER Model –21th June.
Translating from ER to Relationship Schema Adapted from Juliana Freire.
Dimensional Modeling Business Intelligence Solutions.
Sets and Maps Chapter 9. Chapter 9: Sets and Maps2 Chapter Objectives To understand the Java Map and Set interfaces and how to use them To learn about.
Physical Database Monitoring and Tuning the Operational System.
XML To Relational Model. Key Index – Forward Traversal Backward Traversal.
Motivation for IDEF1X Simplicity Common Standard Useful when relational model is target Air Force 1985 or thereabouts.
1 From XML Schema to Relations: A Cost- Based Approach to XML Storage Presented by Xinwan Bian and Danyu Wu
The Query Compiler 16.1 Parsing and Preprocessing Meghna Jain(205) Dr. T. Y. Lin.
LegoDB Customizing Relational Storage for XML Documents Timothy Sutherland Sachin Patidar.
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 9 Relational Database Design by ER- and EER-to- Relational Mapping.
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Chapter 17 Methodology – Physical Database Design for Relational Databases Transparencies © Pearson Education Limited 1995, 2005.
Team Dosen UMN Physical DB Design Connolly Book Chapter 18.
XMLII XSchema XSchema XQuery XQuery. XML Schema XML Schema is a more sophisticated schema language which addresses the drawbacks of DTDs. Supports XML.
17 Apr 2002 XML Schema Andy Clark. What is it? A grammar definition language – Like DTDs but better Uses XML syntax – Defined by W3C Primary features.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model.
Data Access Patterns Some of the problems with data access from OO programs: 1.Data source and OO program use different data modelling concepts 2.Decoupling.
LegoDB 1 Data Binding Workshop, Avaya Labs, June 2003 LegoDB: Cost-based XML to Relational “Shredding” Jerome Simeon Bell Labs – Lucent Technologies joint.
Lecture 9 Methodology – Physical Database Design for Relational Databases.
CODD’s 12 RULES OF RELATIONAL DATABASE
EN : Adv. Storage and TP Systems Cost-Based Query Optimization.
Chapter 16 Methodology – Physical Database Design for Relational Databases.
ICOM 6005 – Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 – Relational Model.
Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Entity-Relationship Model Using High-Level Conceptual Data Models for Database Design Entity Types, Sets, Attributes and Keys Relationship Types, Sets,
CSc340 1b1 The Relational Model Chapter 2 Database Schema Keys Schema Diagrams Relational Query Languages Relational Operations.
Lecture A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Lecture 24 – Part 2 XML Query Processing Phil Gibbons April.
Chapter 10 Designing the Files and Databases. SAD/CHAPTER 102 Learning Objectives Discuss the conversion from a logical data model to a physical database.
Methodology – Physical Database Design for Relational Databases.
Relational Theory and Design
Chapter 2: Intro to Relational Model. 2.2 Example of a Relation attributes (or columns) tuples (or rows)
DatabaseIM ISU1 Chapter 7 ER- and EER-to-Relational Mapping Fundamentals of Database Systems.
Computing & Information Sciences Kansas State University Friday, 20 Oct 2006CIS 560: Database System Concepts Lecture 24 of 42 Friday, 20 October 2006.
The Relational Model. 2 Relational Model Terminology u A relation is a table with columns and rows. –Only applies to logical structure of the database,
Copyright © 2009 Pearson Education, Inc. Publishing as Prentice Hall Chapter 9 Designing Databases 9.1.
Deriving Relation Keys from XML Keys by Qing Wang, Hongwei Wu, Jianchang Xiao, Aoying Zhou, Junmei Zhou Reviewed by Chris Ying Zhu, Cong Wang, Max Wang,
Experience with XML Schema Ashok Malhotra Schema Usage  Mapping XML Schema and XML documents controlled by the Schema to object classes and instances.
Jennifer Widom Relational Databases The Relational Model.
Chapter 3: Relational Databases
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
Sets and Maps Chapter 9. Chapter Objectives  To understand the Java Map and Set interfaces and how to use them  To learn about hash coding and its use.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Welcome: To the fifth learning sequence “ Data Models “ Recap : In the previous learning sequence, we discussed The Database concepts. Present learning:
Lecture 5 Data Model Design Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS
Database Applications (15-415) DBMS Internals- Part VIII Lecture 19, March 29, 2016 Mohammad Hammoud.
Relational Algebra COMP3211 Advanced Databases Nicholas Gibbins
Database Applications (15-415) DBMS Internals- Part IX Lecture 20, March 31, 2016 Mohammad Hammoud.
CENG 351 File Structures and Data Management1 Relational Model Chapter 3.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
XML: Extensible Markup Language
Methodology – Physical Database Design for Relational Databases
Translation of ER-diagram into Relational Schema
Relational Databases The Relational Model.
Relational Databases The Relational Model.
Database Applications (15-415) DBMS Internals- Part IX Lecture 21, April 1, 2018 Mohammad Hammoud.
Tuning Queries from (E&N)
Lecture 30: Final Review Wednesday, December 6, 2000.
Lecture 30: Final Review Wednesday, December 10, 2003.
Presentation transcript:

LegoDB XML-to-Relational Mapping using LegoDB Dustin Anderson CSC560 a way to map XML Schema structures to relational tables

Rewrite as P-Schema p-schemaRewrite the XML Schema more simply, but syntactically equivalent : p-schema P-schemas contain data statistics which can be mapped into relations, and allow the XML-to-relational mappings to take place

Stratify Types stratifyUsing a grammar, stratify the types (this allows LegoDB to work with structured & semistructured data in the same way) TypeRepresentationStratification Physical Singleton elements, nested singleton elements, optional types Mapped to columns != null OptionalOptional nested elements Mapped to columns null Named Only named types to ensure regex do not contain nested elements Only used for foreign key generation & to keep track of child- parent relationships

Map P-Schemas map them to relationsNow that the physical types are stratified, map them to relations: 1. Create one relation for each element 2. For each relation, create a key that will store the identifiers of all parent elements 3. For each relation, create an attribute for each inlined subelement 4. If the element is optional, then the corresponding column can contain a null value

Generate New P-Schemas XML-to-XML transformationsLegoDB uses a subset of p-schemas that are derived from XML-to-XML transformations: 1. Inlining/outlining 2. Repetition merge/split 3. Wildcards rewriting 4. Union factorization/distribution 5. From union to options

Inlining: Transformation Example type TV = seasons [ Integer ], Description, Episode* Type Description = description [ String ] type TV = seasons [ Integer ], description [ String ], Episode*

Selecting the “Best” over all the transformationsThe LegoDB algorithm works by exploring the space of the “interesting” possible XML-to-relational mappings over all the transformations GREEDY algorithm that looks at each potential mapping’s: »given statistics (data type sizes, minimum/maximum values, number of distinct values) »query workloads (a set of sample queries and their “importance”)

GREEDY LegoDB XML-to-XML 1. Any possible XML-to-XML transformation (rewriting) is applied to the p-schema, resulting in a new p-schema

GREEDY LegoDB XML-to-relational new 2. XML-to-relational transformations are applied to the new p-schema and resulting relational schema (mapping), on which given queries are tested and the performance estimated Keep the “best” one every step. one every step.

ankYou! Thank You! M.C. Escher - Relativity Philip Bohannon, Juliana Freire, Prasan Roy, Jérôme Siméon: From XML Schema to Relations: A Cost-Based Approach to XML Storage. ICDE 2002: 64-75