09.05.2008 Mapping Data to Queries Martin Hentschel Systems Group, ETH Zurich.

Slides:



Advertisements
Similar presentations
Copyright 2001, ActiveState. XSLT and Scripting Languages or…XSLT: what is everyone so hot and bothered about?
Advertisements

Advanced XSLT. Branching in XSLT XSLT is functional programming –The program evaluates a function –The function transforms one structure into another.
DWebSpec Why dWebSpec? It could be asked,with all the web application alternatives.
Querying on the Web: XQuery, RDQL, SparQL Semantic Web - Spring 2006 Computer Engineering Department Sharif University of Technology.
XML: Extensible Markup Language
Twig 2 Stack: Bottom-up Processing of Generalized-Tree-Pattern Queries over XML Documents Songting Chen, Hua-Gang Li *, Junichi Tatemura Wang-Pin Hsiung,
Getting started with ML ML is a functional programming language. ML is statically typed: The types of literals, values, expressions and functions in a.
1 XML Data Management Course Outline and Organisation Werner Nutt.
1 CS 561 Presentation: Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Ming Li.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Manish Bhide, Manoj K Agarwal IBM India Research Lab India {abmanish, Amir Bar-Or, Sriram Padmanabhan IBM Software Group, USA
Paper by: A. Balmin, T. Eliaz, J. Hornibrook, L. Lim, G. M. Lohman, D. Simmen, M. Wang, C. Zhang Slides and Presentation By: Justin Weaver.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
A Graphical Environment to Query XML Data with XQuery
1 Indexing and Querying XML Data for Regular Path Expressions A Paper by Quanzhong Li and Bongki Moon Presented by Amnon Shochot.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
17 Apr 2002 XML Stylesheets Andy Clark. What Is It? Extensible Stylesheet Language (XSL) Language for document transformation – Transformation (XSLT)
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
XML Query Language Changki Kim July MSE. 2 2 Contents  Introduction  XQuery Data Model  XQuery Expression Types  Implementations of XQuery.
Web service testing Group D5. What are Web Services? XML is the basis for Web services Web services are application components Web services communicate.
1 Distributed Monitoring of Peer-to-Peer Systems By Serge Abiteboul, Bogdan Marinoiu Docflow meeting, Bordeaux.
Scalable Architecture for the Cloud. What????  Command Query Responsibility Segregation  What is it?  What kinds of apps is it for?  What are the.
TDDD43 XML and RDF Slides based on slides by Lena Strömbäck and Fang Wei-Kleiner 1.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation An Introduction to XQuery.
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
DP&NM Lab. POSTECH, Korea - 1 -Interaction Translation Methods for XML/SNMP Gateway Interaction Translation Methods for XML/SNMP Gateway Using XML Technologies.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
1 XML Data Management Course Outline and Organisation Werner Nutt.
XML as a Boxwood Data Structure Feng Zhou, John MacCormick, Lidong Zhou, Nick Murphy, Chandu Thekkath 8/20/04.
OXygen XML Editor Support for eXist DB XQuery debugging. Stefan Vasile
1 HKU CSIS DB Seminar: HKU CSIS DB Seminar: Web Services Oriented Data Processing and Integration Speaker: Eric Lo.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
DANIEL J. ABADI, ADAM MARCUS, SAMUEL R. MADDEN, AND KATE HOLLENBACH THE VLDB JOURNAL. SW-Store: a vertically partitioned DBMS for Semantic Web data.
HEPTOX 1 : Marrying XML and Heterogeneity in Your P2P Databases Angela Bonifati (Icar CNR, Italy), Elaine Q.Chang, Laks V.S.Lakshmanan, Terence Ho, Rachel.
Database Systems Part VII: XML Querying Software School of Hunan University
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
7 Systems Analysis and Design in a Changing World, Fifth Edition.
Early Profile Pruning on XML-aware Publish- Subscribe Systems Mirella M. Moro, Petko Bakalov, Vassilis J. Tsotras University of California VLDB 2007 Presented.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
XML Query: xQuery Reference: Xquery By Priscilla Walmsley, Published by O’Reilly.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
Steps to integrate XML How does XML processing work? Simple uses of passive DOM objects Adding behaviour to information A converter and translator subsystem.
Scheduling of Transactions on XML Documents Author: Stijin Dekeyser Jan Hidders Reviewed by Jason Chen, Glenn, Steven, Christian.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Compilation of XSLT into Dataflow Graphs for Web Service Composition Peter Kelly Paul Coddington Andrew Wendelborn.
XML Stream Processing Yanlei Diao University of Massachusetts Amherst.
Microsoft Azure and ServiceNow: Extending IT Best Practices to the Microsoft Cloud to Give Enterprises Total Control of Their Infrastructure MICROSOFT.
Design of a Notification Engine for Grid Monitoring Events and Prototype Implementation Natascia De Bortoli INFNGRID Technical Board Bologna Feb.
Slice & dice the Web with XmlPL, The XML Processing Language A presentation for Boise Code Camp 2007 Joseph Coffland Cauldron Development LLC.
Business rules.
Tools Of Structured Analysis
Efficient Evaluation of XQuery over Streaming Data
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
6.001 SICP Variations on a Scheme
XACML and the Cloud.
High-Performance XML Filtering with YFilter
{ XML Technologies } BY: DR. M’HAMED MATAOUI
Optimized Rewriter Rules for Efficient Querying of JSON Data
OrientX: an Integrated, Schema-Based Native XML Database System
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Akshay Tomar Prateek Singh Lohchubh
Andrej Valjavec, dipl. org. Ljubljanske mlekarne d.d.
Querying XML XPath.
Towards an Internet-Scale XML Dissemination Service
Querying XML XPath.
Query Optimization.
The Ohio State University
Presentation transcript:

Mapping Data to Queries Martin Hentschel Systems Group, ETH Zurich

Martin Hentschel/Systems Group, ETH  “…, but the real advantage of XML is precisely that it allows you to go from Point A to destinations unknown.” -- Larry O’Brien, Microsoft 2

Martin Hentschel/Systems Group, ETH 3 Goals  Integrate data from various data feeds  Light-weight  Easy to use  Fast

Martin Hentschel/Systems Group, ETH 4 Goals  Integrate data from various data feeds  Light-weight Mapping rules  Easy to use Based on common language (XQuery)  Fast Implements research ideas (YFilter)

Martin Hentschel/Systems Group, ETH Targets  Health care  Electronic health records (Health Level 7)  Finance  Exchange of financial data (xBRL)  Web services  News feeds  Weather Every domain which uses several data sources 5

Martin Hentschel/Systems Group, ETH Example  Find the most powerful car 6 Ford 130 Ford 130 VW Golf 150 VW Golf 150

Martin Hentschel/Systems Group, ETH Example  Find the most powerful car 7 Ford 130 Ford 130 VW Golf 150 VW Golf 150 datenis-adb; autois-acar; psis-ahp; datenis-adb; autois-acar; psis-ahp;

Martin Hentschel/Systems Group, ETH Example  Find the most powerful car  Apply standard XQuery 8 Ford 130 Ford 130 VW Golf 150 VW Golf 150 datenis-adb; autois-acar; psis-ahp; datenis-adb; autois-acar; psis-ahp; let $max := max(//hp) for $car in //car where $car/hp = $max return $car let $max := max(//hp) for $car in //car where $car/hp = $max return $car

Martin Hentschel/Systems Group, ETH Example  Find the most powerful car  Apply standard XQuery 9 Ford 130 Ford 130 VW Golf 150 VW Golf 150 datenis-adb; autois-acar; psis-ahp; datenis-adb; autois-acar; psis-ahp; let $max := max(//hp) for $car in //car where $car/hp = $max return $car let $max := max(//hp) for $car in //car where $car/hp = $max return $car VW Golf 150 VW Golf 150  Result

Martin Hentschel/Systems Group, ETH Usage Scenarios  Continuous query processing 10 DSMS Querie s Rules Streaming Input Events Streaming Output Events

Martin Hentschel/Systems Group, ETH Usage Scenarios  Publish/subscribe systems 11 Rules PublishersSubscribers Enhanced Broker Enhanced Broker Data Subscriptions Data

Martin Hentschel/Systems Group, ETH Usage Scenarios  Data integration 12 Rules Source 1 Company‘s Data Store Data Source 2 Source x Homogeneous Data Handler Data Handler

Martin Hentschel/Systems Group, ETH The Is-A Rule  Map XML elements  Expresses a substitutability relationship  Like in object oriented design  Use the car wherever vehicles are expected  It follows //vehicle also returns car elements  Returned as car  Not transformed into vehicle  Consistent with OO-approach 13 car is-a vehicle;

Martin Hentschel/Systems Group, ETH The Is-A Rule  Map path expressions  XPath path expressions  Left hand side may include predicates 14 german/car is-a auto; auto is-a german/car; german/car is-a auto; auto is-a german/car; < 100] is-a slow/vehicle; < 100] is-a slow/vehicle;

Martin Hentschel/Systems Group, ETH The Is-A Rule  Specify contexts  Element names could be used differently in different contexts  Scope applicability of rules  Further refinement 15 car in is-a auto; car in is-a auto;

Martin Hentschel/Systems Group, ETH The Is-A Rule  Element construction  Map elements  Transform data, e.g. for  Integration of very diverse data 16 auto as $a is-a {$a/ps * 0.74} ; auto as $a is-a {$a/ps * 0.74} ; Ford 100 Ford 100 VW Golf 150 VW Golf 150

Martin Hentschel/Systems Group, ETH Implementation  Several possibilities  MDQ approach -Native approach, novel MDQ data model -Allows lazy execution  Query rewrite -E.g. //(car | auto | vehicle |...) -Does not scale  Data translation -Translate input data -Big overhead 17

Martin Hentschel/Systems Group, ETH MDQ Data Model  Classical XML tree model 18 Golf 150 Golf 150 auto psname „Golf“„150“ daten

Martin Hentschel/Systems Group, ETH MDQ Data Model  MDQ data model  Move names from nodes to edges 19 Golf 150 Golf 150 auto psname „Golf“„150“ daten

Martin Hentschel/Systems Group, ETH MDQ Data Model  Application of mapping rules 20 Golf 150 Golf 150 auto ps name „Golf“„150“ daten daten is-a db; auto is-a car; ps is-a hp; daten is-a db; auto is-a car; ps is-a hp; db car hp

Martin Hentschel/Systems Group, ETH Lazy Evaluation, YFilter  Built from left hand side of rules  Non-deterministic finite state machine  Main idea:  Evaluate XQuery program  Iterate through data model  Report to YFilter  Apply rules only when reaching an accepting state 21 R1: daten is-a db; R2: auto is-a car; R2: ps is-a hp; R1: daten is-a db; R2: auto is-a car; R2: ps is-a hp;  * daten auto ps R1 R2 R3

Martin Hentschel/Systems Group, ETH Experiment: Throughput  Complex query (multiple scans, joins)  QR: too many unions, DT: overhead of translation 22

Martin Hentschel/Systems Group, ETH Experiment: Throughput  Simple query  Less unions for QR, DT: still overhead of translation 23

Martin Hentschel/Systems Group, ETH Experiment: Throughput  1 input message, bundle of queries evaluated at once  QR: even more unions, DT: less overhead, only transforms input message once 24

Martin Hentschel/Systems Group, ETH Again: Advantages  Performance  Novel data model, lazy execution  Light-weight  Mappings rules are small units  Extensibility  Add more rules as new sources are adopted  Flexibility  Complex mappings through element constructors 25

Martin Hentschel/Systems Group, ETH The End  Visit our website, LIVE DEMO!   Write us, please!  26