1 Composable XML Integration Grammars Xibei Jia Laboratory for Foundations of Computer Science Supervisor: Wenfei Fan 20 May 2004.

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

Querying on the Web: XQuery, RDQL, SparQL Semantic Web - Spring 2006 Computer Engineering Department Sharif University of Technology.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
XML: Extensible Markup Language
Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
1 Information Preserving XML Schema Embedding Philip BohannonBell Laboratories Wenfei FanUniv of Edinburgh & Bell Labs Michael Flaster Bell Laboratories.
SOAP & Security IEEE Computer Society Utah Chapter Hilarie Orman - Purple Streak Development Tolga Acar - Novell, Inc. October 24, 2002.
CSE 6331 © Leonidas Fegaras XML and Relational Databases 1 XML and Relational Databases Leonidas Fegaras.
The Greedy Method1. 2 Outline and Reading The Greedy Method Technique (§5.1) Fractional Knapsack Problem (§5.1.1) Task Scheduling (§5.1.2) Minimum Spanning.
SilkRoute: A Framework for Publishing Relational Data in XML Mary Fernández, AT&T Labs - Research Dan Suciu, Univ. of Washington Yada Kadiyska, Univ. of.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
Manish Bhide, Manoj K Agarwal IBM India Research Lab India {abmanish, Amir Bar-Or, Sriram Padmanabhan IBM Software Group, USA
XML Views El Hazoui Ilias Supervised by: Dr. Haddouti Advanced XML data management.
The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Summary. Chapter 9 – Triggers Integrity constraints Enforcing IC with different techniques –Keys –Foreign keys –Attribute-based constraints –Schema-based.
14 1 Chapter 14 Database Connectivity and Web Development Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
CSE1301 Computer Programming: Lecture 21 Software Engineering.
XML –Query Languages, Extracting from Relational Databases ADVANCED DATABASES Khawaja Mohiuddin Assistant Professor Department of Computer Sciences Bahria.
Distributed Collaborations Using Network Mobile Agents Anand Tripathi, Tanvir Ahmed, Vineet Kakani and Shremattie Jaman Department of computer science.
Federated Search of Text Search Engines in Uncooperative Environments Luo Si Language Technology Institute School of Computer Science Carnegie Mellon University.
Mining Metamodels From Instance Models: The MARS System Faizan Javed Department of Computer & Information Sciences, University of Alabama at Birmingham.
Messaging Technologies Group: Yuzhou Xia Yi Tan Jianxiao Zhai.
By Justin Thompson. What is SOAP? Originally stood for Simple Object Access Protocol Created by vendors from Microsoft, Lotus, IBM, and others Protocol.
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
Discovering E-Services Using UDDI in SELF-SERV Quan Z. Sheng, Boualem Benatallah, Rayan Stephan, Eileen Oi-Yan Mak, Yan Q. Zhu School of Computer Science.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
And Di s t r i b u t e d Databases on the Web Nathaniel Ayewah CSE 8330 Presentation SMU.
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
Ohio State University Department of Computer Science and Engineering Automatic Data Virtualization - Supporting XML based abstractions on HDF5 Datasets.
Chapter 1 Introduction to Data Mining
Fundamentals of Database Chapter 7 Database Technologies.
NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on: Global Computing (GC) Proactive.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn Luyiqi Locus based alignment storage.
Using XML to present computer program Qingsong Yao Qingsong Yao Department of Computer Science Department of Computer Science York University York University.
The Semistructured-Data Model Programming Languages for XML Spring 2011 Instructor: Hassan Khosravi.
Cost Framework for a Heterogeneous Distributed Semi-structured Environment Tianxiao Liu (1)(2) Tuyet-Tram Dang-Ngoc (1) Dominique Laurent (1) DBMAN 2007.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. An Overview of XML Ellen Pearlman Eileen Mullin Programming the Web Using.
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
1 Approximate XML Query Answers Presenter: Hongyu Guo Authors: N. polyzotis, M. Garofalakis, Y. Ioannidis.
Dispatching Java agents to user for data extraction from third party web sites Alex Roque F.I.U. HPDRC.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Research Overview Gagan Agrawal Associate Professor.
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dynamic Mapping Dr. Xiao Qin Auburn University
Advanced Database Course Syllabus 1 Advanced Database System Lecturer : H.Ben Othmen.
Ping-Tsun Chang Intelligent Systems Laboratory NTU/CSIE Using Support Vector Machine for Integrating Catalogs.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Parallel Patterns.
SilkRoute: A Framework for Publishing Rational Data in XML
2/18/2019.
Convert (flatten) IATI XML file to CSV file(s) using XQUERY
Database Connectivity and Web Development
Query Optimization.
Decentralised Middleware and Workflow Enactment
Query Processing.
Implementation of a Functional Programming Language
Presentation transcript:

1 Composable XML Integration Grammars Xibei Jia Laboratory for Foundations of Computer Science Supervisor: Wenfei Fan 20 May 2004

2 Schema-directed XML integration extract relevant data from distributed, multiple XML repositories construct an XML view conform to a predefined schema XML3 XML1 XML view XML2 middleware schema data exchange

3 Composable XML Integration A lightweight language: XML Integration Grammar (XIG) XML XIG middleware XML XIG middleware XML XIG middleware XML XIG middleware XML XIG middleware XML query result www

4 Our middleware for composable schema-directed integration XML data sources remote XIG XQuery engine remote XIG call execution query scheduling XQuery optimizer query merging optimizer parsing XIG XML document cost statistics data call result

5 XIG: XML Integration Grammar XIG: a novel specification language –Is not another query/transformation language –Rely solely on DTDs and XQuery –Provide a systematic mechanism and design tool to ensure DTD-conformance for XQuery –Can be defined using some specific XQuery fragment that allows for more optimizations than full-fledged XQuery –Composable: local/remote XIGs can be treated as functions returning XML trees and can be embedded A systematic framework for schema-directed XML integration

6 Example: car sale db  dealers, promotion Syn(db) = {Syn(dealers)} {Syn(promotion)} promotion  sale * Syn(promotion) = Vsale(X) dealers  dealer * Inh(dealer) =for $Y in R/dl return $Y; Syn(dealers) = { Syn(dealer) } dealer  name, address, cars Inh(name) = let $p := Inh(dealer)/Uv let $u := Inh(dealer)/U let $v = $p:Vdealer($u) return $v/dealer/name; Syn(dealer) = { Syn(name) } { Syn(address) } { Syn(cars) }

7 Example: car sale car  car * Inh(car)  let $s := Vsale(X) for $c in Inh(cars)/car $c1 in $s/promotion/sale where $c/make = $c1/make and $c/model = $c1/model return $c Syn(cars) = { Syn(car) } car  make, model, price, inStock Syn(car) = Inh(car) name  PCDATA /* similarly for address */ Syn(name) = Inh(name)/text();

8 XIG-Evaluation Algorithms Optimization techniques: nontrivial optimization problems –recursion: capture recursive DTDs and recursive XIGs –query scheduling: a cost-based algorithm for scheduling local XML queries/XIGs and remote XIGs to maximize parallelism –query merging: merge certain queries processed at the same source into a larger query to reduce communication costs –interaction between query merging and query scheduling: have to be dealt with together, a greedy heuristic algorithm –query dependency: XIGs support top-down, bottom-up and sideway information passing (implicit)