Cost Framework for a Heterogeneous Distributed Semi-structured Environment Tianxiao Liu (1)(2) Tuyet-Tram Dang-Ngoc (1) Dominique Laurent (1) DBMAN 2007.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
ADBIS 2007; Varna, Bulgaria; Towards Self-Optimization of Message Transformation Processes Matthias Böhm 1,2,3 *, Dirk Habich 2, Uwe Wloka 3,
Personalized Navigation in the Semantic Web: An Enhanced Faceted Browser Michal Tvarožek FIIT STU BA.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
Outline SQL Server Optimizer  Enumeration architecture  Search space: flexibility/extensibility  Cost and statistics Automatic Physical Tuning  Database.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Paper by: A. Balmin, T. Eliaz, J. Hornibrook, L. Lim, G. M. Lohman, D. Simmen, M. Wang, C. Zhang Slides and Presentation By: Justin Weaver.
Zero-programming Sensor Network Deployment 學生:張中禹 指導教授:溫志煜老師 日期: 5/7.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
Building Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Semantic web technologies for secure interoperability and.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
1 Overview of Database Federation and IBM Garlic Project Presented by Xiaofen He.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Data Integration in Service Oriented Architectures Rahul Patel Sr. Director R & D, BEA Systems Liquid Data – XML-based data access and integration for.
Cost-based Optimization of Graph Queries Silke Trißl Humboldt-Universität zu Berlin Knowledge Management in Bioinformatics IDAR 2007.
From Web 1.0  Web 3.0: Is RDF access to RDB enough? Vipul Kashyap Senior Medical Informatician, Clinical Informatics R&D Partners.
1 Static Type Analysis of Path Expressions in XQuery Using Rho-Calculus Wang Zhen (Selina) Oct 26, 2006.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
1/39 University of Versailles, September 28th 1999 Hubert Naacke Mediator Cost Models for Heterogeneous Data Sources Hubert Naacke.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
Dart: A Meta-Level Object-Oriented Framework for Task-Specific Behavior Modeling by Domain Experts R. Razavi et al..OOPSLA Workshop DSML‘ Dart:
SOFSEM-SRF 2006, January 21-26, Merin, Czech Republic R. Adamus,K. Kuliberda, J. Wislicki, K. Subieta Wrapping Relational Data Structures to Object-Oriented.
The Grid Component Model and its Implementation in ProActive CoreGrid Network of Excellence, Institute on Programming Models D.PM02 “Proposal for a Grid.
Academic Year 2014 Spring. MODULE CC3005NI: Advanced Database Systems “QUERY OPTIMIZATION” Academic Year 2014 Spring.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Semantic Web Services enabled B2B Integration Kotinurmi,Vitvar,
Agent-Based Hybrid Intelligent Systems and Their Dynamic Reconfiguration Zili Zhang Faculty of Computer and Information Science Southwest University
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
1 SIGMOD 2000 Christophides Vassilis On Wrapping Query Languages and Efficient XML Integration V. Christophides, S. Cluet, J Simeon Computer Science Department,
M. Adorni, F. Arcelli, D. Ardagna, L. Baresi, C. Batini, C. Cappiello, M. Comerio, M. Comuzzi, F. De Paoli, C. Francalanci, S.Grega, P. Losi, A.Maurino,
Presentation on Issues and Challenges in Evaluation of Agent-Oriented Software Engineering Methodologies By: kanika singhal.
1 Le Thi Thu Thuy*, Doan Dai Duong*, Virendrakumar C. Bhavsar* and Harold Boley** * Faculty of Computer Science, University of New Brunswick, Fredericton,
The Forest and the Trees Julia Stoyanovich Candidacy Exam in Database Systems Fall 2005.
Succinct Dynamic Cardinal Trees with Constant Time Operations for Small Alphabet Pooya Davoodi Aarhus University May 24, 2011 S. Srinivasa Rao Seoul National.
Christoph Bussler, Laurentiu Vasiliu Digital Enterprise Research Institute (DERI) National University of Ireland, Galway, Ireland SDK meeting.
S. Shumilov – Zürich Analytical Visualization Framework - a visual data processing and knowledge discovery system Ivan Denisovich, Serge Shumilov Department.
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
1 Composable XML Integration Grammars Xibei Jia Laboratory for Foundations of Computer Science Supervisor: Wenfei Fan 20 May 2004.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
1 Typing XQuery WANG Zhen (Selina) Something about the Internship Group Name: PROTHEO, Inria, France Research: Rewriting and strategies, Constraints,
1 Selectivity Estimation for Exclusive Query Translation in Deep Web Data Integration Fangjiao Jiang Renmin University of China Joint work with Weiyi Meng.
Challenges with XML Challenges with Semi-Structured collections Ludovic Denoyer University of Paris 6 Bridging the gap between research communities.
Cooperative Computing & Communication Laboratory A Survey on Transformation Tools for Model-Based User Interface Development Robbie Schäfer – Paderborn.
SSQSA present and future Gordana Rakić, Zoran Budimac Department of Mathematics and Informatics Faculty of Sciences University of Novi Sad
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
Adaptive Faceted Browsing in Job Offers Danielle H. Lee
Rate-Based Query Optimization for Streaming Information Sources Stratis D. Viglas Jeffrey F. Naughton.
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints Arianna Gallo, Roberto Esposito, Rosa Meo, Marco Botta.
Chapter 13: Query Processing
Feature Generation and Selection in SRL Alexandrin Popescul & Lyle H. Ungar Presented By Stef Schoenmackers.
Composing semantic Web services under constraints E.Karakoc, P.Senkul Journal: Expert Systems with Applications 36 (2009)
Igor EPIMAKHOV Abdelkader HAMEURLAIN Franck MORVAN
XML: Extensible Markup Language
UCSD Neuron-Centered Database
An Adaptive Middleware for Supporting Time-Critical Event Response
Dr. Bhavani Thuraisingham The University of Texas at Dallas
2/18/2019.
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
Ivan Kurtev, Klaas van den Berg Software Engineering Group
ESS.VIP Validation Item 5.1
Context-Aware Internet
Presentation transcript:

Cost Framework for a Heterogeneous Distributed Semi-structured Environment Tianxiao Liu (1)(2) Tuyet-Tram Dang-Ngoc (1) Dominique Laurent (1) DBMAN 2007 (1) ETIS Laboratory University of Cergy-Pontoise Cergy-Pontoise, France (2) Xcalia S.A., Paris, France June 18 th, 2007

Outline Motivation Cost models for heterogeneous data sources Contributions  Generic language for cost communication  Dynamic cost estimation framework Conclusion DBMAN 2007

Motivation Cost-based query optimization  Various execution plans for the same query  Different costs for each plan (execution time, price, communication, etc.)  Cost model used to estimate the cost of candidate plans Cost formulas: source oriented or operation oriented Statistics of data sources Problems in the case of mediation context  Data source autonomy: cost models not available  Integration of various cost models at mediator level  Cost communication between components of the system DBMAN 2007

Cost models for heterogeneous data sources Cost models based on operation implementation Generic cost modelsSpecific methods Known sourcesHeterogeneous autonomous sources Relational Data sources Object oriented Data souces Semi-structured Data sources Operation [GP89] [ML86] [SA82] Sampling [ZL98] Calibration [DKS92] Adaptive [Zhu95] Adapted Refined Operation [CD92] [BMG93] [DOA+94] Calibration [GST96] Access Path [GGT96] Extended Flora [Flo96] [Gru96] Hybrid cost model [NGT98] Cost model by history [ACP96] Wrapper [HKWY97] [ROH99] Operation [AAN01] [MW99] XQuery Self-Learning [ZHJGML05] Applied DBMAN 2007

Background XLive mediation system and its XQuery evaluation process DBMAN 2007 Wrapper … XQuery Query Result (XML) Relational data source XML data source Web services Canonized XQuery Tree Graph View (TGV) Annotated TGV XAlgebra Query Canonization Modeling Annotation Transformation Evaluation Cost-based Optimization Response Wrapper operators Mediator Equivalent rules Search Strategy Mediator Information Repository Wrapper Information Repository Cost information Mediator operators

Background Tree Graph View (TGV) An example of XQueryTGV presentation DBMAN 2007

Generic cost model in a mediation context Design a generic cost model…  Source type: relational, semi-structured, web-service…  Specific methods Calibration, History… APIs implemented by the system Principle: as accurate as possible …Using cost formulas  Equation systems  Statistics expressed also in the form of equation  Constant values Existing generic cost model (Disco) Object Oriented environment Predefined variables in the language DBMAN 2007

Our proposal: Generic Language for Cost Communication (GLCC) A language based on XML  Cost formulas and equation systems in the form of MathML A generic language  No predefined variables  Express different costs for various optimization objectives (time, price…) DBMAN 2007

Dynamic cost estimation framework Cooperation and communication between different components of XLive Use execution results (response time) to improve the accuracy of cost models Cost communication performed in GLCC DBMAN 2007

Overall cost estimation on the mediator TGV cost annotation For one or a group of operations in a TGV, annotate with cost information Annotated DBMAN 2007

Overall cost estimation on the mediator Cost Annotation Tree (CAT) Breadth-first traversal of CAT to associate the execution cost for each node DBMAN 2007

Conclusion and future work Contributions  First cost-based query optimization framework for XML-based mediation system  Generic language  Suitable for various search strategies Future work  Cost model validation: Accuracy and performance  Calibrating cost of native XML Data sources  Search Strategy DBMAN 2007

Thanks for your attention! Questions? DBMAN 2007