A Publish & Subscribe Architecture for Distributed Metadata Management Markus Keidl 1 Alexander Kreutz 1 Alfons Kemper 1 Donald Kossmann 2 1 Universität.

Slides:



Advertisements
Similar presentations
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Advertisements

DB glossary (focus on typical SQL RDBMS, not XQuery or SPARQL)
RDFTL: An Event-Condition- Action Language for RDF George Papamarkos Alexandra Poulovassilis Peter T. Wood School of Computer Science and Information Systems.
Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
Building Dynamic Market Places Using HyperQueries Christian Wiesner Peter Winklhofer Alfons Kemper Universität Passau.
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
1 gStore: Answering SPARQL Queries Via Subgraph Matching Presented by Guan Wang Kent State University October 24, 2011.
By Daniela Floresu Donald Kossmann
Outline SQL Server Optimizer  Enumeration architecture  Search space: flexibility/extensibility  Cost and statistics Automatic Physical Tuning  Database.
Selective Dissemination of Streaming XML By Hyun Jin Moon, Hetal Thakkar.
Xyleme A Dynamic Warehouse for XML Data of the Web.
A Robust System Architecture For Mining Semi-structured Data By Aby M Mathew CSE
1 RDF Aggregate Queries and Views Edward Hung, Yu Deng, V.S. Subrahmanian University of Maryland, College Park ICDE 2005, April 7, Tokyo, Japan.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
1 Awareness Services for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
1 Personal Activity Coordinator (PAC) Xia Hong UC Berkeley ISRG retreat 1/11/2000.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
Automatic Data Ramon Lawrence University of Manitoba
INTEGRATION INTEGRATION Ramon Lawrence University of Iowa
Fuego Event Service: Towards Modularity in Event Routing Sasu Tarkoma Rutgers-Helsinki Workshop
Graph Algebra with Pattern Matching and Aggregation Support 1.
Indexing XML Data Stored in a Relational Database VLDB`2004 Shankar Pal, Istvan Cseri, Gideon Schaller, Oliver Seeliger, Leo Giakoumakis, Vasili Vasili.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
© Copyright 2000 M. Rodriguez-Martinez, All Rights Reserved Automatic Deployment of Application-Specific Metadata and Code in MOCHA Manuel Rodriguez-Martinez.
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
1 Overview of Database Federation and IBM Garlic Project Presented by Xiaofen He.
Efficient Distribution-Based Event Filtering Annika Hinze, Sven Bittner Institute of Computer Science Freie Universität Berlin {hinze,
NiagaraCQ A Scalable Continuous Query System for Internet Databases Jianjun Chen, David J DeWitt, Feng Tian, Yuan Wang University of Wisconsin – Madison.
NiagaraCQ : A Scalable Continuous Query System for Internet Databases (modified slides available on course webpage) Jianjun Chen et al Computer Sciences.
Event-Condition-Action Rule Languages over Semistructured Data George Papamarkos.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
Hopkins Storage Systems Lab, Department of Computer Science A Workload-Driven Unit of Cache Replacement for Mid-Tier Database Caching Xiaodan Wang, Tanu.
Compiling Mappings to Bridge Applications and Databases Melnik, Adya and Research.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
MIDDLEWARE SYSTEMS RESEARCH GROUP Denial of Service in Content-based Publish/Subscribe Systems M.A.Sc. Candidate: Alex Wun Thesis Supervisor: Hans-Arno.
1 Maintaining Semantics in the Design of Valid and Reversible SemiStructured Views Yabing Chen, Tok Wang Ling, Mong Li Lee Department of Computer Science.
Physical Database Design Chapter 6. Physical Design and implementation 1.Translate global logical data model for target DBMS  1.1Design base relations.
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Angela Bonifati, “Active XQuery”, ICDE Active XQuery A. Bonifati, D. Braga, A. Campi, S. Ceri Politecnico di Milano (Italy)
1 Large-Scale Information Filtering Systems Fatma Ozcan May 9, 2000 University of Maryland, College Park.
Early Profile Pruning on XML-aware Publish- Subscribe Systems Mirella M. Moro, Petko Bakalov, Vassilis J. Tsotras University of California VLDB 2007 Presented.
Sven Bittner and Annika Hinze, 2 November 2005 Talk at the 13th International Conference on Cooperative Information Systems (CoopIS 2005) A Detailed Investigation.
Ranking objects based on relationships Computing Top-K over Aggregation Sigmod 2006 Kaushik Chakrabarti et al.
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
Module 5: Implementing Merge Replication. Overview Understanding Merge Replication Architecture Implementing Conflict Resolution Planning and Deploying.
Sesame: An Architecture for Storing and Querying RDF Data and Schema Inf. Yasser Ganji Saffar When they were out of sight Ali Baba.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Semantic Caching Method Based on Linear Constraints Yoshiharu Ishikawa and Hiroyuki Kitagawa University of Tsukuba
Chapter 9: Web Services and Databases Title: NiagaraCQ: A Scalable Continuous Query System for Internet Databases Authors: Jianjun Chen, David J. DeWitt,
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
NiagaraCQ : A Scalable Continuous Query System for Internet Databases Jianjun Chen et al Computer Sciences Dept. University of Wisconsin-Madison SIGMOD.
Distributed Archives Interoperability Cynthia Y. Cheung NASA Goddard Space Flight Center IAU 2000 Commission 5 Manchester, UK August 12, 2000.
REED : Robust, Efficient Filtering and Event Detection in Sensor Network Daniel J. Abadi, Samuel Madden, Wolfgang Lindner Proceedings of the 31st VLDB.
Module 5: Managing Content. Overview Publishing Content Executing Reports Creating Cached Instances Creating Snapshots and Report History Creating Subscriptions.
Retele de senzori Curs 2 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
WP2: Data Management Gavin McCance University of Glasgow.
NiagaraCQ : A Scalable Continuous Query System for Internet Databases
Efficient Filtering of XML Documents with XPath Expressions
RE-Tree: An Efficient Index Structure for Regular Expressions
Query Optimization.
B. Stegmaier und R. Kuntschke TU München – Fakultät für Informatik
Data Stream Sharing Richard Kuntschke and Alfons Kemper
Presentation transcript:

A Publish & Subscribe Architecture for Distributed Metadata Management Markus Keidl 1 Alexander Kreutz 1 Alfons Kemper 1 Donald Kossmann 2 1 Universität Passau D TU München D München

ICDE Outline Motivation The MDV system The publish & subscribe algorithm Conclusion

ICDE Motivation Resource management in ObjectGlobe Requirements: Large number of clients  3-tier architecture Information close to the clients  caching Up-to-date information

ICDE RDF and RDF Schema RDF = Resource Description Framework W3C Recommendation Defines resources, properties, and values RDF Schema W3C Candidate Recommendation Defines schema of metadata, similar to class hierarchy

ICDE RDF Example: doc.rdf pirates.uni-passau.de 5874 <ServerInformation rdf:ID="info1" memory="92" cpu="600" />

ICDE The MDV System Metadata: RDF and RDF Schema 3-tier Architecture: MDPs, LMRs, and MDV Clients Caching on local tier Up-to-date metadata by using a publish & subscribe mechanism

ICDE Architecture Overview MDP LMR Publish/Subscribe Optimizer Backbone

ICDE Architecture: MDPs MDP = Metadata Provider Backbone of MDPs Sharing the same schema Full Replication of metadata Metadata: globally and publicly available Registration, update, deletion of metadata

ICDE Architecture Overview MDP LMR Publish/Subscribe Optimizer Backbone

ICDE Architecture: LMRs LMR = Local Metadata Repository Metadata Caching of global metadata  publish & subscribe Storing of local metadata  only locally accessible, for sensitive data Query processing Declarative language Cached and local metadata

ICDE Architecture Overview MDP LMR Publish/Subscribe Optimizer Backbone

ICDE Architecture: MDV Clients Pose queries to LMRs Browse metadata at MDPs and LMRs  determine metadata that should be cached Modify subscription rules of LMRs

ICDE Based on subscription rules: LMRs subscribe to metadata (at MDPs) MDPs determine which metadata to publish (to LMRs) Insertion, update, or deletion of metadata  Evaluation The Publish & Subscribe System MDP LMR Publish metadata Subscription rules

ICDE Subscription Rule Language Operators: =, !=,, >=, contains Example: search CycleProvider c register c where c.serverHost contains 'uni-passau.de' and c.serverInformation.memory > 64 Joins Input: document + complete database Publish: resources, not documents

ICDE References Problem: only subscription of CycleProvider resources What’s with ServerInformation resources? search CycleProvider c register c where c.serverHost contains 'uni-passau.de' and c.serverInformation.memory > 64 pirates.uni-passau.de 5874 <ServerInformation rdf:ID="info1" memory="92" cpu="600" />

ICDE References - Solution Augmentation of RDF schema  "user-defined" dangling references Strong references: transmitted together with referencing resource garbage collector deletes superfluous resources at LMR Weak references: never transmitted with referencing resource

ICDE Publish & Subscribe Algorithm Problem: huge set of subscription rule Solution: index on complete set of rules Goal: evaluation of a subset of all subscription rules Based on standard RDBMS technology

ICDE Basic Approach pirates.uni-passau.de 5874 <ServerInformation rdf:ID="info1" memory="92" cpu="600" /> search CycleProvider c register c where c.serverHost contains 'uni- passau.de' and c.serverInformation.memory > 64 RDF DocumentSubscription Rule Set of all subscription rules Decomposition

ICDE Advantages of the algorithm Based on standard RDBMS technology: robustness, scalability, and query abilities Usage of tables, SQL, indexes, optimizer, … Insertions, updates, and deletions Support of joins

ICDE The Filter Algorithm Decomposition of subscription rules Registration of new RDF document: Decomposition of the RDF document Execution of algorithm:  Rules that match new metadata + new metadata Rules  LMRs Notification of these LMRs

ICDE Details of the Algorithm Decomposition into atoms RDF documents Rules  triggering and join rules Evaluation: Determination of affected triggering rules Iterative evaluation of join rules  calculation of transitive closure

ICDE Decomposition of Documents <ServerInformation rdf:ID="info1" memory="92" cpu="600" /> pirates.uni-passau.de 5874

ICDE Table: FilterData ridclasspropertyvalue doc.rdf#host1CycleProviderrdf#subjectdoc.rdf#host1 CycleProviderserverHostpirates.uni-passau.de doc.rdf#host1CycleProviderserverPort5874 doc.rdf#host1CycleProviderserverInformationdoc.rdf#info1 ServerInformationrdf#subjectdoc.rdf#info1 ServerInformationmemory92 doc.rdf#info1ServerInformationcpu600

ICDE Details of the Algorithm Decomposition into atoms RDF documents Rules  triggering and join rules Evaluation: Determination of affected triggering rules Iterative evaluation of join rules  calculation of transitive closure

ICDE Normalization Path expressions are split up Search part contains all classes referenced by the rule Example: search CycleProvider c, ServerInformation s register c where c.serverHost contains 'uni-passau.de' and c.serverInformation = s and s.memory > 64

ICDE Rule Decomposition – Example search ServerInformation s register s where s.memory > 64RuleA search ServerInformation s register s where s.cpu > 500RuleB search CycleProvider c register c where c.serverHost contains 'uni-passau.de' RuleC search CycleProvider c, ServerInformation s register c where c.serverHost contains 'uni-passau.de' and c.serverInformation = s and s.memory > 64 and s.cpu > 500 search RuleA a, RuleB b register a where a = bRuleE search RuleE a, RuleC c register c where c.serverInformation = a RuleF

ICDE Dependency Tree RuleE RuleF ServerInformation RuleA s.memory > 64 ServerInformation RuleB s.cpu > 500 CycleProvider RuleC c.serverHost contains 'uni-passau.de' a = bc.serverInformation = a

ICDE Details of the Algorithm Decomposition into atoms RDF documents Rules  triggering and join rules Evaluation: Determination of affected triggering rules Iterative evaluation of join rules  calculation of transitive closure

ICDE Filter Algorithm - Example rule_idclasspropertyvalue RuleAServerInformationmemory64 RuleBServerInformationcpu500 FilterRulesGT rule_idclasspropertyvalue RuleCCycleProviderserverHostuni-passau.de FilterRulesCON oidclasspropertyvalue doc.rdf#host1CycleProviderrdf#subjectdoc.rdf#host1 CycleProviderserverHostpirates.uni-passau.de doc.rdf#host1CycleProviderserverPort5874 doc.rdf#host1CycleProviderserverInformationdoc.rdf#info1 ServerInformationrdf#subjectdoc.rdf#info1 ServerInformationmemory92 doc.rdf#info1ServerInformationcpu600 FilterData ResultObjects ridrule_id RuleBdoc.rdf#info1 RuleA doc.rdf#info1 RuleC doc.rdf#host1

ICDE Details of the Algorithm Decomposition into atoms RDF documents Rules  triggering and join rules Evaluation: Determination of affected triggering rules Iterative evaluation of join rules  calculation of transitive closure

ICDE Iterative Evaluation RuleE RuleF ServerInformation RuleA s.memory > 64 ServerInformation RuleB s.cpu > 500 CycleProvider RuleC c.serverHost contains ‘uni-passau.de’ a = bc.serverInformation = a RuleARuleBRuleC RuleE RuleF

ICDE Updates and Deletions Filter algorithm only works for new metadata Solution: execute algorithm 3 times

ICDE Related Work - 1 Metadata: Equal Time For Data on the Internet with WebSemantics [Mihaila, Raschid, Tomasic; EDBT '98] MOCHA: A Self-Extensible Database Middleware System for Distributed Data Sources [Rodriguez-Martinez, Roussopoulos; SIGMOD '00] Universal Description, Discovery, and Integration (UDDI) [Ariba, Inc., IBM, Microsoft; Publish/Subscribe: Efficient Matching for Web-Based Publish/Subscribe Systems [Pereira, Fabret, Llirbat, Shasha; CoopIS '00] Matching Events in a Content-Based Subscription System [Aguilera, Strom, Sturman, Astley, Chandra; PODC '99] The SIFT Information Dissemination System [Yan, Garcia-Molina; TODS '99] Efficient Filtering of XML Documents for Selective Dissemination of Information [Altinel, Franklin; VLDB '00]

ICDE Related Work - 2 Continuous Queries: NiagaraCQ: A Scalable Continuous Query System for Internet Databases [Chen, DeWitt, Tian, Wang; SIGMOD '00] Continual Queries for Internet Scale Event-Driven Information Delivery [Liu, Pu, Tang; IEEE TKDE '99] Materialized Views and Semantic Caching: Maintaining Views Incrementally [Gupta, Mumick, Subrahmanian; SIGMOD '93] Efficiently Updating Materialized Views [Blakeley, Larson, Tompa; SIGMOD '86] Semantic Data Caching and Replacement [Dar, Franklin, Jónsson, Srivastava, Tan; VLDB '96]

ICDE Conclusion The MDV System: MDPs, LMRs, and MDV Clients The Publish & Subscribe Algorithm: Decomposition of documents and rules Determination of affected triggering rules Iterative evaluation of join rules