Republishers in a Publish/Subscribe Architecture for Data Streams Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences, Heriot-Watt.

Slides:



Advertisements
Similar presentations
Oyster, Edinburgh, May 2006 AIFB OYSTER - Sharing and Re-using Ontologies in a Peer-to-Peer Community Raul Palma 2, Peter Haase 1 1) Institute AIFB, University.
Advertisements

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies Grid.
21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
Peer-to-peer and agent-based computing P2P Algorithms.
WP3 Werner Nutt (Heriot-Watt University) R-GMA – Architecture and Query Mediation 24/4/2003.
Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Distributed Application Management Using PLuSH Jeannie Albrecht, Christopher Tuttle, Alex C. Snoeren, and Amin Vahdat UC San Diego CSE {jalbrecht, ctuttle,
WP3 R-GMA Revisited 23/7/2002 Werner Nutt / Heriot-Watt University.
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
WP6: Grid Authorization Service Review meeting in Berlin, March 8 th 2004 Marcin Adamski Michał Chmielewski Sergiusz Fonrobert Jarek Nabrzyski Tomasz Nowocień.
Chapter 2 The process Process, Methods, and Tools
1 Distributed Monitoring of Peer-to-Peer Systems By Serge Abiteboul, Bogdan Marinoiu Docflow meeting, Bordeaux.
GrIDS -- A Graph Based Intrusion Detection System For Large Networks Paper by S. Staniford-Chen et. al.
The McGraw-Hill Companies, Inc Information Technology & Management Thompson Cats-Baril Chapter 3 Content Management.
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
Republishing Mechanisms for R-GMA Benefits and Approaches. Talk by: Alasdair Gray Collaborators: Andy Cooke, Lisha Ma, and Werner Nutt Heriot-Watt University.
Submitted by: Shailendra Kumar Sharma 06EYTCS049.
Introduction on R-GMA Shi Jingyan Computing Center IHEP.
Agent Model for Interaction with Semantic Web Services Ivo Mihailovic.
ASG - Towards the Adaptive Semantic Services Enterprise Harald Meyer WWW Service Composition with Semantic Web Services
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
GRID IIII D UK Particle Physics GridPP Collaboration meeting - R.P.Middleton (RAL/PPD) 23-25th May Grid Monitoring Services Robin Middleton RAL/PPD24-May-01.
Professor Michael J. Losacco CIS 1110 – Using Computers Database Management Chapter 9.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
Sensor Data Management Due to the rapid development of internet, Web and cheaper hardware, sensor networks have become common. A sensor is a device which.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Marcelo R.N. Mendes. What is FINCoS? A Java-based set of tools for data generation, load submission, and performance measurement of event processing systems;
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Multi-synchronous Collaborative Semantic Wikis Charbel Rahhal, Hala Skaf-Molli, Pascal Molli, Stéphane Weiss Inria Nancy-Grand Est Nancy University, France.
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
PROC-1 1. Software Development Process. PROC-2 A Process Software Development Process User’s Requirements Software System Unified Process: Component Based.
Software Life Cycle The software life cycle is the sequence of activities that occur during software development and maintenance.
Aggregate Queries in Peer-to-Peer OLAP Mauricio Minuto Espil Faculty of Engineering Universidad Católica Argentina Alejandro A. Vaisman Computer Science.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
WP3 RGMA Deployment Laurence Field / RAL Steve Fisher / RAL.
Website: Answering Continuous Queries Using Views Over Data Streams Alasdair J G Gray Werner.
WP3 Werner Nutt (Heriot-Watt University) R-GMA – DataGrid’s Monitoring System 1/7/2003.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
Computational Tools for Population Biology Tanya Berger-Wolf, Computer Science, UIC; Daniel Rubenstein, Ecology and Evolutionary Biology, Princeton; Jared.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
Marcelo R.N. Mendes. What is FINCoS? A set of tools for data generation, load submission, and performance measurement of CEP systems; Main Characteristics:
Project Management Overview U08784Software Project Management Rosemary Phillimore.
The impact of R-GMA (upon WP1 and WP4). EDG (Paris) 6 Mar James MagowanImpact of R-GMA Grid Monitoring Architecture (GMA) We use it not only for.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative Sensing and Acquisition Subsystem Arjuna Balasuriya Life Cycle Architecture Review La Jolla,
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
Parallelizing Functional Tests for Computer Systems Using Distributed Graph Exploration Alexey Demakov, Alexander Kamkin, and Alexander Sortov
Supporting Join Queries Talk by: Andy Cooke Collaborators: Alasdair Gray, Lisha Ma, and Werner Nutt Heriot-Watt University.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
APRIL 10, Meeting Agenda  Prototype 2 Goals  Robust Connections Demo  System Diagnostics Tool Demo  Final Prototype Risk Mitigation  Final.
APRIL 10, Meeting Agenda  Prototype 2 Goals  Robust Connections Demo  System Diagnostics Tool Demo  Final Prototype Risk Mitigation  Final.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
The Mediator: What Next? Talk by: Andy Cooke Collaborators: Alasdair Gray, Lisha Ma, and Werner Nutt Heriot-Watt University.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
SONATA: Scalable Streaming Analytics for Network Monitoring
Cluster Resource Management: A Scalable Approach
SDM workshop Strawman report History and Progress and Goal.
A Distribution Network
Basic Grid Projects – Condor (Part I)
RELATIONAL GRID MONITORING ARCHITECHTURE
Resource Allocation for Distributed Streaming Applications
Presentation transcript:

Republishers in a Publish/Subscribe Architecture for Data Streams Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh 6 th July 2005

A.J.G. Gray and W. NuttBNCOD222 Overview  Motivation  Publish/Subscribe Architecture  Query planning

6 th July 2005A.J.G. Gray and W. NuttBNCOD223 Motivation Scenario:  Streams generated by distributed sensors  Users are also distributed  Use data integration to match users to streams For example,  Grid monitoring for logging and bookkeeping  Sensor networks Grid Job progress Bookkeeping Monitoring data

6 th July 2005A.J.G. Gray and W. NuttBNCOD224 Data Streams as Relations  Sensor readings can be viewed as tuples conforming to a relational schema  Example: Network ThroughPut NTP(from,to,tool,psize,latency,timestamp) ('hw','ral','ping',32,11.1, :05:34)

6 th July 2005A.J.G. Gray and W. NuttBNCOD225 Publish/Subscribe Architecture  Local as View Approach Consumers pose a query over the schema to request streams Producers describe their stream using a view on the schema  Queries and views are selections over a single relation Producers Registry Data Streams Consumers

6 th July 2005A.J.G. Gray and W. NuttBNCOD226 Query Planning: Consumer Query S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' C: from = 'hw' Λ psize ≥ 1024 Problem: Approach does not scale to hundreds of producers and consumers.

6 th July 2005A.J.G. Gray and W. NuttBNCOD227 Republishers Provide Scalability S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' R 3 : TRUE R 1 : from = 'hw'R 2 : from = 'ral' C: from = 'hw' Λ psize ≥ 1024

6 th July 2005A.J.G. Gray and W. NuttBNCOD228 Plans Need to be Maintained  Queries are long lived  Set of publishers can change  Query plans should reflect changes  What happens when we add a republisher? remove a republisher?

6 th July 2005A.J.G. Gray and W. NuttBNCOD229 Adding a Republisher: 1 st Attempt S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' R 3 : TRUE R 1 : from = 'hw'R 2 : from = 'ral'R 4 : tool = 'ping' Problem: Republishers disconnected from Producers Cycle in data flow Relevant publishersMaximal relevant Replan R 3 Replan other queriesAdding a new publisher

6 th July 2005A.J.G. Gray and W. NuttBNCOD2210 Desirable Properties for a Hierarchy  Correctness: streams answer queries  Cycle freeness: loops can lead to duplicates  Uniqueness: hierarchy defined for a set of publishers  Local planning: Publishers and Consumers only need to communicate with the Registry

6 th July 2005A.J.G. Gray and W. NuttBNCOD2211 Adding a Republisher: 2 nd Attempt S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' R 3 : TRUE R 1 : from = 'hw'R 2 : from = 'ral'R 4 : tool = 'ping' C: from = 'hw' Λ psize ≥ 1024 Relevant publishers

6 th July 2005A.J.G. Gray and W. NuttBNCOD2212 Removing a Republisher S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' R 3 : TRUE R 1 : from = 'hw'R 2 : from = 'ral'R 4 : tool = 'ping' C: from = 'hw' Λ psize ≥ 1024

6 th July 2005A.J.G. Gray and W. NuttBNCOD2213 Conclusions  Republishers: Allow system to scale Complicate query answering problem  Republishers require special planning  We have developed algorithms that allows the system to adapt to changes in the set of publishers  Full details available in HW Technical Report sp?id=0031

6 th July 2005A.J.G. Gray and W. NuttBNCOD2214 Integrating Data Streams  Local as View Approach Consumers pose a query over the schema to request streams Producers describe their stream using a view on the schema  Queries and views are selections over a single relation

6 th July 2005A.J.G. Gray and W. NuttBNCOD2215 Example S 1 : from = 'hw' Λ tool = 'udp' S 2 : from = 'hw' Λ tool = 'ping' S 3 : from = 'ral' Λ tool = 'ping' S 4 : from = 'ral' Λ tool = 'udp' R 3 : TRUE R 1 : from = 'hw'R 2 : from = 'ral'R 4 : tool = 'ping' C: from = 'hw' Λ psize ≥ 1024