A. Kemper, R. Kuntschke, and B. Stegmaier

Slides:



Advertisements
Similar presentations
Lehrstuhl Informatik III: Datenbanksysteme Astrometric Matching - E-Science Workflow 1 Lehrstuhl Informatik III: 1 Datenbanksysteme 1 Fakultät für Informatik.
Advertisements

Lehrstuhl Informatik III: Datenbanksysteme Grid-based Data Stream Processing in e-Science 1 Richard Kuntschke 1, Tobias Scholl 1, Sebastian Huber 1, Alfons.
Supporting Cooperative Caching in Disruption Tolerant Networks
Jaringan Komputer Lanjut Packet Switching Network.
Schema-based Scheduling of Event Processors and Buffer Minimization for Queries on Structured Data Streams Bernhard Stegmaier (TU München) Joint work with.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hafeeda, Ahsan Habib et al. Presented By: Abhishek Gupta.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
The Cougar Approach to In-Network Query Processing in Sensor Networks By Yong Yao and Johannes Gehrke Cornell University Presented by Penelope Brooks.
CHEP03 - UCSD - March 24th-28th 2003 T. M. Steinbeck, V. Lindenstruth, H. Tilsner, for the Alice Collaboration Timm Morten Steinbeck, Computer Science.
Hermes: A Distributed Event- Based Middleware Architecture Peter Pietzuch and Jean Bacon 1st DEBS Workshop, Vienna,
P-Grid Presentation by Thierry Lopez P-Grid: A Self-organizing Structured P2P System Karl Aberer, Philippe Cudré-Mauroux, Anwitaman Datta, Zoran Despotovic,
1 04/18/2005 Flux Flux: An Adaptive Partitioning Operator for Continuous Query Systems M.A. Shah, J.M. Hellerstein, S. Chandrasekaran, M.J. Franklin UC.
Institute of Computer and Communication Network Engineering OFC/NFOEC, 6-10 March 2011, Los Angeles, CA Lessons Learned From Implementing a Path Computation.
26 Sep 2003 Transparent Adaptive Resource Management for Distributed Systems Department of Electrical Engineering and Computer Science Vanderbilt University,
Managing Service Metadata as Context The 2005 Istanbul International Computational Science & Engineering Conference (ICCSE2005) Mehmet S. Aktas
Cracow Grid Workshop, October 27 – 29, 2003 Institute of Computer Science AGH Design of Distributed Grid Workflow Composition System Marian Bubak, Tomasz.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Use Cases for High Bandwidth Query and Control of Core Networks Greg Bernstein, Grotto Networking Young Lee, Huawei draft-bernstein-alto-large-bandwidth-cases-00.txt.
Telegraph Status Joe Hellerstein. Overview Telegraph Design Goals, Current Status First Application: FFF (Deep Web) Budding Application: Traffic Sensor.
Chapter 2 Database Environment.
Sep Multiple Query Optimization for Wireless Sensor Networks Shili Xiang Hock Beng Lim Kian-Lee Tan (ICDE 2007) Presented by Shan Bai.
Reliable Web Service Execution and Deployment in Dynamic Environments * Markus Keidl, Stefan Seltzsam, and Alfons Kemper Universität Passau Passau,
Stream Data Operator Ordering  Query Optimization Query Index.
Mobile IP THE 12 TH MEETING. Mobile IP  Incorporation of mobile users in the network.  Cellular system (e.g., GSM) started with mobility in mind. 
P4P: Proactive Provider Assistance for P2P Haiyong Xie Yale University.
Introduction to Machine Learning, its potential usage in network area,
Databases and DBMSs Todd S. Bacastow January 2005.
GridOS: Operating System Services for Grid Architectures
Accelerating Peer-to-Peer Networks for Video Streaming
S. Sudarshan CS632 Course, Mar 2004 IIT Bombay
Efficient Evaluation of XQuery over Streaming Data
Lecture (2).
Presented by: Saurav Kumar Bengani
Klara Nahrstedt Spring 2009
Ad-hoc Networks.
UNIT-V Transport Layer protocols for Ad Hoc Wireless Networks
Location Information Services
Wireless Sensor Network Architectures
Adapting Applications and Platforms
University of Technology
#01 Client/Server Computing
Pervasive Data Access (PDA) Research Group
An overview of Data Streaming
Supporting Fault-Tolerance in Streaming Grid Applications
Chapter 2 Database Environment Pearson Education © 2009.
The Extensible Tool-chain for Evaluation of Architectural Models
StreamGlobe: P2P Stream Sharing
DISTRIBUTED CLUSTERING OF UBIQUITOUS DATA STREAMS
Streaming Sensor Data Fjord / Sensor Proxy Multiquery Eddy
Cognitive Radio Networks
Database Environment Transparencies
Chapter 20 Object-Oriented Analysis and Design
Towards an Internet-Scale XML Dissemination Service
Database Systems Instructor Name: Lecture-3.
Multithreaded Programming
Data-Centric Networking
The Anatomy and The Physiology of the Grid
Deterministic and Semantically Organized Network Topology
B. Stegmaier und R. Kuntschke TU München – Fakultät für Informatik
Resource Allocation for Distributed Streaming Applications
Data Stream Sharing Richard Kuntschke and Alfons Kemper
Quality-aware Middleware
TelegraphCQ: Continuous Dataflow Processing for an Uncertain World
EE 122: Lecture 22 (Overlay Networks)
Database System Architectures
SIENA: Wide-Area Event Notification Service
Kostas Kolomvatsos, Christos Anagnostopoulos
#01 Client/Server Computing
Presentation transcript:

StreamGlobe Adaptive Query Processing and Optimization in Streaming P2P Environments A. Kemper, R. Kuntschke, and B. Stegmaier TU München – Fakultät für Informatik Lehrstuhl III: Datenbanksysteme http://www-db.in.tum.de/research/projects/StreamGlobe I would like to welcome everybody to this talk. My name is Bernhard Stegmaier and I will present the Vision of the StreamGlobe DMS, which is currently developed at our group at the TU München.

Outline Motivation StreamGlobe Current and Future Research Conclusion The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

Exemplary Initial Situation WLAN Network Consists of peers Given or grown topology Data Sources Provide XML data stream Possibly infinite streams (e.g., sensor measurements) User requests Continuous queries Query language XQuery Registered at a peer B A Request a Requests a/ab erklären Request ab Request a 19/04/2019 StreamGlobe

General Traditional Approach Register requests Establish data transfer → Peers may connect arbitrarily Process / Execute requests Routing of streams → Map streams to network B A Request a Request ab Request a 19/04/2019 StreamGlobe

General Traditional Approach (ctd.) Drawbacks Transmission of useless data Redundant transmissions Multiple request evaluation  Network congestion and processing overhead B A 2 1 3 Request a 3 Request ab Request a 19/04/2019 StreamGlobe

Why StreamGlobe? Other Systems / previous work E.g. Cougar, TelegraphCQ, Multicast techniques: Focus on specific aspects (e.g., query optimization) Tailored to specific domains StreamGlobe Contribution is combination of techniques: In-network query processing combined with routing Constitutes a generic infrastructure Independent of domain Efficient data stream transformation and distribution 19/04/2019 StreamGlobe

Outline Motivation StreamGlobe Current and Future Research Conclusion The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

The StreamGlobe Approach Intelligent Routing Multicast routing techniques Data Stream Clustering Push query execution into network Multi-query optimization Reduce network traffic Avoid redundant transmissions Reduce processing cost B A ab a Request a Request a Request ab 19/04/2019 StreamGlobe

Basic Concepts P2P Network Topology Classification of peers No arbitrary communication → Communication via transfer paths No fixed P2P topology Classification of peers Thin-Peers Super-Peers Constitution of a super-peer backbone Hierarchical organization → Speaker-peer responsible for certain subnet 19/04/2019 StreamGlobe

StreamGlobe Peer Architecture Based upon Open Grid Services Architecture (OGSA) Integration similar to OGSA-DAI or OGSA-DQP Layers as grid-services Availability according to peer capabilities Message exchange via RPC and notifications Data stream transfer via direct TCP connections XQuery Subscriptions XML Data Streams register StreamGlobe Interface Optimization Management Metadata Überleitung: I will show some details of the different layers on the next few slides. Query Engine Globus Toolkit 19/04/2019 StreamGlobe

Optimization Goals Achievement Registration of arbitrary subscriptions at any peer Achieve good distribution of data streams Optimize evaluation of many subscriptions Achievement Pushing query execution into the network → (1) and (3) Multiquery optimization → (3) Early filtering of data streams resp. evaluation of subscriptions → (2) Data stream clustering → (2) Since multiquery optimization and data stream clustering are main aspects of StreamGlobe, I will now give a slightly more detailed view on what is done there. 19/04/2019 StreamGlobe

Multi-Query Optimization Performed by speaker-peer Analyze subscriptions and streams Common subqueries Re-usability of streams Based on properties of subscriptions / streams Computes Filters and queries Data stream clustering Execution locations Request a Request a Request ab Query a Filter a Filter b Query ab In this simple example, only queries representing the two request and two filters for each data stream are computed. But, in real life, much more queries will be generated. For example, query ab might get split into several smaller queries, which are re-used by other subscriptions. 19/04/2019 StreamGlobe

Query Execution Basic concepts Evaluation of subscriptions with FluX Streaming evaluation and push-based techniques Preclude unbounded buffering by requiring window constraints Extensibility by means of mobile code Evaluation of subscriptions with FluX Designed for streaming processing of XQuery Event-based extension to XQuery Usage of schema information for buffer minimization → Visit my talk at the VLDB: Tomorrow, Research Session 6: XML(II) 19/04/2019 StreamGlobe

Outline Motivation StreamGlobe Current and Future Research Conclusion The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

Current and Future Research Current Research Optimization techniques Extension of FluX Future Research Quality-of-Service management Explicit load balancing Load shedding techniques Construction of overlay network … 19/04/2019 StreamGlobe

Conclusion StreamGlobe Exploiting in-network query processing capabilities In combination with data stream clustering  Minimization of network traffic Query execution with FluX  Efficient and scalable execution of subscriptions Multi-query optimization  Parallelization and load balancing in the network 19/04/2019 StreamGlobe

Related Work Aberer, Cudré-Mauroux, Datta, Despotovic, Hauswirth, Punceva, Schmidt. “P-Grid: a self-organizing structured P2P system”. SIGMOD Record 32(3), 2003 Arasu, Babcock, Babu, Datar, Ito, Motwani, Nishizawa, Srivastava, Thomas, Varma, Widom. “STREAM: The Stanford Stream Data Manager”. Data Engineering Bulletin 26(1), 2003 Carney, Cetintemel, Cherniack, Convey, Lee, Seidman, Stonebraker, Tatbul, Zdonik. “Monitoring Streams – A New Class of Data Management Applications”. VLDB 2002 Chandrasekaran, Cooper, Deshpande, Franklin, Hellerstein, Hong, Krishnamurthy, Madden, Raman, Reiss, Shah. “TelegraphCQ: Continuous Dataflow Processing for an Uncertain World”. CIDR 2003 Cherniack, Balakrishnan, Balazinska, Carney, Cetintemel, Xing, Zdonik. “Scalable Distributed Stream Processing”. CIDR 2003 Krämer, Seeger. “PIPES – A Public Infrastructure for Processing and Exploring Streams”. SIGMOD 2004 Madden, Shah, Hellerstein, Raman. “Continuously Adaptive Continuous Queries over Streams”. SIGMOD 2002 Sellis. “Multiple-Query Optimization”. TODS 1988 Yang, Garcia-Molina. “Designing a Super-Peer Network”. ICDE 2003 Yao, Gehrke. “The Cougar Approach to In-Network Query Processing in Sensor Networks”. SIGMOD Record 31(3), 2002 19/04/2019 StreamGlobe