DISTRIBUTED EVENT AGGREGATION FOR CONTENT-BASED PUBLISH/SUBSCRIBE SYSTEMS Navneet Kumar Pandey 1 Stéphane Weiss 1 Roman Vitenberg 1 Kaiwen Zhang 2 Hans-Arno.

Slides:



Advertisements
Similar presentations
Low Overhead With Speed Aware Routing (LOWSAR) in VANETs By Kannikar Siriwong Na Ayutaya.
Advertisements

An Improved TCP for transaction communications on Sensor Networks Tao Yu Tsinghua University 2/8/
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Supporting Cooperative Caching in Disruption Tolerant Networks
Solving Manufacturing Equipment Monitoring Through Efficient Complex Event Processing Tilmann Rabl, Kaiwen Zhang, Mohammad Sadoghi, Navneet Kumar Pandey,
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Management of Uncertainty in Publish/Subscribe Systems Haifeng Liu Department of Computer Sceince University of Toronto.
1 Message Oriented Middleware and Hierarchical Routing Protocols Smita Singhaniya Sowmya Marianallur Dhanasekaran Madan Puthige.
GrooveSim: A Topography- Accurate Simulator for Geographic Routing in Vehicular Networks 簡緯民 P
CSLI 5350G - Pervasive and Mobile Computing Week 3 - Paper Presentation “RPB-MD: Providing robust message dissemination for vehicular ad hoc networks”
Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,
1 Sensor Relocation in Mobile Sensor Networks Guiling Wang, Guohong Cao, Tom La Porta, and Wensheng Zhang Department of Computer Science & Engineering.
Small-Scale Peer-to-Peer Publish/Subscribe
Transactional Mobility in Distributed Content-Based Publish/Subscribe Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese.
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
1 On Handling QoS Traffic in Wireless Sensor Networks 吳勇慶.
Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan.
Rendezvous Points-Based Scalable Content Discovery with Load Balancing Jun Gao Peter Steenkiste Computer Science Department Carnegie Mellon University.
Study of Distance Vector Routing Protocols for Mobile Ad Hoc Networks Yi Lu, Weichao Wang, Bharat Bhargava CERIAS and Department of Computer Sciences Purdue.
Hermes: A Distributed Event- Based Middleware Architecture Peter Pietzuch and Jean Bacon 1st DEBS Workshop, Vienna,
Department of Computer Engineering Koc University, Istanbul, Turkey
Fuego Event Service: Towards Modularity in Event Routing Sasu Tarkoma Rutgers-Helsinki Workshop
Real-time Publish/subscribe ECE Expert Topic Lizhong Cao Milenko Petrovic March 6 th,2003.
Alex King Yeung Cheung and Hans-Arno Jacobsen University of Toronto June, 24 th 2010 ICDCS 2010 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
Effects of Routing Computations in Content-Based Routing Networks with Mobile Data Sources Vinod Muthusamy, Milenko Petrovic, Hans-Arno Jacobsen University.
An adaptive framework of multiple schemes for event and query distribution in wireless sensor networks Vincent Tam, Keng-Teck Ma, and King-Shan Lui IEEE.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG MADES - A Multi-Layered, Adaptive, Distributed Event Store Tilmann Rabl Mohammad Sadoghi Kaiwen Zhang Hans-Arno.
Sidewinder A Predictive Data Forwarding Protocol for Mobile Wireless Sensor Networks Matt Keally 1, Gang Zhou 1, Guoliang Xing 2 1 College of William and.
International CANOE Summer School on Events, Publish/Subscribe & Systems, Oslo, 2009 Break-out Sessions Organizer: Hans-Arno Jacobsen August 16 th – 21.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
MIDDLEWARE SYSTEMS RESEARCH GROUP Denial of Service in Content-based Publish/Subscribe Systems M.A.Sc. Candidate: Alex Wun Thesis Supervisor: Hans-Arno.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Navneet Kumar Pandey 1 Stéphane Weiss 1 Roman Vitenberg 1 Kaiwen Zhang 2 Hans-Arno Jacobsen 2 2 University of Toronto 1 University of Oslo Minimizing the.
Supporting Disconnected Operations in Publish/Subscribe Systems Vinod Muthusamy Joint work with Milenko Petrovic, Ioana Burcea, H.-Arno Jacobsen, Eyal.
Routing and Data Dissemination. Outline Motivation and Challenges Basic Idea of Three Routing and Data Dissemination schemes in Sensor Networks Some Thoughts.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
Socially-aware pub-sub system for human networks Yaxiong Zhao Jie Wu Department of Computer and Information Sciences Temple University Philadelphia
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
MIDDLEWARE SYSTEMS RESEARCH GROUP Middleware A Policy Management Framework for Content-based Publish/Subscribe Middleware Hans-Arno Jacobsen Department.
ECO-DNS: Expected Consistency Optimization for DNS Chen Stephanos Matsumoto Adrian Perrig © 2013 Stephanos Matsumoto1.
Dynamic Load Balancing in Distributed Content-based Publish/Subscribe Alex K. Y. Cheung & Hans-Arno Jacobsen University of Toronto November 30 th, 2006.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Total Order in Content-based Publish/Subscribe Systems Joint work with: Vinod Muthusamy, Hans-Arno Jacobsen.
Distributed Automatic Service Composition in Large-Scale Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese Academy of.
PRoPHET+: An Adaptive PRoPHET- Based Routing Protocol for Opportunistic Network Ting-Kai Huang, Chia-Keng Lee and Ling-Jyh Chen.
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
APL: Autonomous Passive Localization for Wireless Sensors Deployed in Road Networks IEEE INFOCOM 2008, Phoenix, AZ, USA Jaehoon Jeong, Shuo Guo, Tian He.
BARD / April BARD: Bayesian-Assisted Resource Discovery Fred Stann (USC/ISI) Joint Work With John Heidemann (USC/ISI) April 9, 2004.
Minimal Broker Overlay Design for Content-Based Publish/Subscribe Systems Naweed Tajuddin Balasubramaneyam Maniymaran Hans-Arno Jacobsen University of.
ICDCS Beijing China Routing of XML and XPath Queries in Data Dissemination Networks Guoli Li, Shuang Hou Hans-Arno Jacobsen Middleware Systems Research.
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
November 4, 2003Applied Research Laboratory, Washington University in St. Louis APOC 2003 Wuhan, China Cost Efficient Routing in Ad Hoc Mobile Wireless.
Distributed Automatic Service Composition in Large-Scale Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese Academy of.
Securing Broker-Less Publish/Subscribe Systems Using Identity-Based Encryption.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Distributed Ranked Data Dissemination in Social Networks Joint work with: Mo Sadoghi Vinod Muthusamy Hans-Arno.
Cognitive Information Service Basic Principles and Implementation of A Cognitive Inter-Node Protocol Optimization Scheme Dzmitry Kliazovich Fabrizio Granelli.
Congestion Avoidance with Incremental Filter Aggregation in Content-Based Routing Networks Mingwen Chen 1, Songlin Hu 1, Vinod Muthusamy 2, Hans-Arno Jacobsen.
Wireless sensor and actor networks: research challenges Ian. F. Akyildiz, Ismail H. Kasimoglu
Wireless Sensor Network Architectures
Navneet Kumar Pandey1 Stéphane Weiss1 Roman Vitenberg1
Foundations for Highly-Available Content-based Publish/Subscribe Overlays Young Yoon, Vinod Muthusamy and Hans-Arno Jacobsen.
Data-Centric Networking
Small-Scale Peer-to-Peer Publish/Subscribe
Presentation transcript:

DISTRIBUTED EVENT AGGREGATION FOR CONTENT-BASED PUBLISH/SUBSCRIBE SYSTEMS Navneet Kumar Pandey 1 Stéphane Weiss 1 Roman Vitenberg 1 Kaiwen Zhang 2 Hans-Arno Jacobsen 2 2 University of Toronto 1 University of Oslo

Motivation: Intelligent Transport System (ITS) Information providers: road sensors, crowdsourced mobile apps Information seekers: commuters, police, first responders, radio networks etc. 2 Aggregate subscriptions Count number of cars passing a street light per hour Average speed of cars on a road segment per day Non-aggregate subscriptions Accident reports Traffic violation reports

Aggregation in pub/sub 3 Pub/sub is well known for efficient content filtering and dissemination for distributed event sources and sinks. However, pub/sub does not support aggregation, which is required in emerging applications. Our primary objective is to retain the traditional pub/sub focus on low communication cost, while adding support for aggregation.

Contributions: aggregation in pub/sub 4 We propose a framework and baseline approaches for aggregation in content-based pub/sub systems (CBPS). We show how the relative performance of the baseline approaches varies with workload properties. We propose a per-broker distributed adaptive approach.

BIBI P[val,8] A[val, >,4] S[val, >,3] BpBp BqBq BSBS BIBI B Broker Subscription Delivery Tree (SDT) Advertisement-based pub/sub model 5

Comparison with stream processing 6 Aggregation in stream processingAggregation in pub/sub Requires global view of topologyTopology is not known to individual broker nodes Requires a priori knowledge of publication sources Publication sources and sinks are dynamic Needs control layerBrokers are loosely coupled Usually have a static query planSDTs are dynamic and determined by the pub/sub implementation Optimized for continuous data streams Publications come at an irregular rate

Proposed aggregation framework 7 Publication filtering procedure (PFP) Subscription: { RoadID = 101, speed > 10, op=‘avg’, Duration (ω) = 2 hour, shift size (δ) = 1 hour} NWR 3 NWR 1 NWR 2 subscription 1230Time Notification window ranges (NWR) Pub 1 Pub 2 Pub 3 A single publication can participate in several NWRs, even for the same subscription.

Proposed aggregation framework 8 Initial computation procedure (ICP) Publication filtering procedure (PFP) Outgoing messages: { avg(Pub 1, Pub 2, Pub 3 ), avg(Pub 2, Pub 3 ) } Outgoing messages: { avg(Pub 1, Pub 2 ), avg(Pub 2 ), Pub 3 } NWR 3 NWR 1 NWR 2 subscription 1230Time Notification window ranges (NWR) Pub 1 Pub 2 Pub 3 x Processing start time presents a trade-off between communication cost and end-to-end delay.

Proposed aggregation framework 9 Initial computation procedure (ICP) Publication filtering procedure (PFP) Recurrent processing procedure (RPP) BpBp BIBI BqBq Collection delay avg p avg q avg pq Collection delay is another parameter affecting the delay-communication trade-off.

Late aggregation approach 10 BpBp BqBq BsBs P[val,9] P[val,2] P[val,5] P[val,3] S min [val,>,2] P[Val min,3] Messages exchanged in Late aggregation: 6 X X X PFSICP RPP BSBS BIBI Late approach aggregates messages at subscriber-edge brokers.

Early aggregation approach 11 BABA BIBI P[val,9] P[val,2] P[val,5] P[val,3] S min [val,>,2] P[val min,9] P[val min,3] P[val min,3] P[val min,3] PFSICP RPP X X X X X X Messages exchanged in Early aggregation: 3 BpBp BqBq BSBS Messages exchanged in Late aggregation: 6 Early approach aggregates messages at publisher-edge brokers.

Early does not always outperform Late 12 BIBI P[val,9] P[val,2] P[val,5] P[val,3] S max [val,>,2] Late aggregation Messages exchanged: 6 S count [val,>,2] S min [val,>,2] P[val max,5] P[val min,3] P[val count,2] Early aggregation Messages exchanged: 9 12 BpBp BqBq BSBS P[val max,9] P[val min,9] P[val count,1] P[val max,9] P[val min,3] P[val count,3]

Comparison between Early and Late 13 Reducing the communication cost requires an adaptive solution Increasing parameterFavors Publication matching rateEarly Matching number of NWRsLate Overlap among aggregate subscriptionsLate Ratio between aggregate and regular subscriptionsEarly Several parameters affect the performance of our baselines:

Benefits of adaptive aggregation 14 BABA P[val,9] P[val,2] P[val,5] P[val,3] S[val,>,6] S min [val,>,2] P[val min,3] 14 BABA BABA P[val,9] P[val min,3] Late 6 BFBF Early 5 BpBp BqBq BIBI BSBS P[val min,9]

Benefits of adaptive aggregation 15 P[val,9] P[val,2] P[val,5] P[val,3] S[val,>,6] S min [val,>,2] P[val min,3] 15 BABA BABA P[val,9] P[val min,3] Late 6 BqBq Per-broker adaptation reduces communication cost Early 5 Adaptive 4 BpBp BqBq BSBS BIBI BIBI

Adaptation process (MAPE-K) 16 Matching publications within sampling period Changes in subscription set Compare the ratio between Pubs vs. NWRs Estimate the notification rate Choose the suitable mode Transition between aggregate and forward mode Start/stop aggregation at broker Monitor Analyze Plan Execute Information at a broker Registered subscriptions Current execution mode Knowledge

Experimental setup Implemented in Java over the PADRES framework Topology: 16 brokers – Combination of publisher-edge only, subscriber-edge only and mixed brokers Real life datasets: Traffic dataset from the ONE-ITS service 1 Yahoo! Finance Stock dataset Metrics: Number of messages exchanged Processing overhead End-to-end delay 17 B B B B B B B B B B B B B B B B B B B B B B B B B B B B B B B B 1

Results (Stock dataset) 18 Varying Publication/secondVarying number of subscriptions Decision becomes more accurate when available information is sufficient Adaptive aggregation performs close to the best among Early and Late for all settings. Early perform better at high pub rates whereas Late is better with large number of subscriptions.

Results (Traffic dataset) 19 Varying Publication/secondVarying number of subscriptions Per-Broker adaptation can cause individual brokers to make incorrect decisions

Processing overhead (Stock) 20 Predicate matching cost Aggregation-related overhead Adaptation overhead is dominating the aggregation overhead

Conclusions 21 We provide an aggregation framework for CBPS with baseline solutions. We demonstrate that neither baseline is dominant and depends upon workload parameters. We provide a generic adaptive aggregation framework. We experimentally demonstrate that our distributed adaptive solution performs close to the best baseline across all settings.

Thank you! For questions and comments Contact: 22

Motivation: stock market application 23 Information providers: stock exchanges Information seekers: brokers, buyers Non-aggregate subscriptions: Stock value updates Aggregate subscriptions: Stock market indicators (eg. MACD)

Aggregation semantics Window parameters – Window shift size (δ) – Duration (ω) Example – Sliding window: Moving average of the number of cars passing a street light per hour. – Tumbling window: Average speed of cars on a road segment. – Hoping window: Number of cars crossing during rush hour. 24 ω = 2 hour, δ = 1 hour, ω δ ω = δ = 2 hour, ω δ ω = 2 hour, δ = 24 hour, ω δ

Challenges of adaptive deployment Data flow is hard to predict: Irregular event rates at the publishers Dynamic number of subscriptions Coupled with dynamic content matching Brokers function autonomously Compatible solution: Congruent to Pub/Sub routing standards Minimum impact over QoS for regular publications 25

Other experiments End to end delay Sensitivity towards sampling period Sensitivity towards Collection delay 26 please refer our full paper.

Sensitivity analysis: Collection delay 27 Increasing collection time reduces the number of messages but delays the delivery of result.

Publication process flow 28 Timestamp publication if not Matched for aggregatio n Is broker aggregating ? Any regular subscriptio n matched? Enqueue for aggregation computation Send Tag as aggregated No Yes No

Aggregation Basics Notification window Ranges 29 PublicationMatching NWR 2 1 NWR 2 2 NWR 2 3 NWR 2 4 sub 2 NWR 3 1 NWR 3 2 NWR 3 3 sub 1 NWR 3 3 NWR 3 1 NWR 3 2 sub NWR 3 4 Time Sliding Window Tumbling Window Sampling Window

Motivation Pub/Sub is well known for efficient content filtering and dissemination for distributed event source and syncs. Content-based Pub/Sub does not supports time-based aggregation. 30

Pub/Sub systems :- a popular communication paradigm Researches in Pub/sub have traditionally focused on performance than extending functionality. 31 Business process[4] work- flow management[5] stock- market monitoring[3] social interaction[2] network monitoring and management[6] RSS filtering[1]

Event distribution systems such as ITS demand aggregation filters Moving average of the number of cars passing a street light per hour. Average speed of cars on a road segment. Number of cars crossing a highway during rush hour. 32

Scope of our solution Acyclic overlay Broker federated Pub/Sub Advertisement based forwarding model Time based aggregation 33