Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan.

Slides:



Advertisements
Similar presentations
SkipNet: A Scalable Overlay Network with Practical Locality Properties Nick Harvey, Mike Jones, Stefan Saroiu, Marvin Theimer, Alec Wolman Microsoft Research.
Advertisements

Colyseus: A Distributed Architecture for Online Multiplayer Games
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Peer to Peer and Distributed Hash Tables
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Some Open Problems in Publish/Subscribe Networking David S. Rosenblum Chief Technology Officer PreCache Inc.
“SUB-2-SUB” Self-Organizing Collaborative Content-based Pub/Sub Spyros Voulgaris, Etienne Rivière, Anne-Marie Kermarrec, Maarten van Steen.
Chord: A scalable peer-to- peer lookup service for Internet applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashock, Hari Balakrishnan.
Self-Organizing Hierarchical Routing for Scalable Ad Hoc Networking David B. Johnson Department of Computer Science Rice University Monarch.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Small-Scale Peer-to-Peer Publish/Subscribe
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #13: P2P and Sensor Networks Shivkumar Kalyanaraman:
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 One Torus to Rule Them All: Multi-dimensional Queries in P2P Systems Prasanna Ganesan Beverly Yang Hector Garcia-Molina Stanford University.
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
Applications over P2P Structured Overlays Antonino Virgillito.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
M ERCURY : A Scalable Publish-Subscribe System for Internet Games Ashwin R. Bharambe, Sanjay Rao & Srinivasan Seshan Carnegie Mellon University.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Internet Indirection Infrastructure Ion Stoica UC Berkeley.
Multi-dimensional Range Query in Sensor Networks Xin Li,Young Jim Kim, Ramesh Govindan (University of Southern California ) Wei Hong (Intel Research Lab.
Mercury: Scalable Routing for Range Queries Ashwin R. Bharambe Carnegie Mellon University With Mukesh Agrawal, Srinivasan Seshan.
1 AINA 2006 Wien, April th 2006 DiVES: A DISTRIBUTED SUPPORT FOR NETWORKED VIRTUAL ENVIRONMENTS The IEEE 20th International Conference on Advanced.
Rendezvous Points-Based Scalable Content Discovery with Load Balancing Jun Gao Peter Steenkiste Computer Science Department Carnegie Mellon University.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
SkipNet: A Scalable Overlay Network with Practical Locality Properties Nick Harvey, Mike Jones, Stefan Saroiu, Marvin Theimer, Alec Wolman Microsoft Research.
Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek and Hari alakrishnan.
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Object Naming & Content based Object Search 2/3/2003.
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
Mercury: Supporting Scalable Multi-Attribute Range Queries A. Bharambe, M. Agrawal, S. Seshan In Proceedings of the SIGCOMM’04, USA Παρουσίαση: Τζιοβάρα.
1 CS 194: Distributed Systems Distributed Hash Tables Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer.
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
Focus on Distributed Hash Tables Distributed hash tables (DHT) provide resource locating and routing in peer-to-peer networks –But, more than object locating.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Chord & CFS Presenter: Gang ZhouNov. 11th, University of Virginia.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Data Distribution Dynamic Data Distribution. Outline Introductory Comments Dynamic (Value based) Data Distribution: HLA Data Distribution Management –Routing.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
BitTorrent enabled Ad Hoc Group 1  Garvit Singh( )  Nitin Sharma( )  Aashna Goyal( )  Radhika Medury( )
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Rendezvous Regions: A Scalable Architecture for Service Location and Data-Centric Storage in Large-Scale Wireless Sensor Networks Karim Seada, Ahmed Helmy.
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
M ERCURY : A Scalable Publish-Subscribe System for Internet Games Ashwin R. Bharambe To appear in NetGames’02.
DHT-based unicast for mobile ad hoc networks Thomas Zahn, Jochen Schiller Institute of Computer Science Freie Universitat Berlin 報告 : 羅世豪.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Lecture 12 Distributed Hash Tables CPE 401/601 Computer Network Systems slides are modified from Jennifer Rexford.
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
Evaluation of ad hoc routing over a channel switching MAC protocol Ethan Phelps-Goodman Lillie Kittredge.
D istributed Systems, N etwork Protocols & A pplications Srinivasan Seshan Computer Science Department Carnegie Mellon University.
Chapter 9: Web Services and Databases Title: NiagaraCQ: A Scalable Continuous Query System for Internet Databases Authors: Jianjun Chen, David J. DeWitt,
NCLAB 1 Supporting complex queries in a distributed manner without using DHT NodeWiz: Peer-to-Peer Resource Discovery for Grids Sujoy Basu, Sujata Banerjee,
Challenge: Peers on Wheels – A Road to New Traffic Information Systems Jedrzej Rybicki, Björn Scheuermann, Wolfgang Kiess Christian Lochert, Pezhman Fallahi,
Architecture and Algorithms for an IEEE 802
EE 122: Peer-to-Peer (P2P) Networks
A Case for Mutual Notification
Small-Scale Peer-to-Peer Publish/Subscribe
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Presentation transcript:

Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan

Carnegie Mellon University Outline Publish-subscribe systems Subscription Languages Routing Protocols M ERCURY architecture Preliminary evaluation Scalability Performance Future work

Carnegie Mellon University Publish-subscribe systems Challenges Subscription language - how to express “interests”? Routing mechanism - how is content routed and where is it matched? Matcher S P ? Publication Subscription Matched Publication

Carnegie Mellon University Subscription language How to express interests? Channels or Subjects All content attributes Operators? Exact matches Range queries Regular expressions Subject = MATH Name = ‘A’ & Age < 25 Price = 350$ 300$ ≤ Price ≤ 350$ Name ~= Ash[win]*B*

Carnegie Mellon University M ERCURY subscription language Subscription = list of (type, attribute, rel-op, value) Can implement boolean expressions (AND/ORs) ( int, price, LESS_THAN, 300 ) ( string, name, EQUALS, “Opensig” ) Publication = list of (type, attribute, value)

Carnegie Mellon University Example: Virtual reality User x ≥ 50 x ≤ 150 y ≥ 150 y ≤ 250 Interests x 100 y 200 Events (100,200) (150,150) (50,250) Arena Virtual World

Carnegie Mellon University Routing mechanism Centralized? Easy to make publications “meet” subscriptions Single point of failure – not robust! Distributed? Where are subscriptions stored? How do publications “meet” subscriptions? Broadcast-based solutions not scalable

Carnegie Mellon University Distributed routing - goals Scalability is a key goal Flooding anything is bad, bad, bad… System should not have hot-spot in terms of: Computational load – matching ) Subscriptions should be evenly distributed Number of packets routed or received ) Publications should be evenly distributed Yet – we should have low delivery delays!!

Carnegie Mellon University Previous work Systems like Scribe use DHTs for scalability Why can’t we ? Exact matches vs. Range queries! int x  1 int x  10 hash Subscription int x = 9 hash Publication ? How about generating 10 subscriptions ? Too many subscriptions Works for discrete-valued attributes only

Carnegie Mellon University Attribute Hubs Divide range of an attribute into bins Each node responsible for range of attribute values H price [240, 320) [80, 160) [160, 240) [0, 80) Attribute Hub Hub-nodes connected through a circular overlay Circle only for connectivity One hub per attribute Routing algo compare value in content to my range

Carnegie Mellon University Routing illustrated Send subscription to any one attribute hub Send publications to all attribute hubs [0, 80) [210, 320) 50 ≤ x ≤ ≤ y ≤ 250 x 100 y 200 HxHx [240, 320) [80, 160) [160, 240) HyHy [0, 105) [105, 210) Subscription Publication Rendezvous point

Carnegie Mellon University Efficient routing Reduce number of hops Each hub-node maintains “small” number of pointers to distant parts of the hub How to maintain these pointers? Send ACKs for publication receipts Various caching policies determine the structure of the pointer table e.g., LRU, Uniform-spacing, Exponential-spacing

Carnegie Mellon University Routing illustrated [0, 80) [240, 320) [80, 160) [160, 240) HxHx x 100 y 200 Publication [105, 210) HyHy [210, 320) [0, 105) ACK

Carnegie Mellon University Evaluation Workload Experimental setup Metrics

Carnegie Mellon University Workload One of our target apps  multi-player games Model Virtual world as square Subscriptions as rectangles around current positions

Carnegie Mellon University Experimental setup Player movements simulated using mobility models from ns-2 Two hubs – x and y co-ordinates Half the nodes in each hub Uniform partition of range

Carnegie Mellon University Metrics Scalability metric  load Number of publications routed by a node Averaged over time Performance metric  publication delivery delay Time between sending of a publication and its receipt by all subscribers Averaged over all subscribers of a publication Averaged over all publications

Carnegie Mellon University Results: scalability Ideal graph: delta function Observed variation: ±12%

Carnegie Mellon University Results: performance Without caching: linear scaling Caching reduces delays to near optimal Workload effects ? Cache size: log(n)

Carnegie Mellon University Conclusions Expressive subscription language Decentralized architecture Scalability Avoids flooding of subscriptions and publications – reduces network traffic Distributes publications and subscriptions throughout the network – prevents swamping

Carnegie Mellon University Future Work Load balancing Sensitive to data value distribution Adapt ranges dynamically according to the distribution Affects pointer management, caching, etc. x Pr(X=x)

Carnegie Mellon University Future Work Perform sensitivity analysis for different kinds of workloads Generic API for building applications on top of M ERCURY To be released soon Build a full-fledged distributed Quake-II