Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.

Slides:



Advertisements
Similar presentations
17 th International World Wide Web Conference 2008 Beijing, China XML Data Dissemination using Automata on top of Structured Overlay Networks Iris Miliaraki.
Advertisements

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Multicasting in Mobile Ad hoc Networks By XIE Jiawei.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Cooperative Overlay Networking for Streaming Media Content Feng Wang 1, Jiangchuan Liu 1, Kui Wu 2 1 School of Computing Science, Simon Fraser University.
SplitStream: High- Bandwidth Multicast in Cooperative Environments Monica Tudora.
MMCN 19 Jan 2005 Ooi Wei Tsang Peer-to-Peer Streaming.
Suphakit Awiphan, Takeshi Muto, Yu Wang, Zhou Su, Jiro Katto
Cobra: Content-based Filtering and Aggregation of Blogs and RSS Feeds Ian Rose 1, Rohan Murty 1, Peter Pietzuch 2, Jonathan Ledlie 1, Mema Roussopoulos.
Resilient Peer-to-Peer Streaming Paper by: Venkata N. Padmanabhan Helen J. Wang Philip A. Chou Discussion Leader: Manfred Georg Presented by: Christoph.
Computer Science 1 ShapeShifter: Scalable, Adaptive End-System Multicast John Byers, Jeffrey Considine, Nicholas Eskelinen, Stanislav Rost, Dmitriy Zavin.
1 Failure Recovery for Priority Progress Multicast Jung-Rung Han Supervisor: Charles Krasic.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
FeedTree: Sharing Web Micronews with Peer-to-Peer Event Notification D. Sandler, A. Mislove, A. Post, P. Druschel Presented by: Andrew Sutton.
Computer Science ROMA: Reliable Overlay Multicast with Loosely Coupled TCP Connections Gu-In Kwon and John Byers Computer Science Dept. Boston University.
Opportunities and Challenges of Peer-to-Peer Internet Video Broadcast J. Liu, S. G. Rao, B. Li and H. Zhang Proc. of The IEEE, 2008 Presented by: Yan Ding.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
CoolStreaming/DONet: A Data- driven Overlay Network for Peer- to-Peer Live Media Streaming INFOCOM 2005 Xinyan Zhang, Jiangchuan Liu, Bo Li, and Tak- Shing.
Scalable Adaptive Data Dissemination Under Heterogeneous Environment Yan Chen, John Kubiatowicz and Ben Zhao UC Berkeley.
Application Layer Multicast
1March -05 Jiangchuan Liu with Xinyan Zhang, Bo Li, and T.S.P.Yum Infocom 2005 CoolStreaming/DONet: A Data-Driven Overlay Network for Peer-to-Peer Live.
Understanding Mesh-based Peer-to-Peer Streaming Nazanin Magharei Reza Rejaie.
An Overlay Multicast Infrastructure for Live/Stored Video Streaming Visual Communication Laboratory Department of Computer Science National Tsing Hua University.
1 An Overlay Scheme for Streaming Media Distribution Using Minimum Spanning Tree Properties Journal of Internet Technology Volume 5(2004) No.4 Reporter.
1 Sonia Fahmy and Minseok Kwon Department of Computer Sciences Purdue University For slides, technical reports, and implementations, please see:
CS218 – Final Project A “Small-Scale” Application- Level Multicast Tree Protocol Jason Lee, Lih Chen & Prabash Nanayakkara Tutor: Li Lao.
1 Routing as a Service Karthik Lakshminarayanan (with Ion Stoica and Scott Shenker) Sahara/i3 retreat, January 2004.
Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.
Department of Computer Science & Engineering The Chinese University of Hong Kong Constructing Robust and Resilient Framework for Cooperative Video Streaming.
Distributed Publish/Subscribe Network Presented by: Yu-Ling Chang.
Communication Part IV Multicast Communication* *Referred to slides by Manhyung Han at Kyung Hee University and Hitesh Ballani at Cornell University.
© 2009 AT&T Intellectual Property. All rights reserved. Multimedia content growth: From IP networks to Medianets Cisco-IEEE ComSoc Webinar. Sept. 23, 2009.
Scalable Reliable Multicast Architecture Wenjun Zeng Computer Science Department University of Missouri-Columbia.
Peer-to-Peer Overlay Networks. Outline Overview of P2P overlay networks Applications of overlay networks Classification of overlay networks – Structured.
1 Distributed Monitoring of Peer-to-Peer Systems By Serge Abiteboul, Bogdan Marinoiu Docflow meeting, Bordeaux.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Bullet: High Bandwidth Data Dissemination Using an Overlay Mesh.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Research Interests Georgia Koloniari Computer Science Department University of Ioannina, Greece.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
Overlay Network Physical LayerR : router Overlay Layer N R R R R R N.
Resilient Peer-to-Peer Streaming Presented by: Yun Teng.
1 Resilient and Coherence Preserving Dissemination of Dynamic Data Using Cooperating Peers Shetal Shah, IIT Bombay Kirthi Ramamritham, IIT Bombay Prashant.
Aadil Zia Khan and Shahab Baqai LUMS School of Science and Engineering QoS Aware Path Selection in Content Centric Networks Fahad R. Dogar Carnegie Mellon.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
Streaming over Subscription Overlay Networks Department of Computer Science Iowa State University.
2007/03/26OPLAB, NTUIM1 A Proactive Tree Recovery Mechanism for Resilient Overlay Network Networking, IEEE/ACM Transactions on Volume 15, Issue 1, Feb.
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
Application-Level Multicast Routing Michael Siegenthaler CS 614 – Cornell University November 2, 2006 A few slides are borrowed from Swati Agarwal, CS.
ICDCS Beijing China Routing of XML and XPath Queries in Data Dissemination Networks Guoli Li, Shuang Hou Hans-Arno Jacobsen Middleware Systems Research.
Analysis and algorithms of the construction of the minimum cost content-based publish/subscribe overlay Yaxiong Zhao and Jie Wu
Push Technology Humie Leung Annabelle Huo. Introduction Push technology is a set of technologies used to send information to a client without the client.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
On Reducing Mesh Delay for Peer- to-Peer Live Streaming Dongni Ren, Y.-T. Hillman Li, S.-H. Gary Chan Department of Computer Science and Engineering The.
Peer to Peer Network Design Discovery and Routing algorithms
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
1 FairOM: Enforcing Proportional Contributions among Peers in Internet-Scale Distributed Systems Yijun Lu †, Hong Jiang †, and Dan Feng * † University.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
1 Scalability and Accuracy in a Large-Scale Network Emulator Nov. 12, 2003 Byung-Gon Chun.
Architecture and Algorithms for an IEEE 802
Nuno Salta Supervisor: Manuel Ricardo Supervisor: Ricardo Morla
Towards an Internet-Scale XML Dissemination Service
Network Topologies Charles Warren.
Dynamic Replica Placement for Scalable Content Delivery
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems

P2P: A Delivery Infrastructure Overcast  Application-level multicasting  Build data distribution trees  Adapt to changing network conditions  Inner nodes heavily loaded SplitStream  Load-balancing across all peers  Split content into redundant streams  Redundancy offers resilience to failures

Our Focus Dynamic Application-level Multicast  Single source  Multiple receivers  High-volume data flow (“document streams”)  Dynamic: very large number of “groups”  IP multicast is bad Rigid to deploy Dynamic groups?  “Intelligent” trees on the fly?

Organization Motivation  Data filtering   Distributed YFilter Dynamic multicast  Unstructured overlay network  Metrics  Experiments Summary & future work

Data Filtering Pub-sub systems XML: the “wire format” for data  Web services  RDF Site Summary (RSS) data feeds News Stock ticks  Personalized content delivery Message brokers  Filtering  Transformation  Delivery

YFilter: A Data Filtering Engine Picture blatantly stolen from “Path Sharing and Predicate Evaluation for High-Performance XML Filtering”, Diao et al., TODS 2003

YFilter: Some Numbers Incoming document flow – per second Document sizes – 20KB Subscribers – Lots! Processing bottleneck  50ms per document with 100,000 simple XML path queries Dissemination bottleneck  Thousands of recepients per document – bandwidth needed ~ GbPS Solution: Distributed filtering

Content-Based Routing Embed filtering logic into the network  “XML routers” Overlay topologies (e.g. mesh)  Parent routers hold disjunction of child routers’ queries Streams filtered on the fly Problems  Low network economy – scalability?  Query aggregation challenges

Distributed Hierarchical Filtering Filter Core Clients Recurring theme: dynamic multicast

Peer-to-Peer Result Dissemination Source Clients

Application-Level Dynamic Multicast Each document has a different receiver list Exploit “peers” for dissemination Build trees on the fly  Pass documents wrapped with receiver identities  Each peer contributes a fanout Possibly high delivery delays  Heuristic: Try to minimize tree height Application-level approach: high traffic  Heuristic: Exploit geographical distribution of clients at source

Possible Evaluation Metrics Delivery delay Network economy Document loss Out-of-order delivery

Experimental Setup PlanetLab testbed  Over 200 nodes  1-10 clients per node Document Size:20KB Generation Rate: 1document/second Query Selectivity: 10% Filter Fanout: 2 Filter Host: planetlab1.lcs.mit.edu Client Fanout:  % - Modem  % - DSL  % - Cable

Result 1: Distribution of Delays

Result 2: Scalability

Result 3: Bandwidth Requirements

Exploiting Geographical Distribution of Clients

Result 4: With the optimization

Summary Current filtering engines – processing and bandwidth bottlenecks A possible scheme for distributed filtering  Recurring theme: highly dynamic multicast Application-level multicast  Peer-to-peer delivery  Trees construction on the fly PlanetLab is crazy

Future Work Reliable, dedicated delivery nodes Exploiting query similarity for discovering multicast groups