Rendezvous Points-Based Scalable Content Discovery with Load Balancing Jun Gao Peter Steenkiste Computer Science Department Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
P2PR-tree: An R-tree-based Spatial Index for P2P Environments ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA University of Tokyo.
Advertisements

Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Min Song 1, Yanxiao Zhao 1, Jun Wang 1, E. K. Park 2 1 Old Dominion University, USA 2 University of Missouri at Kansas City, USA IEEE ICC 2009 A High Throughput.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
1 Sensor Relocation in Mobile Sensor Networks Guiling Wang, Guohong Cao, Tom La Porta, and Wensheng Zhang Department of Computer Science & Engineering.
Small-world Overlay P2P Network
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #13: P2P and Sensor Networks Shivkumar Kalyanaraman:
Distributed Scalable Content Discovery Based on Rendezvous Points Jun Gao Ph.D. Thesis Proposal Computer Science Department Carnegie Mellon University.
YAPPERS: A Peer-to-Peer Lookup Service over Arbitrary Topology Qixiang Sun Prasanna Ganesan Hector Garcia-Molina Stanford University.
JXTA P2P Platform Denny Chen Dai CMPT 771, Spring 08.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
Technion –Israel Institute of Technology Software Systems Laboratory A Comparison of Peer-to-Peer systems by Gomon Dmitri and Kritsmer Ilya under Roi Melamed.
Pastry: Scalable, decentralized object location and routing for large-scale peer-to-peer systems Antony Rowstron and Peter Druschel Proc. of the 18th IFIP/ACM.
M ERCURY : A Scalable Publish-Subscribe System for Internet Games Ashwin R. Bharambe, Sanjay Rao & Srinivasan Seshan Carnegie Mellon University.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Internet Indirection Infrastructure Ion Stoica UC Berkeley.
Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan.
Mercury: Scalable Routing for Range Queries Ashwin R. Bharambe Carnegie Mellon University With Mukesh Agrawal, Srinivasan Seshan.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
SkipNet: A Scalable Overlay Network with Practical Locality Properties Nick Harvey, Mike Jones, Stefan Saroiu, Marvin Theimer, Alec Wolman Microsoft Research.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao Cisco Systems, Inc. (Joint work with Christine Lv, Edith Cohen, Kai Li and Scott Shenker)
An Authentication Service Against Dishonest Users in Mobile Ad Hoc Networks Edith Ngai, Michael R. Lyu, and Roland T. Chin IEEE Aerospace Conference, Big.
Kyushu University Graduate School of Information Science and Electrical Engineering Department of Advanced Information Technology Supervisor: Professor.
A Distributed Search Service for Peer-to-Peer File Sharing in Mobile Application Presented by Tony Sung On Loy, MC Lab, CUHK IE 1 A Distributed Search.
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 CS 194: Distributed Systems Distributed Hash Tables Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer.
Focus on Distributed Hash Tables Distributed hash tables (DHT) provide resource locating and routing in peer-to-peer networks –But, more than object locating.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Information-Centric Networks07a-1 Week 7 / Paper 1 Internet Indirection Infrastructure –Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Full-Text Search in P2P Networks Christof Leng Databases and Distributed Systems Group TU Darmstadt.
SOS: Security Overlay Service Angelos D. Keromytis, Vishal Misra, Daniel Rubenstein- Columbia University ACM SIGCOMM 2002 CONFERENCE, PITTSBURGH PA, AUG.
Efficient Peer to Peer Keyword Searching Nathan Gray.
Network Computing Laboratory Scalable File Sharing System Using Distributed Hash Table Idea Proposal April 14, 2005 Presentation by Jaesun Han.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Implicit group messaging in peer-to-peer networks Daniel Cutting, 28th April 2006 Advanced Networks Research Group.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
Enabling Peer-to-Peer SDP in an Agent Environment University of Maryland Baltimore County USA.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
Adaptive Location Management Model for Manet (ALM) Navid NIKAEIN Christian BONNET
DHT-based unicast for mobile ad hoc networks Thomas Zahn, Jochen Schiller Institute of Computer Science Freie Universitat Berlin 報告 : 羅世豪.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Location Directory Services Vivek Sharma 9/26/2001 CS851: Large Scale Deeply Embedded Networks.
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
Teknik Routing Pertemuan 10 Matakuliah: H0524/Jaringan Komputer Tahun: 2009.
Peer to Peer Network Design Discovery and Routing algorithms
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Design and implementation of an intentional naming system William Adjie-WinotoElliot Schwartz Hari BalakrishnanJeremy Lilley MIT Laboratory for Computer.
An Adaptive Zone-based Storage Architecture for Wireless Sensor Networks Thang Nam Le, Dong Xuan and *Wei Yu Department of Computer Science and Engineering,
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
NCLAB 1 Supporting complex queries in a distributed manner without using DHT NodeWiz: Peer-to-Peer Resource Discovery for Grids Sujoy Basu, Sujata Banerjee,
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Mobile Networks and Applications (January 2007) Presented by J.H. Su ( 蘇至浩 ) 2016/3/21 OPLab, IM, NTU 1 Joint Design of Routing and Medium Access Control.
ROUTING TECHNIQUES IN WIRELESS SENSOR NETWORKS: A SURVEY.
Magdalena Balazinska, Hari Balakrishnan, and David Karger
MIT – Laboratory for Computer Science
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Presentation transcript:

Rendezvous Points-Based Scalable Content Discovery with Load Balancing Jun Gao Peter Steenkiste Computer Science Department Carnegie Mellon University October 24th, 2002 NGC 2002, Boston, USA

Jun GaoCarnegie Mellon University2 Outline  Content Discovery System (CDS)  Existing solutions  CDS system design  Simulation evaluation  Conclusions

Jun GaoCarnegie Mellon University3 Content Discovery System (CDS)  Example: a highway monitoring service  Cameras and sensors monitor road and traffic status  Users issue flexible queries  CDS enables content discovery  Locate contents that match queries  Example services  Service discovery; P2P; pub/sub; sensor networks Snapshot from traffic.com

Jun GaoCarnegie Mellon University4 Comparison of Existing Solutions Searchability Centralized Distributed Registration flooding Query broadcasting Hash-based Graph-based Tree-based Scalability Robustness yes Hierarchical names Look-up yes?no yes no yes no Load balancing yes?no yes? no Design Goals Solutions

Jun GaoCarnegie Mellon University5 CDS Design  Attribute-value pair based naming scheme  Enable searchability  Peer-to-peer system architecture  Robust distributed system  Rendezvous Points-based content discovery  Improve scalability  Load Balancing Matrix  Dynamic balance load

Jun GaoCarnegie Mellon University6 Naming Scheme  Based on Attribute-Value pairs  CN: {a 1 =v 1, a 2 =v 2,..., a n =v n }  Not necessarily hierarchical  Attribute can be dynamic  Searchable via subset matching  Q  CN  Number of matches for a CN is large  2 n -1 Camera ID = 5562 Highway = I-279 Exit = 4 City = Pittsburgh Speed = 25mph Road condition = Icy Highway = I-279 Exit = 4 City = Pittsburgh Q1 Q2 CN1 City = Pittsburgh Speed = 25mph

Jun GaoCarnegie Mellon University7 Distributed Infrastructure  Hash-based overlay substrate  Routing, forwarding, management  Node ID  Hash function H(node)  Application layer publishes contents or issues queries  CDS layer determines where to register contents and send queries  Centralized and network-wide flooding are not scalable  Idea: use a small set of nodes as Rendezvous Points TCP/IP Hash-based Overlay CDS Application N1 N6 N4 N5 N3 N2 N9 N8 N7

Jun GaoCarnegie Mellon University8 RP-based Scheme  Hash each AV-pair to get a set of RPs  | RP | = n  RP node stores names that share the same pair  Maintain with soft state  Query is sent directly to an RP node  Use the least loaded RP  RP node fully resolves locally N3 N5 CN1 N4 N6 CN1: {a1=v1, a2=v2, a3=v3, a4=v4} RP2 CN2 CN2: {a1=v1, a2=v2, a5=v5, a6=v6} N2 N1 RP1 N i  H(a i =v i ) N7 N8 N9 ? Q:{a1=v1, a2=v2}

Jun GaoCarnegie Mellon University9 System Properties  Efficient registration and query  O(n) registration messages; n small  O(m) messages for query with probing  Hashing AV-pair individually ensures subset matching  Query may contain only 1 AV-pair  No inter-RP node communication for query resolution  Tradeoff between CPU and Bandwidth  Load is spread across nodes  Different names use different RP set

Jun GaoCarnegie Mellon University10 Load Concentration Problem  RP node may be overloaded  Some AV-pairs more popular than others  Speed=55mph vs. Speed=95mph  P2P keyword follows Zipf distribution  However, many nodes are underutilized  Intuition: use a set of nodes to share load caused by popular pairs  Challenge: accomplish load balancing in a distributed and self-adaptive fashion AV-pair rank #of names Example Zipf distribution

Jun GaoCarnegie Mellon University11 Load Balancing Matrix (LBM)  Organize nodes into a logical matrix  Each column holds a partition  Rows are replicas of each other  Node IDs are determined by: H(a1=v1, p, r)  N 1 (p,r)  Matrix expands itself to accommodate extra load  Increase P when registration load reaches threshold  Query load   R  1,12,1 3,1 1,22,2 3,2 1,32,3 3,3 CN1 CN2 CN3 CN4 CN3CN4 Q1 Q2 Q3 Q4 Q3 Q4 LBM for AV-pair: {a1=v1} R P

Jun GaoCarnegie Mellon University12 Registration and Query with LBM  Determine LBM size  Register with one random column of each matrix  Compute IDs locally 1,1 1,2 2,1 2,2 1,32,3 CN1:{a1v1, a2v2, a3v3} 0,0 P=?, R=? P=3, R=3 LBM1 LBM2 LBM3 3,1 3,2 3,3 Q:{a1v1, a2v2} Load is balanced within an LBM Registration  Query can be sent to the matrix of any AV-pair  Use LBM with min(P)  Sent to one random node in each column Query

Jun GaoCarnegie Mellon University13 System Properties with LBM  Registration and query cost for one pair increases  O(R) registration messages  O(P) query messages  Matrix size depends on current load  LBM must be kept small for efficiency  Query optimization helps, e.g., large P  small R  Matrix shrinking mechanism  E.g., May query a subset of the partitions  Load on each RP node is upper-bounded  Efficient processing  Underutilized nodes are recruited as LBM expands

Jun GaoCarnegie Mellon University14 Simulation Evaluation  Implement in an event-driven simulator  Each node monitors its registration and query load  Assume Chord-like underlying routing mechanism  Experiment setup  10,000 nodes in the CDS network  10,000 distinct AV-pairs (50 attributes, 200 values/attribute)  Use synthetic registration and query workload  Performance metric: success rate  System should maintain high success rate as load increases

Jun GaoCarnegie Mellon University15 Workload Rank of AV-pairs Number of names Number of queries Registration load Query load

Jun GaoCarnegie Mellon University16 Registration Success Rate Comparison Avg. registration arrival rate (1000 reg/sec) Registration success rate (%) Threshold = 50 reg/sec Poisson arrival P max = ,000 names 1 registration every 10 secs.

Jun GaoCarnegie Mellon University17 Query Success Rate Comparison Avg. query arrival rate (1000q/sec) Query success rate (%) 1 million users, 1 query every 10 secs. Threshold = 200 q/sec Poisson arrival R max = 10

Jun GaoCarnegie Mellon University18 Conclusions  Proposed a distributed and scalable design to the content discovery problem  RP-based approach addresses scalability  Avoid flooding  LBMs improve system throughput  Balance load  Distributed algorithms  Decisions are made locally