On Heterogeneous Overlay Construction and Random Node Selection in Unstructured P2P Networks Presenter: 游創文.

Slides:



Advertisements
Similar presentations
C. Mastroianni, D. Talia, O. Verta - A Super-Peer Model for Resource Discovery Services in Grids A Super-Peer Model for Building Resource Discovery Services.
Advertisements

Peer-to-Peer and Social Networks An overview of Gnutella.
A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
JetStream: Achieving Predictable Gossip Dissemination by Leveraging Social Network Principles Jay A. Patel 1, Indranil Gupta 1, and Noshir Contractor 2.
IEEE ICDCS, Toronto, Canada, June 2007 (LA-UR ) 1 Scale-Free Overlay Topologies with Hard Cutoffs for Unstructured Peer-to-Peer Networks Hasan Guclu.
VDR: Proactive element Conclusions VDR reaches 3.5% more nodes than VDR-R and 9% more nodes than our modified random walk routing strategy (RWR) VDR shows.
1 Routing Techniques in Wireless Sensor networks: A Survey.
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
Symphony: Distributed Hashing in a Small World Gurmeet Singh Manku Mayank Bawa Prabhakar Raghavan Presented by Satpreet Singh.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Unstructured overlays: construction, optimization, applications Anne-Marie Kermarrec Joint work with Laurent Massoulié and Ayalvadi Ganesh.
Small-world Overlay P2P Network
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.
ZIGZAG A Peer-to-Peer Architecture for Media Streaming By Duc A. Tran, Kien A. Hua and Tai T. Do Appear on “Journal On Selected Areas in Communications,
Peer to Peer File Sharing Huseyin Ozgur TAN. What is Peer-to-Peer?  Every node is designed to(but may not by user choice) provide some service that helps.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:
Making Gnutella-like P2P Systems Scalable Presented by: Karthik Lakshminarayanan Yatin Chawathe, Sylvia Ratnasamy, Lee Breslau, Nick Lanham, and Scott.
Understanding Mesh-based Peer-to-Peer Streaming Nazanin Magharei Reza Rejaie.
The Small World Phenomenon: An Algorithmic Perspective by Anton Karatoun.
Spring Routing & Switching Umar Kalim Dept. of Communication Systems Engineering 06/04/2007.
Scalable and Distributed GPS free Positioning for Sensor Networks Rajagopal Iyengar and Biplab Sikdar Department of ECSE, Rensselaer Polytechnic Institute.
Wide-area cooperative storage with CFS
P2P Course, Structured systems 1 Introduction (26/10/05)
Geographic Routing Without Location Information A. Rao, C. Papadimitriou, S. Shenker, and I. Stoica In Proceedings of the 9th Annual international Conference.
Correctness of Gossip-Based Membership under Message Loss Maxim Gurevich, Idit Keidar Technion.
1 Virtual Direction Routing for Overlay Networks Bow-Nan Cheng Murat Yuksel Shivkumar Kalyanaraman.
Analysis of Simulation Results Andy Wang CIS Computer Systems Performance Analysis.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Communication (II) Chapter 4
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
IEEE P2P, Aachen, Germany, September Ad-hoc Limited Scale-Free Models for Unstructured Peer-to-Peer Networks Hasan Guclu
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
CCAN: Cache-based CAN Using the Small World Model Shanghai Jiaotong University Internet Computing R&D Center.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
ENERGY-EFFICIENT FORWARDING STRATEGIES FOR GEOGRAPHIC ROUTING in LOSSY WIRELESS SENSOR NETWORKS Presented by Prasad D. Karnik.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
GIA: Making Gnutella-like P2P Systems Scalable Yatin Chawathe Sylvia Ratnasamy, Scott Shenker, Nick Lanham, Lee Breslau (Several slides have been taken.
Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
Data Communications and Networking Chapter 11 Routing in Switched Networks References: Book Chapters 12.1, 12.3 Data and Computer Communications, 8th edition.
Computer Networks Dr. Jorge A. Cobb The Performance of Query Control Schemes for the Zone Routing Protocol.
SRL: A Bidirectional Abstraction for Unidirectional Ad Hoc Networks. Venugopalan Ramasubramanian Ranveer Chandra Daniel Mosse.
MobiQuitous 2007 Towards Scalable and Robust Service Discovery in Ubiquitous Computing Environments via Multi-hop Clustering Wei Gao.
Dual-Region Location Management for Mobile Ad Hoc Networks Yinan Li, Ing-ray Chen, Ding-chau Wang Presented by Youyou Cao.
Eclipse Attacks on Overlay Networks: Threats and Defenses By Atul Singh, et. al Presented by Samuel Petreski March 31, 2009.
Routing Networks and Protocols Prepared by: TGK First Prepared on: Last Modified on: Quality checked by: Copyright 2009 Asia Pacific Institute of Information.
A Framework for Reliable Routing in Mobile Ad Hoc Networks Zhenqiang Ye Srikanth V. Krishnamurthy Satish K. Tripathi.
CS 347Notes081 CS 347: Parallel and Distributed Data Management Notes 08: P2P Systems.
Ad Hoc On-Demand Distance Vector Routing (AODV) ietf
Spring Routing: Part I Section 4.2 Outline Algorithms Scalability.
Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.
1 FairOM: Enforcing Proportional Contributions among Peers in Internet-Scale Distributed Systems Yijun Lu †, Hong Jiang †, and Dan Feng * † University.
Distance Vector Routing
1 Roie Melamed, Technion AT&T Labs Araneola: A Scalable Reliable Multicast System for Dynamic Wide Area Environments Roie Melamed, Idit Keidar Technion.
1 “Hybrid Search Schemes for Unstructured Peer- to-Peer Networks” “Random Walks in Peer-to-Peer Networks” Christos Gkantsidis, Milena Mihail, Amin Saberi.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
William Stallings Data and Computer Communications
Stochastic Streams: Sample Complexity vs. Space Complexity
Pastry Scalable, decentralized object locations and routing for large p2p systems.
On Growth of Limited Scale-free Overlay Network Topologies
Data and Computer Communications
Javad Ghaderi, Tianxiong Ji and R. Srikant
Presentation transcript:

On Heterogeneous Overlay Construction and Random Node Selection in Unstructured P2P Networks Presenter: 游創文

Outline Motivation Relate work Algorithm for graph construction Selection walks on the built graph Experiments Conclusion

Motivation Unstructed P2P and overlay networks –Use random walks to build unstructured graphs and do node selection Requirements to build a good graph and do random selection –Load balance –Heterogeneity –Simplicity –Scalability Design heterogeneous graph building and random node selection algorithms –Practical to deploy and functional over a wide range of requirements

Unstructured P2P overlay network Organize peers in a random graph in flat or hierarchical manners (e.g. super-peers layer) –Random graph construction Two steps involved in function of P2P Systems –Query Phase random node selection on top of the graph –Delivery Phase

Related work Gia –Unstructured file sharing system that uses random walk –Give high capacity nodes higher degrees and more information to store –Not give control over degree or load SCAMP –Build graphs where the average node degree is proportional to the log of the number of nodes –Not give enough control over node degree or load

Related work Araneola –Build regular graphs that could potentially be used for random selection –Doesn’t discuss the case of heterogeneity –Assumption: the existing nodes contacted by newly joining nodes are uniformly picked –Run constant background protocol Law and Siu –A distributed mechanism to construct a regular random graphs –Not handle heterogeneity and is vulnerable to unexpected node departures. SelfLoops & Iterative scaling –Random walk methods not suitable to use for graph construction or to accommodate heterogeneity –Extend by this paper

Initial node discovery For any node that wants to join the graph –know at least one already existing member in the graph Light-weight approach –A small set (of about 10) of the most recently joined nodes

Algorithms for graph construction (requirements) Truly random walk –Selected uniformly randomly –High degree nodes  high probability to be selected –More severe for build walks (Compound itself as the network grows) Add bias to ensure high degree nodes don’t keep collecting more links Heterogeneous requirement –Higher capacity nodes have proportionally higher nodes degrees than lower capacity nodes

Algorithms for graph construction Node i in the graph to establish a fixed number of links Ki, called outlinks, with randomly select nodes in the graph –Lead to nodes obtaining roughly as many inlinks as outlinks A node never has the option of refusing a request to create an inlink –For simplicity –The bias tends to prevent the need for this Counteract the effects of early joiners obtaining more inlinks –Stronger bias –Actively manage each node’s indegrees Nodes with high indegrees to move an inlink to nodes with low indegrees

Taxonomy of Biased walks Biased-halting –Next hop at a node is picked uniformly from all links at the node. –Ended at each node with a random probability Inversely to the degree of the node –Average length could be fixed –Ex: SelfLoops, SCAMP Biased-forwarding –Selection of next hop in the walk Weighted against high degree nodes –# of hops is set at a fixed constant H –InlinkInvProb, TotalInvProb, Iterative Scaling

Taxonomy of Biased walks Tradeoff –Biased-forwarding  state exchange with their neighbors (their degree..) –biased-halting tend to unfairly load high- degree nodes (tends to be forwarded to high- degree nodes)

SelfLoops (SL) Emulate a graph with perfectly uniform node degrees –adding virtual links to oneself (self loops). Original work –not support heterogeneity or provide the needed bias for build walks Modification –For selection walks the virtual degree of each node α its outdegree –For build walks The virtual degree α (outdegree 2 /indegree)

Refresh of SelfLoops (SL) A node discards one of its links and chooses another Steady state –Net change in the expected indegree of i due to the refresh is zero loses an inlink = gains an inlink c*indeg(i) = c’ * (outdeg(i) 2 /indeg(i) ) indeg(i) = c’’ * outdeg(i) c’’ = 1 Much harder to estimate the virtual hop length to achieve a desired average real hop-length –Conservative option is to use a large enough value  results a larger average hop-length

Problems of biased-halting Any given walk can be quite short Hybrid approach –If the expected walk length was h hops –First h/2 hops, use on of the biased- forwarding –Later half, use selfloops –Call this as hybrid TotalInvProb-SelfLoops

Inverse-Probability walks Probability of forwarding a walk to a node –InlinkInvProb, IP α ( outdegree/ indegree), used for build walks –TotalInvProb, or TIP α ( outdegree/ total degree), used for selection walks

Iterative Scaling (IS) Biased-forwarding walk –Each node assigns outgoing and incoming weights to each of its links –Iterative computation across all links Derive the elements of a matrix when the row and column sums are known –The outgoing (incoming) weight of a link the node’s probability (perception of the probability) that it’s picked during a random walk from the other (the other end of the link)

Iterative Scaling (IS) Incoming weight assigned by a to link l = the outgoing weight assigned by b to link l –set Wt A IN (l) = Wt B out (l) Bring the system to a state where at every node both the incoming and outgoing weights add to 1 each. A sufficiently long random walk is equally likely to end at any node. Heterogeneity –Select Ideal probability that a node is selected α its outdegree –Build Ideal probability is α (outdegree 2 /indegree) Update at A –The incoming weight for each link A  B is scaled by the estimated probability of a walk reaching B before the normalization is performed.

N A WNAWNA

Some issues Exchanging neighbor information (IP or IS) –Each node send a message to all of its neighbors every time it experiences a link change –Piggyback the neighbor information on other message –Not good for high-degree graphs Graph refreshes –Periodically remove a outlink and replace it with another randomly –Overhead, graph changes, more complex

SwapLinks (SW) Inspire from Law and Siu Modify It to handle heterogeneity and robust to unexpected node departures Two kinds of walks –OnlyInLinks – each node chooses uniformly randomly among its inlinks only –OnlyOutLinks – each node chooses uniformly randomly among its outlinks only

SwapLinks (SW) Lost a outlink –Relace the outlink with a new neighbor O discovered with an OnlyInLinks build walk Lost an inlink –Check if its indegree < its outdegree If so, use OnlyOutLinks to discover a node I No need to exchange state with neighbors BI Neighbor of I BI

Selection walks How a graph is walked Four ways –TotalInverseProb (TIP), Iterative Scaling (IS), SelfLoop (SL), hybrid TIP-SL (Hyb-TIP-SL)

Experiments setup Static simulation –Fully added or deleted before the next node is added or deleted –No packet loss Two scenarios –Shrink (build then shrink to 25% of its original) –Churn (build then churn, 2N churn events, expected network size is N) Setup –N = 5000, build walk length of 10 hops –Except In the case of heterogeneity, a constant outdegree of 5 at every node 10M selection walks, the graph has M nodes at the time of selection –Look the distribution of the selected nodes and the selection load balance

Four graph-construction techniques (Homogeneous case) Graph with or without refreshes (exception SwapLink) Status update (IS and IP) –1-hop updates of any link change –piggybacking SCAMP, TrueRandom (each node forms 5 (expected) outlinks with distinct uniformly chosen nodes) Load balance under node addition –10 nodes are added and the load placed on previously existing nodes (Repeat 100 times) –Average load per node (AvgBLoad-Add ), Standard deviation of the load ( Dev(BLoad-Add) ) –focus on the load placed on already existing nodes –The same load on the network irrespective of the size of the graph

Four graph-construction techniques (Homogeneous case) Node departure –M/5 nodes randomly –Average load per node AvgBLoad-Kill and Dev(BLoad-Kill)

Result of four graph-construction techniques (Homogeneous case)

Graph Construction under heterogeneity Heterogeneous –Expected outdegree is also 5 Each of the N nodes is a default-degree node with probability 0.5 (outdegree of 5) A heterogeneous node with probability 0.5 (outdegree uniformly randomly from [2, 50]) –Churn or shrink is performed after all nodes have joined and formed all their outlinks. The modifications made –The walk prob indeed work as intened

Graph Construction under heterogeneity Average indegree and the build load grow linearly with the outdegree

Quality of random selection on homogeneous graphs Distribution of the selected nodes, distribution of load imposed by the select walks –Use Standard deviation of hits rate to represent Not employ piggybacking on the selection walks –# of selection walks is comparatively large –Start from a single node, log the number of hits each node receives 1. All nodes  same outdegree of 5 2. # of walks = 10 *current number of nodes in the graph

Quality of random selection on homogeneous graphs Selection load seen by a node –# of selection walks that pass through or end at the node The origins of the walks is distributed across the graph –Load distribution should be uniform

Selection with heterogeneity Quality of selection when nodes have different outdegrees Same setup as graph building under heterogeneity Running random selection walks All selection methos are function satisfactorily well Distribution of selection hits as a function of the outdegree

Scaling to larger sizes Measure # of hops it takes to obtain a random selection distribution whose standard diviation is within 5% of that of true random distribution Graphs are churned before the selections # of selection = 10 * the network size

Scaling to larger sizes Swaplink –builds good graphs even at large scale

Conclusion A mechanism for building random graphs and doing random selection (SwapLinks) –Simple –Scalable –Good control over heterogeneity –Enable the desired random selection by setting of only a single parameter (desired node degree) of each node Future work –Implement and test in a real setting –Compare with a random selection strategy that uses DHTs. –Consider misbehaving nodes –Establishing proximal neighbors (low latency or high BW)