Topology-Aware Overlay Construction and Server Selection Sylvia Ratnasamy Mark Handley Richard Karp Scott Shenker Infocom 2002.

Slides:



Advertisements
Similar presentations
Virtual Distance: A Generalized Metric for Overlay Tree Construction ISCC12 July Suat Mercan (Zirve University) & Murat Yuksel (University of Nevada,
Advertisements

CAN 1.Distributed Hash Tables a)DHT recap b)Uses c)Example – CAN.
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Peer to Peer and Distributed Hash Tables
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Topologically-Aware Overlay Construction and Server Selection Sylvia Ratnasamy, Mark Handly, Richard Karp and Scott Shenker Presented by Shreeram Sahasrabudhe.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Geometry of large networks (computer science perspective) Dmitri Krioukov (CAIDA/UCSD) AIM, November 2011.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Presented by Elisavet Kozyri. A distributed application architecture that partitions tasks or work loads between peers Main actions: Find the owner of.
A Scalable Content Addressable Network (CAN)
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network (CAN) ACIRI U.C.Berkeley Tahoe Networks.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
A Scalable Content Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, and Scott Shenker Presented by: Ilya Mirsky, Alex.
A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:
Overlay Networks EECS 122: Lecture 18 Department of Electrical Engineering and Computer Sciences University of California Berkeley.
A Scalable Content- Addressable Network Sections: 3.1 and 3.2 Καραγιάννης Αναστάσιος Α.Μ. 74.
1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.
1 An Overlay Scheme for Streaming Media Distribution Using Minimum Spanning Tree Properties Journal of Internet Technology Volume 5(2004) No.4 Reporter.
Topology-Aware Overlay Networks By Huseyin Ozgur TAN.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network ACIRI U.C.Berkeley Tahoe Networks 1.
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
1 A scalable Content- Addressable Network Sylvia Rathnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker Pirammanayagam Manickavasagam.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
CONTENT ADDRESSABLE NETWORK Sylvia Ratsanamy, Mark Handley Paul Francis, Richard Karp Scott Shenker.
Network Aware Resource Allocation in Distributed Clouds.
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Applied Research Laboratory David E. Taylor A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker.
Sylvia Ratnasamy (UC Berkley Dissertation 2002) Paul Francis Mark Handley Richard Karp Scott Shenker A Scalable, Content Addressable Network Slides by.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.
Paper Group: 20 Overlay Networks 2 nd March, 2004 Above papers are original works of respective authors, referenced here for academic purposes only Chetan.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Content Addressable Networks CAN is a distributed infrastructure, that provides hash table-like functionality on Internet-like scales. Keys hashed into.
Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
P2P Group Meeting (ICS/FORTH) Monday, 28 March, 2005 A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp,
1 Distributed Hash Table CS780-3 Lecture Notes In courtesy of Heng Yin.
University “Ss. Cyril and Methodus” SKOPJE Cluster-based MDS Algorithm for Nodes Localization in Wireless Sensor Networks Ass. Biljana Stojkoska.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
A configuration method for structured P2P overlay network considering delay variations Tomoya KITANI (Shizuoka Univ. 、 Japan) Yoshitaka NAKAMURA (NAIST,
Two Peer-to-Peer Networking Approaches Ken Calvert Net Seminar, 23 October 2001 Note: Many slides “borrowed” from S. Ratnasamy’s Qualifying Exam talk.
A Simulation-Based Study of Overlay Routing Performance CS 268 Course Project Andrey Ermolinskiy, Hovig Bayandorian, Daniel Chen.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
CSCI 599: Beyond Web Browsers Professor Shahram Ghandeharizadeh Computer Science Department Los Angeles, CA
Distance Vector Routing
Gang Wang, Shining Wu, Guodong Wang, Beixing Deng, Xing Li Tsinghua University Tsinghua Univ. Oct Experimental Study on Neighbor Selection Policy.
Plethora: A Locality Enhancing Peer-to-Peer Network Ronaldo Alves Ferreira Advisor: Ananth Grama Co-advisor: Suresh Jagannathan Department of Computer.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Lecture 13 – Network Mapping
Virtual Domain and Coordinate Routing in Wireless Sensor Networks
Zhichen Xu, Mallik Mahalingam, Magnus Karlsson
Turning Heterogeneity into an Advantage in Overlay Routing
A Scalable content-addressable network
CONTENT ADDRESSABLE NETWORK
A Scalable, Content-Addressable Network
A Scalable Content Addressable Network
A Scalable, Content-Addressable Network
Presentation transcript:

Topology-Aware Overlay Construction and Server Selection Sylvia Ratnasamy Mark Handley Richard Karp Scott Shenker Infocom 2002

Connections of a node

Introduction Problem: Inefficient routing in large-scale networks In large-scale overlay networks, each node is logically connected to a small subset of other participants. Due to the lack of effort to ensure that application-level connectivity is congruent with underlying IP-level network topology Basic Idea: Optimize routing paths in network Define a binning scheme whereby nodes partition themselves into bins Nodes that fall within a given bin are relatively close to one another in terms of network latency

Outline Introduction Distributed Binning Topologically-aware construction of overlay networks Topologically-aware server selection Conclusion

Extracting proximity information Measuments that can be used to derive topological information: traceroute: intended for network diagnostic purposes, too heavy-weight, excessive load on the network, disabled ICMP at some sites for security BGP routing table: not directly available for end users, requires privilege or third party service Network latency: often a direct indicator of network performance, light-weight, end-to-end measurement, non-intrusive manner s a b c t 2 sec 7 sec 5 sec

Distributed Binning Goal: Have a set of nodes independently partition themselves into disjoint bins Nodes within a single bin are relatively closer to one another than to nodes not in their bin Scheme: A well-known set of machines that act as landmarks on the Internet Form a distributed binning of nodes based-on their relative distances A node measures round-trip-time (RTT) to each landmark and orders landmarks in order of increasing RTT Every node has an associated ordering of landmarks(or bin)

Distributed Binning Scheme: (Cont.) After finding ordering, we calculate absolute values of each RTT in ordering as follows We divide the range of possible latency values into a number of levels. Convert RTT values into level number and obtain a level vector Example: Level ms Level ms Level 2 > 200ms Node As bin becomes l 2 l 3 l 1 :0 1 2 Topologically close nodes likely to have same ordering and belong to same bin l3l3 l1l1 l2l2 A 232 ms 117 ms 57 ms

Distributed Binning Distributed Binning Scheme

Performance of Distributed Binning Even though it is clearly scalable, does it do a reasonable job? Metric used: average inter-bin latency = average latency from a given node to all nodes not in its bin average intra-bin latency = average latency from a given node to all nodes in its bin A higher gain ratio indicates a higger reduction in latency, hence more desirable

Performance of Distributed Binning Datasets or test topologies: TS-10K and TS-1K: Transit-Stub topologies with and 1000 nodes respectively. 2-level hierarchy PLRG1 and PLRG2: Power-Law Random graph with 1166 and 1779 nodes Edge latencies assigned randomly NLANR: Distributed network of over 100 active monitors Systematically perform scheduled measurement between each other

Performance of Distributed Binning Other binning algorithms used in experiments: Random Binning: Each nodes selects a bin at random acts as a lower bound for the gain ratio Nearest Neighbor clustering: Each node is initially assigned to a cluster itself. At each iteration, two closest clusters are merged into a single cluster. The algorithm terminated when the required number of clusters is obtained _

Performance of Distributed Binning Experiments: Effect of number of landmarks (#level=1) Effect of number of levels (#landmarks=12)

Performance of Distributed Binning Experiments: Comparison of different binning techniques(#levels=1)

Topologically-aware construction of overlay networks Two types of overlay networks Structured: Nodes are interconnected in some well-defined manner(Application-level) Unstructured: Much less structured like Gnutella,Freenet Metric for evaluation:

Topologically-sensitive CAN construction Content-Addressable Network Scalable indexing system for large-scale decentralized storage applications on the Internet Built around a virtual multi-dimensional Cartesian coordinate space Entire coordinate space is dynamically partitioned among all the peers, i.e. every peer possesses its individual, distinct zone within the overall space A CAN peer maintains a routing table that holds the IP address and virtual coordinate zone of each of its neighbor coordinates

2D CAN Example x State of the system at time t Peer Resource Zone In this 2 dimensional space, a key is mapped to a point (x,y)

Routing in CAN y Peer Q(x,y) (x,y) d-dimensional space with n zones Routing path of length: Algorithm: Choose the neighbor nearest to the destination Q(x,y) Query/ Resource key

Contribution to CAN Construct CAN topologies that are congruent with underlying IP topology Scheme: With m landmarks, m! such ordering is possible For example, if m=2, then possible orderings are ab and ba We partion the coordinate space into m! equal sized portions, each corresponding to a single ordering Divide the space along first dimension into m portions Each portion is then sub-divided along the second dimension into m-1 portions Each of these are divided into m-2 portion and so on… When a node joins CAN at a random point, the node determines its associated bin based-on delay measurement According to its landmark ordering, it takes place in the correspanding portion of CAN

Gain in CAN using Distributed Binning Stretch for a 2D CAN; topology TS-1K;#levels=1Stretch for a 2D CAN; topology PLRG2;#levels=1

Topologically-aware construction of unstructured overlays Aims much less structured overlay such as Gnutella, Freenet Focusing on the following general problem in unstructured overlays: Optimal overlay is NP-hard, so used some heuristic called Short-Long Given a set of n nodes on the Internet, have each node picks any k neighbor nodes from this set so that the average routing latency on the resultant overlay is low

Topologically-aware construction of unstructured overlays Short-Long Heuristic A node picks its k neighbors by picking k/2 nodes closest to itself and then picks another k/2 nodes at random Well-connected pocket of closest nodes and inter-connections to far pockets with random picks BinShort-Long (Contribution) : A node picks k/2 neighbors at random from its bin and picks remaining k/2 at random Current Node Nearby Nodes Distant Nodes Other Nodes

Gain in Unstructured Overlay using Distributed Binning Unstructured overlays; TS-10K;#levels=1;#landmarks=12

Topology-aware server selection Replication of content over Internet gives rise to the problem of server selection Parameter: Server load and distance(in term of Network Latency) _ Replication Server Client

Topology-aware server selection Server selection process with distributed binning works as follows: If there exist one or more servers within same bin as client, then client is redirected to a random server from its own bin If no server exists within same bin as client, then an existing server whose bin is most similar to clients bin is selected at random Compared performance to 3 schemes: Random: Client selects server at random Hotz Metric: Uses RTT measure from a node to well known landmarks to estimate internode distance (Triangle inequality) Cartesian Distance: Calculates Euclidean distance using level vector of node and selects the server with minimum distance Measurement for evaluation:

Topology-aware server selection Comparison of different schemes under following conditions: 12 landmarks and 3 levels 1000 servers for TS-10K, 100 servers for TS-1K, PLRG1 and PLRG2 and 10 for NLANR

Topology-aware server selection-Node Perspective CDF of latency stretch for NLANR dataCDF of latency stretch for TS-10K data

Conclusion Described a simple,scalable,binning scheme that can be used to infer network proximity information Nature of the underlying network topology affects behavior of the scheme It is applied to the problem of topologically-aware overlay construction and server selection domains Three applications of distributed binning is given: Structured Overlay Unstructured Overlay Server selection A small number of landmarks yields significant improvements. Can be referred as network-level GPS system _

Happy end! Thank you for your patience!