T. S. Eugene Ng Mellon University1 Global Network Positioning: A New Approach to Network Distance Prediction Tze Sing Eugene.

Slides:



Advertisements
Similar presentations
T. S. Eugene Ng Mellon University1 Towards Global Network Positioning T. S. Eugene Ng and Hui Zhang Department of Computer.
Advertisements

Topology-Aware Overlay Construction and Server Selection Sylvia Ratnasamy Mark Handley Richard Karp Scott Shenker Infocom 2002.
A Network Positioning System for the Internet T. S. Eugene Ng and Hui Zhang USENIX 04 Presented By: Imranul Hoque 1.
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Peer to Peer and Distributed Hash Tables
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Topologically-Aware Overlay Construction and Server Selection Sylvia Ratnasamy, Mark Handly, Richard Karp and Scott Shenker Presented by Shreeram Sahasrabudhe.
Fabián E. Bustamante, 2007 Meridian: A lightweight network location service without virtual coordinates B. Wong, A. Slivkins and E. Gün Sirer SIGCOM 2005.
EL9331 Meridian: A Lightweight Network Location Service without Virtual Coordinates Bernard Wong, Aleksandrs Slivkins, Emin Gun Sirer SIGCOMM’05 ( Slides.
1 Turning Heterogeneity into an Advantage in Overlay Routing Gisik Kwon Dept. of Computer Science and Engineering Arizona State University Published in.
Computer Science ROMA: Reliable Overlay Multicast with Loosely Coupled TCP Connections Gu-In Kwon and John Byers Computer Science Dept. Boston University.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
Measurement-Based Optimization Techniques for Bandwidth-Demanding Peer-to- Peer Systems T. S. Eugene Ng, Yang-hua Chu, Sanjay G. Rao, Kunwadee Sripanidkulchai.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
Internet Iso-bar: A Scalable Overlay Distance Monitoring System Yan Chen, Lili Qiu, Chris Overton and Randy H. Katz.
CMPE 150- Introduction to Computer Networks 1 CMPE 150 Fall 2005 Lecture 22 Introduction to Computer Networks.
Efficient Hop ID based Routing for Sparse Ad Hoc Networks Yao Zhao 1, Bo Li 2, Qian Zhang 2, Yan Chen 1, Wenwu Zhu 3 1 Lab for Internet & Security Technology,
Overlay Networks EECS 122: Lecture 18 Department of Electrical Engineering and Computer Sciences University of California Berkeley.
Predicting Communication Latency in the Internet Dragan Milic Universität Bern.
1 An Overlay Scheme for Streaming Media Distribution Using Minimum Spanning Tree Properties Journal of Internet Technology Volume 5(2004) No.4 Reporter.
Tesseract A 4D Network Control Plane
Abstract Shortest distance query is a fundamental operation in large-scale networks. Many existing methods in the literature take a landmark embedding.
Mario Čagalj supervised by prof. Jean-Pierre Hubaux (EPFL-DSC-ICA) and prof. Christian Enz (EPFL-DE-LEG, CSEM) Wireless Sensor Networks:
Delay Analysis of Large-scale Wireless Sensor Networks Jun Yin, Dominican University, River Forest, IL, USA, Yun Wang, Southern Illinois University Edwardsville,
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
On the Power of Off-line Data in Approximating Internet Distances Danny Raz Technion - Israel Institute.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization Yang Chen Department of Computer Science Duke University
N ETWORKed M EDIA L AB. D EPT. OF I NFO. & C OMM., K-JIST DHT Based P2P (Peer-to-Peer) for Exploiting Network Proximity ChanMo Park
Phoenix: Towards an Accurate, Practical and Decentralized Network Coordinate System Yang Chen 1, Xiao Wang 1, Xiaoxiao Song 1, Eng Keong Lua 2, Cong Shi.
IDMaps: A Global Internet Host Distance Estimation Service P. Francis, S. Jamin, C. Jin, Y. Jin, D. Raz, Y. Shavitt, L. Zhang Presenter: Zhenying Liu.
Feb nd IPTPS Lighthouses for Scalable Distributed Location Marcelo Pias UCL Jon Crowcroft CL/Cambridge University Steve Wilbur UCL Tim Harris Cambridge.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
Communication Paradigm for Sensor Networks Sensor Networks Sensor Networks Directed Diffusion Directed Diffusion SPIN SPIN Ishan Banerjee
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
1 Vivaldi: A Decentralized Network Coordinate System Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris Presented by: Chen Qian.
L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.
Resilient Overlay Networks Robert Morris Frans Kaashoek and Hari Balakrishnan MIT LCS
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Network Coordinates : Internet Distance Estimation Jieming ZHU
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
CoopNet: Cooperative Networking
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
Distance Vector Routing
Gang Wang, Shining Wu, Guodong Wang, Beixing Deng, Xing Li Tsinghua University Tsinghua Univ. Oct Experimental Study on Neighbor Selection Policy.
15-829A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Review.
ALTO: A Multi Dimensional Peer Selection Problem IETF 73 Saumitra Das
Matchmaking for Online Games and Other Latency-Sensitive P2P Systems
Lecture 13 – Network Mapping
Vivaldi: A Decentralized Network Coordinate System
Analysis of Network Distance Prediction with Global Network Positioning Mathieu Rodrigue Department of Computer Science University of Hartford 17/09/2018.
Turning Heterogeneity into an Advantage in Overlay Routing
Early Measurements of a Cluster-based Architecture for P2P Systems
CMPE 252A : Computer Networks
A Scalable content-addressable network
CMPE 252A : Computer Networks
A Scalable Content Addressable Network
Presentation transcript:

T. S. Eugene Ng Mellon University1 Global Network Positioning: A New Approach to Network Distance Prediction Tze Sing Eugene Ng Department of Computer Science Carnegie Mellon University

T. S. Eugene Ng Mellon University2 New Challenges Large-scale distributed services and applications –Napster, Gnutella, End System Multicast, etc Large number of configuration choices K participants  O(K 2 ) e2e paths to consider Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley Stanford MIT CMU Berkeley CMU MIT Stanford Berkeley

T. S. Eugene Ng Mellon University3 Role of Network Distance Prediction On-demand network measurement can be highly accurate, but –Not scalable –Slow Network distance –Round-trip propagation and transmission delay –Relatively stable Network distance can be predicted accurately without on-demand measurement –Fast and scalable first-order performance optimization –Refine as needed

T. S. Eugene Ng Mellon University4 Applying Network Distance Napster, Gnutella –Use directly in peer-selection –Quickly weed out 95% of likely bad choices End System Multicast –Quickly build a good quality initial distribution tree –Refine with run-time measurements Key: network distance prediction mechanism must be scalable, accurate, and fast

T. S. Eugene Ng Mellon University5 State of the Art: IDMaps [Francis et al ‘99] A network distance prediction service Tracer HOPS Server A B 50msA/B

T. S. Eugene Ng Mellon University6 IDMaps Benefits Significantly reduce measurement traffic compared to (# end hosts) 2 measurements End hosts can be simplistic

T. S. Eugene Ng Mellon University7 Challenging Issues Scalability –Topology data widely disseminated to HOPS servers –Requires more HOPS servers to scale with more client queries Prediction speed/scalability –Communication overhead is O(K 2 ) for distances among K hosts Prediction accuracy –How accurate is the “Tracers/end hosts” topology model when the number of Tracers is small? Deployment –Tracers/HOPS servers are sophisticated; probing end hosts may be viewed as intrusive

T. S. Eugene Ng Mellon University8 Global Network Positioning (GNP) Model the Internet as a geometric space (e.g. 3-D Euclidean) Characterize the position of any end host with coordinates Use computed distances to predict actual distances Reduce distances to coordinates y (x 2,y 2,z 2 ) x z (x 1,y 1,z 1 ) (x 3,y 3,z 3 ) (x 4,y 4,z 4 )

T. S. Eugene Ng Mellon University9 Landmark Operations Compute Landmark coordinates by minimizing the overall discrepancy between measured distances and computed distances –Cast as a generic multi-dimensional global minimization problem y x Internet (x 2,y 2 ) (x 1,y 1 ) (x 3,y 3 ) L1L1 L2L2 L3L3 L1L1 L2L2 L3L3 Small number of distributed hosts called Landmarks measure inter-Landmark distances

T. S. Eugene Ng Mellon University10 Landmark Operations Landmark coordinates are disseminated to ordinary end hosts –A frame of reference –e.g. (2-D, (L 1,x 1,y 1 ), (L 2,x 2,y 2 ), (L 3,x 3,y 3 ))

T. S. Eugene Ng Mellon University11 Ordinary Host Operations Each ordinary host measures its distances to the Landmarks, Landmarks just reflect pings x Internet (x 2,y 2 ) (x 1,y 1 ) (x 3,y 3 ) (x 4,y 4 ) L1L1 L2L2 L3L3 y L2L2 L1L1 L3L3 Ordinary host computes its own coordinates relative to the Landmarks by minimizing the overall discrepancy between measured distances and computed distances –Cast as a generic multi-dimensional global minimization problem

T. S. Eugene Ng Mellon University12 GNP Advantages Over IDMaps High scalability and high speed –End host centric architecture, eliminates server bottleneck –Coordinates reduce O(K 2 ) communication overhead to O(K*D) –Coordinates easily exchanged, predictions are locally and quickly computable by end hosts Enable new applications –Structured nature of coordinates can be exploited Simple deployment –Landmarks are simple, non-intrusive (compatible with firewalls)

T. S. Eugene Ng Mellon University13 Evaluation Methodology 19 Probes we control –12 in North America, 5 in East Asia, 2 in Europe Select IP addresses called Targets we do not control Probes measure –Inter-Probe distances –Probe-to-Target distances –Each distance is the minimum RTT of 220 pings

T. S. Eugene Ng Mellon University14 Evaluation Methodology (Cont’d) Choose a subset of well-distributed Probes to be Landmarks, and use the rest for evaluation TTTTT P3P3 P1P1 P4P4 P2P2 T (x 1,y 1 ) (x 2, y 2 )

T. S. Eugene Ng Mellon University15 Computing Coordinates Multi-dimensional global minimization problem –Will discuss the objective function later Simplex Downhill algorithm [Nelder & Mead ’65] –Simple and robust, few iterations required f(x) x

T. S. Eugene Ng Mellon University16 Data Sets Global Set 19 Probes 869 Targets uniformly chosen from the IP address space –biased towards always-on and globally connected nodes 44 Countries –467 in USA, 127 in Europe, 84 in East Asia, 39 in Canada, …, 1 in Fiji, 65 unknown Abilene Set 10 Probes are on Abilene 127 Targets that are Abilene connected web servers

T. S. Eugene Ng Mellon University17 Performance Metrics Directional relative error –Symmetrically measure over and under predictions Relative error = abs(Directional relative error) Rank accuracy –% of correct prediction when choosing some number of shortest paths

T. S. Eugene Ng Mellon University18 GNP vs IDMaps (Global)

T. S. Eugene Ng Mellon University19 GNP vs IDMaps (Global)

T. S. Eugene Ng Mellon University20 Why the Difference? IDMaps tends to heavily over-predict short distances Consider (measured  50ms) –22% of all paths in evaluation –IDMaps on average over-predicts by 150 % –GNP on average over-predicts by 30% ???

T. S. Eugene Ng Mellon University21 GNP vs IDMaps (Global)

T. S. Eugene Ng Mellon University22 Abilene Data Set

T. S. Eugene Ng Mellon University23 GNP vs IDMaps (Abilene)

T. S. Eugene Ng Mellon University24 GNP vs IDMaps (Abilene)

T. S. Eugene Ng Mellon University25 GNP vs IDMaps (Abilene)

T. S. Eugene Ng Mellon University26 Basic Questions How to measure model error? How to select Landmarks? How does prediction accuracy change with the number of Landmarks? What is geometric model to use? How can we further improve GNP?

T. S. Eugene Ng Mellon University27 Measuring Model Error is measured distance is computed distance is an error measuring function

T. S. Eugene Ng Mellon University28 Error Function Squared error May not be good because one unit of error for short distances carry the same weight as one unit of error for long distances

T. S. Eugene Ng Mellon University29 More Error Functions Normalized error Logarithmic transformation

T. S. Eugene Ng Mellon University30 Comparing Error Functions 6 Landmarks15 Landmarks Squared Error Normalized Error Logarithmic Transformation

T. S. Eugene Ng Mellon University31 Selecting N Landmarks Intuition: Landmarks should be well separated Method 1: Clustering –start with 19 clusters, one probe per cluster –iteratively merge the two closest clusters until there are N clusters –choose the center of each cluster as the Landmarks Method 2: Find “N-Medians” –choose the combination of N Probes that minimizes the total distance from each not chosen Probe to its nearest chosen Probe Method 3: Maximum separation –choose the combination of N Probes that maximizes the total inter-Probe distances

T. S. Eugene Ng Mellon University32 K-Fold Validation Want more than just one set of N Landmarks to reduce noise Select N+1 Landmarks based on a criterion Eliminate one Landmark to get N Landmarks i.e., N+1 different sets of N Landmarks that are close to the selection criterion

T. S. Eugene Ng Mellon University33 Comparing Landmark Selection Criteria (6 Landmarks) ClusteringN-MediansMax sep. GNP IDMaps

T. S. Eugene Ng Mellon University34 Comparing Landmark Selection Criteria (9 Landmarks) ClusteringN-MediansMax sep. GNP IDMaps

T. S. Eugene Ng Mellon University35 Landmark Placement Sensitivity MaxMinMeanStd Dev GNP IDMaps

T. S. Eugene Ng Mellon University36 Number of Landmarks/Tracers

T. S. Eugene Ng Mellon University37 What Geometric Model to Use? Spherical surface, cylindrical surface –No better than 2-D Euclidean space Euclidean space of varying dimensions

T. S. Eugene Ng Mellon University38 Euclidean Dimensionality

T. S. Eugene Ng Mellon University39 Why Additional Dimensions Help? A A B C D A B C D A,BC,D ISP A B C D dimensional model A B C D 3-dimensional model 1 5

T. S. Eugene Ng Mellon University40 Reducing Measurement Overhead Hypothesis: End hosts do not need to measure distances to all Landmarks to compute accurate coordinates P4P4 (x, y) P2P2 P1P1 P3P3 P5P5 T P6P6

T. S. Eugene Ng Mellon University41 Reducing Measurement Overhead Hypothesis: End hosts do not need to measure distances to all Landmarks to compute accurate coordinates P4P4 (x’, y’) P2P2 P1P1 P3P3 P5P5 T P6P6

T. S. Eugene Ng Mellon University42 Using 9 of 15 Landmarks in 8 Dimensions

T. S. Eugene Ng Mellon University43 Using 9 of 15 Landmarks in 8 Dimensions

T. S. Eugene Ng Mellon University44 Triangular Inequality Violations

T. S. Eugene Ng Mellon University45 Removing Triangular Inequality Violations Remove Target (t) from data if –t in {a, b, c} –(a,c)/((a,b)+(b,c)) > threshold Try two thresholds –2.0; 647 of 869 Targets remain –1.5; 392 of 869 Targets remain –Note: at 1.1, only 19 of 869 Targets remain!!!

T. S. Eugene Ng Mellon University46 Removing Triangular Inequality Violations

T. S. Eugene Ng Mellon University47 Removing Triangular Inequality Violations

T. S. Eugene Ng Mellon University48 Removing Triangular Inequality Violations

T. S. Eugene Ng Mellon University49 Removing Triangular Inequality Violations

T. S. Eugene Ng Mellon University50 Why Not Use Geographical Distance?

T. S. Eugene Ng Mellon University51 Summary Network distance prediction is key to performance optimization in large-scale distributed systems GNP is scalable –End hosts carry out computations –O(K*D) communication overhead due to coordinates GNP is fast –Distance predictions are fast local computations GNP is accurate –Discover relative positions of end hosts

T. S. Eugene Ng Mellon University52 Future Work Understand the capabilities and limitations of GNP Can we learn about the underlying topology from GNP? Is GNP resilient to network topology changes? Can we reduce the number of measured paths while not affecting accuracy? Design better algorithms for Landmark selection Design more accurate models of the Internet Apply GNP to overlay network routing problems Apply GNP to geographic location problems