Internet Iso-bar: A Scalable Overlay Distance Monitoring System Yan Chen, Lili Qiu, Chris Overton and Randy H. Katz.

Slides:



Advertisements
Similar presentations
T. S. Eugene Ng Mellon University1 Towards Global Network Positioning T. S. Eugene Ng and Hui Zhang Department of Computer.
Advertisements

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Relative Network Positioning via CDN Redirections A. Su, D. Choffnes, F. Bustamante, A. Kuzmanovic ICDCS 2008 Presented by: Imranul Hoque.
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
Topologically-Aware Overlay Construction and Server Selection Sylvia Ratnasamy, Mark Handly, Richard Karp and Scott Shenker Presented by Shreeram Sahasrabudhe.
Configurable restoration in overlay networks Matthew Caesar, Takashi Suzuki.
Measurement and Estimation of Network QoS among Peer Xbox Game Players Youngki Lee, KAIST Sharad Agarwal, Microsoft Research Chris Butcher, Bungie Studio.
SCAN: A Dynamic, Scalable, and Efficient Content Distribution Network Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy,
IPlane: An Information Plane for Distributed Services Offence by: Anup Goyal Sagar Vemuri.
1 Clustering Web Content for Efficient Replication Yan Chen, Lili Qiu*, Weiyu Chen, Luan Nguyen, Randy H. Katz EECS Department UC Berkeley *Microsoft Research.
Tomography-based Overlay Network Monitoring UC Berkeley Yan Chen, David Bindel, and Randy H. Katz.
Server-based Inference of Internet Performance V. N. Padmanabhan, L. Qiu, and H. Wang.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Informed Detour Selection Helps Reliability Boulat A. Bash.
1 On the Stability of Network Distance Monitoring Yan Chen, Chris Karlof, Yaping Li and Randy Katz {yanchen, ckarlof, yaping, EECS.
1 Network Tomography Venkat Padmanabhan Lili Qiu MSR Tab Meeting 22 Oct 2001.
Tomography-based Overlay Network Monitoring and its Applications Joint work with David Bindel, Brian Chavez, Hanhee Song, and Randy H. Katz UC Berkeley.
1 Clustering Web Content for Efficient Replication Yan Chen, Lili Qiu*, Weiyu Chen, Luan Nguyen, Randy H. Katz EECS Department UC Berkeley *Microsoft Research.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring University of California at Berkeley David Bindel, Hanhee Song, and Randy H.
PAM A Measurement Study of Internet Delay Asymmetry Abhinav PathakPurdue University Himabindu PuchaPurdue University Ying ZhangUniversity of Michigan.
Tomography-based Overlay Network Monitoring UC Berkeley Yan Chen, David Bindel, and Randy H. Katz.
1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.
Wide-area Network Measurement and Monitoring Services Goal:Develop scalable, robust overlay performance and distance information collection/sharing infrastructure.
T. S. Eugene Ng Mellon University1 Global Network Positioning: A New Approach to Network Distance Prediction Tze Sing Eugene.
1 Emulating AQM from End Hosts Presenters: Syed Zaidi Ivor Rodrigues.
Tomography-based Overlay Network Monitoring and its Applications Joint work with David Bindel, Brian Chavez, Hanhee Song, and Randy H. Katz UC Berkeley.
Internet-Scale Research at Universities Panel Session SAHARA Retreat, Jan 2002 Prof. Randy H. Katz, Bhaskaran Raman, Z. Morley Mao, Yan Chen.
RRAPID: Real-time Recovery based on Active Probing, Introspection, and Decentralization Takashi Suzuki Matthew Caesar.
Yao Zhao 1, Yan Chen 1, David Bindel 2 Towards Unbiased End-to-End Diagnosis 1.Lab for Internet & Security Tech, Northwestern Univ 2.EECS department, UC.
Tomography-based Overlay Network Monitoring Hugo Angelmar Slides courtesy of (Yan Chen, David Bindel, and Randy H. Katz)
The Delta Routing Project Low-loss Routing for Hybrid Private Networks George Porter (UCB) Minwen Ji, Ph.D. (SRC - HP Labs)
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Network Planète Chadi Barakat
On the Power of Off-line Data in Approximating Internet Distances Danny Raz Technion - Israel Institute.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization Yang Chen Department of Computer Science Duke University
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
A Scalable, Adaptive, Network-aware Infrastructure for Efficient Content Delivery Yan Chen Ph.D. Status Talk EECS Department UC Berkeley.
Measurement and Modeling of Packet Loss in the Internet Maya Yajnik.
1 Passive Network Tomography Using Bayesian Inference Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang Microsoft Research Internet Measurement.
Paper Group: 20 Overlay Networks 2 nd March, 2004 Above papers are original works of respective authors, referenced here for academic purposes only Chetan.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
1 Vivaldi: A Decentralized Network Coordinate System Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris Presented by: Chen Qian.
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
WSP: A Network Coordinate based Web Service Positioning Framework for Response Time Prediction Jieming Zhu, Yu Kang, Zibin Zheng and Michael R. Lyu The.
Network Coordinates : Internet Distance Estimation Jieming ZHU
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
Network Computing Laboratory Load Balancing and Stability Issues in Algorithms for Service Composition Bhaskaran Raman & Randy H.Katz U.C Berkeley INFOCOM.
Gang Wang, Shining Wu, Guodong Wang, Beixing Deng, Xing Li Tsinghua University Tsinghua Univ. Oct Experimental Study on Neighbor Selection Policy.
1 Network Tomography Using Passive End-to-End Measurements Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang.
Access Link Capacity Monitoring with TFRC Probe Ling-Jyh Chen, Tony Sun, Dan Xu, M. Y. Sanadidi, Mario Gerla Computer Science Department, University of.
© 2006 Andreas Haeberlen, MPI-SWS 1 Monarch: A Tool to Emulate Transport Protocol Flows over the Internet at Large Andreas Haeberlen MPI-SWS / Rice University.
Lecture 13 – Network Mapping
Vivaldi: A Decentralized Network Coordinate System
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Northwestern Lab for Internet and Security Technology (LIST) Yan Chen Department of Computer Science Northwestern University.
CMPE 252A : Computer Networks
Dynamic Replica Placement for Scalable Content Delivery
Presentation transcript:

Internet Iso-bar: A Scalable Overlay Distance Monitoring System Yan Chen, Lili Qiu, Chris Overton and Randy H. Katz

Motivations Applications of end-to-end distance monitoring/estimation –Overlay Routing/Location –Peer-to-peer Systems –VPN Management/Provisioning –Service Redirection/Placement –Cache-infrastructure Configuration Requirements for E2E distance monitoring system –Scalable: a small amount of probing traffic and system load –Accurate: capture congestion/failures + latency estimation –Fast: small computation for real-time estimation –Incrementally deployable –Easy to use Benefit applications –Application-driven measurement –Inference techniques for trouble shooting, root cause analysis –Improve application performance and reliability

E2E Estimation/Monitoring Systems Comparison Properties GNPAkamaiIDMapsRONInternet Isobar Dynamic monitoring Scalability: (N hosts, AP address prefixes, K landmarks, C clusters) N > AP » C C ≥ K Estimation accuracy Monitors deployment

E2E Estimation/Monitoring Systems Comparison Properties GNPAkamaiIDMapsRONInternet Isobar Dynamic monitoring Static estimation Scalability: (N hosts, AP address prefixes, K landmarks, C clusters) N > AP » C C ≥ K O(N K) probes, each landmark takes O(N) Estimation accuracy Accurate, but only symmetric distance Monitors deployment End hosts

E2E Estimation/Monitoring Systems Comparison Properties GNPAkamaiIDMapsRONInternet Isobar Dynamic monitoring Static estimation Yes Scalability: (N hosts, AP address prefixes, K landmarks, C clusters) N > AP » C C ≥ K O(N K) probes, each landmark takes O(N) O(FAP) probes, F = number of CDN edge server farms Clustering need pair-wise distance b/t all pairs of APs, O(C 2 +AP) probes O(N 2) probes Estimation accuracy Accurate, but only symmetric distance No existing comparison. Inaccurate: Triangulation inequality & proximity-based clustering Exact measurem ents  most accurate Monitors deployment End hostsCDN edge servers Transit AS’s (hard to deploy) End hosts

E2E Estimation/Monitoring Systems Comparison Properties GNPAkamaiIDMapsRONInternet Isobar Dynamic monitoring Static estimation Yes Scalability: (N hosts, AP address prefixes, K landmarks, C clusters) N > AP » C C ≥ K O(N K) probes, each landmark takes O(N) O(FAP) probes, F = number of CDN edge server farms Clustering need pair-wise distance b/t all pairs of APs, O(C 2 +AP) probes O(N 2) probes O(C 2 +N) probes Estimation accuracy Accurate, but only symmetric distance No existing comparison. Inaccurate: Triangulation inequality & proximity-based clustering Exact measurem ents  most accurate Similar accuracy to GNP Monitors deployment End hostsCDN edge servers Transit AS’s (hard to deploy) End hosts

Problem Formulation Given N end hosts, how to select a subset of them as monitors and build a scalable overlay distance monitoring service without knowing the underlying topology? Distance info desired: report congestion/failure if occurs, otherwise latency

E2E Congestion/Failures Analysis Based on National Lab of Applied Network Research (NLANR) AMP data set –104 sites in US (including Alaska, Hawaii) & Australia, every host ping all other hosts every minute –Sliding window of 10 samples, use minimum RTT as latency sample –105M measurements, 6/25/01 – 7/1/01 –Congestion/failures (uniformly denoted as congestion) defined as measurement “loss” or (latency > geo mean × geo stdev) Congestions not common, only 0.96% samples A few congestion links dominate the E2E congestion –Besides those happened at the last mile, E2E congestion exhibit strong spatial correlation

NLANR AMP Sites

Internet Iso-bar Procedures 1.Cluster hosts that perceive similar performance to a small set of sites (landmarks) 2.For each cluster, select a monitor for active and continuous probing 3.Estimate distance between any pair of hosts using inter- and intra-cluster distance

Internet Iso-bar (I): Host Clustering Define correlation distance between each pair of hosts –Existing work use network proximity: cor_dist(i,j) = net_dist(i,j) (denoted p ij ) –Iso-bar uses network distance vector (k landmarks for clustering only): netV i = [p i1, p i2, …, p ik ] T Euclidean distance based: Cosine vector similarity based: Apply generic clustering methods –Optimize the worst case: minimize the maximum radius of all clusters (limit_num_minRmax) –Optimize the average case: minimize the sum of total host- monitor distance (limit_num_minDistSum)

End Host Cluster A Cluster B Cluster C Landmark Diagram of Internet Iso-bar

Cluster A End Host Cluster B Monitor Cluster C Distance probes from monitor to its hosts Distance probes among monitors Landmark Diagram of Internet Iso-bar

Internet Iso-bar (II): Distance Estimation Intra-cluster estimation –If path(m, i) or path(m, j) is congested, report path(i, j) as congestion –O/w pDist(i,j) = (mDist(m, i) + mDist(m, j))/ 2 Inter-cluster estimation –If path( m i, i), path(m i, m j ) or path( m j, j) is congested, report path(i, j) as congestion –O/w pDist(i,j) = mDist(m i, m j ) i j m j mjmj i mimi

Evaluation Methodology Internet measurement data –NLANR AMP data set Clustering with geometric mean of training date Estimation dates: 6/25/01 – 7/24/01, 12/06/01 –Keynote CDN measurement data 63 agents covering all major ISPs in US, Europe, Asia & Australia 2 targets (CDN re-directors) in Boston and Texas Measure TCP connection time (2/3 of handshake) from each agent to target every minute Training date: 10/21/2002 Estimation dates: 10/21/2002 – 11/25/2002 Similar latency estimation results for both datasets, present NLANR

Evaluation Methodology (II) Estimation metric –Relative accuracy error for un-congested latency –Stability –For dynamic monitoring systems, amount of congestion captured and false positive ratio Internet distance estimation techniques evaluated –Omniscent: use g-mean data of (source, dest) on training date –Global Network Positioning (GNP) –Clustering with network distance vector (Iso-bar) –Clustering with network proximity 15 clusters vs. 15 landmarks of GNP

Latency Prediction Accuracy & Stability Training date: 06/25/01 Estimation dates: 06/25/ /06/01 Summary of the 90 th percentile relative error for various distance estimation methods

Distance Estimation Results Latency estimation when un-congested –Omniscient is the most accurate, but unscalable –GNP and Iso-bar are the second Both have good accuracy and stability for distance estimation GNP unscalable for online monitoring, static approach –Iso-bar outperforms proximity-based clustering by 50% 90 th percentile < 0.5, if 60ms latency, 45ms < prediction < 90ms Congestion/failures estimation –6/25/01 – 7/01/01, averagely 148K congested measurements per day –Iso-bar captures 78% of them, 32% false positive ratio –Only 3% of monitoring overhead compared with RON

Conclusions Propose Internet Iso-bar Cluster hosts based on the network similarity Inter- and Intra-cluster latency estimation w/ first-step heuristic for congestion/failure detection Preliminary results promising –High accuracy & stability for normal latency estimation –Simple heuristics of congestion estimation captures 78% of congestions, with 32% false positive, and only 3% of monitoring overhead of RON

Ongoing Work Current focus switch from latency estimation to congestion/failures estimation –Apply topology information, e.g. lossy link detection with network tomography –Cluster and choose monitors based on the lossy links Benefit applications –Dynamic node join/leave for P2P systems Joining client pings landmark sites to get distance vector, compare with those of monitors, and choose closest one to join Split/merge clusters –Multi-path selection More comprehensive evaluation –Simulate with large network –Deploy on PlanetLab, and operate at finer level

Internet Iso-bar Problem formulation: Given N end hosts, how to select a subset of them as monitors and build a scalable overlay distance monitoring service without knowing the underlying topology? Distance info desired: report congestion/failure if occurs, o/w latency Our approach: 1.Cluster hosts that perceive similar performance to a small set of sites (landmarks) 2.For each cluster, select a monitor for active and continuous probing 3.Estimate distance between any pair of hosts using inter- and intra-cluster distance Performance evaluation –Using real Internet measurement data –Compared with other distance estimation services: GNP, RON –Performance metrics: accuracy and stability

Internet Iso-bar (II): Distance Estimation Congestion/failures analysis –Congestion/failures (uniformly denoted as congestion) not common Defined as measurement “loss” or (latency > geo mean × geo stdev) Only 0.96% out of 105M NLANR ping measurements over a week –Suggest a few congestion links dominate the E2E congestion Besides those happened at the last mile, E2E congestion exhibit strong spatial correlation Estimation algorithms –Intra-cluster estimation (i and j use the same monitor m) If path(m, i) or path(m, j) is congested, report path(i, j) as congestion O/w predictedDist(i,j) = (measuredDist(m, i) + measuredDist(m, j))/ 2 –Inter-cluster distance estimation If path( monitor i, i), path(monitor i, monitor j ) or path( monitor j, j) is congested, report path(i, j) as congestion Otherwise predictedDist(i,j) = measuredDist(monitor i, monitor j ) –Self-diagnostics of monitors, check for last-mile congestion