10-Year History of Internet Delay 1 April 24, 2010, DK Lee, Kenjiro Cho*, Gianluca Iannaccone**, Sue Moon CAIDA-WIDE-CASFI Joint Workshop.

Slides:



Advertisements
Similar presentations
Florin Dinu T. S. Eugene Ng Rice University Inferring a Network Congestion Map with Traffic Overhead 0 zero.
Advertisements

Measuring the Deployment of IPv6: Topology, Routing, and Performance Amogh Dhamdhere, Matthew Luckie, Bradley Huffaker, kc claffy (CAIDA / UC San Diego)
Estimating TCP Latency Approximately with Passive Measurements Sriharsha Gangam, Jaideep Chandrashekar, Ítalo Cunha, Jim Kurose.
By Hitesh Ballani, Paul Francis, Xinyang Zhang Slides by Benson Luk for CS 217B.
1 Internet Path Inflation Xenofontas Dimitropoulos.
Ten Years in the Evolution of the Internet Ecosystem
Measurement and Estimation of Network QoS among Peer Xbox Game Players Youngki Lee, KAIST Sharad Agarwal, Microsoft Research Chris Butcher, Bungie Studio.
University of Nevada, Reno Ten Years in the Evolution of the Internet Ecosystem Paper written by: Amogh Dhamdhere, Constantine Dovrolis School of Computer.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
Traffic Engineering With Traditional IP Routing Protocols
Server-based Inference of Internet Performance V. N. Padmanabhan, L. Qiu, and H. Wang.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Informed Detour Selection Helps Reliability Boulat A. Bash.
On Power-Law Relationships of the Internet Topology CSCI 780, Fall 2005.
PAM A Measurement Study of Internet Delay Asymmetry Abhinav PathakPurdue University Himabindu PuchaPurdue University Ying ZhangUniversity of Michigan.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Yao Zhao 1, Yan Chen 1, David Bindel 2 Towards Unbiased End-to-End Diagnosis 1.Lab for Internet & Security Tech, Northwestern Univ 2.EECS department, UC.
Ningning HuCarnegie Mellon University1 A Measurement Study of Internet Bottlenecks Ningning Hu (CMU) Joint work with Li Erran Li (Bell Lab) Zhuoqing Morley.
End-to-End Issues. Route Diversity  Load balancing o Per packet splitting o Per flow splitting  Spill over  Route change o Failure o policy  Route.
Observations from Router-level Traces Lisa Amini IBM T. J. Watson Research Center Joint with Henning Schulzrinne, Aurel Lazar Columbia University.
Measuring ISP topologies with Rocketfuel Ratul Mahajan Neil Spring David Wetherall University of Washington ACM SIGCOMM 2002.
1 Network Topology Measurement Yang Chen CS 8803.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks Locations.
A Machine Learning-based Approach for Estimating Available Bandwidth Ling-Jyh Chen 1, Cheng-Fu Chou 2 and Bo-Chun Wang 2 1 Academia Sinica 2 National Taiwan.
Lecture 10 Overview. Border Gateway Protocol(BGP) De facto standard for Internet inter-AS routing allows subnet to advertise its existence to rest of.
On the Power of Off-line Data in Approximating Internet Distances Danny Raz Technion - Israel Institute.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.
Path Stitching: Internet-Wide Path and Delay Estimation from Existing Measurements DK Lee, Keon Jang, Changhyun Lee, Sue Moon, Gianluca Iannaccone* ASIAFI.
Quantifying the Causes of Path Inflation Neil Spring, Ratul Mahajan, and Thomas Anderson Presented by Luv Kohli COMP November 24, 2003.
1 Controlling IP Spoofing via Inter-Domain Packet Filters Zhenhai Duan Department of Computer Science Florida State University.
FIDEMO 2009, Nov. 18 A Step Towards a Planet-scale Measurements Retrieval Infrastructure In this work, we propose to design an end-to-end path and delay.
Computer Networks Zhenhai Duan Department of Computer Science 09/03/2015.
Issues with Inferring Internet Topological Attributes Lisa Amini ab, Anees Shaikh a, Henning Schulzrinne b a IBM T.J. Watson Research Center b Columbia.
On AS-Level Path Inference Jia Wang (AT&T Labs Research) Joint work with Z. Morley Mao (University of Michigan, Ann Arbor) Lili Qiu (University of Texas,
Reducing Transient Disconnectivity using Anomaly-Cognizant Forwarding Andrey Ermolinskiy, Scott Shenker University of California – Berkeley and ICSI.
Lecture 4: BGP Presentations Lab information H/W update.
Advanced Networking Lab. Given two IP addresses, the estimation algorithm for the path and latency between them is as follows: Step 1: Map IP addresses.
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
EEC-484/584 Computer Networks Lecture 9 Wenbing Zhao (Part of the slides are based on Drs. Kurose & Ross ’ s slides for their Computer.
TDTS21: Advanced Networking Lecture 7: Internet topology Based on slides from P. Gill and D. Choffnes Revised 2015 by N. Carlsson.
1 Passive Network Tomography Using Bayesian Inference Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang Microsoft Research Internet Measurement.
1 Network Layer Lecture 13 Imran Ahmed University of Management & Technology.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
1 TBD: Trajectory-Based Data Forwarding for Light-Traffic Vehicular Networks IEEE ICDCS’09, Montreal, Quebec, Canada Jaehoon Jeong, Shuo Gu, Yu Gu, Tian.
CS 4396 Computer Networks Lab BGP. Inter-AS routing in the Internet: (BGP)
ICS 156: Networking Lab Magda El Zarki Professor, ICS UC, Irvine.
Stainov - DataComMET CS TC5353 THE NETWORK LAYER 5.2 ROUTING ALGORITHMS - adaptive Distance Vector Routing (Bellman-Ford, Ford-Fulkenson). It was used.
정하경 MMLAB Fundamentals of Internet Measurement: a Tutorial Nevil Brownlee, Chris Lossley, “Fundamentals of Internet Measurement: a Tutorial,” CMG journal.
Evaluation of a Novel Two-Step Server Selection Metric Presented by Karthik Lakshminarayanan
Has the Internet Delay Gotten Better or Worse? Universidad Carlos III de Madrid DK Lee, Keon Jang, Changhyun Lee, Gianluca Iannaccone, Kenjiro.
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
Placing Relay Nodes for Intra-Domain Path Diversity Meeyoung Cha Sue Moon Chong-Dae Park Aman Shaikh Proc. of IEEE INFOCOM 2006 Speaker 游鎮鴻.
Constructing Inter-Domain Packet Filters to Control IP Spoofing Based on BGP Updates Zhenhai Duan, Xin Yuan Department of Computer Science Florida State.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Topology Discovery Using an Address Prefix Stopping Rule Benoit Donnet Joint work with Timur Friedman Eunice 2005 – Madrid (Spain)
1 On the Impact of Route Monitor Selection Ying Zhang* Zheng Zhang # Z. Morley Mao* Y. Charlie Hu # Bruce M. Maggs ^ University of Michigan* Purdue University.
A CIDR Prefix Stopping Rule for Topology Discovery Benoit Donnet joint work with Timur Friedman Algotel 2005 – Presqu'Ile de Giens.
Lecture 13 – Network Mapping
Comparative Analysis of Internet Topology Data Sets
COMP 3270 Computer Networks
Detection of Routing Loops and Analysis of Its Causes
RESOLVING IP ALIASES USING DISTRIBUTED SYSTEMS
Measured Impact of Crooked Traceroute
Presentation transcript:

10-Year History of Internet Delay 1 April 24, 2010, DK Lee, Kenjiro Cho*, Gianluca Iannaccone**, Sue Moon CAIDA-WIDE-CASFI Joint Workshop April 24, 2010 Division of Computer Science, KAIST *IIJ Research Laboratory **Intel Research, Berkeley

For the Last Few Decades Many large-scale Internet measurements: – NLANR AMP, CAIDA’s Ark, DIMES, iPlane – UCSD network teloscope, RouteViews, RIPE RIS About the Internet evolution, we know that – Internet topology has been shrunken in terms of the average AS hop count (by network densification) – Dominant Internet traffic types have changed from web to peer-to-peer traffic April 24, 2010, 2

What do we know about the overall Internet delay performance?

What We Know About Internet Delay Transmission delay – Improved with faster link speed Propagation delay – Improved with new undersea cables Queuing and processing delay – Improved with faster devices Routing Issues – Loops or detours from VPNs, overlays – Delays can be Improved with new AS peering practices April 24, 2010, 4

Then, Has the Internet grown shorter in delay?

What are the basic rules that govern the long-term dynamics of the Internet delay?

Talk Outline Has the Internet delay gotten better or worse? Two main methodologies: – Path stitching – Random sampling of the Internet host pairs Data sets Preliminary results – Delay distributions from 2004 to 2009 April 24, 2010, 7

Reasons for No Authoritative Statement About the Internet-wide Delay “Random sampling” has not been feasible – No measurement system with access to every AS and subnet of the Internet – No rigorous method to address bias in Internet sampling Only a selective set of statistics has been possible – Stability, variation, and abrupt changes of delay as a path statistic have been well studied April 24, 2010, 8

Path Stitching for Random Sampling April 24, 2010, 9 Internet-wide path and round-trip delay estimation between any pair of Internet hosts by recycling existing data – Keep database of end-to-end measurement data segmented by the AS – Identifies relevant segments efficiently – Produces path and round-trip delay estimates, by stitching segments together

Path Segment Repository Indexing the path segments by the AS number :A: Intra-domain segments of A : :B: Intra-domain segments of B : A::B Inter-domain segments between A and B : :A: + A::B + :B: = Router-level paths from A to B : March 15, 2010, 10 ABC traceroute outputs: AS path: a1a1 a2a2 a3a3 a4a4 b1b1 b2b2 b3b3 c1c1 c2c2 c3c3 a1a1 a2a2 a3a3 a4a4 b1b1 b2b2 b3b3

Overview of Path Stitching Question: Answers: April 24, 2010, 11 a ?c A C B Step 1. IP-to-AS mapping Step 2. AS path inference :A: rtt A :C: rtt C :B: rtt B A::B rtt AB Step 3. Path stitching :A::B::C: B::C rtt BC Router-level paths and RTT from a to c ? Path = RTT = rtt A + rtt AB + rtt B + rtt BC + rtt C

Results of Path Stitching We evaluate the Internet-wide coverage and accuracy of the estimated results – More than 70% of pairs are covered by the algorithm – 80% of pairs have absolute errors less than 20msec – Median absolute error is less than 5msec Reference: DK Lee, Keon Jang, Changhyun Lee, Gianluca Iannaccone, Sue Moon, “Internet-wide Path and Delay Estimation from Existing Measurements”, IEEE INFOCOM 2010 Mini-conference April 24, 2010, 12

Survey Design: Select a Set of Host Pairs to Examine Random sampling design with size n – Internet consists of N unique pairs of /24 IP prefixes – Extract all routable /24 prefixes from BGP table – Randomly select n=10,000 pairs Fraction of responded pairs with path stitching – 67% in 2004 and 65% in 2009 April 24, 2010, 13

Sampling Errors for the Population Median – (1) Confidence Interval (CI) for the population median estimator: = q 0.5 ± In 2009/06, n = 10,000 = ± 4.9 msec In 2009/06, n = 100,000 = ± 1.3 msec April 24, 2010, 14

Sampling Errors for the Population Median – (2) April 24, 2010, 15 Sample size n=100,000 is very accurate

Sampling Errors for the Population Median – (3) April 24, 2010, 16 Results for the sample size n>=10,000 are almost identical

Data Sets: E2E Measurements + Routing Data End-to-end path and delay measurement – Traceroute measurements CAIDA Ark project (from 1998~) NLANR’s AMP project (from 1999~) Routing information – BGP routing tables University of Oregon, RouteViews (from 1997~) RIPE RIS (from 1999~) April 24, 2010, 17

Data Processing Oveview For each YYYY/MM, we process: April 24, 2010, 18 Queries Path and delay estimations for queries

We choose a set of host pairs in each year, and see the delay distribution

Delay distributions from 2004 to 2009 April 24, 2010, 20

Delay distributions from 2004 to 2009 April 24, 2010, 21

2004 vs Median delay: msec  msec April 24, 2010, 22

Median Delays Increase Constantly Delay distribution has gotten worse from 2004 to 2009, both at first/last mile and in the core IP/AS hop counts decreased end-to-end – IP hop counts: 14.8 (2004)  14.1 (2009) – AS hop counts: 3.77 (2004)  3.65 (2009) April 24, 2010, 23

But what if we choose the same set of host pairs?

Delay distributions from 2004 to 2009 (For the Same Pairs) Only 2432 pairs are constantly responded from 2004 to 2009 April 24, 2010, 25

Delay distributions from 2004 to 2009 (For the Same Pairs) April 24, 2010, 26

2004 vs (For the Same Pairs) Median delay: msec  msec April 24, 2010, 27

Median Delays improved (For the Same Pairs) Delay distributions for the same set of sample host pairs remain almost identical or slightly improved from 2004 to 2009 IP/AS hop counts decreased April 24, 2010, 28

Finding the corroborating Evidence for the observations IP address usage have expanded from 2004 to 2009 – /24 prefixes of those hosts in 2009 existed in 2004? – ASes of those hosts in 2009 existed in 2004? In sampled pairs in 2009, compared to 2004, 1729 ASes are disappeared, 2091 Ases are newly appeared. Network densification helps AS hop count to decrease. Does it also help IP hop count or delay to decrease? April 24, 2010, 29

Other Challenges Analyzing the delay distribution in 1999 – Skitter’s old-format does not have hop-by-hop delays – NLANR AMP dataset is too small – RouteViews have very restricted number of peers Effect of non-response – Where does the missing 35% come from? Effect of measurement errors April 24, 2010, 30

Conclusion We present the methodology for the Internet delay history reconstruction and analysis: – Path-stitching with existing measurements – Random sampling of the Internet host pairs Our approach is very feasible in showing insight about the overall Internet delay distribution April 24, 2010, 31

Thank You! Any Questions? We are looking for other traceroute outputs and BGP table snapshots archived before 2000 April 24, 2010, 32

Backup Slides “To get to the essence of things, one has to work long and hard” -- Vincent van Gogh

What If There Are Too few segments: Too many segments: April 24, 2010, 34 A::B ?B::C :A: :C: :B: ?...

When There Are Too Few or No Segments

We Employ Approximations (i) Missing AS »No solutions (other than collecting more measurements. ) (ii) Missing inter-domain segment »Search for reverse path segments. (i.e., if we cannot find A::B, use B::A instead) (iii) Path segments do not rendezvous at the same address (i.e., the segment cannot be stitched) »Identify nearby segments (on the same router, PoP, Prefix) April 24, 2010, 36 :A: :B: B::A X Z Y W A X::A::W = ?

When There Are Too Many Segments

We Apply Preference Rules Rank the list of candidate path segments – Eliminate candidates as many as possible while keeping the most accurate one. – Reflect the actual routing mechanism April 24, 2010, 38 Source AS Destination AS Intermediate ASes Rule #1, 2, 3 Rule # 2, 3

Rule #1: Proximity Preference to the path segments that closest to the queried source and destination address April 24, 2010, 39 Source AS a.b.c.1 a.b.c.2 a.b.1.1 d.b.1.2 x.y.z.1 x.y.z.2 Query: a.b.c.d --> x.y.z.w

Rule #2: Destianation-bound Preference to the segments from traceroutes with the same destination prefix April 24, 2010, 40 Query: a.b.c.d --> x.y.z.w Source AS Original traceroutes traceroutes to x.y.z.1 traceroutes to u.v.w.1

Rule #3: Most Recent Preference to the most recent path segment April 24, 2010, 41 Query: a.b.c.d --> x.y.z.w Source AS Original traceroutes traceroutes to x.y.z.1 YYYYMMDD-12:30:00 YYYYMMDD-10:30:00

Comparisons with iPlane – (1) April 24, 2010, 42 CDF of absolute errors for pl-easy pairs Errors <= 20ms for 90% of pl-easy pairs

Comparisons with iPlane – (1) CDF of absolute errors for pl-hard pairs April 24, 2010, 43 Very promising results: With accurate AS paths inference, errors <= 20ms for 80% of pl-hard pairs