Experimental Measurement of Delayed Convergence Abha Ahuja Internap/Merit Network, Inc. Craig Labovitz Microsoft Research/Merit Network, Inc. Farnam Jahanian,

Slides:



Advertisements
Similar presentations
Distance Vector Routing Protocols
Advertisements

Karlston D'Emanuele Distance Vector Routing Protocols Notes courtesy of Mr. Joe Cordina Password Removed
Chapter 4 NETWORK LAYER Computer Networks Summer 2007 Distributed Computing Group Dynamic PowerPoint Slides Only.
OSPF 1.
Introduction to IP Routing Geoff Huston. Routing How do packets get from A to B in the Internet? A B Internet.
Multihoming and Multi-path Routing
CSCI-1680 Network Layer: Intra-domain Routing Based partly on lecture notes by David Mazières, Phil Levis, John Jannotti Rodrigo Fonseca.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Distance Vector Routing Protocols Routing Protocols and Concepts –
IRTF-RR IRTF agenda Agenda issues (5 sec) Intro - why are we here (10 sec) - abha Goals of the group, etc (30 min)- sean Topics of Interest.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Link-State Routing Protocols Routing Protocols and Concepts – Chapter.
Cs/ee 143 Communication Networks Chapter 5 Routing Text: Walrand & Parakh, 2010 Steven Low CMS, EE, Caltech.
The Impact of Policy and Topology on Internet Routing Convergence NANOG 20 October 23, 2000 Abha Ahuja InterNap *In collaboration with.
Modeling Inter-Domain Routing Protocol Dynamics ISMA 2000 December 6, 2000 In collaboration with Abha, Ahuja, Roger Wattenhofer, Srinivasan Venkatachary,
Outline  Recap Network components & basic mechanisms Routing Flow control  Continue Basic queueing analysis Construct routing table.
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
Internet Routing Instability Craig Labovitz, G. Robert Malan, Farham Jahanian University of Michigan Presented By Krishnanand M Kamath.
Part IV: BGP Routing Instability. March 8, BGP routing updates  Route updates at prefix level  No activity in “steady state”  Routing messages.
Advanced Networks 1. Delayed Internet Routing Convergence 2. The Impact of Internet Policy and Topology on Delayed Routing Convergence.
Delayed Internet Routing Convergence Craig Labovitz, Microsoft Research Abha Ahuja, University of Michigan Farnam Jahanian, University of Michigan Abhit.
1 Experimental Study of Internet Stability and Wide-Area Backbone Failure Craig Labovitz, Abha Ahuja Merit Network, Inc Presented by Changchun Zou.
Copyright 2008 Kenneth M. Chipps Ph.D. Cisco CCNA Exploration CCNA 2 Routing Protocols and Concepts Chapter 4 Distance Vector Routing Protocols.
Internet Routing Instability
Improved BGP convergence via Ghost Flushing Yehuda Afek Anat Bremler-Barr Shemer Schwarzd המרכז הבינתחומי הרצליה.
Chapter 5 IP Routing Routing Sending packets through network from one device to another What must routers know? – Destination address – Neighboring routers.
Scalability & Stability of the Internet Infrastructure Farnam Jahanian Department of EECS University of Michigan.
(c) Anirban Banerjee, Winter 2005, CS-240, 2/1/2005. The Impact of Internet Policy and Topology on Delayed Routing convergence C. Labovitz, A. Ahuja, R.
Internet Routing Instability Labovitz et al. Sigcomm 1997 Largely adopted from Ion Stoica’s slide at UCB.
BGP: Inter-Domain Routing Protocol Noah Treuhaft U.C. Berkeley.
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
INTRA- AND INTERDOMAIN ROUTING Routing inside an autonomous system is referred to as intradomain routing. Routing between autonomous systems is.
Routing Concepts Warren Toomey GCIT. Introduction Switches need to know the link address and location of every station. Doesn't scale well, e.g. to several.
Distance Vector Routing Protocols W.lilakiatsakun.
1 Computer Communication & Networks Lecture 22 Network Layer: Delivery, Forwarding, Routing (contd.)
Dynamic Routing Protocols  Function(s) of Dynamic Routing Protocols: – Dynamically share information between routers (Discover remote networks). – Automatically.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2000 Chapter 14 Routing Protocols RIP, OSPF, BGP.
Count to infinity problem
Unicast Routing Protocols  A routing protocol is a combination of rules and procedures that lets routers in the internet inform each other of changes.
Routing protocols Basic Routing Routing Information Protocol (RIP) Open Shortest Path First (OSPF)
1 3-Oct-15 Distance Vector Routing CCNA Exploration Semester 2 Chapter 4.
Network Layer4-1 Chapter 4: Network Layer r 4. 1 Introduction r 4.2 Virtual circuit and datagram networks r 4.3 What’s inside a router r 4.4 IP: Internet.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 11 Unicast Routing Protocols.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
1 © 2004, Cisco Systems, Inc. All rights reserved. CCNA 2 v3.1 Module 7 Distance Vector Routing Protocols.
Eliminating Packet Loss Caused by BGP Convergence Nate Kushman Srikanth Kandula, Dina Katabi, and Bruce Maggs.
Routing protocols. Static Routing Routes to destinations are set up manually Route may be up or down but static routes will remain in the routing tables.
An internet is a combination of networks connected by routers. When a datagram goes from a source to a destination, it will probably pass through many.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
1 Chapter 4: Internetworking (IP Routing) Dr. Rocky K. C. Chang 16 March 2004.
Inter-domain Routing Outline Border Gateway Protocol.
The Impact of Internet Policy and Topology on Delayed Routing Convergence.
Distance Vector and Link State Routing Pragyaditya Das.
Abha Ahuja InterNap Craig Labovitz Microsoft Research
(How the routers’ tables are filled in)
Distance-Vector Routing Protocols
CCNA 2 v3.1 Module 7 Distance Vector Routing Protocols
Intra-Domain Routing Jacob Strauss September 14, 2006.
Routing: Distance Vector Algorithm
CS 4700 / CS 5700 Network Fundamentals
Routing Protocols (RIP, OSPF, BGP).
COS 561: Advanced Computer Networks
CS 3700 Networks and Distributed Systems
CS 640: Introduction to Computer Networks
RFC 1058 & RFC 2453 Routing Information Protocol
COS 561: Advanced Computer Networks
CS 3700 Networks and Distributed Systems
COS 561: Advanced Computer Networks
Communication Networks
COMP/ELEC 429/556 Introduction to Computer Networks
COS 461: Computer Networks
Presentation transcript:

Experimental Measurement of Delayed Convergence Abha Ahuja Internap/Merit Network, Inc. Craig Labovitz Microsoft Research/Merit Network, Inc. Farnam Jahanian, Abhijit Bose University of Michigan Apricot

The Internet: Failure Analysis Mostly seems to work Something happens. Doesnt work. Time Mostly seems to work

Routing Protocol Convergence Unlike connection oriented PSTN (~30 ms), Internet does not have fail-over. Instead, each node recalculates on a hop-per-hop basis (i.e. no flooding of changes) Distance-vector algorithms (e.g. RIP, BGP) exhibit slower convergence than link state protocols During convergence –Latency, loss, out of order –Additional update messages (CPU processing)

Distance Vector (BF) Protocols Suffer from counting to infinity problem Solutions –Poison reverse –Split horizon –Path vectors A B C Example

Conventional Wisdom Restoral is not an issue in the IP world –Just reroute around in a few milliseconds or whatever BGP convergence takes only a few _____ Bad news travels fast –Fast withdraw propagation valid goal –Announcements slower because bundled BGP has great convergence properties –ASPath solved the convergence and counting to infinity problems All my customers are multi-homed, triple-homed –Convergence -- what, me worry?

More Conventional Wisdom Enough bandwidth will solve anything It will all be one big network one day soon anyways (Especially after yesterday)

Internet Failures Replication, round-robin DNS, etc. helps reliability of inter-domain content oriented services Inter-domain transaction oriented services (e.g. VoIP, EBay, database commits, etc.) still pose a challenge Important model how long does it take for the Internet to converge –After Failure –After Fail-Over –After Repair

BGP: Bad news With unconstrained policies (Griffin99, Varadhan96) –Divergence –Possible create mutually unsatisfiable policies –NP-complete to identify these policies in IRR –Happening today? With constrained policies (e.g. shortest path first) –Transient oscillations –BGP usually converges –It might just take a very long time…. This talk is about constrained policies

Some Observations How do we study convergence? –From BGP logs (e.g. debug ip bgp), difficult to determine causal relationships –Earlier work studied BGP pathologies and failures –Still lots of BGP duplicates and oscillations Failure/repair data (next slide) for default-free routes shows 30 minute curve –Examined long-lived default-free routes from 24 providers for a year –Restoral time for given provider after failure (i.e. route withdrawn)

How long until routes return? (From A Study of Internet Failures) What is happening here?

16 Month Study of Convergence Instrument the Internet –Inject routes into geographically and topologically diverse provider BGP peering sessions (Mae-West, Japan, Michigan, London) –Periodically fail and change these routes (i.e. send withdraws or new attributes) –Time events using ICMP echos and NTP synchronized BGP routeviews monitoring machines (also http gets) –Write lots of Perl scripts –Wait a sixteen months… (45,000 routing events)

Setup

How Many Announcements Does it Take For an AS to Withdraw a Route? 7/5 19:33:25Route R is withdrawn 7/5 19:34:15AS6543 announce R /5 19:35:00AS6543 announce R /5 19:35:37 AS6543 announce R /5 19:35:39 AS6543 announce R /5 19:35:39 AS6543 announce R /5 19:35:52 AS6543 announce R /5 19:36:00AS6543 announce R … 7/5 19:38:22AS6543 withdraw R Answer: Up to 19 (AS6543 chosen as an example – all ASes exhibit similar behavior) Abha made me change the AS numbers

Withdraw Convergence After a BGP route is withdrawn, barring other failures, how long does it take Internet routing tables to reach steady-state?

Withdraw Convergence AS1 AS2 AS3 AS4

Withdraw Convergence Probability distribution Providers exhibit different, but related convergence behaviors 80% of withdraws from all ISPs take more than a minute For ISP4, 20% withdraws took more than three minutes to converge

Fail-Overs and Repairs What are the relative convergence latencies for fail-overs and repairs? Does bad news (withdraws) travel faster?

Failures, Fail-overs and Repairs

Bad news does not travel fast… Repairs (Tup) exhibit similar convergence properties as long-short ASPath fail-over Failures (Tdown) and short-long fail-overs (e.g. primary to secondary path) also similar –Slower than Tup (e.g. a repair) –60% take longer than two minutes –Fail-over times degrade the greater the degree of multi- homing!

End2End Connectivity After a repair, how long before my site is reachable? –Modified ICMP pings and HTTP sent once a second –Source IP address block of pseudo-AS –100 randomly chosen web sites from parent cache logs

ICMP Response after Repairs

What is Happening? Non-deterministic ordering of BGP update messages leads to –Transient oscillations –Each change in FIB adds delay (CPU, BGP bundling timer) –At extreme, convergence triggers BGP dampening

BGP Bad News Given best current routing practices, inter- domain BGP convergence times degrade exponentially with increase in the degree of interconnectivity for a given route … and the degree of inter-connectivity (multi-homing, transit, etc) is increasing