Advanced Networks 1. Delayed Internet Routing Convergence 2. The Impact of Internet Policy and Topology on Delayed Routing Convergence.

Slides:



Advertisements
Similar presentations
Experimental Measurement of Delayed Convergence Abha Ahuja Internap/Merit Network, Inc. Craig Labovitz Microsoft Research/Merit Network, Inc. Farnam Jahanian,
Advertisements

The Impact of Policy and Topology on Internet Routing Convergence NANOG 20 October 23, 2000 Abha Ahuja InterNap *In collaboration with.
Modeling Inter-Domain Routing Protocol Dynamics ISMA 2000 December 6, 2000 In collaboration with Abha, Ahuja, Roger Wattenhofer, Srinivasan Venkatachary,
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
Delayed Internet Routing Convergence due to Flap Dampening Z. Morley Mao Ramesh Govindan, Randy Katz, George Varghese
Internet Routing Instability Craig Labovitz, G. Robert Malan, Farham Jahanian University of Michigan Presented By Krishnanand M Kamath.
Part IV: BGP Routing Instability. March 8, BGP routing updates  Route updates at prefix level  No activity in “steady state”  Routing messages.
Delayed Internet Routing Convergence Craig Labovitz, Microsoft Research Abha Ahuja, University of Michigan Farnam Jahanian, University of Michigan Abhit.
Border Gateway Protocol Ankit Agarwal Dashang Trivedi Kirti Tiwari.
CSE534 – Fundamentals of Computer Networks Lecture 6-7: Inter Domain Routing (It’s all about the Money) Based on slides from D. Choffnes Northeastern U.
Network Layer: Internet-Wide Routing & BGP Dina Katabi & Sam Madden.
CS540/TE630 Computer Network Architecture Spring 2009 Tu/Th 10:30am-Noon Sue Moon.
© J. Liebeherr, All rights reserved 1 Border Gateway Protocol This lecture is largely based on a BGP tutorial by T. Griffin from AT&T Research.
Fundamentals of Computer Networks ECE 478/578 Lecture #18: Policy-Based Routing Instructor: Loukas Lazos Dept of Electrical and Computer Engineering University.
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
Improved BGP convergence via Ghost Flushing Yehuda Afek Anat Bremler-Barr Shemer Schwarzd המרכז הבינתחומי הרצליה.
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization –All routers are identical –Network is flat. Not true in Practice Hierarchical.
Lecture 14: Inter-domain Routing Stability CS 268 class March 8 th, 2004 (slides from Timothy Griffin’s tutorial and Craig Labovitz’s NANOG talk)
Routing Jinyang Li. Administravia Hand in PS1 to Hui Zhang before you leave! me project teamlist by Oct-1.
CS 164: Global Internet Slide Set In this set... More about subnets Classless Inter Domain Routing (CIDR) Border Gateway Protocol (BGP) Areas with.
1 An Experimental Analysis of BGP Convergence Time Timothy Griffin AT&T Research & Brian Premore Dartmouth College.
(c) Anirban Banerjee, Winter 2005, CS-240, 2/1/2005. The Impact of Internet Policy and Topology on Delayed Routing convergence C. Labovitz, A. Ahuja, R.
Internet Routing Instability Labovitz et al. Sigcomm 1997 Largely adopted from Ion Stoica’s slide at UCB.
BGP: Inter-Domain Routing Protocol Noah Treuhaft U.C. Berkeley.
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Computer Networking Lecture 10: Inter-Domain Routing
Internet Routing (COS 598A) Today: Multi-Homing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Feb 12, 2008CS573: Network Protocols and Standards1 Border Gateway Protocol (BGP) Network Protocols and Standards Winter
Computer Networks Layering and Routing Dina Katabi
Routing Algorithms (Ch5 of Computer Network by A. Tanenbaum)
Unicast Routing Protocols  A routing protocol is a combination of rules and procedures that lets routers in the internet inform each other of changes.
Information-Centric Networks04a-1 Week 4 / Paper 1 Open issues in Interdomain Routing: a survey –Marcelo Yannuzzi, Xavier Masip-Bruin, Olivier Bonaventure.
CS 3700 Networks and Distributed Systems Inter Domain Routing (It’s all about the Money) Revised 8/20/15.
Network Layer4-1 Chapter 4: Network Layer r 4. 1 Introduction r 4.2 Virtual circuit and datagram networks r 4.3 What’s inside a router r 4.4 IP: Internet.
Network Protocols: Design and Analysis Polly Huang EE NTU
Border Gateway Protocol (BGP) W.lilakiatsakun. BGP Basics (1) BGP is the protocol which is used to make core routing decisions on the Internet It involves.
More on Internet Routing A large portion of this lecture material comes from BGP tutorial given by Philip Smith from Cisco (ftp://ftp- eng.cisco.com/pfs/seminars/APRICOT2004.
Routing Convergence Dan Massey Colorado State University.
1 Quantifying Path Exploration in the Internet Ricardo Oliveira, Rafit Izhak-Ratzin, Lixia Zhang, UCLA Beichuan Zhang, UArizona Dan Pei, AT&T Labs -- Research.
TCOM 509 – Internet Protocols (TCP/IP) Lecture 06_a Routing Protocols: RIP, OSPF, BGP Instructor: Dr. Li-Chuan Chen Date: 10/06/2003 Based in part upon.
Routing Convergence Global Routing Internet Routing Convergence An Experimental Study of Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja,
Mike Freedman Fall 2012 COS 561: Advanced Computer Networks Traffic Engineering.
CS 640: Introduction to Computer Networks Aditya Akella Lecture 11 - Inter-Domain Routing - BGP (Border Gateway Protocol)
Spring 2000CS 4611 Routing Outline Algorithms Scalability.
The Impact of Internet Policy and Topology on Delayed Routing Convergence.
CS 640: Introduction to Computer Networks Aditya Akella Lecture 11 - Inter-Domain Routing - BGP (Border Gateway Protocol)
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization All routers are identical Network is flat. Not true in Practice Hierarchical.
CS 3700 Networks and Distributed Systems
CS 3700 Networks and Distributed Systems
Abha Ahuja InterNap Craig Labovitz Microsoft Research
Border Gateway Protocol
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization All routers are identical Network is flat. Not true in Practice Hierarchical.
COS 561: Advanced Computer Networks
Introduction to Internet Routing
Routing: Distance Vector Algorithm
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization All routers are identical Network is flat. Not true in Practice Hierarchical.
COS 561: Advanced Computer Networks
RFC 1058 & RFC 2453 Routing Information Protocol
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
BGP Policies Jennifer Rexford
Communication Networks
BGP Interactions Jennifer Rexford
COS 461: Computer Networks
COS 561: Advanced Computer Networks
BGP Instability Jennifer Rexford
Computer Networks Protocols
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization All routers are identical Network is flat. Not true in Practice Hierarchical.
Presentation transcript:

Advanced Networks 1. Delayed Internet Routing Convergence 2. The Impact of Internet Policy and Topology on Delayed Routing Convergence

The Problem How to Recover from Failure Quickly? Phone systems recover, failover, in milliseconds Internet takes an order of minutes Loss of Connectivity Packet Loss Latency

The Problem (cont) Failure over on the internet not very good Sluggish Backup systems Internet has to adjust to the failure Path must be restored to back up

The Questions Why does convergence take so long? What is the upper bound for convergence? What causes this delayed convergence? What can we do about it?

Theory Unexpected Interaction of: Protocol timers Router Implementation Policies (Safe/Unsafe)

Theory (cont) Distance vector algorithm has issues Lack of sufficient info to determine if next hop choice will cause loops

Convergence Accelerators Use of Path Vector Split Horizon Triggered updates Diffusion Timers

Policies Admins can implement unsafe policies Policies can cause route oscillations Routers default to Shortest Path Even if constrained upper-bound might be as high factorial

Point of Paper Measure the convergence behavior of BGP 4 Done for Bellman-Ford O(n 3 ) Convergence in BGP is NOT much better than RIP Give an upper and lower bounds to convergence

The Work Done 2 year study 250,000 routing fault injections 25 Internet providers End to End performance measurements

Terminology Tup: (New) Route Announcement Tdown: Route Withdrawal Tshort: Shorter Route Replaces Current Current Route is Withdrawn Implicitly Tlong: Shorter Route Replaced with longer one Represents a failure and failover Current Route is Withdrawn Implicitly

Latency

Latency (cont) Oscillation greater than 3 minutes 20% of Tlong 40% of Tdown Equivalence Latency Classes Tlong,Tdown Tshort,Tup

Latency per ISP

BGP Update Volume Average Message Per Event Type Tup: Route Announcement Tdown: Route Withdrawal Tshort: Shorter Route Replacement Tlong: Longer Route Replacement

Questions Why do Tlong and Tdown cause 2 times the amout of updates? Why do certain ISP produce more updates per event? Relationship between number of updates and convergence latency?

Questions (cont) What makes an ISP have a higher latency? Interesting Points ISP3: Japan’s National Backbone ISP5 Canadian ISP Latency NOT Dependant Geographic Distance or Network Distance (aka hop count)

Graph Analysis No relationship between day of the week and Latency! Independent of Network load and congestion

End to End Measurements Route Oscillation effects performance Drop Packets, Buffering of Packets Out of order delivery

Failover from end to end view Time after ICMP echo arrived after Tup Simulates a failover 80% of test sites began returning after 30 seconds 100% after one minute

BGP Convergence Model IBGP ignored Full Mesh Ignore ingress and egress filters Exclude MinRouteAdver Updates messages follow FIFO ordering

BGP Convergence Example Start: 0(*R, 1R, 2R) 1(0R, *R, 2R) 2(0R, 1R, *R) R Withdraws routes R -> 0 W R -> 1 W R -> 2 W

BGP Convergence Example 0(-, -, *2R) 1(-, -, *2R) 2(*01R, 10R, -) 0(-, *1R, 2R) 1(*0R, -, 2R) 2(*0R, 1R, -) 1 and 2 receive new announcement from 0 0 -> 1 01R (loop) 0 -> 2 01R 0(-, *1R, 2R) 1(-, -, *2R) 2(01R, *1R, -) 0 and 2 receive new announcement from 1 1 -> 0 10R (loop) 1 -> 2 10R

BGP Convergence Example 0 and 1 receive new announcement from 2 2 -> 0 20R 2 -> 1 20R 0(-, -, -) 1(-, -, *20R) 2(*01R, 10R, -) 0 and 2 receive new announcement from 1 1 -> 0 12R 1 -> 2 12R 0(-, *12R, -) 1(-, -, *20R) 2(*01R, -, -) … 48 steps later 0(-, -, -) 1(-, -, -) 2(-, -, -)

Upper Bound For n nodes there exist 0((n-1)!) distinct paths When a route is withdrawn, a new route is found of equal or increasing length Message count could be a bad as (n-1)O((n-1)!) until convergence Not really possible on the internet

Lower Bound Made possible by MinRouteAdver timers (n-1) Rounds to convergence

MinRouteAdver Minimum time between route advertisements Gives a AS time to pick a good route before announcing it In standard BGP, timer only applied to announcements Does Not apply to explicit withdrawls

Example Reloaded Instead of 48 rounds only took 13 rounds

Example Reloaded

Question Reloaded Why do Tup/Tshort converge quicker than Tdown/Tlong? Answer: Tup/Tshort are decreasing while Tdown/Tlong are increasing One a path is selected a longer one will not be picked While on Tdown/Tlong you pick the next best one until you are out of choices O(1) for Tup while O(n) for Tdown

Question Reloaded Why is there different latencies between the five ISPs? Answer: The topological factors, length and number of possible paths (peering relationships, policies and agreements) are the answer. Longer routes announced, longer latencies Longer routes the more MinRouteAdver rounds

Loop Detection Loop Detection done at receiver side If done, at sender you can get more out of MinRouteAdver round MinRouteAdver is good but causes a 30 second delay in end to end communication at best

Convergence Delay Due to Policies and Topology 2 nd study of convergence 20 unique advertisement between 200 pairs of ISPs, 6 months Measure the impact of Policies Measure the impact of Topology Analysis

Multi-home Networks One network, two ISPs Better connectivity + backup Failover = New route convergence Work done in this Paper Convergence Analysis of Tdown event

Work Done Fault injection announcements Logged table snapshot to disk Survey of backbone providers Routing and peering policies Used data to discuss impact on convergence

Policy How policy impacts number and length of ASPaths with a given route Limited inbound acceptance by all ISP

Inbound Filtering Example ISP D filters peering session with ISPG D only accept G’s backbone and customers routes ISP A filters peering session with D A only accept D’s backbone and customers routes ISP A will accepts G’s routes by chaining

Outbound Filters A will advertise routes with paths “D G” and “D” but not “C D G” Done by 13% of ISPs Combinations of ASPath and prefix filters create unintentional back-up transit paths

Topological Effect Interaction of MinRouteAdver timers MinRouteAdver is per peer not prefix MinRouteAdver interference delays convergence

Backup Path Selection

Convergence Latency

Convergence Latency (cont) ISP1 explored one backup path of length 2 ISP2 explored backup paths of length 2 and 3 ISP 3 explored backup paths of length 5

Convergence Latency (cont)