Internet Routing (COS 598A) Today: Hot-Potato Routing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.

Slides:



Advertisements
Similar presentations
Internet Routing (COS 598A) Today: Topology Size Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Advertisements

1 Interdomain Traffic Engineering with BGP By Behzad Akbari Spring 2011 These slides are based on the slides of Tim. G. Griffin (AT&T) and Shivkumar (RPI)
CS540/TE630 Computer Network Architecture Spring 2009 Tu/Th 10:30am-Noon Sue Moon.
© J. Liebeherr, All rights reserved 1 Border Gateway Protocol This lecture is largely based on a BGP tutorial by T. Griffin from AT&T Research.
Fundamentals of Computer Networks ECE 478/578 Lecture #18: Policy-Based Routing Instructor: Loukas Lazos Dept of Electrical and Computer Engineering University.
Dynamic Routing Scalable Infrastructure Workshop, AfNOG2008.
1 BGP Anomaly Detection in an ISP Jian Wu (U. Michigan) Z. Morley Mao (U. Michigan) Jennifer Rexford (Princeton) Jia Wang (AT&T Labs)
Courtesy: Nick McKeown, Stanford
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
TIE Breaking: Tunable Interdomain Egress Selection Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie with Tim Griffin.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
Traffic Engineering With Traditional IP Routing Protocols
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
Network Protocols Designed for Optimizability Jennifer Rexford Princeton University
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Wresting Control from BGP: Scalable Fine-grained Route Control UCSD / AT&T Research Usenix —June 22, 2007 Dan Pei, Tom Scholl, Aman Shaikh, Alex C. Snoeren,
Internet Routing (COS 598A) Today: Interdomain Traffic Engineering Jennifer Rexford Tuesdays/Thursdays.
1 Design and implementation of a Routing Control Platform Matthew Caesar, Donald Caldwell, Nick Feamster, Jennifer Rexford, Aman Shaikh, Jacobus van der.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Internet Routing (COS 598A) Today: Multi-Homing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Network Monitoring for Internet Traffic Engineering Jennifer Rexford AT&T Labs – Research Florham Park, NJ 07932
COS 420 Day 17. Agenda Finished Grading Individualized Projects Very large disparity in student grading No two students had same ranking for other students.
Backbone Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
Stable Internet Routing Without Global Coordination Jennifer Rexford AT&T Labs--Research Joint work with Lixin Gao.
Hot Potatoes Heat Up BGP Routing Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman Shaikh, and.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Wide-Area Traffic Management COS 597E: Software Defined Networking.
Computer Networks Layering and Routing Dina Katabi
EQ-BGP: an efficient inter- domain QoS routing protocol Andrzej Bęben Institute of Telecommunications Warsaw University of Technology,
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
CS 3700 Networks and Distributed Systems Inter Domain Routing (It’s all about the Money) Revised 8/20/15.
Routing protocols Basic Routing Routing Information Protocol (RIP) Open Shortest Path First (OSPF)
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks BGP.
Chapter 9. Implementing Scalability Features in Your Internetwork.
A Case Study in Understanding OSPFv2 and BGP4 Interactions Using Efficient Experiment Design David Bauer†, Murat Yuksel‡, Christopher Carothers† and Shivkumar.
Routing Fundamental W.lilakiatsakun. Review Routing Fundamental VLSM Static & Dynamic Routing Routing algorithm concept.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira (UCSD),
More on Internet Routing A large portion of this lecture material comes from BGP tutorial given by Philip Smith from Cisco (ftp://ftp- eng.cisco.com/pfs/seminars/APRICOT2004.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Routing and Routing Protocols
Evolving Toward a Self-Managing Network Jennifer Rexford Princeton University
Evolving Toward a Self-Managing Network Jennifer Rexford Princeton University
Mike Freedman Fall 2012 COS 561: Advanced Computer Networks Traffic Engineering.
1 Agenda for Today’s Lecture The rationale for BGP’s design –What is interdomain routing and why do we need it? –Why does BGP look the way it does? How.
1 Chapter 4: Internetworking (IP Routing) Dr. Rocky K. C. Chang 16 March 2004.
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
Michael Schapira, Princeton University Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks
Jian Wu (University of Michigan)
Border Gateway Protocol
COS 561: Advanced Computer Networks
BGP supplement Abhigyan Sharma.
Interdomain Traffic Engineering with BGP
Intra-Domain Routing Jacob Strauss September 14, 2006.
COS 561: Advanced Computer Networks
Internet Routing (COS 598A) Today: Topology Size
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
BGP Policies Jennifer Rexford
COS 561: Advanced Computer Networks
BGP Security Jennifer Rexford Fall 2018 (TTh 1:30-2:50 in Friend 006)
COS 561: Advanced Computer Networks
BGP Instability Jennifer Rexford
Computer Networks Protocols
Presentation transcript:

Internet Routing (COS 598A) Today: Hot-Potato Routing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm

Outline Hot-potato routing –Selecting closest egress from a set –Hot-potato routing changes Measuring hot-potato routing –BGP and IGP monitoring –Inferring causality Characterizing hot potatoes –Frequency and number of destinations –Convergence delays and forwarding loops Avoiding hot potatoes –Operational practices –New egress-selection techniques

Multiple Links Between Domains Client Web server Multiple links Middle of path

Hot-Potato Routing San Francisco Dallas New York Hot-potato routing = route to closest egress point when there is more than one route to destination ISP network 9 10 dest multiple egress points -All traffic from customer to peers -All traffic to customer prefixes with multiple connections

BGP Decision Process Highest local preference Lowest AS path length Lowest origin type Lowest MED (with same next hop AS) Lowest IGP cost to next hop Lowest router ID of BGP speaker “Equally good”

Motivations for Hot-Potato Routing Simple computation for the routers –IGP path costs are already computed –Easy to make a direct comparison Ensures consistent forwarding paths –Next router in path picks same egress point Reduces resource consumption –Get traffic out as early as possible –(But, what does IGP distance really mean???) dest

Hot-Potato Routing Change San Francisco Dallas New York ISP network dest failure - planned maintenance - traffic engineering 11 Routes to thousands of destinations switch egress points!!! Consequences:  Transient forwarding instability  Traffic shift  Interdomain routing changes 11

Why Care about Hot Potatoes? Understanding of Internet routing –Frequency of hot-potato routing changes –Influence on end-to-end performance Operational practices –Knowing when hot-potato changes happen –Avoiding unnecessary hot-potato changes –Analyzing externally-caused BGP updates Distributed root cause analysis –Each AS can tell what BGP updates it caused –Someone should know why each change happens

Measuring Hot Potatoes

Measuring Hot Potatoes is Hard Cannot collect data from all routers –OSPF: flooding gives complete view of topology –BGP: multi-hop sessions to several vantage points A single event may cause multiple messages –Group related routing messages in time Router implementation affects message timing –Analyze timing in the measurement data –Controlled experiments with router in lab Many BGP updates caused by external events –Classify BGP routing changes by possible causes

Measurement Infrastructure Measure both protocols –BGP and OSPF monitors Correlate the two streams –Match BGP updates with OSPF events Analyze the interaction X Y Z M ISP backbone OSPF messages BGP updates

Algorithm for Matching Classify BGP updates by possible OSPF causes Transform stream of OSPF messages into routing changes link failurerefreshweight change chg cost del chg cost Match BGP updates with OSPF events that happen close in time Stream of OSPF messages Stream of BGP updates time

Computing Cost Vectors Transform OSPF messages into path cost changes from a router’s perspective M X Y Z LSA weight change, LSA weight change, 10 X 5 Y 4 CHG Y, 7 X 5 Y 7 LSA delete DEL X Y 7 ADD X, 5 X 5 Y 7 OSPF routing changes: 2 1

Classifying BGP Updates Cannot have been caused by cost change –Destination just became (un)available in BGP –New BGP route through same egress point –New route better/worse than old (e.g., shorter) Can have been caused by cost change –New route is equally good as old route (perhaps X got closer, or Y got further away) X Y Z dst M

The Role of Time OSPF link-state advertisements –Multiple LSAs from a single physical event –Group into single cost vector change BGP update messages –Multiple BGP updates during convergence –Group into single BGP routing change Matching IGP to BGP –Avoid matching unrelated IGP and BGP changes –Match related changes that are close in time Characterize the measurement data to determine the right windows 10 sec 70 sec 180 sec

Characterizing Hot Potatoes

Frequency of Hot-Potato Changes router A router B

Variation Across Routers NY 10 9 SF A NY SF dest Small changes will make router A switch exit points to dst More robust to intradomain routing changes B Important factors: - Location: relative distance to egresses - Day: which events happen

Impact of an OSPF Change router A router B

BGP Reaction Time Transfer delay First BGP update All BGP updates BGP scan

Transferring Multiple Prefixes Cumulative Number of Hot-Potato Changes time BGP update – time LSA (seconds) 81 seconds delay

Data Plane Convergence R1R1 R2R2 dst E1E1 E2E2 Disastrous for interactive applications (VoIP, gaming, web) 2 – R 2 starts using E 1 to reach dst 1 – BGP decision process runs in R 2 R1R1 R2R2 dst E1E1 E2E2 3 – R 1 ’s BGP decision can take up to 60 seconds to run Packets to dst may be caught in a loop for 60 seconds! 2 – R 2 starts using E 1 to reach dst 1 – BGP decision process runs in R 2

BGP Updates Over Prefixes Cumulative %BGP updates % prefixes OSPF-triggered BGP updates affects ~50% of prefixes uniformly prefixes with only one exit point

Avoiding Hot Potatoes

Reducing the Impact of Hot Potatoes Vendors: better router implementation –Avoid timer-driven reaction to IGP changes –Move toward an event-drive BGP implementation Operators: avoid equal-distant exits Z 10 X Y Z X Y dst Small changes will make Z switch exit points to dst More robust to intra-domain routing changes

Reducing the Impact (Continued) Operators: new maintenance practices –Careful cost-in/cost-out of links –(But, is this problem over-constrained???) Z X Y dst 4 100

Is Hot-Potato Routing the Wrong Design? Too restrictive –Egress-selection mechanism dictates a policy Too disruptive –Small changes inside can lead to big disruptions Too convoluted –Intradomain metrics shouldn’t be so tightly coupled with BGP egress selection

Strawman Solution: Fixed Ranking Goal: no disruptions from internal changes –Each router has a fixed ranking of egresses –Select highest-ranked egress for each destination –Use tunnels from ingress to egress Disadvantage –Sometimes changing egresses would be useful –Harm from disruptions depends on application A B C D G E F A B dst

Egress Selection Mechanisms automatic adaptation robustness to internal changes hot-potato routing fixed ranking m(i,dst,e) = static rank(i,e) m(i,dst,e) = d(i,e), d is intradomain distance For each ingress, destination, egress:

TIE: Tunable Interdomain Egress Selection Flexible policies –Tuning  and  allows covering a wide-range of egress selection policies Simple computation –One multiplication and one addition –Information already available in routers Easy to optimize –Expressive for a management system to optimize m(i,dst,e) =  (i,dst,e) * d(i,e) +  (i,dst,e)

Using TIE Decouples egress selection from IGP paths –Egress selection is done by tuning  and  Requirements –Small change in router decision logic –Use of tunnels Configuring TIE –Network designers define high-level policy –Network management system translate policy into parameters

Example Policy: Minimizing Sensitivity Problem definition –Minimize sensitivity to equipment failures –No delay more than twice design time delay Simple change to routers –If distance is more than twice original distance Change to closest egress –Else Keep using old egress point Cannot change routers for all possible goals

Output of simulation phase At design time: m(C,dst,A) < m(C,dst,B) Minimizing Sensitivity with TIE A B C dst  (C,dst,A) +  (C,dst,A) < 10.  (C,dst,B) +  (C,dst,B) 11.  (C,dst,A) +  (C,dst,A) < 10.  (C,dst,B) +  (C,dst,B) 20.  (C,dst,A) +  (C,dst,A) > 10.  (C,dst,B) +  (C,dst,B) Optimization phase: solve integer programming

Evaluation of TIE on Real Networks Topology and egress sets –Abilene network (U.S. research network) –Set link weight with geographic distance Configuration of TIE –Considering single-link failures –Threshold of delay ratio: 2 –   [1,4] and 93% of  (i,dst,e)=1 –   {0,1,3251} and 90%  (i,dst,e)=0 Evaluation –Simulate single-node failures –Measure routing sensitivity and delay

Effectiveness of TIE Delay –Within the 2x target whenever possible (i.e., whenever hot-potato could do it) –Lower delay than the fixed-ranking scheme Sensitivity –Only a little more sensitive than a fixed ranking scheme –Much less sensitive than hot-potato routing

Conclusion Hot-potato routing –Simple, intuitive, distributed mechanism –But, large reaction to small changes Studying hot-potato routing –Measurement of hot-potato routing changes –Characterization of hot potatoes in the wild –Guidelines for vendors and operators Improving the routing architecture –Identify egress selection as its own problem –Decouple from the intradomain link weights

Next Time: Root-Cause Analysis Two papers –“Locating Internet Routing Instabilities” –“A Measurement Framework for Pin-Pointing Routing Changes” NANOG video –“Root Cause Analysis of Internet Routing Dynamics” Review just of the first paper –Summary, why accept, why reject, future work Think about your course project –One-page written proposal by Thursday March 24 –Final written report due Tuesday May 10