Understanding the Impact of Route Reflection in Internal BGP Ph.D. Final Defense presented by Jong Han (Jonathan) Park July 15 th, 2011 1.

Slides:



Advertisements
Similar presentations
Address-based Route Reflection Ruichuan Chen (MPI-SWS) Aman Shaikh (AT&T Labs - Research) Jia Wang (AT&T Labs - Research) Paul Francis (MPI-SWS) CoNEXT.
Advertisements

© 2009 Cisco Systems, Inc. All rights reserved. Cisco Public Presentation_ID 1 BGP Diverse Paths draft-ietf-grow-diverse-bgp-paths-dist-02 Keyur Patel.
Internet Routing (COS 598A) Today: Topology Size Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Part IV: BGP Routing Instability. March 8, BGP routing updates  Route updates at prefix level  No activity in “steady state”  Routing messages.
Advanced Networks 1. Delayed Internet Routing Convergence 2. The Impact of Internet Policy and Topology on Delayed Routing Convergence.
CS540/TE630 Computer Network Architecture Spring 2009 Tu/Th 10:30am-Noon Sue Moon.
1 Experimental Study of Internet Stability and Wide-Area Backbone Failure Craig Labovitz, Abha Ahuja Merit Network, Inc Presented by Changchun Zou.
© J. Liebeherr, All rights reserved 1 Border Gateway Protocol This lecture is largely based on a BGP tutorial by T. Griffin from AT&T Research.
1 BGP Anomaly Detection in an ISP Jian Wu (U. Michigan) Z. Morley Mao (U. Michigan) Jennifer Rexford (Princeton) Jia Wang (AT&T Labs)
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
How to Construct a Correct and Scalable iBGP Configuration Mythili Vutukuru Joint work with Paul Valiant, Swastik Kopparty and Hari Balakrishnan.
Swinog-3, 19 September 2001 Fabien Berger, BGP Oscillation …the Internet routing protocol is diverging! Fabien Berger CCIE#6143 IP-Plus.
A Comparative Study of Architectural Impact on BGP Next-hop Diversity 15 th IEEE Global Symposium, March 2012 Jong Han Park 1, Pei-chun Cheng 2, Shane.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v3.2—6-1 Scaling Service Provider Networks Introducing Route Reflectors.
1 Measurement of Highly Active Prefixes in BGP Ricardo V. Oliveira, Rafit Izhak-Ratzin, Beichuan Zhang, Lixia Zhang GLOBECOM’05.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
S ufficient C onditions to G uarantee P ath V isibility Akeel ur Rehman Faridee
Internet Routing Instability Labovitz et al. Sigcomm 1997 Largely adopted from Ion Stoica’s slide at UCB.
BGP: Inter-Domain Routing Protocol Noah Treuhaft U.C. Berkeley.
Slide -1- February, 2006 Interdomain Routing Gordon Wilfong Distinguished Member of Technical Staff Algorithms Research Department Mathematical and Algorithmic.
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Computer Networking Lecture 10: Inter-Domain Routing
Wresting Control from BGP: Scalable Fine-grained Route Control UCSD / AT&T Research Usenix —June 22, 2007 Dan Pei, Tom Scholl, Aman Shaikh, Alex C. Snoeren,
More on BGP Check out the links on politics: ICANN and net neutrality To read for next time Path selection big example Scaling of BGP.
1 Design and implementation of a Routing Control Platform Matthew Caesar, Donald Caldwell, Nick Feamster, Jennifer Rexford, Aman Shaikh, Jacobus van der.
Feb 12, 2008CS573: Network Protocols and Standards1 Border Gateway Protocol (BGP) Network Protocols and Standards Winter
Hot Potatoes Heat Up BGP Routing Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman Shaikh, and.
DARPA NMS PI Meeting November 14, 2002 Understanding BGP in Action Dan Massey USC/ISI.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.
I-4 routing scalability Taekyoung Kwon Some slides are from Geoff Huston, Michalis Faloutsos, Paul Barford, Jim Kurose, Paul Francis, and Jennifer Rexford.
Advertising Equal Cost Multi-Path Routes in BGP Manav Bhatia Samsung India Software Operations, Bangalore – India July 17, th IETF - Vienna draft-ecmp-routes-in-bgp-00.txt.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 6 Routing and Routing Protocols.
1 Interdomain Routing (BGP) By Behzad Akbari Fall 2008 These slides are based on the slides of Ion Stoica (UCB) and Shivkumar (RPI)
CS 3700 Networks and Distributed Systems Inter Domain Routing (It’s all about the Money) Revised 8/20/15.
Scaling iBGP. BGP iBGP –Internal BGP –BGP peering between routers in same AS –Goal: get routes from a border router to another border router without losing.
1 GIRO: Geographically Informed Inter-domain Routing Ricardo Oliveira, Mohit Lad, Beichuan Zhang, Lixia Zhang.
Lecture 4: BGP Presentations Lab information H/W update.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira (UCSD),
David Wetherall Professor of Computer Science & Engineering Introduction to Computer Networks Hierarchical Routing (§5.2.6)
Border Gateway Protocol (BGP) W.lilakiatsakun. BGP Basics (1) BGP is the protocol which is used to make core routing decisions on the Internet It involves.
A Firewall for Routers: Protecting Against Routing Misbehavior1 June 26, A Firewall for Routers: Protecting Against Routing Misbehavior Jia Wang.
More on Internet Routing A large portion of this lecture material comes from BGP tutorial given by Philip Smith from Cisco (ftp://ftp- eng.cisco.com/pfs/seminars/APRICOT2004.
Routing Convergence Dan Massey Colorado State University.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
T. S. Eugene Ngeugeneng at cs.rice.edu Rice University1 COMP/ELEC 429/556 Introduction to Computer Networks Inter-domain routing Some slides used with.
On Understanding of Transient Interdomain Routing Failures Feng Wang, Lixin Gao, Jia Wang, and Jian Qiu Department of Electrical and Computer Engineering.
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
1 Quantifying Path Exploration in the Internet Ricardo Oliveira, Rafit Izhak-Ratzin, Lixia Zhang, UCLA Beichuan Zhang, UArizona Dan Pei, AT&T Labs -- Research.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
How to Construct a Correct and Scalable iBGP Configuration Mythili Vutukuru Joint work with Paul Valiant, Swastik Kopparty and Hari Balakrishnan.
By, Matt Guidry Yashas Shankar.  Analyze BGP beacons which are announced and withdrawn, usually within two hour intervals.  The withdraws have an effect.
CS 640: Introduction to Computer Networks Aditya Akella Lecture 11 - Inter-Domain Routing - BGP (Border Gateway Protocol)
© 2005 Cisco Systems, Inc. All rights reserved. BGP v3.2—6-1 Scaling Service Provider Networks Scaling IGP and BGP in Service Provider Networks.
BGP and ICMP. Exterior Gateway Protocol (EGP) Like RIP, but no metrics. Just if reachable. Rtr inside a domain collects reachability information and informs.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
Text BGP Basics. Document Name CONFIDENTIAL Border Gateway Protocol (BGP) Introduction to BGP BGP Neighbor Establishment Process BGP Message Types BGP.
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
1 Investigating occurrence of duplicate updates in BGP announcements Jong Han Park 1, Dan Jen 1, Mohit Lad 2, Shane Amante 3, Danny McPherson 4, Lixia.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v3.2—6-1 Scaling Service Provider Networks Introducing Confederations.
1 IGP Data Plane Convergence Benchmarking draft-ietf-bmwg-igp-dataplane-conv-app-00.txt draft-ietf-bmwg-igp-dataplane-conv-term-00.txt draft -ietf-bmwg-igp-dataplane-conv-meth-00.txt.
Doing Don’ts: Modifying BGP Attributes within an Autonomous System Luca Cittadini, Stefano Vissicchio, Giuseppe Di Battista Università degli Studi RomaTre.
Jian Wu (University of Michigan)
Border Gateway Protocol
Dynamic routing Routing Algorithm (Dijkstra / Bellman-Ford) – idealization All routers are identical Network is flat. Not true in Practice Hierarchical.
BGP supplement Abhigyan Sharma.
Lixin Gao ECE Dept. UMASS, Amherst
Scaling Service Provider Networks
BGP Instability Jennifer Rexford
Computer Networks Protocols
Presentation transcript:

Understanding the Impact of Route Reflection in Internal BGP Ph.D. Final Defense presented by Jong Han (Jonathan) Park July 15 th,

Research Overview 2 Internal Border Gateway Protocol and Route Reflection Understanding the Impact of BGP Route Reflection - Understanding BGP Next-hop Diversity (2 nd author, Global Internet Symposium 2011) - A Comparative Study of Architectural Impact on Next-hop Diversity (under submission to IMC’11) - Quantifying i-BGP Convergence inside large ISPs (under submission to IMC’11) BGP Route Reflection Protocol Diagnosis - Investigating Occurrence of Duplicate Updates in BGP Announcements (PAM’10, Best Paper) Others (listed as 2 nd author) on BGP Performance - Route Flap Damping with Assured Reachability (AINTEC’10) - Explaining Slow BGP Table Transfers: Implementing a TCP Delay Analyzer (under submission to IMC’11)

Motivation Route reflection was added to the routing architecture to fix a few critical problems Despite the wide adoption of RR, a systematic evaluation and analysis on the impact of route reflection is missing, which can be helpful in: – Understanding of the protocol performance and enhancements – More realistic simulations – Designing the future routing protocols This work is to fill in the void 3

Outline 4 Introduction to Internal BGP and Route Reflection Understanding BGP Path Diversity and the Impact of Route Reflection Understanding BGP Convergence inside Large ISPs

Introduction to full-mesh i-BGP Total number of sessions = N(N-1)/2 Number of additional sessions for an additional i-BGP router = N Total number of i-BGP routers in AS1 = 4 = N AS1 AS2 AS3 AS4 e-BGP i-BGP This router is no longer needed. Remove! 5

Full-mesh i-BGP does not scale City 1 City 2City 3 Large ISPs have hundreds or even more than a thousand routers internally Full mesh leads to a high cost in provisioning – Adding or removing a router requires reconfigurations of all other routers 6

Addressing the scalability problem of full-mesh i-BGP Two solutions are suggested in 1996 – AS confederations (RFC 1965) – Route reflection (RFC 1966) This work focuses on route reflection – Dominant solution – Main concerns shared with AS confederation Path diversity reduction Convergence delay 7

Route reflection solves scalability problem Total number of sessions = 4 Number of additional sessions for an additional i-BGP router = 1 Total number of i-BGP routers = 5 = N AS1 AS2 route reflector client 1 client 2 client 3 client 4 e-BGP i-BGP 8

Large ISP revisited with hierarchical RR Route reflection substantially reduces the total number of sessions Route reflection can be deployed hierarchically to reduce even more 9

Negative Impact of BGP route reflection Negative side effects – Routing performance Path diversity [Uhlig, Networking’06] Convergence Others – Robustness to failures – Internal update explosion [McPherson,APNIC talk, 2009] – Optimal route selection [Vutukuru, Infocom’06] – Routing correctness Data forwarding loop [Griffin, Sigcomm’02] Route oscillations [McPherson, Internet Draft, 2000] 10

Outline 11 Introduction to Internal BGP and Route Reflection Understanding BGP Path Diversity and the Impact of Route Reflection Understanding BGP Convergence inside Large ISPs

Definitions Next-hop POP and AS – Next-hop Point-of-Presence (i.e., city in which the next-hop router is located) and AS that the ISP uses to reach a given external destination BGP Next-hop Diversity – Number of distinct next-hops to reach a given external destination as used simultaneously inside a given ISP 12

Why do we care about path diversity? Higher path diversity – More flexibility in traffic engineering and load balancing – Higher availability Current IETF efforts to increase BGP diversity – Diverse-path, Add-path, and External-best 13

Path diversity reduction due to route reflection AS1 RR RTR2 RTR3 AS2, p p: NH = RTR1, ASPATH = AS2 p: NH = RTR4, ASPATH = AS2 OTHERS p: NH = RTR1, ASPATH = AS2 p: NH = RTR4, ASPATH = AS2 RTR1, RR RTR1 RTR4 ALL 14

Main questions to answer What degree of BGP next-hop diversity do existing ISPs have now? Does route reflection deployment reduce BGP next-hop diversity? 15

Data collection settings ISP FM : Tier-1 ISP with full-mesh i-BGP backbone routing infrastructure ISP RR : Tier-1 ISP with route reflection i-BGP backbone routing infrastructure i-BGP full-mesh ISP FM backbone sub-AS Sub AS AS 1 Sub AS Sub AS AS i ISP RR AS ii AS 11 AS 22 AS 2 Collector BGP routerNode type: confederation BGP 1 st level reflector2 nd level reflector3 rd level reflector Session type: i-BGP reflector to client i-BGP peer e-BGP peer 16

BGP next-hop diversity of the 2 ISPs ISP FM ISP RR Common observations – A small number of prefixes with a very high degree of next-hop diversity – Prefixes with very low degree (diversity=1) of next-hop diversity – A few large groups of prefixes with the same moderate degree of next-hop diversity – A significant number of prefixes (more than 90% and 65% respectively) have multiple next-hop POPs and ASes Overall, ISP RR has relatively lower next-hop diversity, compared to ISP FM 17

Inferring external connectivity 18 AS1 R2 R1 AS2, p R3 R4 AS3 In the absence of failures, the reachability through R2 is not visible If the current best path fails, the path through R2 will be explored

Inferred external connectivity vs. next-hop POPs The external connectivity is not the main reason for the difference ISP FM (during 1 st week of June 2010) 19 ISP RR (during 1 st week of June 2010)

Paths can be hidden due to path preference 7 BGP path attribute values used by a BGP router in BGP best path selection – First 4 are independent from the i-BGP topological location of the given router LOCAL_PREF AS_PATH length ORIGIN MED – The rest 3 attribute values change depending on the i-BGP topological location of the given router Prefer e-BGP over i-BGP IGP cost Router ID 20

Diversity reduction by the first 4 BGP path attributes The first 2 criteria of BGP path selection hides the majority of the path diversity – About 16% and 10% reduction for ISP FM and 34% and 7.6% reduction for ISP RR by (1) LOCAL_PREF and (2) AS_PATH length respectively 21 ISP FM (during 1 st week of June 2010) ISP RR (during 1 st week of June 2010)

Summary The overall next-hop diversity varies widely, depending on the topological location of origin AS for a given prefix The difference in the overall next-hop diversity is due to i-BGP topology-independent factors – More specifically, the first 2 BGP best selection criteria hides up to 42% Next-hop diversity reduction by ISP RR ’s hierarchical RR is less than 3.3% – Main reason. significant reduction by the i-BGP topology-independent factors already 22

Outline 23 Introduction to internal BGP and Route Reflection Understanding BGP Path Diversity and the Impact of Route Reflection Understanding BGP Convergence inside Large ISPs

Definitions Event – Change in routing information to reach a given external prefix Monitor – Router from which i-BGP data is collected within a given ISP i-BGP convergence – Convergence of all monitors inside a given ISP for a given event 24

Why do we care about i-BGP convergence? BGP suffers from slow convergence – May cause severe performance problems in data delivery [TON’01, Labovitz] [Infocom’01,Labovitz] [IMC’03,Mao] [Sigcomm’06,Wang] at inter-AS level – Virtually no measurement studies exist on BGP convergence inside an ISP 25

Increased convergence delay in i-BGP RR AS1 RTR 1 RR1 RTR 2 RR2 AS2, p Update path 1.RR2->RTR1 2.RR1->RTR1 3.RR2->RR1->RTR1 4.RR1->RR2->RTR1 5.Not reachable There is no path to prefix p! 1. Delay due to hierarchy - additional path distance - additional processing delays 2. Delay due to route reflector redundancy - increased # of control paths 26 RTR 3 RTR 4

Main questions to answer What does i-BGP convergence look like? What is the impact of route reflection convergence delay ? 27

Data collection settings ISP FM : the collector is a member of the i-BGP full-mesh ISP RR : the collector is a client of the 2 nd level route reflectors i-BGP full-mesh ISP FM backbone sub-AS Sub AS AS 1 Sub AS Sub AS AS i ISP RR AS ii AS 11 AS 22 AS 2 Collector 28 BGP routerNode type: confederation BGP 1 st level reflector2 nd level reflector3 rd level reflector Session type: i-BGP reflector to client i-BGP peer e-BGP peer

Inferring best path selection for peers in i-BGP full-mesh Q: Best path used by RTR3 to reach prefix p? A: Use geographical information of the routers to approximate IGP cost in the BGP best path selection 29 AS1 RTR1 Path 1 to prefix p RTR2 RTR3 Path 2 to prefix p Which path does RTR3 use? Collector SelectBestPath(Path1,Path2) 1.LOCAL_PREF 2.AS_PATH length 3.ORIGIN 4.MED 5.E-BGP over I-BGP 6.IGP cost to the path 7.Router ID (tie breaker)

High-level view of quantifying i-BGP convergence monitor n monitor 1 collector Event Identification (update clustering) event e T = 60 seconds path preference T T S S 30 METRICS 1. Duration(e) 2. NumUpdates(e) 3. NumPaths(e) Event Classification (Determine Type & Scale)

Event identification: time-based update clustering 31 X = 60 seconds ISP FM Inter-arrival times of beacon prefix updates during June 2010 (seconds) Fraction of updates (CCDF) Time Example of update arrivals for a given beacon prefix 7200 seconds

Event classification: adding type information 32 Event M Path DisturbancePath ChangeSame Path I dist I spath I equal I short I long I up I down Time p0p0 p1p1 pnpn p 0 = p n p 0 != p n p 0 = … = p n Updates generated from a monitor in an event [IMC’06 Oliveira] The last update from the previous event ISP FM 8.9% 3.0% 3.1% 35.8% 40.1% 0.3% 8.8% ISP RR 15.7% 4.9% 4.6% 29.7% 31.9% 0% 13.2%

Event classification: adding scale information Event Scale – Se = (# of POPs observed the event) / (total # of monitored POPs) Event Scale Types – Local Event: only one POP inside the ISP observes the event – AS-wide Event: all POPs inside the ISP observe the event – Others: non-local or non-AS-wide events 33

Identified events from ISP RR and ISP FM 34 The total number of events gradually increases as it fluctuates Most of events are either local or AS-wide in their scale Local events are observed in all POPs Number of Identified Events per Month Scale of Events During June 2010

Event characteristics 35 The majority of local events converge within 1 second – 97% and 72% for ISP RR and ISP FM respectively – Difference due to the different delays of the neighboring ASes AS-wide event duration differs between the two ISPs – Due to the delayed updates via different paths ISP FM ISP RR Local Events AS-wide Events

How Much Delay Does Route Reflection Add to the Overall i-BGP Convergence? 36

Case studies in ISP RR : estimating the additional delay caused by route reflection Additional delays due to route reflector redundancy – Identify the superfluous updates generated purely due to route reflector redundancy – What is the additional convergence time solely contributed by these updates? Additional delays due to hierarchy – Compare the direct and RR paths between all monitors in the backbone routing infrastructure inside ISP RR 37

Superfluous update example 38 ISP RR BR1 BR2 1.How many superfluous updates? 2.What is the additional delay caused by these updates?

Superfluous updates due to route reflector redundancy and its Impact on convergence The amount of superfluous updates is not significant in most cases – Convergence duration: 0.3%, 0.2%, 0.4% and 5.3% for I up, I short, I long and I down increase – Number of updates: 3%, 4%, 13%, and 40% increase for I up, I short, I long, and I down increase 39

Is there routing plane path stretch in the top 2-levels of route reflection inside ISP RR ? Measure the physical path length and latency for RR paths using traceroute and ping Repeat the measurement for direct paths and compare with RR paths 40 Distance Direct (AA,BB) = AB AA BB where r i is a router in the order detected by traceroute Distance RR (AA,BB) = Distance Direct (AA,B) + Distance Direct (B,BB)

Path distance and latency of direct and RR paths 41 In case of ISP RR, RR paths are shorter with less latency – i.e., the RRs are aligned well with the shortest physical paths

Summary Defined, quantified, and analyzed i-BGP convergence i-BGP routing events mostly are local or AS-wide in their scale – Local events: mostly lasts less than 1 second – AS-wide events: the duration is longer and mostly depends on external factors Our case study of ISP RR shows RR does increase the number of updates and convergence duration However, the amount is not significant – Additional 0.3%, 0.2%, 0.4%, and 5.3% increase in the duration of I up, I short, I long, and I down RR topology design can mitigate the additional delays 42

Thank you. 43

Backup Slides 44

Paths can be hidden due to path preference In BGP, a less preferred path is not announced by the border routers In this example, external connectivity: 3 POPs, next-hop diversity: 2 POPs 45 AS1 R2 R1 AS2, p R3 R4 AS3

Topology-independent diversity reduction in ISP FM LOCAL_PREF and AS_PATH length are the two main impacting attributes that hide paths – About 16% and 10% respectively 46

Topology-independent diversity reduction in ISP RR Significant reduction mostly due to the LOCAL_PREF value – About 34% and 7.6% by LOCAL_PREF and AS_PATH length respectively 47

Event characteristics 48 The majority of local events converge within 1 second – 97% and 72% for ISP RR and ISP FM respectively i-BGP convergence duration differs between the two ISPs – Due to the difference in connectivity and delayed updates via different paths ISP FM ISP RR Local Events AS-wide Events

Update reduction in full-mesh i-BGP 49 Setting – Data: NTT i-BGP data from – Apply different MRAI timers to the monitor-collector session and calculate the reduction for beacon prefixes Observation – Higher MRAI timer leads to update reduction, and the update reduction is not significant

Increased convergence time in full-mesh i-BGP Setting – Data: NTT i-BGP data from – Apply different MRAI timers to the monitor-collector session and calculate the convergence duration for beacon prefixes Observation – The increased convergence time is proportional to the MRAI timer used 50

Update reduction in i-BGP HRR Setting – Data: Level3 i-BGP data from – Apply different MRAI timers to the monitor-collector session and calculate the reduction for beacon prefixes Observation – Reduction MRAI timer with 1 second effective enough; the update propagation and the internal path exploration for a given external path is mostly under 1 second within the ISP 51

Increased convergence time in i-BGP HRR Setting – Data: Level3 i-BGP data from – Apply different MRAI timers to the monitor-collector session and calculate the convergence duration for beacon prefixes Observation – The increased convergence time is proportional to the MRAI timer used in Iup 52