1 Aman Shaikh UCSC SHS IMW - 2002 A Case-study of OSPF Behavior in a Large Enterprise Network Aman Shaikh, UCSC Chris Isett, Siemens Health Services Albert.

Slides:



Advertisements
Similar presentations
OSPF 1.
Advertisements

1 A. Sshaikh, A. Greenberg; Nov 01 UCSC Sigcomm IMW Experience in Black-box OSPF Measurement Aman Shaikh, UCSC Albert Greenberg, AT&T Labs-Research.
Introduction to OSPF.
Lonnie Decker Multiarea OSPF for CCNA Department Chair, Networking/Information Assurance Davenport University, Michigan August 2013 Elaine Horn Cisco Academy.
Designing OSPF Networks
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 6: Multiarea OSPF Scaling Networks.
1 Aman Shaikh: June 02 UCSC INFOCOM 2002 Avoiding Instability during Graceful Shutdown of OSPF Aman Shaikh, UCSC Joint work with Rohit Dube, Xebeo Communications.
CS 4700 / CS 5700 Network Fundamentals Lecture 9: Intra Domain Routing Revised 7/30/13.
Shadow Configurations: A Network Management Primitive Richard Alimi, Ye Wang, Y. Richard Yang Laboratory of Networked Systems Yale University.
CCNA 2 v3.1 Module 6.
1 Relates to Lab 4. This module covers link state routing and the Open Shortest Path First (OSPF) routing protocol. Dynamic Routing Protocols II OSPF.
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Routing and Routing Protocols
OSPF Monitor Architecture, Design and Deployment Experience
Objectives After completing this chapter you will be able to: Describe hierarchical routing in OSPF Describe the 3 protocols in OSPF, the Hello, Exchange.
1 Semester 2 Module 6 Routing and Routing Protocols YuDa college of business James Chen
1 Relates to Lab 4. This module covers link state routing and the Open Shortest Path First (OSPF) routing protocol. Dynamic Routing Protocols II OSPF.
Fundamentals of Networking Discovery 2, Chapter 6 Routing.
© 2006 Cisco Systems, Inc. All rights reserved. ICND v2.3—3-1 Determining IP Routes Introducing Link-State and Balanced Hybrid Routing.
Routing. A world without networks and routing  No connection between offices, people and applications  Worldwide chaos because of the lack of centralized.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 5: Adjust and Troubleshoot Single- Area OSPF Scaling Networks.
Open Shortest Path First (OSPF) -Sheela Anand -Kalyani Ravi -Saroja Gadde.
Routing and Routing Protocols Dynamic Routing Overview.
1 CS 4396 Computer Networks Lab Dynamic Routing Protocols - II OSPF.
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
Routing and Routing Protocols Routing Protocols Overview.
1 Introducing Routing 1. Dynamic routing - information is learned from other routers, and routing protocols adjust routes automatically. 2. Static routing.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 6 Routing and Routing Protocols.
Collected By: Mehdi Daneshvar Supervisor: E.M.Kosari.
LAN Switching and WAN Networks Topic 6 - OSPF. What we have done so far! 18/09/2015Richard Hancock2  Looked at the basic switching concepts and configuration.
Instructor & Todd Lammle
© 1999, Cisco Systems, Inc OSPF Overview RFC 2328, 2178, 1583.
© 2002, Cisco Systems, Inc. All rights reserved. 1 Determining IP Routes.
Network Architecture and Design
Introduction to OSPF Nishal Goburdhan. Routing and Forwarding Routing is not the same as Forwarding Routing is the building of maps Each routing protocol.
Page 110/27/2015 A router ‘knows’ only of networks attached to it directly – unless you configure a static route or use routing protocols Routing protocols.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 3 v3.0 Module 2 Single-Area OSPF.
CCNA 3 Week 2 Link State Protocols OSPF. Copyright © 2005 University of Bolton Distance Vector vs Link State Distance Vector –Copies Routing Table to.
1 Module 4: Implementing OSPF. 2 Lessons OSPF OSPF Areas and Hierarchical Routing OSPF Operation OSPF Routing Tables Designing an OSPF Network.
1 of of 35 Single Area OSPF Concepts 3 of 35 OSPF Basics.
CCNA Guide to Cisco Networking Fundamentals Fourth Edition
Lecture 2 Agenda –Finish with OSPF, Areas, DR/BDR –Convergence, Cost –Fast Convergence –Tools to troubleshoot –Tools to measure convergence –Intro to implementation:
CCNA 2 Week 6 Routing Protocols. Copyright © 2005 University of Bolton Topics Static Routing Dynamic Routing Routing Protocols Overview.
Saeed Darvish Pazoki – MCSE, CCNA Abstracted From: Cisco Press – ICND 2 – 10 EIGRP 1.
Routing and Routing Protocols
Routing Networks and Protocols Prepared by: TGK First Prepared on: Last Modified on: Quality checked by: Copyright 2009 Asia Pacific Institute of Information.
1 OSPF in Multiple Areas. 2 2 Scalability Problems in Large OSPF Areas Scalability problems in large OSPF areas include Large routing tables Large routing.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 8: Single-Area OSPF Routing Protocols.
1 Version 3.1 Module 6 Routed & Routing Protocols.
Routing protocols. 1.Introduction A routing protocol is the communication used between routers. A routing protocol allows routers to share information.
CO5023 EIGRP. Features of EIGRP EIGRP is a highly advanced distanced vector routing protocol Uses Protocol dependent modules to route different packets.
© 2002, Cisco Systems, Inc. All rights reserved..
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 8: Single-Area OSPF Routing Protocols.
1 24-Feb-16 S Ward Abingdon and Witney College OSPF CCNA Exploration Semester 2 Chapter 11.
Cisco 2 - Routers Perrine modified by Brierley Page 13/21/2016 Chapter 4 Module 6 Routing & Routing Protocols.
Single Area OSPF Module 2, Review How routing information is maintained Link-state routers apply the Dijkstra shortest path first algorithm against.
Routing and Routing Protocols CCNA 2 v3 – Module 6.
CS 268: Lecture 9 Intra-domain Routing Protocols Ion Stoica Computer Science Division Department of Electrical Engineering and Computer Sciences University.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 8: Single-Area OSPF Routing & Switching.
Link State Routing protocol
OSPF (Open Shortest Path First)
ISIS Flooding Reduction in MSDC
© 2002, Cisco Systems, Inc. All rights reserved.
Chapter 9: Multiarea OSPF
CS 268: Lecture 8 Intra-domain Routing Protocols
Dynamic Routing and OSPF
Chapter 8: Single-Area OSPF
Chapter 9: Multiarea OSPF
Chapter 9: Multiarea OSPF
Presentation transcript:

1 Aman Shaikh UCSC SHS IMW A Case-study of OSPF Behavior in a Large Enterprise Network Aman Shaikh, UCSC Chris Isett, Siemens Health Services Albert Greenberg, AT&T Labs-Research Matthew Roughan, AT&T Labs-Research Joel Gottlieb, AT&T Labs-Research IMW – November 07, 2002

2 Aman Shaikh UCSC SHS IMW Why Study OSPF Behavior? Any meaningful performance assurance depends on routing stability –An internal network change (OSPF event) can have major impact on services, flows and customers Transients can degrade services significantly (e.g., VoIP) Expectations for IP network management are higher –Improve OSPF performance, particularly reliable and fast detection of topology change, without introducing instabilities –Changes are needed Parameter adjustment or more fundamental Realistic workload model for simulations are needed –Testing scalability, convergence, reliability However, the behavior and performance of OSPF in large ISPs and enterprise networks is not well understood

3 Aman Shaikh UCSC SHS IMW OSPF OSPF is a Link-state routing protocol –All routers in the domain come to a consistent view of the topology by exchange of Link State Advertisements (LSAs) Router describes its local connectivity (i.e., set of links) in an LSA –Set of LSAs (self-originated + received) at a router = topology Hierarchical routing –OSPF domain can be divided into areas –Hub-and-spoke topology with area 0 as hub and other non-zero areas as spokes

4 Aman Shaikh UCSC SHS IMW OSPF Performance OSPF processing impacts convergence, (in)stability –Load is increasing as networks grow Bulk of OSPF processing is due to LSAs –Sending/receiving LSAs –LSAs can trigger Route calculation (Dijkstra’s algorithm) Understanding dynamics of LSA traffic is key for a better understanding of OSPF

5 Aman Shaikh UCSC SHS IMW Methodology Categorize and baseline LSA traffic Detect, diagnose and act on anomalies Propose changes to improve performance

6 Aman Shaikh UCSC SHS IMW Categorizing LSA Traffic A router originates an LSA due to… –Change in network topology Example: link goes down or comes up Detection of anomalies and problems –Periodic soft-state refresh Recommended value of interval is 30 minutes Forms baseline LSA traffic LSAs are disseminated using reliable flooding –Includes change and refresh LSAs –Flooding leads to duplicate copies of LSAs being received at a router –Overhead: wastes resources Change LSAs Refresh LSAs Duplicate LSAs

7 Aman Shaikh UCSC SHS IMW Highlights of the Results Categorize, baseline and predict –Categories: Refresh, Change, Duplicate; External, Internal –Bulk of LSA traffic is due to refresh –Refresh LSA traffic is smooth: no evidence of refresh synchronization across network –Refresh LSA traffic is predictable from router configuration info Detect, diagnose and act –Almost all LSAs arise from persistent yet partial failure modes –Internal LSA spikes Indicate router hardware degradation Carry out preventive maintenance –External LSA spikes Indicate degradation in customer connectivity Call customer before customer calls you Propose Improvements –Simple configuration changes to reduce duplicate LSA traffic

8 Aman Shaikh UCSC SHS IMW Enterprise Network Case Study The network provides customers with connectivity to applications and databases residing in the data center OSPF network –15 areas, 500 routers This case study covers 8 areas, 250 routers One month: April 2002 –Link-layer = Ethernet-based LANs Customers are connected via leased lines –Customer routes are injected via EIGRP into OSPF The routes are propagated via external LSAs Quite reasonable for the enterprise network in question

9 Aman Shaikh UCSC SHS IMW Enterprise Network Topology Area 0Area BArea C Area A Servers Database Applications Customer OSPF Domain Customer B1B2 Monitor LAN1LAN2 Border rtrs Area A Area 0 External (EIGRP) Monitor is completely passive No adjacencies with any routers Receives LSAs on a multicast group

10 Aman Shaikh UCSC SHS IMW Area 4 Days Area 3 Days Area 2 Days Area 0 Days LSA Traffic in Different Areas Duplicate LSAs Change LSAs Refresh LSAs Artifact: 23 hr day (Apr 7) Genuine Anomaly

11 Aman Shaikh UCSC SHS IMW Baseline LSA Traffic: Refresh LSAs Refresh LSA traffic can be reliably predicted using information available in router configuration files –Important for workload modeling –See paper for details Area 2Area 3 Days

12 Aman Shaikh UCSC SHS IMW Refresh process is not synchronized No evidence of synchronization –Contrary to simulation-based study in [Basu01] Reasons –Changes in the topology help break synchronization –LSA refresh at one router is not coupled with LSA refresh at other routers –Drift in the refresh interval of different routers Negligible LSA clumping

13 Aman Shaikh UCSC SHS IMW Anomaly Detection: Change LSAs Internal to OSPF domain versus external –Change LSAs due to external events dominated –Not surprising due to large number of leased lines used to import customer routes into OSPF Customer volatility  network volatility Days

14 Aman Shaikh UCSC SHS IMW Root Causes of Change LSAs Persistent problem  flapping  numerous change LSAs –Internal LSA spikes  hardware router problems OSPF monitor identified a problem (not visible to SNMP-based network mgt tools) early and led to preventive maintenance –External LSA spikes  customer route volatility Overload of an external link to a customer between 8 pm – 4 am causes EIGRP session on that link to flap

15 Aman Shaikh UCSC SHS IMW Overhead: Duplicate LSAs Why do some areas witness substantial duplicate LSA traffic, while other areas do not witness any? –OSPF flooding over LANs leads to control plane asymmetries and to imbalances in duplicate LSA traffic Days

16 Aman Shaikh UCSC SHS IMW LSA Flooding over Broadcast LANs DR = Designated router, BDR = Backup Designated Router Who becomes DR and BDR depends on configuration Flooding on a LAN is a two-step process: 1.A router multicasts LSA to DR and BDR 2.DR or BDR multicasts LSA to other routers LSA appears only twice on LAN instead of n – 1 times DRBDR LAN

17 Aman Shaikh UCSC SHS IMW Control Plane Asymmetry Two LANs (LAN1 and LAN2) in each area Monitor is on LAN1 Routers B1 and B2 are connected to LAN1 and LAN2 LSAs originated on LAN2 can get duplicated depending on which routers have become DR and BDR on LAN1 –Leads to control plane asymmetry –Four cases

18 Aman Shaikh UCSC SHS IMW Four Cases B2 B1 DR LAN2 LAN1 LAN2 Case 4 (R, R’) LAN1 B1 (BDR) B2 DR Case 3 (R, B1) B1 (DR) B2 (BDR) LAN1 LAN2 Case 1 (B1, B2) B1 (DR) B2 Case 2 (B1, R) LAN1 LAN2

19 Aman Shaikh UCSC SHS IMW Eliminating Duplicate LSA Traffic Case1Case 2Case 3Case 4 Duplicate LSA traffic HighNoneHighNone Deterministic via configuration YesNo Yes Area 2 XX configuration change Area 3 XX configuration change

20 Aman Shaikh UCSC SHS IMW Summary Categorize and baseline LSA traffic –Refresh LSAs: constitute bulk of overall LSA traffic No evidence of synchronization between different routers Refresh LSA traffic predictable from configuration information Detect, diagnose and act on anomalies –Change LSAs: can indicate persistent yet partial failure modes Internal LSA spikes  hardware router problems  preventive router maintenance External LSA spikes  customer congestion problems  “preventive” customer care Propose changes to improve performance –Duplicate LSAs: can arise from control plane asymmetries Simple configuration changes can eliminate duplicate LSAs and improve performance

21 Aman Shaikh UCSC SHS IMW Future Work Study OSPF behavior in other commercial networks –ISPs, enterprise networks Longer term studies Combine with other data sources –BGP: interaction with OSPF –Traffic: impact of routing on forwarding Convergence Better monitoring and management tools Good simulation models –Combine with router-level measurements [Shaikh & Greenberg, IMW ‘01]

22 Aman Shaikh UCSC SHS IMW Backup

23 Aman Shaikh UCSC SHS IMW Questions OSPF is a Link-state routing protocol –All routers in the domain come to a consistent view of the topology by exchange of Link State Advertisements (LSAs) –Three categories of LSAs: refresh, change, duplicate Refresh –Is the refresh traffic predictable? Can it be baselined? –Is refresh traffic synchronized in real networks? Change –What is the nature of change LSA traffic, arising from internal and external sources? –What do the failure modes look like? –Is it possible to use this traffic to trigger preventive maintenance traffic (e.g., just as measurements of bit error rates triggers preventive maintenance of the data plane) Duplicate –Can duplicate LSAs be reduced? At what cost to reliability?

24 Aman Shaikh UCSC SHS IMW Router Model Route Processor (CPU) FIB Interface card Forwarding Switching Fabric Data packet Topology View SPF Calculation OSPF Process LSA LS Ack LSA Forwarding LSA Processing LSA Flooding SPF Calculation FIB Update