A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University

Slides:



Advertisements
Similar presentations
Multihoming and Multi-path Routing
Advertisements

Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
Routing Basics.
Border Gateway Protocol Ankit Agarwal Dashang Trivedi Kirti Tiwari.
Fundamentals of Computer Networks ECE 478/578 Lecture #18: Policy-Based Routing Instructor: Loukas Lazos Dept of Electrical and Computer Engineering University.
Dynamic Routing Scalable Infrastructure Workshop, AfNOG2008.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 Routing Working at a Small-to-Medium Business or ISP – Chapter 6.
1 Route Control Platform – IEEE CCW 2004 Route Control Platform Making an AS look and act like one router Aman Shaikh AT&T Labs - Research IEEE CCW 2004.
Traffic Engineering With Traditional IP Routing Protocols
1 Route Control Platform Making the Network Act Like One Big Router Jennifer Rexford Princeton University
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
A Routing Control Platform for Managing IP Networks Jennifer Rexford Computer Science Department Princeton University
New Routing Architectures Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
MIRED: Managing IP Routing is Extremely Difficult Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Incrementally Deployable Security for Interdomain Routing (TTA-4, Type-I) Jennifer Rexford, Princeton University Joan Feigenbaum, Yale University July.
1 Route Control Platform – IEEE CCW 2004 Route Control Platform Making an AS look and act like a router Aman Shaikh AT&T Labs - Research IEEE CCW 2004.
Wresting Control from BGP: Scalable Fine-grained Route Control UCSD / AT&T Research Usenix —June 22, 2007 Dan Pei, Tom Scholl, Aman Shaikh, Alex C. Snoeren,
1 Design and implementation of a Routing Control Platform Matthew Caesar, Donald Caldwell, Nick Feamster, Jennifer Rexford, Aman Shaikh, Jacobus van der.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Internet Routing (COS 598A) Today: Telling Routers What to Do Jennifer Rexford Tuesdays/Thursdays.
Network Monitoring for Internet Traffic Engineering Jennifer Rexford AT&T Labs – Research Florham Park, NJ 07932
Routing and Routing Protocols
14 – Inter/Intra-AS Routing
1 Interdomain Routing Policy Reading: Sections plus optional reading COS 461: Computer Networks Spring 2008 (MW 1:30-2:50 in COS 105) Jennifer Rexford.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Backbone Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
© 2009 Cisco Systems, Inc. All rights reserved. ROUTE v1.0—6-1 Connecting an Enterprise Network to an ISP Network Considering the Advantages of Using BGP.
1 Network-wide Decision Making: Toward a Wafer-thin Control Plane Jennifer Rexford, Albert Greenberg, Gisli Hjalmtysson ATT Labs Research David A. Maltz,
Internet Routing (COS 598A) Today: Router Software Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Central.
1 ECE453 – Introduction to Computer Networks Lecture 10 – Network Layer (Routing II)
Dr. John P. Abraham Professor University of Texas Pan American Internet Routing and Routing Protocols.
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
I-4 routing scalability Taekyoung Kwon Some slides are from Geoff Huston, Michalis Faloutsos, Paul Barford, Jim Kurose, Paul Francis, and Jennifer Rexford.
1 Computer Communication & Networks Lecture 22 Network Layer: Delivery, Forwarding, Routing (contd.)
Routing and Routing Protocols Routing Protocols Overview.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
CS 3700 Networks and Distributed Systems Inter Domain Routing (It’s all about the Money) Revised 8/20/15.
Routing protocols Basic Routing Routing Information Protocol (RIP) Open Shortest Path First (OSPF)
Using Measurement Data to Construct a Network-Wide View Jennifer Rexford AT&T Labs—Research Florham Park, NJ
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks BGP.
Border Gateway Protocol (BGP) W.lilakiatsakun. BGP Basics (1) BGP is the protocol which is used to make core routing decisions on the Internet It involves.
A Firewall for Routers: Protecting Against Routing Misbehavior1 June 26, A Firewall for Routers: Protecting Against Routing Misbehavior Jia Wang.
More on Internet Routing A large portion of this lecture material comes from BGP tutorial given by Philip Smith from Cisco (ftp://ftp- eng.cisco.com/pfs/seminars/APRICOT2004.
T. S. Eugene Ngeugeneng at cs.rice.edu Rice University1 COMP/ELEC 429/556 Introduction to Computer Networks Inter-domain routing Some slides used with.
CS 4396 Computer Networks Lab BGP. Inter-AS routing in the Internet: (BGP)
Evolving Toward a Self-Managing Network Jennifer Rexford Princeton University
ICS 156: Networking Lab Magda El Zarki Professor, ICS UC, Irvine.
Evolving Toward a Self-Managing Network Jennifer Rexford Princeton University
Routing in the Inernet Outcomes: –What are routing protocols used for Intra-ASs Routing in the Internet? –The Working Principle of RIP and OSPF –What is.
1 Agenda for Today’s Lecture The rationale for BGP’s design –What is interdomain routing and why do we need it? –Why does BGP look the way it does? How.
1 Chapter 4: Internetworking (IP Routing) Dr. Rocky K. C. Chang 16 March 2004.
Michael Schapira, Princeton University Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks
Inter-domain Routing Outline Border Gateway Protocol.
Separating Routing From Routers Jennifer Rexford Princeton University
Separating Routing From Routers Jennifer Rexford Princeton University
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 Routing Working at a Small-to-Medium Business or ISP – Chapter 6.
1 CS716 Advanced Computer Networks By Dr. Amir Qayyum.
Working at a Small-to-Medium Business or ISP – Chapter 6
BGP 1. BGP Overview 2. Multihoming 3. Configuring BGP.
Jian Wu (University of Michigan)
COS 561: Advanced Computer Networks
BGP supplement Abhigyan Sharma.
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
Working at a Small-to-Medium Business or ISP – Chapter 6
COS 461: Computer Networks
BGP Instability Jennifer Rexford
Computer Networks Protocols
Presentation transcript:

A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University

Outline Revisiting the control plane –Complexity of today’s control plane –Principles for a redesign Routing Control Platform –Deployability –Scalability –Reliability Example applications –DDoS blackholing, planned maintenance, and customized egress selection Conclusions and future work

Internet Architecture Smart hosts, and a dumb network Network delivers packets to hosts Services implemented on hosts Keep most state at the edges Edge Network IP But, how should we partition function vertically?

Today: Inside a Single Network Data Plane Packet handling by routers Forwarding, filtering, queuing Management Plane Figure out what is happening in network Decide how to change it Shell scripts Traffic Engin. Databases Planning tools OSPF SNMPnetflowmodems Configs OSPF BGP Link metrics OSPF BGP OSPF BGP Control Plane Multiple routing processes on each router Each router with different configuration program Many control knobs: link weights, access lists, policy FIB Routing policies Packet filters

No State in the Network? Yeah, Right… Dynamic state –Routing tables –Forwarding tables Configuration state –Access control lists –Link weights –Routing policies Hard-wired state –Default values of timers –Path-computation algorithms Lots of state, updated in a distributed, uncoordinated way

How Did We Get in This Mess? Initial IP architecture –Bundled packet handling and control logic –Distributed the functions across routers –Didn’t fully anticipate the need for management Rapid growth in features –Sudden popularity and growth of the Internet –Increasing demands for new functionality –Incremental extensions to protocols & routers Challenges of distributed algorithms –Some tasks are hard to do in a distributed fashion

What Does the Network Operator Want? Network-wide views –Network topology (e.g., routers, links) –Mapping to lower-level equipment –Traffic matrix Network-level objectives –Load balancing –Survivability –Reachability –Security Direct control –Explicit configuration of data-plane mechanisms

What Architecture Would Achieve This? Management plane  Decision plane –Responsible for all decision logic and state –Operates on network-wide view and objectives –Directly controls the behavior of the data plane Control plane  Discovery plane –Responsible for providing the network-wide view –Topology discovery, traffic measurement, etc. Data plane –Queues, filters, and forwards data packets –Accepts direct instruction from the decision plane

Advantages of the New Approach Lower management complexity –Complete, network-wide view –Direct control over the routers –Single specification of policies and objectives Simpler routers –Much less control-plane software –Much less configuration state Enabling innovation –New algorithms for selecting paths within an AS –New approaches to inter-AS routing

Example: Improving ISP Routing Border router Internal router 1.Provide internal reachability (IGP) 2.Learn routes to external destinations (eBGP) 3.Distribute externally learned routes internally (iBGP) 4.Select closest egress (IGP)

Is the New Architecture Feasible? Deployability: any way from here to there? –Must be compatible with today’s routers –Must provide incentives for deployment Speed: can it run fast enough? –Must respond quickly to network events –Needs to be as fast as a router Reliability: avoid single point of failure? –Must be replicated to tolerate failure –Replicas must behave consistently

Deployability: Don’t Change the Message Format Border Gateway Protocol –Interdomain routing protocol for the Internet –Widely implemented and used in networks Three main aspects of BGP –Protocol: standard messages sent between routers –Decision logic: multi-step route selection process –Policy: configuration options that influence routing The key point is –Although decision logic and policies are complex… –… the protocol and message format are simple Idea: use BGP messages to tell each router how to forward

Phase 1: Flexible Path Selection in One AS iBGP eBGP Before: conventional use of BGP in backbone network iBGP eBGP After: RCP learns routes and sends answers to routers RCP

Phase 2: AS-Wide Path Selection and Export iBGP eBGP Before: RCP gets “best” iBGP routes (and IGP feed) After: RCP gets all eBGP routes from neighbors iBGP eBGP RCP

Phase 3: Direct Communication Between RCPs Before: RCP gets all eBGP routes from neighbors iBGP eBGP After: ASes exchange routes via RCP RCP AS 3 AS 2 AS 1 iBGP Physical peering Inter-AS Protocol RCP

RCP Architecture Route Control Server (RCS) BGP Engine IGP Viewer Routing Control Platform (RCP) Available BGP routes BGP updates … Selected BGP routes BGP updates … Path cost matrix IGP link-state advertisements …

Challenges and Contributions Reliability –Problem: single point of failure –Contribution: simple replication of RCP components Consistency –Problem: inconsistent decisions by replicas –Contribution: consistency without inter-replica protocol Scalability –Problem: storing all routes increases cpu/memory usage –Contribution: can support large ISP in one computer  Building this system is feasible

Consistency: One RCP, One Partition Solution: Assign all routers along the shortest IGP path the same exit router –Ensures forwarding loops don’t arise RCP 1 BA “Use egress B” “Use egress A”

Consistency: One RCP, Many Partitions Solution: Only use state from router’s partition in assigning its routes –Ensures next hop is reachable Partition 1Partition 2 RCP 1

Consistency: Many RCPs, Many Partitions Solution: RCPs receive same IGP/BGP state from each partition they can reach –IGP provides complete visibility and connectivity –RCS only acts on partition if it has complete state for it Partition 1Partition 2Partition 3 RCP 2RCP 1  No consistency protocol needed to guarantee consistency in steady state

RCS Scalability Eliminate redundancy –Store only a single copy of each BGP route Accelerate lookup –Quickly find routers whose routes changed Avoid recomputation –Compute routes once for groups of routers –Don’t recompute if relative ranking of egress routers unchanged

RIB-Out shadow tables  Prefixes BGP updates (to routers) (points to currently used route for each router) rtr1rtr2rtr3 (stores copies of routes) BGP routes   Prefixes BGP updates (from egress routers) Global route table eg1 eg2 eg3 eg1 eg2 eg3 IGP updates (points to routes that use each egress) Egress lists rtr1 rtr2 Scalability: RCS Data Structures

Example of Egress List Operation A BC C A B D D’s egress list 4 3 7

Example of Egress List Operation A BC D 2 C A B D’s egress list

Example of Egress List Operation A BC D 5 5 C A B D’s egress list 4 3 7

Example of Egress List Operation A BC D 1 1 C A B D’s egress list 4 3 7

Scalability: Standard Computing Platform Implementation platform –3.2 GHz Pentium-4 –8 GB memory –Linux kernel Workload –Routing/topology changes in AT&T’s network RCP performance –Memory usage: less than 2GB –Speed, BGP changes: less than 40 msec –Speed, topology changes: seconds System is able to keep up…

Application: DDoS Blackholing Blackholing of denial-of-service attacks –Preconfigure a “null” route on each router –Identify address of victim (from DoS system) –RCP assigns a null route for the destination iBGP RCP Victim “Use null route for /32” attack (detected by traffic analysis)

Application: Maintenance Dry-Out Dry-out of traffic before maintenance –Plan to take a router temporarily out of service –RCP assigns routes via new egress in advance iBGP RCP Router r about to undergo maintenance before d r s “Use route via s for d” after

Application: Customized Egress Selection Customer-controlled selection of egress points –Customer with two data centers and many sites –Customer wants to control the load balancing –RCP customization, not simply closest egress iBGP RCP d r s “Use route via r for d” “Use route via s for d” Site #1 Site #2

Conclusion Managing IP networks is too hard –IP architecture not designed for management –Complex, distributed operation of routers Reducing complexity in the key –Network-wide views/objectives and direct control –Removing control logic and state from the routers New architecture is feasible –RCP is deployable, scalable, and reliable –RCP solves important operations problems

Future Work Optimization –Real-time adaptation and offline planning –Designing the boundary to support optimization Security –Identifying unstable and suspicious BGP routes –Incrementally deploying a more secure protocol Policy –High-level specification of routing policies –Quantifying reductions in configuration complexity