COPE: Traffic Engineering in Dynamic Networks Hao Wang, Haiyong Xie, Lili Qiu, Yang Richard Yang, Yin Zhang, Albert Greenberg Yale University UT Austin.

Slides:



Advertisements
Similar presentations
Responsive Yet Stable Traffic Engineering Srikanth Kandula Dina Katabi, Bruce Davie, and Anna Charny.
Advertisements

1 Praveen K. Muthuswamy Electrical Computer and Systems Engineering Rensselaer Polytechnic Institute In collaboration with Koushik Kar, Aparna Gupta (RPI)
Abhigyan, Aditya Mishra, Vikas Kumar, Arun Venkataramani University of Massachusetts Amherst 1.
Traffic Engineering with Forward Fault Correction (FFC)
1 Traffic Engineering (TE). 2 Network Congestion Causes of congestion –Lack of network resources –Uneven distribution of traffic caused by current dynamic.
1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
SIGCOMM 2003 Making Intra-Domain Routing Robust to Changing and Uncertain Traffic Demands: Understanding Fundamental Tradeoffs David Applegate Edith Cohen.
Network Architecture for Joint Failure Recovery and Traffic Engineering Martin Suchara in collaboration with: D. Xu, R. Doverspike, D. Johnson and J. Rexford.
TIE Breaking: Tunable Interdomain Egress Selection Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie with Tim Griffin.
Traffic Engineering With Traditional IP Routing Protocols
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
Multiple constraints QoS Routing Given: - a (real time) connection request with specified QoS requirements (e.g., Bdw, Delay, Jitter, packet loss, path.
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
1IMIC, 8/30/99 Constraint-Based Unicast and Multicast: Practical Issues Bala Rajagopalan NEC C&C Research Labs Princeton, NJ
Optimizing Cost and Performance for Multihoming ACM SIGCOMM 2004 Lili Qiu Microsoft Research Joint Work with D. K. Goldenberg, H. Xie,
1 Deriving Traffic Demands for Operational IP Networks: Methodology and Experience Anja Feldmann*, Albert Greenberg, Carsten Lund, Nick Reingold, Jennifer.
On the Stability of Rational, Heterogeneous Interdomain Route Selection Hao Wang Yale University Joint work with Haiyong Xie, Y. Richard Yang, Avi Silberschatz,
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
More Traffic Engineering TE with MPLS TE in the inter-domain.
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
S. Suri, M, Waldvogel, P. Warkhede CS University of Washington Profile-Based Routing: A New Framework for MPLS Traffic Engineering.
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Wide-Area Traffic Management COS 597E: Software Defined Networking.
AGG-NANOG IP Network Traffic Engineering Albert Greenberg Internet and Networking Systems Research Lab AT&T Labs - Research; Florham Park, NJ See.
Routing Games for Traffic Engineering F. Larroca and J.L. Rougier IEEE International Conference on Communications (ICC 2009) Dresden, Germany, June
MATE: MPLS Adaptive Traffic Engineering Anwar Elwalid, et. al. IEEE INFOCOM 2001.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Identifying and Using Energy Critical Paths Nedeljko Vasić with Dejan Novaković, Satyam Shekhar, Prateek Bhurat, Marco Canini, and Dejan Kostić EPFL, Switzerland.
1 Meeyoung Cha (KAIST) Sue Moon (KAIST) Chong-Dae Park (KAIST) Aman Shaikh (AT&T Labs – Research) IEEE INFOCOM 2005 Poster Session Positioning Relay Nodes.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,
1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.
Lecture 15. IGP and MPLS D. Moltchanov, TUT, Spring 2008 D. Moltchanov, TUT, Spring 2015.
June 10, 2003STOC 2003 Optimal Oblivious Routing in Polynomial Time Harald Räcke Paderborn Edith Cohen AT&T Labs-Research Yossi Azar Amos Fiat Haim Kaplan.
Efficient and Robust Traffic Engineering in a Dynamic Environment Ph.D. Dissertation Defense Hao Wang Y. Richard Yang Joan Feigenbaum Jennifer Rexford.
Quantifying the Causes of Path Inflation Neil Spring, Ratul Mahajan, and Thomas Anderson Presented by Luv Kohli COMP November 24, 2003.
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
A Fair and Dynamic Load Balancing Mechanism F. Larroca and J.L. Rougier International Workshop on Traffic Management and Traffic Engineering for the Future.
Spatio-Temporal Compressive Sensing Yin Zhang The University of Texas at Austin Joint work with Matthew Roughan.
1 REIN: Reliability as an Interdomain Service Jia Wang with Hao Wang, Yang Richard Yang, Paul H. Liu, Alexandre Gerber, Albert Greenberg Yale University.
On Understanding of Transient Interdomain Routing Failures Feng Wang, Lixin Gao, Jia Wang, and Jian Qiu Department of Electrical and Computer Engineering.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
April 4th, 2002George Wai Wong1 Deriving IP Traffic Demands for an ISP Backbone Network Prepared for EECE565 – Data Communications.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.
Towards a More Fair and Robust Internet Backbone Year 1 Status Report Rene Cruz, Tara Javidi, Bill Lin Center for Networked Systems University of California,
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
System & Network Reading Group On Selfish Routing In Internet-Like Evironments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang.
Mike Freedman Fall 2012 COS 561: Advanced Computer Networks Traffic Engineering.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
TeXCP: Protecting Providers’ Networks from Unexpected Failures & Traffic Spikes Dina Katabi MIT - CSAIL nms.csail.mit.edu/~dina.
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
1 Chapter 4: Internetworking (IP Routing) Dr. Rocky K. C. Chang 16 March 2004.
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
CSci5221: Intra-Domain Traffic Engineering 1 Intra-Domain Traffic Engineering Traffic Engineering (TE) – MPLS and traffic engineering (will go over very.
System & Network Reading Group On Selfish Routing In Internet-Like Evironments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang.
P4P: Proactive Provider Assistance for P2P Haiyong Xie Yale University.
Impact of Interference on Multi-hop Wireless Network Performance
Presented by Tae-Seok Kim
Jian Wu (University of Michigan)
ECE 544: Traffic engineering (supplement)
Backbone Traffic Engineering
Achieving Resilient Routing in the Internet
Presentation transcript:

COPE: Traffic Engineering in Dynamic Networks Hao Wang, Haiyong Xie, Lili Qiu, Yang Richard Yang, Yin Zhang, Albert Greenberg Yale University UT Austin AT&T Labs - Research ACM SIGCOMM 2006

2 Traffic Engineering (TE) Objective –Adapting the routing of traffic to avoid congestion and make more efficient use of network resource Motivation –High cost of network assets & highly competitive nature of ISP market –Routing influences efficiency of network resource utilization Latency, loss rate, congestion, … Two components –Understand traffic demands –Configure routing protocols This paper focuses on intra-domain TE –But the basic approach may also apply in inter- domain TE and network optimization in general

3 Challenge: Unpredictable Traffic Internet traffic is highly unpredictable! –Can be relatively stable most of the time … –However, usually contains spikes that ramp up extremely quickly We identified sudden traffic spikes in the traces of several networks –Unpredictable traffic variations have been observed and studied by other researchers [ Teixeira et al. ’04, Uhlig & Bonaventure ’02, Xu et al. ’05 ] –Confirmed by operators of several large networks via survey –Abrupt traffic changes often occur when service is most valuable! Many possible causes for traffic unpredictability –Worms/viruses, DoS attacks, flash crowds, BGP routing changes [ Teixeira et al. ’05, Agarwal et al. ’05 ], failures in other networks, load balancing by multihomed customers, TE by peers … TE needs to handle unpredictable traffic –Otherwise, links and/or routers may get unnecessarily overloaded Long delay, high loss, reduced throughput, violation of SLA –Customers can remember bad experiences really well …

4 Existing TE Solutions Prediction-based TE –Examples: Off-line: –Single predicted TM [ Sharad et al. ’05 ] –Multiple predicted TMs [ Zhang et al. ’05 ] On-line: MATE [ Elwalid et al. ’01 ] & TeXCP [ Kandula et al. ’05 ] –Pro: Works great when traffic is predictable –Con: May pay a high penalty when real traffic deviates substantially from the prediction Oblivious routing –Examples: Oblivious routing [ Racke ’02, Azar et al. ’03, Applegate et al. ’03 ] Valiant load-balancing [ Kodialam et al. ’05, Zhang & McKeown ’04 ] –Pro: Provides worst-case performance bounds –Con: May be sub-optimal for normal traffic The optimal oblivious ratio of several real network topologies studied in [Applegate et al ’03] is ~2

5 Our Approach: COPE Common-case Optimization with Penalty Envelope C X Bound for set X Optimize for set C min f max d  C P C (f, d) s.t. (1) f is a routing (2)  x  X:  P X (f, x)  PE C: common-case (predicted) TMs X: all TMs of interest P C (f,d): common-case penalty function P X (f,x): worst-case penalty function PE: penalty envelope

6 Model Network topology: graph G = (V,E) –V : set of routers –E : set of network links Traffic matrices (TMs) –A TM is a set of demands: d = { d ab | a,b  V } –d ab : traffic demand from a to b –Can extend to point-to-multipoint demands MPLS-style, link-based routing –f = { f ab (i,j) | a,b  V, (i,j)  E } –f ab (i,j) : the fraction of demand from a to b (i.e., d ab ) that is routed through link (i,j) –Paper includes ideas on how to approximate OSPF- style (i.e., shortest path implementable) routing

7 Routing Performance Metrics Maximum Link Utilization (MLU): Optimal Utilization Performance Ratio

8 COPE Instantiation C: convex hull of multiple past TMs –A linear predictor predicts the next TM as a convex combination of past TMs (e.g., EWMA) –Aggregation of all possible linear predictors  the convex hull X: all possible non-negative TMs –Can add access capacity constraints or use a bigger convex hull P C (f,d): penalty function for common cases –maximum link utilization: U(f,d) –performance ratio: PR(f,d) P X (f,x): penalty function for worst cases –performance ratio: PR(f,x) PE: penalty envelope –PE =  min f max x  X P X (f,x) –  1 controls the size of PE w.r.t. the optimal worst-case penalty  =1  oblivious routing  =   prediction-based TE min f max d  C P C (f, d) s.t. (1) f is a routing; and (2)  x  X:  P C (f, x)  PE

9 Current COPE Implementation 1.Collect TMs continuously 2.Compute COPE routing for the next day by solving a linear program (LP) –Common-case optimization Common case: convex hull of multiple past TMs –All TMs in previous day + same/previous days in last week Minimize either MLU or PR over the convex hull –Penalty envelope Bounded PR over all possible nonnegative TMs –See paper for details of our LP formulation 3.Install COPE routing Currently done once per day  an off-line solution –Can be made on-line (e.g., recompute routing upon detection of significant changes in TM)

10 COPE Illustrated Prediction-based TE Best common-case + Poor worst-case Oblivious Routing Poor common-case + Best worst-case COPE Good common-case + Bounded worst-case Spectrum of TE with unpredictable traffic Position controllable by penalty envelope There are enough unexpected cases  Penalty envelope is required The worst unexpected case too unlikely to occur  Too wasteful to “optimize” for the worst-case (at the cost of poor common-case performance)

11 Evaluation Methodology TE Algorithms –COPE: COPE with P C (f,d) = PR(f,d) (i.e. performance ratio) –COPE-MLU: COPE with P C (f,d) = U(f,d) (i.e. max link utilization) –Oblivious routing: min f max x PR(f,x) (  COPE with  =1) –Dynamic: optimize routing for TM in previous interval –Peak: peak interval of previous day + prev/same days in last week –Multi: all intervals in previous day + prev/same days in last week –Optimal: requires an oracle Dataset –US-ISP hourly PoP-level TMs for a tier-1 ISP (1 month in 2005) Optimal oblivious ratio: 2.045; default penalty envelope: 2.5 –Abilene 5-min router-level TMs on Abilene (6 months: Mar – Sep. 2004) Optimal oblivious ratio: 1.853; default penalty envelope: 2.0

12 US-ISP: Performance Ratio Common cases: COPE is close to optimal/dynamic and much better than others Unexpected cases: COPE beats even OR and is much better than others

13 US-ISP: Maximum Link Utilization Common cases: COPE is close to optimal/dynamic and much better than others Unexpected cases: COPE beats even OR and is much better than others

14 Abilene: Performance Ratio Common cases: COPE is close to optimal/dynamic and much better than others Unexpected cases: COPE is close to OR and much better than others

15 Abilene: MLU in Common Cases Common cases: COPE is close to optimal/dynamic and much better than others

16 Abilene: MLU in Unexpected Cases Unexpected cases: COPE is close to OR and much better than others

17 US-ISP: Sensitivity to PE COPE is insensitive to PE; even a small margin in PE can significantly improve the common-case performance

18 COPE with Interdomain Routing Motivation –Changes in availability of interdomain routes can cause significant shifts of traffic within the domain E.g. when a peering link fails, all traffic through that link is rerouted Challenges –Point-to-multipoint demands  need to find splitting ratios among exit points –The set of exit points may change  topology itself is dynamic –Too many prefixes  cannot enumerate all possible exit point changes

19 COPE with Interdomain Routing: A Two-Step Approach 1.Apply COPE on an extended topology to derive good splitting ratios Group dest prefixes with same set of exit points into a virtual node Derive pseudo demands destined to each virtual node by merging demands to prefixes that belong to this virtual node Connect virtual node to corresponding peer using virtual link with infinite BW Compute extended topology G’ as G’ = intradomain topology + peers + peering links + virtual nodes + virtual links Apply COPE to compute routing on G’ for the pseudo demands Derive splitting ratios based on the routes 2.Apply COPE on point-to-point demands to compute intradomain routing Use the splitting ratios obtained in Step 1 to map point-to-multipoint demands into point-to-point demands intradomain topology peer virtual

20 Preliminary Evaluation COPE can significantly limit the impact of peering link failures

21 Conclusions & Future Work COPE = Common-case Optimization with Penalty Envelope COPE works! –Common cases: close to optimal; much better than oblivious routing and prediction-based TE with comparable overhead –Unexpected cases: much better than prediction-based TE, and sometimes may beat oblivious routing –COPE is insensitive to the size of the penalty envelope; even a small margin in PE improves common-case performance a lot –COPE can be extended to cope with interdomain routes Lots of ongoing & future work –Efficient implementation of COPE –COPE with MPLS and VPN –COPE with OSPF –COPE with online TE –COPE for other network optimization problems

22 Thank you!