Data Center Routing – Traffic Engineering Yao Lu Rui Zhang ECE 260C VLSI Advanced Topics.

Slides:



Advertisements
Similar presentations
A Centralized Scheduling Algorithm based on Multi-path Routing in WiMax Mesh Network Yang Cao, Zhimin Liu and Yi Yang International Conference on Wireless.
Advertisements

COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks By C. K. Toh.
Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis Miri, Sivasankar Radhakrishnan, Vikram Subramanya, and Amin Vahdat Department.
PRESENTED BY: TING WANG PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Radhika Niranjan Mysore, Andreas Pamboris, Nathan.
1 Traffic Engineering (TE). 2 Network Congestion Causes of congestion –Lack of network resources –Uneven distribution of traffic caused by current dynamic.
1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
Routing Basics By Craig Lindstrom. Overview Routing Process Routing Process Default Routing Default Routing Static Routing Static Routing Dynamic Routing.
CISCO NETWORKING ACADEMY Chabot College ELEC Routed and Routing Protocols.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
Traffic Engineering With Traditional IP Routing Protocols
December 20, 2004MPLS: TE and Restoration1 MPLS: Traffic Engineering and Restoration Routing Zartash Afzal Uzmi Computer Science and Engineering Lahore.
LCN 2007, Dublin 1 Non-bifurcated Routing in Wireless Multi- hop Mesh Networks by Abdullah-Al Mahmood and Ehab S. Elmallah Department of Computing Science.
Rethinking Internet Traffic Management: From Multiple Decompositions to a Practical Protocol Jiayue He Princeton University Joint work with Martin Suchara,
On Multi-Path Routing Aditya Akella 03/25/02. What is Multi-Path Routing?  Dynamically route traffic Multiple paths to a destination Path taken dependant.
Routing.
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Helios: A Hybrid Electrical/Optical Switch Architecture for Modular Data Centers Nathan Farrington George Porter, Sivasankar Radhakrishnan,
1 Semester 2 Module 6 Routing and Routing Protocols YuDa college of business James Chen
Distributed Quality-of-Service Routing of Best Constrained Shortest Paths. Abdelhamid MELLOUK, Said HOCEINI, Farid BAGUENINE, Mustapha CHEURFA Computers.
CISCO NETWORKING ACADEMY Chabot College ELEC IP Routing Protocol Highlights.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,
Internet Traffic Engineering by Optimizing OSPF Weights Bernard Fortz (Universit é Libre de Bruxelles) Mikkel Thorup (AT&T Labs-Research) Presented by.
L13. Shortest path routing D. Moltchanov, TUT, Spring 2008 D. Moltchanov, TUT, Spring 2014.
Lecture 15. IGP and MPLS D. Moltchanov, TUT, Spring 2008 D. Moltchanov, TUT, Spring 2015.
Routing and Routing Protocols Routing Protocols Overview.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 6 Routing and Routing Protocols.
Network Aware Resource Allocation in Distributed Clouds.
1. 2 Anatomy of an IP Packet IP packets consist of the data from upper layers plus an IP header. The IP header consists of the following:
CCNA 1 Module 10 Routing Fundamentals and Subnets.
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
Improving Capacity and Flexibility of Wireless Mesh Networks by Interface Switching Yunxia Feng, Minglu Li and Min-You Wu Presented by: Yunxia Feng Dept.
Group 3 Sandeep Chinni Arif Khan Venkat Rajiv. Delay Tolerant Networks Path from source to destination is not present at any single point in time. Combining.
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
Hedera: Dynamic Flow Scheduling for Data Center Networks Mohammad Al-Fares Sivasankar Radhakrishnan Barath Raghavan Nelson Huang Amin Vahdat Mohammad Al-Fares.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.
CCNA 1 v3.0 Module 10 Routing Fundamentals and Subnets.
Routing Networks and Protocols Prepared by: TGK First Prepared on: Last Modified on: Quality checked by: Copyright 2009 Asia Pacific Institute of Information.
Symbiotic Routing in Future Data Centers Hussam Abu-Libdeh Paolo Costa Antony Rowstron Greg O’Shea Austin Donnelly MICROSOFT RESEARCH Presented By Deng.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
1 Version 3.1 Module 6 Routed & Routing Protocols.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Routing and Routing Protocols PJC CCNA Semester 2 Ver. 3.0 by William Kelly.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
HELSINKI UNIVERSITY OF TECHNOLOGY Visa Holopainen 1/18.
Optimization Problems in Wireless Coding Networks Alex Sprintson Computer Engineering Group Department of Electrical and Computer Engineering.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
1 Transport Bandwidth Allocation, Intro to Network Layer 4/3/2012.
Prof. Alfred J Bird, Ph.D., NBCT Office – Science 3rd floor – S Office Hours – Monday and Thursday.
PATH DIVERSITY WITH FORWARD ERROR CORRECTION SYSTEM FOR PACKET SWITCHED NETWORKS Thinh Nguyen and Avideh Zakhor IEEE INFOCOM 2003.
-1/16- Maximum Battery Life Routing to Support Ubiquitous Mobile Computing in Wireless Ad Hoc Networks C.-K. Toh, Georgia Institute of Technology IEEE.
Prof. Alfred J Bird, Ph.D., NBCT Office – McCormick 3rd floor 607 Office Hours – Monday 3:00 to 4:00 and.
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
Yiting Xia, T. S. Eugene Ng Rice University
Data Center Network Architectures
Constraint-Based Routing
ECE 544: Traffic engineering (supplement)
TCP Congestion Control
Routing.
ISP and Egress Path Selection for Multihomed Networks
Backbone Traffic Engineering
Routing.
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Towards Predictable Datacenter Networks
Presentation transcript:

Data Center Routing – Traffic Engineering Yao Lu Rui Zhang ECE 260C VLSI Advanced Topics

Outline What is routing/traditional routing algorithm What is data center Difference between data center and the Internet Some Recent work in data center TE Open questions/proposals

What is routing

Traditional routing algorithm RIP (Routing Information Protocol) IGRP (Interior Gateway Routing Protocol) EIGRP (Enhanced Interior Gateway Routing Protocol) OSPF (Open Shortest Path First) IS-IS (Intermediate System-to-Intermediate System) BGP (Border Gateway Protocol)

What is data center Nowadays, 40% of the total Internet traffic goes to Google[1]

Difference between data center and the Internet Design Goal – latency, reliability, throughput, energy, etc. Properties – Well-structured topology – Movability of the locations of sources and destinations – Global knowledge of the whole data center network

Recent work Equal-Cost Multi-Path (ECMP)[7] Valiant Load Balancing (VLB)[6] CamCube[5] Hedera[8] Joint VM Placement and Routing (JVMPR)[4]

ECMP Many equal cost paths going up to the core switches Only one path down from each core switch Randomly allocate paths to flows using hash of the flow DS

VLB Goal – Guarantee equal-spread load- balancing in a mesh network Method – Bouncing individual packets from a source switch in the mesh off of randomly chosen intermediate “core” switches, which finally forward those packets to their destination switch.

Camcube 3D Torus Topology Offer Camcube API – To let service/application to design its own routing protocal Core services – Basic routing algorithm link state-based protocol

Hedera Detect Large Flows – Flows that need bandwidth but are network-limited Estimate Flow Demands – Use min-max fairness to allocate flows between src-dst pairs Place Flows – Use estimated demands to heuristically find better placement of large flows on the ECMP paths Estimate Flow Demands Place Flows Detect Large Flows

Hedera Large Flow Detection – Scheduler continually polls edge switches for flow byte- counts – Flows exceeding B/s threshold are “ large ” > %10 of hosts ’ link capacity (i.e. > 100Mbps)

Hedera Demand Estimation – Goal Estimate available bandwidth to allocate – Method Using min-max fairness, given traffic matrix of large flows, modify each flow ’ s size at it source and destination iteratively… – Sender equally distributes bandwidth among outgoing flows that are not receiver-limited – Network-limited receivers decrease exceeded capacity equally between incoming flows – Repeat until all flows converge

Hedera A B C X Y FlowEstimateConv. ? AXAX AYAY BYBY CYCY Sender Available Unconv. BW FlowsShare A121/2 B111 C111 Senders

Hedera RecvRL? Non-SL Flows Share XNo-- YYes31/3 Receivers FlowEstimateConv. ? AXAX1/2 AYAY BYBY1 CYCY1 A B C X Y

Hedera FlowEstimateConv. ? AXAX1/2 AYAY1/3Yes BYBY1/3Yes CYCY1/3Yes Sender Available Unconv. BW FlowsShare A2/31 B000 C000 Senders A B C X Y

Hedera FlowEstimateConv. ? AXAX2/3Yes AYAY1/3Yes BYBY1/3Yes CYCY1/3Yes RecvRL? Non-SL Flows Share XNo-- Y -- Receivers A B C X Y

Hedera Flow Placement – Goal Find a good allocation of paths for the set of large flows, such that the average bisection bandwidth of the flows is maximized – Method Global First Fit: – Greedily choose path that has sufficient unreserved b/w Simulated Annealing: – Iteratively find a globally better mapping of paths to flows

Hedera Global First Hit – New flow detected, linearly search all possible paths from S  D – Place flow on first path whose component links can fit that flow

Hedera Simulated Annealing – 4 specifications State space Neighboring states Energy Temperature Simple example: Minimizing f(x) F(x)

Hedera State: All possible mapping of flows to paths – Constrained to reduce state space size – Flows to a destination constrained to use same core Neighbor State: Swap paths between 2 hosts – Within same pod Function/Energy: Total exceeded b/w capacity – Using the estimated demand of flows – Minimize the exceeded capacity Temperature: Iterations left – Fixed number of iterations (1000s)

Hedera

JVMPR Joint VM Placement and Routing Goal: Efficient traffic engineering under dynamic arrivals and departures of jobs – One method : Localizing traffic by flexible VM placement node utilization – Another method : Avoiding congestion by intelligent routing link utilization coupled with each other

JVMPR Figure1:The left structure is the existing VMs and traffic The middle structure is good VM placement with high congestion The right structure is a worse placement with lower congestion existing VM VM we need to add

JVMPR JVMPR consider placement and routing at the same time It develops an approximation algorithm that leverages the specific structure of the joint design problem

JVMPR Placement and Route Selection – Placement: The feasible decision space for VM placement is – Routing : The feasible decision space for routing is

JVMPR Optimize Resource Utilization – cost net : Network cost Measure the congestion – cost node : Node cost Operating cost induced by a swith or a machine – Goal: Minimize the total cost

JVMPR Any problem? Yes! – The number of jobs is not fixed – Jobs enter or depart the system dynamically Better way: Online solution – Static problem setting to a dynamic environment – Key idea: Perform local re-optimization

JVMPR Online solution algorithm – Upon a new job arrival, assign the new job to one configuration accoridng to the transition probability – Upon a job departure, pick one job and migrate it to new machines according to the transition probability

JVMPR Why dynamic JVMPR solution is appealing? – We do not require VM migrations when new jobs arrive and at most one job migration when jobs depart – The computation of migration probability only requires local information

JVMPR Fig. Performance comparison Max Core Switch Utilization Percentage of elephant flows

JVMPR What is the price we pay for it? – The approximated Markov chain no longer converges to the exact stationary distribution But to a neighborhood around it – Need a lot computation

Summary AlgorithmTopology MovabilityGlobal knowledgeOther idea ECMPY VLBY CamCubeY Y HederaY Y JVMPRYY

Summary AlgorithmProsCons ECMP1. Simple 2. Works great with mice flow 1. Might cause congestion with elephant flows VLB1. Simple 2. Works great with mice flow 1. Might cause congestion with elephant flows CamCube1. Flexible1. Optimization per service/ application, no global optimization is considered Hedera1. Can deal with both mice flow and elephant flow 1. Algorithm cannot guarantee global optimal 2. Assumptions when doing Demand Estimation may not hold JVMPR1. Cost is low 2. Computation only need local information 1.Need a lot computation 2.It is a kind of approximation

Open questions/proposals Imperfection of current algorithms – Hedera Large flow detection too simple Demand estimation only considered TCP flows – JVMPR Demand a lot of computation It is approximation Not fully take advantage of the nice features of data center – Combine topology, movability and VM placement together Add VM placement consideration into Hedera

Reference [1] internet/ [2] Moy, John T. OSPF: anatomy of an Internet routing protocol. Addison-Wesley Professional, [3] Chen, Kai, Chengchen Hu, Xin Zhang, Kai Zheng, Yan Chen, and Athanasios V. Vasilakos. "Survey on routing in data centers: insights and future directions." Network, IEEE 25, no. 4 (2011): [4] Jiang, Joe Wenjie, Tian Lan, Sangtae Ha, Minghua Chen, and Mung Chiang. "Joint VM placement and routing for data center traffic engineering." In INFOCOM, 2012 Proceedings IEEE, pp IEEE, [5] Abu-Libdeh, Hussam, Paolo Costa, Antony Rowstron, Greg O'Shea, and Austin Donnelly. "Symbiotic routing in future data centers." ACM SIGCOMM Computer Communication Review 41, no. 4 (2011): [6] Farrington, Nathan, George Porter, Sivasankar Radhakrishnan, Hamid Hajabdolali Bazzaz, Vikram Subramanya, Yeshaiahu Fainman, George Papen, and Amin Vahdat. "Helios: a hybrid electrical/optical switch architecture for modular data centers." ACM SIGCOMM Computer Communication Review 41, no. 4 (2011): [7] Hopps, Christian E. "Analysis of an equal-cost multi-path algorithm." (2000). [8] Al-Fares, Mohammad, Sivasankar Radhakrishnan, Barath Raghavan, Nelson Huang, and Amin Vahdat. "Hedera: Dynamic Flow Scheduling for Data Center Networks." In NSDI, vol. 10, pp

Thank you!