How I Learned to Stop Worrying About the Core and Love the Edge

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Stable Load Control with Load Prediction in Multipath Packet Forwarding IlKyu Park, Youngseok Lee, and Yanghee Choi Proc. 15 th IEEE Int l conf. on Information.
Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
VCRIB: Virtual Cloud Rule Information Base Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan HotCloud 2012.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric
PARIS: ProActive Routing In Scalable Data Centers Dushyant Arora, Theophilus Benson, Jennifer Rexford Princeton University.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. Presented by: Vinuthna Nalluri Shiva Srivastava.
Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis Miri, Sivasankar Radhakrishnan, Vikram Subramanya, and Amin Vahdat Department.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
* Mellanox Technologies LTD, + Technion - EE Department
LocalFlow: Simple, Local Flow Routing in Data Centers Siddhartha Sen, DIMACS 2011 Joint work with Sunghwan Ihm, Kay Ousterhout, and Mike Freedman Princeton.
Rethinking Traffic Management: Using Optimization Decomposition to Derive New Architectures Jennifer Rexford Princeton University Jiayue He, Ma’ayan Bresler,
ProActive Routing In Scalable Data Centers with PARIS Joint work with Dushyant Arora + and Jennifer Rexford* + Arista Networks *Princeton University Theophilus.
Information-Agnostic Flow Scheduling for Commodity Data Centers
CONGA: Distributed Congestion-Aware Load Balancing for Datacenters
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
Jennifer Rexford Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks Data.
1 Latency Equalization: A Programmable Routing Service Primitive Minlan Yu Joint work with Marina Thottan, Li Li at Bell Labs.
Advanced Network Architecture Research Group 2001/11/149 th International Conference on Network Protocols Scalable Socket Buffer Tuning for High-Performance.
Curbing Delays in Datacenters: Need Time to Save Time? Mohammad Alizadeh Sachin Katti, Balaji Prabhakar Insieme Networks Stanford University 1.
Software-Defined Networks Jennifer Rexford Princeton University.
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1.
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Wei Bai with Li Chen, Kai Chen, Dongsu Han, Chen Tian, Hao Wang SING HKUST Information-Agnostic Flow Scheduling for Commodity Data Centers 1 SJTU,
Presto: Edge-based Load Balancing for Fast Datacenter Networks
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Data Center Load Balancing T Seminar Kristian Hartikainen Aalto University, Helsinki, Finland
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
6.888: Lecture 4 Data Center Load Balancing Mohammad Alizadeh Spring
VL2: A Scalable and Flexible Data Center Network
COS 561: Advanced Computer Networks
Building Efficient and Reliable Software-Defined Networks
HULA: Scalable Load Balancing Using Programmable Data Planes
CIS 700-5: The Design and Implementation of Cloud Networks
Resilient Datacenter Load Balancing in the Wild
OTCP: SDN-Managed Congestion Control for Data Center Networks
HyGenICC: Hypervisor-based Generic IP Congestion Control for Virtualized Data Centers Conference Paper in Proceedings of ICC16 By Ahmed M. Abdelmoniem,
How I Learned to Stop Worrying About the Core and Love the Edge
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
TCP-in-UDP draft-welzl-irtf-iccrg-tcp-in-udp-00.txt
Improving Datacenter Performance and Robustness with Multipath TCP
Tapping Into The Unutilized Router Processing Power
Bridging the Theory-Practice Gap in Multi-Commodity Flow Routing
Congestion-Aware Load Balancing at the Virtual Edge
Hamed Rezaei, Mojtaba Malekpourshahraki, Balajee Vamanan
Congestion Control in Software Define Data Center Network
Multi-hop Coflow Routing and Scheduling in Data Centers
AMP: A Better Multipath TCP for Data Center Networks
VL2: A Scalable and Flexible Data Center Network
Network Control Jennifer Rexford
Elmo: Source-Routed Multicast for Public Clouds
Data Center Architectures
Centralized Arbitration for Data Centers
Data Center Networks Mohammad Alizadeh Fall 2018
Congestion-Aware Load Balancing at the Virtual Edge
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
Transport Layer Identification of P2P Traffic
Toward Self-Driving Networks
Toward Self-Driving Networks
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Jennifer Rexford Princeton University
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Reconciling Zero-conf with Efficiency in Enterprises
Data Center Traffic Engineering
Presentation transcript:

How I Learned to Stop Worrying About the Core and Love the Edge CLOVE How I Learned to Stop Worrying About the Core and Love the Edge Aditi Ghag2 Naga Katta1,2, Mukesh Hira2, Changhoon Kim3, Isaac Keslassy2,4, Jennifer Rexford1 1Princeton University, 2VMware, 3Barefoot Networks, 4Technion

Data center load balancing today Equal-Cost Multi-Path (ECMP) routing: Path(packet) = hash(5-tuple{src,dst IP + src,dst port + protocol number}) . . . . . . … Servers Leaf Switches Spine Hash collisions Coarse-grained Congestion-oblivious

Proposed load balancing schemes Centralized load balancing Hedera, MicroTE, SWAN, Fastpass Slow reaction time Routes computation overhead Central Controller vSwitch vSwitch Hypervisors

Proposed load balancing schemes In-network load balancing CONGA, HULA Needs custom ASIC data center fabric High capital cost vSwitch vSwitch Hypervisors

Proposed load balancing schemes End-host load balancing PRESTO Non-standard shadow mac labels based forwarding Controller intervention in case of asymmetry MPTCP Incast collapse Guest VM network stack changes vSwitch vSwitch Hypervisors

vSwitch as the sweet spot Network switches with ECMP using 5-tuple Spine switches Outer transport source ports are used for ECMP traffic distribution Leaf switches Eth IP TCP Overlay vSwitch vSwitch Payload IP Eth

CLOVE design Path discovery Load-balancing flowlets vSwitch load-balancing

vSwitch Load balancing Path Discovery Load balancing flowlets vSwitch Load balancing Outer transport source port maps to network path Standard ECMP in the physical network H1 to H2 DPort: Fixed SPort: P3 Overlay Data H1 to H2 DPort: Fixed SPort: P2 Overlay Data H1 to H2 DPort: Fixed SPort: P1 Overlay Data H1 to H2 DPort: Fixed SPort: P4 Overlay Data Hypervisor learns source port to path mapping Dst SPort H2 5001 5002 5003 5004 vSwitch vSwitch H1 H2 Hypervisor H1 Hypervisor H2

vSwitch Load balancing Path Discovery Load balancing flowlets vSwitch Load balancing Eth IP TCP src 5003 Overlay Data Eth IP TCP src 5004 Overlay Data Eth IP TCP src 5002 Overlay Data Eth IP TCP src 5001 Overlay Data Dst SPort H2 5001 5002 5003 5004 vSwitch vSwitch Flowlet gap H1 H2

Edge-Flowlet Flowlet splitting in vSwitch Select learnt source port for flowlets at random Physical switches forward flowlets using ECMP

vSwitch Load balancing CLOVE-ECN Path Discovery Load balancing flowlets Congestion-aware balancing based on ECN feedback 2. Switches mark ECN on data packets vSwitch Hypervisor H1 Hypervisor H2 Data Path weight table 1. Src vSwitch detects and forwards flowlets 4. Return packet carries ECN and src port for forward path 3. Dst vSwitch relays ECN and src port to src vSwitch Dst SPort Wt H2 5001 0.1 5002 0.3 5003 5004 Dst SPort Wt H2 5001 0.25 5002 5003 5004 5. Src vSwitch adjusts path weights for the src port

vSwitch Load balancing CLOVE-INT Path Discovery Load balancing flowlets Utilization-aware balancing based on INT feedback 2. Switches add requested link utilization vSwitch Hypervisor H1 Hypervisor H2 Data 1. Src vSwitch adds INT instructions to flowlets 4. Return packet carries link utilization for forward path 3. Dst vSwitch relays link utilization and src port to src vSwitch Dst SPort Util H2 5001 40 5002 30 5003 50 5004 10 5. Src vSwitch updates link utilizations 6. Src vSwitch forwards flowlets on least utilized paths

Performance evaluation setup … 16 Clients Spine1 Spine2 Leaf1 4 x 4 Gbps 16 x 1 Gbps 16 Servers Leaf2 Implementation in ns-2 simulator Realistic Web Search Workload Client on Leaf1 <-> server on Leaf2

Symmetric topology 1.4x lower FCT than ECMP 1.1x higher FCT than CONGA CLOVE-ECN captures 82% of the performance gain between ECMP and CONGA

Asymmetric topology 3x lower FCT than ECMP 1.2x higher FCT than CONGA CLOVE-ECN captures 80% of the performance gain between ECMP and CONGA

CLOVE highlights Captures 80% of the performance gain of CONGA No changes to data center infrastructure or guest VM Adapts to asymmetry without any controller input Scalable due to distributed state Modular implementation

Future work Use packet latency to infer congestion Adapt flowlet-gap to network conditions Fine-tune congestion-management algorithm Stability Analyze processing overhead

THANK YOU Questions?