Congestion-Aware Load Balancing at the Virtual Edge

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Stable Load Control with Load Prediction in Multipath Packet Forwarding IlKyu Park, Youngseok Lee, and Yanghee Choi Proc. 15 th IEEE Int l conf. on Information.
Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
DOT – Distributed OpenFlow Testbed
NCCA 2014 Performance Evaluation of Non-Tunneling Edge-Overlay Model on 40GbE Environment Nagoya Institute of Technology, Japan Ryota Kawashima and Hiroshi.
PARIS: ProActive Routing In Scalable Data Centers Dushyant Arora, Theophilus Benson, Jennifer Rexford Princeton University.
PFabric: Minimal Near-Optimal Datacenter Transport Mohammad Alizadeh Shuang Yang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker.
Flowspace revisited OpenFlow Basics Flow Table Entries Switch Port MAC src MAC dst Eth type VLAN ID IP Src IP Dst IP Prot L4 sport L4 dport Rule Action.
LocalFlow: Simple, Local Flow Routing in Data Centers Siddhartha Sen, DIMACS 2011 Joint work with Sunghwan Ihm, Kay Ousterhout, and Mike Freedman Princeton.
Alan Shieh Cornell University Srikanth Kandula Albert Greenberg Changhoon Kim Microsoft Research Seawall: Performance Isolation for Cloud Datacenter Networks.
Information-Agnostic Flow Scheduling for Commodity Data Centers
CONGA: Distributed Congestion-Aware Load Balancing for Datacenters
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #12 LSNAT - Load Sharing NAT (RFC 2391)
1 Latency Equalization: A Programmable Routing Service Primitive Minlan Yu Joint work with Marina Thottan, Li Li at Bell Labs.
Curbing Delays in Datacenters: Need Time to Save Time? Mohammad Alizadeh Sachin Katti, Balaji Prabhakar Insieme Networks Stanford University 1.
Software-Defined Networks Jennifer Rexford Princeton University.
OpenFlow: Enabling Innovation in Campus Networks
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Wei Bai with Li Chen, Kai Chen, Dongsu Han, Chen Tian, Hao Wang SING HKUST Information-Agnostic Flow Scheduling for Commodity Data Centers 1 SJTU,
Presto: Edge-based Load Balancing for Fast Datacenter Networks
Data Center Load Balancing T Seminar Kristian Hartikainen Aalto University, Helsinki, Finland
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
6.888: Lecture 4 Data Center Load Balancing Mohammad Alizadeh Spring
Ananta: Cloud Scale Load Balancing Presenter: Donghwi Kim 1.
AVS Brazos : IPv6. Agenda AVS IPv6 background Packet flows TSO/TCO Configuration Demo Troubleshooting tips Appendix.
VL2: A Scalable and Flexible Data Center Network
COS 561: Advanced Computer Networks
Building Efficient and Reliable Software-Defined Networks
Networking in Datacenters EECS 398 Winter 2017
HULA: Scalable Load Balancing Using Programmable Data Planes
CIS 700-5: The Design and Implementation of Cloud Networks
Resilient Datacenter Load Balancing in the Wild
How I Learned to Stop Worrying About the Core and Love the Edge
Internet Networking recitation #9
OTCP: SDN-Managed Congestion Control for Data Center Networks
HyGenICC: Hypervisor-based Generic IP Congestion Control for Virtualized Data Centers Conference Paper in Proceedings of ICC16 By Ahmed M. Abdelmoniem,
How I Learned to Stop Worrying About the Core and Love the Edge
Openflow-based Multipath Switching in Wide Area Networks
ECE 544: Traffic engineering (supplement)
TCP-in-UDP draft-welzl-irtf-iccrg-tcp-in-udp-00.txt
Improving Datacenter Performance and Robustness with Multipath TCP
Tapping Into The Unutilized Router Processing Power
Bridging the Theory-Practice Gap in Multi-Commodity Flow Routing
Congestion-Aware Load Balancing at the Virtual Edge
Hamed Rezaei, Mojtaba Malekpourshahraki, Balajee Vamanan
Network Virtualization
Congestion Control in Software Define Data Center Network
NTHU CS5421 Cloud Computing
AMP: A Better Multipath TCP for Data Center Networks
VL2: A Scalable and Flexible Data Center Network
Internet Networking recitation #10
Network Control Jennifer Rexford
Elmo: Source-Routed Multicast for Public Clouds
Data Center Architectures
Centralized Arbitration for Data Centers
Data Center Networks Mohammad Alizadeh Fall 2018
Lecture 17, Computer Networks (198:552)
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
Toward Self-Driving Networks
Toward Self-Driving Networks
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Jennifer Rexford Princeton University
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Reconciling Zero-conf with Efficiency in Enterprises
Presentation transcript:

Congestion-Aware Load Balancing at the Virtual Edge CLOVE Aran Bergman (Technion, VMware) Naga Katta, Aditi Ghag, Mukesh Hira, Changhoon Kim, Isaac Keslassy, Jennifer Rexford

Data center load balancing today Equal-Cost Multi-Path (ECMP) routing: Path(packet) = hash(packet’s 5-tuple) src,dst IP + src,dst port + protocol . . . . . . … Spine Switches Coarse-grained Elephant Hash collisions Congestion-oblivious Leaf Switches Servers

Previously proposed load balancing schemes vSwitch vSwitch Hypervisors

Previously proposed load balancing schemes Centralized load balancing Hedera, Fastpass, MicroTE, SWAN Control-driven feedback Slow reaction time Routes computation overhead Scalability Issues Central Controller vSwitch vSwitch Hypervisors

Previously proposed load balancing schemes In-network load balancing CONGA, HULA, LetFlow, DRILL Needs custom ASIC data center fabric High capital cost Controller Involvement may still be required vSwitch vSwitch Hypervisors

Previously proposed load balancing schemes End-host load balancing Presto Congestion Oblivious Controller intervention in case of topology asymmetry MPTCP Incast collapse Guest VM network stack changes Hermes (SIGCOMM’17) Concurrent effort vSwitch vSwitch Hypervisors

vSwitch as the sweet spot Easy to implement No Switch HW changes No guest VM changes Spine switches Leaf switches vSwitch vSwitch

CLOVE assumptions Clove operates over a DC Overlay – e.g., Stateless Transport Tunneling (STT) Network switches with ECMP using 5-tuple Spine switches Outer transport header is used for ECMP traffic distribution Leaf switches Eth IP TCP Overlay vSwitch vSwitch Payload IP Eth

CLOVE in 1 slide Path discovery using traceroute probes Load-balancing flowlets [FLARE ‘05] vSwitch switching between paths based on RTT-scale feedback Explicit Congestion Notification - ECN In-band Network Telemetry - INT

Path Discovery Load balancing flowlets vSwitch Load balancing Outer transport source port (with ECMP) maps to network path Standard ECMP in the physical network H1 to H2 DPort: Fixed SPort: P3 Overlay Data H1 to H2 DPort: Fixed SPort: P2 Overlay Data H1 to H2 DPort: Fixed SPort: P1 Overlay Data H1 to H2 DPort: Fixed SPort: P4 Overlay Data Hypervisor learns source port to path mapping Dst SPort H2 P1 P2 P3 P4 vSwitch vSwitch ECMP-based source routing H1 H2 Hypervisor H1 Hypervisor H2

vSwitch Load balancing Path Discovery Load balancing flowlets Scheme 1: Edge-Flowlet vSwitch Load balancing Eth IP TCP src P3 Overlay Data Eth IP TCP src P4 Overlay Data Eth IP TCP src P2 Overlay Data Eth IP TCP src P1 Overlay Data Dst SPort H2 P1 P2 P3 P4 vSwitch vSwitch Flowlet gap H1 H2

vSwitch Load balancing Scheme2: CLOVE-ECN Path Discovery Load balancing flowlets Congestion-aware balancing based on ECN feedback 2. Switches mark ECN on data packets vSwitch Hypervisor H1 Hypervisor H2 Data Path weight table 1. Src vSwitch detects and forwards flowlets 4. Return packet carries ECN and src port for forward path 3. Dst vSwitch relays ECN and src port to src vSwitch Dst SPort Wt H2 P1 0.1 P2 0.3 P3 P4 Dst SPort Wt H2 P1 0.25 P2 P3 P4 5. Src vSwitch adjusts path weights for the src port

vSwitch Load balancing Scheme 3: CLOVE-INT Path Discovery Load balancing flowlets Utilization-aware balancing based on INT feedback 2. Switches add requested link utilization vSwitch Hypervisor H1 Hypervisor H2 Data 1. Src vSwitch adds INT instructions to flowlets 4. Return packet carries path utilization for forward path 3. Dst vSwitch relays path utilization and src port to src vSwitch Dst SPort Util H2 P1 40 P2 30 P3 50 P4 10 5. Src vSwitch updates path utilizations 6. Src vSwitch forwards flowlets on least utilized paths

Performance evaluation setup Asymmetric Setup … 16 Clients Spine1 Spine2 Leaf1 4 x 40 Gbps 16 x 10 Gbps 16 Servers Leaf2 2-tier leaf-spine symmetric topology Web Search Workload Client on Leaf1 <-> server on Leaf2 Measure Average Flow Completion Time (FCT) Compare Edge-Flowlet and Clove-ECN to ECMP, MPTCP and Presto

Symmetric topology … Better 1.8x 2.5x

Asymmetric topology … Better 5x 12x

Incast Workload Better        CLOVE-ECN outperforms MPTCP on Incast Workloads

NS2 Simulation with CONGA – Asymmetric … 3x lower FCT than ECMP Better 1.2x higher FCT than CONGA CLOVE-ECN captures 80% of the performance gain between ECMP and CONGA

CLOVE highlights Captures 80% of the performance gain of CONGA No changes to network hardware, VMs, applications  Adapts to asymmetry within the data plane Scalable due to distributed state

THANK YOU Questions?

Parameter Sweeping Empirical values for (flowlet-threshold, ECN-threshold) are 1RTT, 20pkts