Improving Datacenter Performance and Robustness with Multipath TCP

Slides:



Advertisements
Similar presentations
Congestion Control and Fairness Models Nick Feamster CS 4251 Computer Networking II Spring 2008.
Advertisements

Data Center Networking with Multipath TCP
Ramin Khalili (T-Labs/TUB) Nicolas Gast (LCA2-EPFL)
A Switch-Based Approach to Starvation in Data Centers Alex Shpiner and Isaac Keslassy Department of Electrical Engineering, Technion. Gabi Bracha, Eyal.
Opportunistic Mobility with Multipath TCP
Towards Predictable Datacenter Networks
Restless bandits and congestion control Mark Handley, Costin Raiciu, Damon Wischik UCL.
Multipath TCP Costin Raiciu University Politehnica of Bucharest Joint work with: Mark Handley, Damon Wischik, University College London Olivier Bonaventure,
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
TDTS21 Advanced Networking
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
Utilizing Datacenter Networks: Dealing with Flow Collisions Costin Raiciu Department of Computer Science University Politehnica of Bucharest.
Resource Pooling A system exhibits complete resource pooling if it behaves as if there was a single pooled resource. The Internet has many mechanisms for.
60 GHz Flyways: Adding multi-Gbps wireless links to data centers
Datacenter Network Topologies
Virtual Layer 2: A Scalable and Flexible Data-Center Network Work with Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Parantap Lahiri,
Data Center Basics (ENCS 691K – Chapter 5)
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
A Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network Architecture.
Datacenter Networks Mike Freedman COS 461: Computer Networks
Routing of Outgoing Packets with MP-TCP draft-handley-mptcp-routing-00 Mark Handley Costin Raiciu Marcelo Bagnulo.
Second year review Resource Pooling Damon Wischik, UCL.
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Curbing Delays in Datacenters: Need Time to Save Time? Mohammad Alizadeh Sachin Katti, Balaji Prabhakar Insieme Networks Stanford University 1.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Networking the Cloud Presenter: b 電機三 姜慧如.
Multipath TCP design, and application to data centers Damon Wischik, Mark Handley, Costin Raiciu, Christopher Pluntke.
VL2 – A Scalable & Flexible Data Center Network Authors: Greenberg et al Presenter: Syed M Irteza – LUMS CS678: 2 April 2013.
Network Aware Resource Allocation in Distributed Clouds.
Routing & Architecture
David G. Andersen CMU Guohui Wang, T. S. Eugene Ng Rice Michael Kaminsky, Dina Papagiannaki, Michael A. Kozuch, Michael Ryan Intel Labs Pittsburgh 1 c-Through:
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
IMPROVING DATACENTER PERFORMANCE AND ROBUSTNESS WITH MULTIPATH TCP Costin Raiciu†, Sebastien Barre‡, Christopher Pluntke†, Adam Greenhalgh†, Damon Wischik†,
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
Congestion control for Multipath TCP (MPTCP) Damon Wischik Costin Raiciu Adam Greenhalgh Mark Handley THE ROYAL SOCIETY.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Approaches to Improve Data Center Performance through Networking - Gurubaran.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
R2C2: A Network Stack for Rack-scale Computers Paolo Costa, Hitesh Ballani, Kaveh Razavi, Ian Kash Microsoft Research Cambridge EECS 582 – W161.
VL2: A Scalable and Flexible Data Center Network
Chen Qian, Xin Li University of Kentucky
Yiting Xia, T. S. Eugene Ng Rice University
Lecture 2: Cloud Computing
Confluent vs. Splittable Flows
Data Center Network Architectures
Multipath TCP and the Resource Pooling Principle
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Multipath TCP in SDN-enabled LEO Satellite Networks
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
Improving Datacenter Performance and Robustness with Multipath TCP
Multipath TCP Yifan Peng Oct 11, 2012
ECF: an MPTCP Scheduler to Manage Heterogeneous Paths
AMP: A Better Multipath TCP for Data Center Networks
Jellyfish: Networking Data Centers Randomly
Data Center Architectures
Centralized Arbitration for Data Centers
Acknowledgement: slides include content from Hedera and MP-TCP authors
Resource Pooling A system exhibits complete resource pooling if it behaves as if there was a single pooled resource. I propose ‘extent of resource pooling’
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
In-network computation
Towards Predictable Datacenter Networks
Data Center Traffic Engineering
Presentation transcript:

Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu Department of Computer Science University Politehnica of Bucharest Sebastien Barre (UCL-BE), Christopher Pluntke (UCL), Adam Greenhalgh (UCL), Damon Wischik (UCL) and Mark Handley (UCL) Thanks to:

This is the wrong place to start Motivation Datacenter apps are distributed across thousands of machines Want any machine to play any role To achieve this: Use dense parallel datacenter topologies Map each flow to a path Problem: Naïve random allocation gives poor performance Improving performance adds complexity This is the wrong place to start

Multipath topologies need multipath transport Contributions Multipath topologies need multipath transport Multipath transport enables better topologies

To satisfy demand, modern datacenters provide many parallel paths … Traditional Topologies are tree-based Poor performance Not fault tolerant Shift towards multipath topologies: FatTree, BCube, VL2, Cisco, EC2

Fat Tree Topology [Fares et al., 2008; Clos, 1953] K=4 Aggregation Switches 1Gbps K Pods with K Switches each 1Gbps Show multiple paths between servers Say that network is rearrangeably non blocking Clos Racks of servers 5

Fat Tree Topology [Fares et al., 2008; Clos, 1953] K=4 Aggregation Switches K Pods with K Switches each Show multiple paths between servers Say that network is rearrangeably non blocking Clos Racks of servers 6

Collisions Show multiple paths between servers Say that network is rearrangeably non blocking Clos 7

Single-path TCP collisions reduce throughput

Collision

Not fair

Not fair

No matter how you do it, mapping each flow to a path is the wrong goal

Instead, we should pool capacity from different links

Instead, we should pool capacity from different links

Instead, we should pool capacity from different links

Instead, we should pool capacity from different links

Multipath Transport

Multipath Transport can pool datacenter networks Instead of using one path for each flow, use many random paths Don’t worry about collisions. Just don’t send (much) traffic on colliding paths

Multipath TCP Primer [IETF MPTCP WG] MPTCP is a drop in replacement for TCP MPTCP spreads application data over multiple subflows

Multipath TCP: Congestion Control [NSDI, 2011]

MPTCP better utilizes the FatTree network Fairness matters for apps!

MPTCP on EC2 Amazon EC2: infrastructure as a service We can borrow virtual machines by the hour These run in Amazon data centers worldwide We can boot our own kernel A few availability zones have multipath topologies 2-8 paths available between hosts not on the same machine or in the same rack Available via ECMP

Amazon EC2 Experiment 40 medium CPU instances running MPTCP For 12 hours, we sequentially ran all-to-all iperf cycling through: TCP MPTCP (2 and 4 subflows)

MPTCP improves performance on EC2 Same Rack

What do the benefits depend on?

How many subflows are needed? How does the topology affect results? How does the traffic matrix affect results?

At most 8 subflows are needed Total Throughput TCP

MPTCP improves fairness in VL2 topologies Fairness is important: Jobs finish when the slowest worker finishes

MPTCP improves throughput and fairness in BCube

Oversubscribed Topologies To saturate full bisectional bandwidth: There must be no traffic locality All hosts must send at the same time Host links must not be bottlenecks It makes sense to under-provision the network core This is what happens in practice Does MPTCP still provide benefits?

Performance improvements depend on traffic matrix Underloaded Sweet Spot Overloaded Increase Load

What is an optimal datacenter topology for multipath transport?

In single homed topologies: Hosts links are often bottlenecks ToR switch failures wipe out tens of hosts for days Show multiple paths between servers Say that network is rearrangeably non blocking Clos Multi-homing servers is the obvious way forward 37

Fat Tree Topology Show multiple paths between servers Say that network is rearrangeably non blocking Clos 38

Fat Tree Topology Upper Pod Switch ToR Switch Servers

Dual Homed Fat Tree Topology Upper Pod Switch ToR Switch Servers

Is DHFT any better than Fat Tree? Not for traffic matrices that fully utilize the core Let’s examine random traffic patterns Other TMs in the paper

DHFT provides significant improvements when core is not overloaded Core Underloaded Core Overloaded

Summary “One flow, one path” thinking has constrained datacenter design Collisions, unfairness, limited utilization Multipath transport enables resource pooling in datacenter networks: Improves throughput Improves fairness Improves robustness “One flow, many paths” frees designers to consider topologies that offer improved performance for similar cost

Backup Slides

Effect of MPTCP on short flows Flow sizes from VL2 dataset MPTCP enabled for long flows only (timer) Oversubscribed Fat Tree topology Results: TCP/ECMP Completion time: 79ms Core Utilization: 25% MPTCP 97ms 65%

MPTCP vs Centralized Dynamic Scheduling Centralized Scheduling MPTCP Animate tough part of the graph Infinite Scheduling Interval 46

Centralized Scheduling: Setting the Threshold Throughput Hope 1Gbps 17% worse than multipath TCP App Limited 100Mbps

Centralized Scheduling: Setting the Threshold Throughput 1Gbps 21% worse than multipath TCP App Limited 100Mbps Hope

Centralized Scheduling: Setting the Threshold Throughput 1Gbps 500Mbps 45% 51% 100Mbps 17% 21%

Effect of Locality in the Dual Homed Fat Tree 50

Overloaded Fat Tree: better fairness with Multipath TCP

VL2 Topology [Greenberg et al, 2009, Clos topology] 10Gbps … 10Gbps 20 hosts

BCube Topology [Guo et al, 2009]