MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Improving Datacenter Performance and Robustness with Multipath TCP
Deconstructing Datacenter Packet Transport Mohammad Alizadeh, Shuang Yang, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker Stanford University.
Mohammad Alizadeh, Albert Greenberg, David A. Maltz, Jitendra Padhye Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, Murari Sridharan Presented by Shaddi.
Jaringan Komputer Lanjut Packet Switching Network.
Principles of Congestion Control Chapter 3.6 Computer Networking: A top-down approach.
1 Transport Protocols & TCP CSE 3213 Fall April 2015.
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
Hui Zhang, Fall Computer Networking TCP Enhancements.
PFabric: Minimal Near-Optimal Datacenter Transport Mohammad Alizadeh Shuang Yang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker.
By Arjuna Sathiaseelan Tomasz Radzik Department of Computer Science King’s College London EPDN: Explicit Packet Drop Notification and its uses.
Flowlet Switching Srikanth Kandula Shan Sinha & Dina Katabi.
TCP Congestion Control Dina Katabi & Sam Madden nms.csail.mit.edu/~dina 6.033, Spring 2014.
Congestion Control: TCP & DC-TCP Swarun Kumar With Slides From: Prof. Katabi, Alizadeh et al.
Introduction 1 Lecture 14 Transport Layer (Transmission Control Protocol) slides are modified from J. Kurose & K. Ross University of Nevada – Reno Computer.
School of Information Technologies TCP Congestion Control NETS3303/3603 Week 9.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-4690: Experimental Networking Informal Quiz: TCP Shiv Kalyanaraman:
The War Between Mice and Elephants Presented By Eric Wang Liang Guo and Ibrahim Matta Boston University ICNP
Datacenter Network Topologies
Chapter 3 Transport Layer slides are modified from J. Kurose & K. Ross CPE 400 / 600 Computer Communication Networks Lecture 12.
Congestion Control Tanenbaum 5.3, /12/2015Congestion Control (A Loss Based Technique: TCP)2 What? Why? Congestion occurs when –there is no reservation.
Transport Layer 3-1 Transport Layer r To learn about transport layer protocols in the Internet: m TCP: connection-oriented protocol m Reliability protocol.
Defense: Christopher Francis, Rumou duan Data Center TCP (DCTCP) 1.
A Switch-Based Approach to Starvation in Data Centers Alex Shpiner Joint work with Isaac Keslassy Faculty of Electrical Engineering Faculty of Electrical.
1 Chapter 3 Transport Layer. 2 Chapter 3 outline 3.1 Transport-layer services 3.2 Multiplexing and demultiplexing 3.3 Connectionless transport: UDP 3.4.
Data Communication and Networks
Dennis Ippoliti 12/6/ MULTI-PATH ROUTING A packet by packet multi-path routing approach.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Lecture 1, 1Spring 2003, COM1337/3501Computer Communication Networks Rajmohan Rajaraman COM1337/3501 Textbook: Computer Networks: A Systems Approach, L.
Path selection Packet scheduling and multipath Sebastian Siikavirta and Antti aalto.
Introduction 1 Lecture 14 Transport Layer (Congestion Control) slides are modified from J. Kurose & K. Ross University of Nevada – Reno Computer Science.
TCP: flow and congestion control. Flow Control Flow Control is a technique for speed-matching of transmitter and receiver. Flow control ensures that a.
IA-TCP A Rate Based Incast- Avoidance Algorithm for TCP in Data Center Networks Communications (ICC), 2012 IEEE International Conference on 曾奕勳.
TCP & Data Center Networking
Designing a DHT for low latency and high throughput Robert Vollmann P2P Information Systems.
3: Transport Layer3b-1 Principles of Congestion Control Congestion: r informally: “too many sources sending too much data too fast for network to handle”
Transport Layer3-1 Chapter 3 outline r 3.1 Transport-layer services r 3.2 Multiplexing and demultiplexing r 3.3 Connectionless transport: UDP r 3.4 Principles.
Multipath TCP design, and application to data centers Damon Wischik, Mark Handley, Costin Raiciu, Christopher Pluntke.
Detail: Reducing the Flow Completion Time Tail in Datacenter Networks SIGCOMM PIGGY.
Understanding the Performance of TCP Pacing Amit Aggarwal, Stefan Savage, Thomas Anderson Department of Computer Science and Engineering University of.
1 Lecture 14 High-speed TCP connections Wraparound Keeping the pipeline full Estimating RTT Fairness of TCP congestion control Internet resource allocation.
HighSpeed TCP for High Bandwidth-Delay Product Networks Raj Kettimuthu.
Analysis of TCP Latency over Wireless Links Supporting FEC/ARQ-SR for Error Recovery Raja Abdelmoumen, Mohammad Malli, Chadi Barakat PLANETE group, INRIA.
Chapter 24 Transport Control Protocol (TCP) Layer 4 protocol Responsible for reliable end-to-end transmission Provides illusion of reliable network to.
Transport Layer 3-1 Chapter 3 Transport Layer Computer Networking: A Top Down Approach 6 th edition Jim Kurose, Keith Ross Addison-Wesley March
CS640: Introduction to Computer Networks Aditya Akella Lecture 15 TCP – III Reliability and Implementation Issues.
Computer Networking Lecture 18 – More TCP & Congestion Control.
CS640: Introduction to Computer Networks Aditya Akella Lecture 15 TCP – III Reliability and Implementation Issues.
Transport Layer3-1 Chapter 3 outline r 3.1 Transport-layer services r 3.2 Multiplexing and demultiplexing r 3.3 Connectionless transport: UDP r 3.4 Principles.
Transport Layer3-1 Chapter 3 outline r 3.1 Transport-layer services r 3.2 Multiplexing and demultiplexing r 3.3 Connectionless transport: UDP r 3.4 Principles.
Chapter 11.4 END-TO-END ISSUES. Optical Internet Optical technology Protocol translates availability of gigabit bandwidth in user-perceived QoS.
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
Revisiting Transport Congestion Control Jian He UT Austin 1.
By, Nirnimesh Ghose, Master of Science,
Chapter 3 outline 3.1 transport-layer services
COMP 431 Internet Services & Protocols
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
Chapter 3 outline 3.1 Transport-layer services
Improving Datacenter Performance and Robustness with Multipath TCP
Hamed Rezaei, Mojtaba Malekpourshahraki, Balajee Vamanan
AMP: A Better Multipath TCP for Data Center Networks
Centralized Arbitration for Data Centers
TCP Congestion Control
Lecture 17, Computer Networks (198:552)
Transport Layer: Congestion Control
Chapter 3 outline 3.1 Transport-layer services
AMP: An Adaptive Multipath TCP for Data Center Networks
Presentation transcript:

MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex, UK IEEE INFOCOM 2016

Data Centre Importance Support diverse applications with diverse communication patterns and requirements – Some apps are bandwidth hungry (online file storage) – Other apps are latency sensitive (online search) The DC Performance is directly impacted the revenue of many companies – Amazon sales dropped by 1% by adding 100ms latency – Online brokers could lose 4M US dollars per millisecond if they fall 5ms behind their competitors 2

Data Center Network Properties Short flow dominance – 99% of flows are short flows (size < 100MB) – Majority of short flows are query flows with deadline in their flow completion times (size < 1MB – e.g. 50KB) – 90% of total bytes come from long flows (size > 100MB) Traffic pattern is very bursty – Bursty traffic pattern is originated from short flows Low latency and high bandwidth – Latency is in the order of microsecond (e.g μs) – Minimum link capacity is 1Gbps 3

Prob 1: Persistent Congestion 4 Core Host Two or more long flows collide on their hashes and end up on the same output port – Increasing the RTT and packet drop probability – Inefficient use of network recourses Aggr ToR ½ rate Long Flow 1 Long Flow 2

Prob 2: Transient Congestion 5 One or more long flow(s) collides with several (bursty) short flows – Increasing the RTT and packet drop probability – Inefficient use of the network resources Long Flow Core Aggr ToR Host Aggr ToR Aggr Host ToR Host ToR ½ rate Timeout Short Flow

Existing Solutions 6 Persistent Congestion MPTCP (SIGCOMM ’11) Hedera (NSDI ’10) Good for Elephant Flows Transient Congestion DCTCP (SIGCOMM ’10) D 2 TCP (SIGCOMM ’12) Good for Mice Flows No universal solution to these problems

Contribution Maximum MultiPath TCP (MMPTCP) – Build on standard MultiPath TCP (MPTCP) High goodput for long flows – ~200% increase compared to TCP Low flow completion time for short flows – ~10% in mean and ~400% in standard deviation compared to MPTCP Incremental deployment – No change into the network and application layers 7

MPTCP Overview 8 Core Host Aggr ToR Aggr ToR MPTCP opens multiple subflows at connection startup Each subflow has its own sequence number space MPTCP moves its traffic from the most congested path(s) to the least congested one(s)

MPTCP: Good for Long Flows 9 More subflows -> Better load balancing -> High Goodput

Host MPTCP: Bad for Short Flows 10 Core Aggr ToR SF1 SF2 SF3 SF4 Packet drop An entire MPTCP connection needs to wait until SF1 recovers its lost packet via a timeout ~ 200ms

MPTCP: Bad for Short Flows 11 More subflows -> Less pkts per subflow -> More Timeouts

Host MMPTCP: Good for All Flows 12 Core Aggr ToR

MMPTCP Operates in Two Phases 1.Starts a connection with one subflow – Randomises traffic on per-packet basis – Recovers lost packets over a single sequence space 2.Opens more subflows when a threshold reaches (e.g. 1MB) – MPTCP congestion control govern the data transmission – The initial subflow is deactivated at this point 13

MMPTCP Key Features Handles bursty traffic patterns gracefully Decreases the flow completion time of short flows compared to MPTCP Increases the throughput of long flows Incrementally deployable 14 MMPTCP achieves its goals by exploiting all parallel paths in the data centre faric

Packet Reordering in Phase 1 Spurious retransmissions may occur due to out-of-order packets – Existing solutions: RR-TCP, Eifel and so on – Not sufficient for latency sensitive short flows Our solution – Increase the dupack threshold based on the number of parallel paths between a src-dst pair – Perfectly works for VL2 and FatTree 15

Simulation Setup 16 A FatTree topology with 4:1 oversubscription ratio (K=8) A Permutation traffic matrix 1/3 of nodes send continuous traffic (long flows) 2/3 of nodes send short flows based on a Poisson arrival MMPTCP switching threshold of 100KB Link rate of 100Mbps and link delay of 20us

Flow Completion Time (FCT) 17 MPTCP, 8 subflows Mean FCT: 125ms Mean Stdev: 425ms MMPTCP Mean FCT: 116ms Mean Stdev: 101ms

Fast ReTx and Timeout 18 MPTCP, 8 subflows Mean FCT: 125ms Mean Stdev: 425ms MMPTCP Mean FCT: 116ms Mean Stdev: 101ms

Hotspot Hotspots occur for several reasons: – Contention between traffic flowing from the Internet to data centres (and vice versa) – Hardware failures or cable faults Simulation Setup: – Mean Short flow arrival rate of 2560/sec (Poisson) – Transport protocols under examination:  MMPTCP  MPTCP  TCP 19

Hotspot (Results)

Final Remarks MMPTCP is an extension of MPTCP – High burst tolerance – Low latency for short flows – High throughput for long flows – Incremental deployment 21

Thank You! 22