© British Telecommunications plc 1 Network Performance Isolation in Data Centres using ConEx Congestion Policing draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02.

Slides:



Advertisements
Similar presentations
QoS Strategy in DiffServ aware MPLS environment Teerapat Sanguankotchakorn, D.Eng. Telecommunications Program, School of Advanced Technologies Asian Institute.
Advertisements

End-host Perspectives on Congestion Management Murari Sridharan CONEX BOF, IETF 76, Hiroshima.
Using Edge-To-Edge Feedback Control to Make Assured Service More Assured in DiffServ Networks K.R.R.Kumar, A.L.Ananda, Lillykutty Jacob Centre for Internet.
Tunnel congestion Feedback (draft-wei-tunnel-congestion-feedback-01) Xinpeng Wei Lei Zhu Lingli Deng Huawei Huawei China Mobile IETF 89 London, UK.
Nicolas Simar – DANTE : Premium IP and LBE transparency on GEANT QoS on GÉANT Premium IP and Less than Best Effort.
Guidelines for Adding Congestion Notification to Protocols that Encapsulate IP draft-briscoe-tsvwg-ecn-encap-guidelines-00 Bob Briscoe IETF-80 Mar 2011.
ConEx Abstract Protocol What’s the Credit marking for? draft-mathis-conex-abstract-mech-00.txt draft-mathis-conex-abstract-mech-00.txt apologies from Bob.
Advanced Computer Networking Congestion Control for High Bandwidth-Delay Product Environments (XCP Algorithm) 1.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-6600: Internet Protocols Informal Quiz #11 Shivkumar Kalyanaraman: GOOGLE: “Shiv RPI”
ACN: IntServ and DiffServ1 Integrated Service (IntServ) versus Differentiated Service (Diffserv) Information taken from Kurose and Ross textbook “ Computer.
Congestion Control and Resource Allocation
CSE 401N Multimedia Networking-2 Lecture-19. Improving QOS in IP Networks Thus far: “making the best of best effort” Future: next generation Internet.
ACN: Congestion Control1 Congestion Control and Resource Allocation.
School of Information Technologies IP Quality of Service NETS3303/3603 Weeks
CS 268: Lecture 11 (Differentiated Services) Ion Stoica March 6, 2001.
Congestion Control for High Bandwidth-Delay Product Environments Dina Katabi Mark Handley Charlie Rohrs.
MPLS And The Data Center Adrian Farrel Old Dog Consulting / Juniper Networks
Tiziana FerrariQuality of Service for Remote Control in the High Energy Physics Experiments CHEP, 07 Feb Quality of Service for Remote Control in.
Low-Rate TCP-Targeted DoS Attack Disrupts Internet Routing
Initial ConEx Deployment Examples draft-briscoe-conex-initial-deploy-00.txt draft-briscoe-conex-initial-deploy-00.txt apologies from Bob Briscoe, BT presented.
QoS Architectures for Connectionless Networks
CSE QoS in IP. CSE Improving QOS in IP Networks Thus far: “making the best of best effort”
ConEx Concepts and Abstract Mechanism draft-ietf-conex-abstract-mech-07.txt draft-ietf-conex-abstract-mech-07.txt Matt Mathis, Google Bob Briscoe, BT IETF-87.
Byte and Packet Congestion Notification draft-ietf-tsvwg-byte-pkt-congest-02.txt draft-ietf-tsvwg-byte-pkt-congest-02.txt Bob Briscoe, BT IETF-78 tsvwg.
Improving Network I/O Virtualization for Cloud Computing.
Tunnelling of Explicit Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-08.txt draft-briscoe-tsvwg-ecn-tunnel-08.txt Bob Briscoe, BT IETF-77 tsvwg.
1 Guidelines for Adding Congestion Notification to Protocols that Encapsulate IP draft-briscoe-tsvwg-ecn-encap-guidelines-01 Bob Briscoe IETF-85 Nov 2012.
ConEx Concepts and Abstract Mechanism draft-mathis-conex-abstract-mech-00.txt draft-mathis-conex-abstract-mech-00.txt Matt Mathis, Google Bob Briscoe,
Link Scheduling & Queuing COS 461: Computer Networks
Fluid-based Analysis of a Network of AQM Routers Supporting TCP Flows with an Application to RED Vishal Misra Wei-Bo Gong Don Towsley University of Massachusetts,
CS 447 Network & Data Communication QoS (Quality of Service) & DiffServ Introduction Department of Computer Science Southern Illinois University Edwardsville.
Congestion marking for low delay (& admission control) Bob Briscoe BT Research Mar 2005.
Capacity Sharing Bringing Together Cost, Value and Control Bob Briscoe Chief Researcher BT Oct 2011 This work was partly funded by Trilogy, a research.
Multimedia Wireless Networks: Technologies, Standards, and QoS Chapter 3. QoS Mechanisms TTM8100 Slides edited by Steinar Andresen.
Byte and Packet Congestion Notification draft-briscoe-tsvwg-byte-pkt-mark-00.txt draft-briscoe-tsvwg-byte-pkt-mark-00.txt Bob Briscoe, BT & UCL IETF-69.
1 Congestion Control Computer Networks. 2 Where are we?
Congestion exposure BoF candidate protocol: re-ECN Bob Briscoe Chief Researcher, BT Nov 2009 This work is partly funded by Trilogy, a research project.
Tunnelling of Explicit Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-03.txt draft-briscoe-tsvwg-ecn-tunnel-03.txt Bob Briscoe, BT IETF-75 saag.
Tunnelling of Explicit Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-02.txt draft-briscoe-tsvwg-ecn-tunnel-02.txt Bob Briscoe, BT IETF-74 tsvwg.
Bjorn Landfeldt, The University of Sydney 1 NETS3303 Networked Systems.
© British Telecommunications plc 1 nice traffic management without new protocols Bob Briscoe Chief Researcher, BT Oct 2012.
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 - Queuing and Basics of QoS.
Guaranteed QoS Synthesiser (GQS) Bob Briscoe, Peter Hovell BT Research Jan 2005.
Solving this Internet resource sharing problem... and the next, and the next Bob Briscoe Chief Researcher BT Group (& UCL) Lou Burness, Toby Moncaster,
Initial ConEx Deployment Examples draft-briscoe-conex-initial-deploy-02.txt draft-briscoe-conex-initial-deploy-02.txt Bob Briscoe, BT Dirk Kutscher, NEC.
Network Performance Isolation in Data Centres using Congestion Policing draft-briscoe-conex-data-centre-01.txt draft-briscoe-conex-data-centre-01.txt Bob.
ConEx Concepts and Abstract Mechanism draft-ietf-conex-abstract-mech-01.txt draft-ietf-conex-abstract-mech-01.txt Matt Mathis, Google Bob Briscoe, BT IETF-80.
Support for ECN and PCN in MPLS networks draft-davie-ecn-mpls-00.txt Bruce Davie Cisco Systems Bob Briscoe June Tay BT Research.
Byte and Packet Congestion Notification draft-briscoe-tsvwg-byte-pkt-mark-02.txt draft-briscoe-tsvwg-byte-pkt-mark-02.txt Bob Briscoe, BT & UCL IETF-71.
Tunnel-based mechanisms for datacenter latency control Xinpeng Wei.
Initial ConEx Deployment Examples draft-briscoe-conex-initial-deploy-00.txt draft-briscoe-conex-initial-deploy-00.txt apologies from Bob Briscoe, BT presented.
Guidelines for Adding Congestion Notification to Protocols that Encapsulate IP (draft-ietf-tsvwg-ecn-encap-guidelines-04) Bob Briscoe (Simula Research.
Providing QoS in IP Networks
Philip Eardley, Bob Briscoe, Dave Songhurst - BT Francois Le Faucheur, Anna Charny, Vassilis Liatsos – Cisco Kwok-Ho Chan, Joe Babiarz, Stephen Dudley.
Layered Encapsulation of Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-01.txt draft-briscoe-tsvwg-ecn-tunnel-01.txt Bob Briscoe, BT IETF-72 tsvwg.
Internet Quality of Service
Advanced Computer Networks
Instructor Materials Chapter 6: Quality of Service
Support for ECN and PCN in MPLS networks
Bob Briscoe, BT IETF-73 pcn Nov 2008
Bob Briscoe Simula Research Laboratory
Buffer Management in a Switch
Bob Briscoe, BT Murari Sridharan, Microsoft IETF-84 ConEx Jul 2012
Bob Briscoe, BT IETF-72 tsvwg Jul 2008
Congestion Control and Resource Allocation
CONEX BoF.
EE 122: Lecture 18 (Differentiated Services)
ECN Experimentation draft-black-ecn-experimentation
EE 122: Differentiated Services
Congestion Control and Resource Allocation
Presentation transcript:

© British Telecommunications plc 1 Network Performance Isolation in Data Centres using ConEx Congestion Policing draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02 draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02 Bob Briscoe Chief Researcher, BT IRTF DCLC Jul 2014 Bob Briscoe’s work is part-funded by the European Community under its Seventh Framework Programme through the Trilogy 2 project (ICT )

© British Telecommunications plc 2 purpose of talk work proposal for the data centre latency control r-g –data centre queuing delay control –designed for global scope (inter-data-centre,... Inter-net) –this talk: adds first step: intra-data centre without any new protocols started in the IETF congestion exposure (ConEx) w-g generalised for initial deployment without ConEx –and even without ECN end-to-end –now even without ECN on switches (in slides, not draft)

© British Telecommunications plc 3 Network Performance Isolation in Data Centres An important problem –isolating between tenants, or departments –virtualisation isolates CPU / memory / storage –but network and I/O system is highly multiplexed & distributed SDN-based (edge) capacity partitioning* –configuration churn: nightmare at scale –poor use of capacity edge-based weighted round robin (or WFQ) –More common –but biases towards heavy hitters (no concept of time) bit-rate time bit-rate time * every problem in computer science can be solved by not thinking then hiding the resulting mess under a layer of abstraction

© British Telecommunications plc 4 Outline Design – First Step edge bottlenecks by capacity design Edge policing like Diffserv –but congestion policing (per guest) isolation within FIFO queue no config on switches hosts switches VM sender VM receiver congestion policer guest OS hypervisor switching w

© British Telecommunications plc 5 bottleneck congestion policer foreach pkt { i = classify_user(pkt) d i += w i *(t now -t i ) //fill t i = t now d i -= s * p //drain if (d i <0) {drop(pkt)} } s : packet size p : drop prob of AQM in a well-provisioned link, policer rarely intervenes but whenever needed, it limits queue growth network policer incoming packet stream meter w1w1 w2w2 wiwi … d i (t) cici congestion token bucket Y FIFO buffer p(t) AQM … outgoing packet stream

© British Telecommunications plc 6 wiwi d i (t) cici kw i C AQM p(t) policer main congestion token bucket meter congestion burst limiter police if either bucket empties actually each bucket needs to be two buckets to limit bursts of congestion similar code –except 2 token buckets... if (d i1 <0 || d i2 <0) {drop(pkt)}

© British Telecommunications plc 7 performance isolation outcome WRR or WFQ congestion policer –with unequal traffic loads congestion policer –treats equal traffic loads equivalently to WRR time rate time rate time rate

© British Telecommunications plc 8 Outline Design edge and core queue control Edge policing like Diffserv –but congestion policing (per-guest) Hose model intra-class isolation in all FIFO queues FIFO ECN marking on L3 switches no other config on switches hosts switches VM sender VM receiver TEP / congestion policer TEP / audit guest OS hypervisor switching w

© British Telecommunications plc 9 trusted path congestion feedback Initial deployment –all under control of infrastructure admin ECN on guest hosts: optional –ECN enabled across tunnel ConEx on guest hosts: optional –any ConEx-enabled packet doesn’t require tunnel feedback details – see spare slide or draft infrastructure policer transport sender transport receiver TEP feedback between tunnel endpoints Non-ConEx packets infrastructure policeraudit transport sender transport receiver ConEx packets

© British Telecommunications plc 10 Features of Solution Network performance isolation between tenants No loss of LAN-like multiplexing benefits work-conserving Zero (tenant-related) switch configuration No change to existing switch implementations Weighted performance differentiation Simplest possible contract per-tenant network-wide allowance tenant can freely move VMs around without changing allowance sender constraint, but with transferable allowance Transport-Agnostic Extensible to wide-area and inter-data-centre interconnect

© British Telecommunications plc 11 call for interest implementation in hypervisors evaluation

Network Performance Isolation in Data Centres using congestion policing draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02 draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02 Q&A & spare slides

© British Telecommunications plc 13 measuring contribution to congestion = bytes weighted by congestion level = bytes dropped (or ECN-marked) = ‘congestion-volume’ as simple to measure as volume bit-rate time congestion time 10GB 0.01% congestion 1MB 1% congestion 1MB 300MB 100MB 3MB 1% 0.01%

unilateral deployment technique for data centre operator for e2e transports that don’t support ECN, the operator can: 1.at encap: alter 00 to 10 in outer 2.at interior buffers: turn on ECN defers any drops until egress audit just before egress can see packets to be dropped 14 exploits: widespread edge-edge tunnels in multi-tenant DCs to isolate forwarding a side-effect of standard tunnelling (IP-in-IP or any ECN link encap) DSDS ECNECN ingress egress DSDS ECNECN DSDS ECNECN DSDS ECNECN DSDS ECNECN DSDS ECNECN E congested network element DSDS ECNECN DSDS ECNECN E    drop A audit A pol- icer for e2e transports that don’t support ConEx, the operator can create its own trusted feedback: 3.at decap: only for Not-ConEx packets, feedback aggregate congestion marking counters: CE outer, Not-ECT inner = loss CE outer, ECT inner = ECN exporter collector IPFIX meter 3