Internet capacity sharing: Fairer, Simpler, Faster? Bob Briscoe Chief Researcher, BT Mar 2010 This work is partly funded by Trilogy, a research project.

Slides:



Advertisements
Similar presentations
End-host Perspectives on Congestion Management Murari Sridharan CONEX BOF, IETF 76, Hiroshima.
Advertisements

CONEX BoF. Welcome to CONEX! Chairs: –Leslie Daigle –Philip Eardley Scribe Note well MORE INFO: -ECN.
Using Edge-To-Edge Feedback Control to Make Assured Service More Assured in DiffServ Networks K.R.R.Kumar, A.L.Ananda, Lillykutty Jacob Centre for Internet.
Re-ECN: Adding Accountability for Causing Congestion to TCP/IP draft-briscoe-tsvwg-re-ecn-tcp-03 Bob Briscoe, BT & UCL Arnaud Jacquet, Alessandro Salvatori.
Philip Eardley, Bob Briscoe, Dave Songhurst - BT Francois Le Faucheur, Anna Charny, Vassilis Liatsos – Cisco Kwok-Ho Chan, Joe Babiarz, Stephen Dudley.
The cost of freedom Bob Briscoe Chief Researcher, BT Group and UCL Dec 2006.
Florin Dinu T. S. Eugene Ng Rice University Inferring a Network Congestion Map with Traffic Overhead 0 zero.
Guidelines for Adding Congestion Notification to Protocols that Encapsulate IP draft-briscoe-tsvwg-ecn-encap-guidelines-00 Bob Briscoe IETF-80 Mar 2011.
Usage cases for Congestion Accounting Bob Briscoe Chief Researcher, BT Oct 2009 This work is partly funded by Trilogy, a research project supported by.
Top-Down Network Design Chapter Thirteen Optimizing Your Network Design Copyright 2010 Cisco Press & Priscilla Oppenheimer.
ConEx Abstract Protocol What’s the Credit marking for? draft-mathis-conex-abstract-mech-00.txt draft-mathis-conex-abstract-mech-00.txt apologies from Bob.
Mending the Internet value chain... …in one bit Internet capacity sharing & QoS Bob Briscoe Chief Researcher, BT Oct 2009 This work is partly funded by.
© British Telecommunications plc 1 Network Performance Isolation in Data Centres using ConEx Congestion Policing draft-briscoe-conex-policing-01 draft-briscoe-conex-data-centre-02.
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Internet Networking Spring 2003 Tutorial 11 Explicit Congestion Notification (RFC 3168) Limited Transmit (RFC 3042)
Staying focused on the big unsolved problems e.g. resource accountability Bob Briscoe Chief Researcher, BT Group Sep 2008.
1 Internet Networking Spring 2003 Tutorial 11 Explicit Congestion Notification (RFC 3168)
DoS-resistant Internet - progress Bob Briscoe Jun 2005.
A Real-Time Video Multicast Architecture for Assured Forwarding Services Ashraf Matrawy, Ioannis Lambadaris IEEE TRANSACTIONS ON MULTIMEDIA, AUGUST 2005.
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #8 Explicit Congestion Notification (RFC 3168) Limited Transmit.
QoS interconnect best without effort Bob Briscoe Chief Researcher BT Sep 2009 This work is partly funded by Trilogy, a research project supported by the.
Internet cost transparency mending value chain incentives Bob Briscoe Chief Researcher, BT Sep 2009 This work is partly funded by Trilogy, a research project.
1 Proposed Additional Use Cases for Congestion Exposure draft-mcdysan-conex-other-usecases-00.txt Dave McDysan.
Fixing the Internet for sustainable business models Bob Briscoe Chief Researcher, BT Group Dec 2008.
Controlling Internet Quality with Price Market Managed Multi-service Internet Bob Briscoe BTexact Research, Edge Lab, University College London & M3I Technical.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential Security Level: Slide title :40-47pt Slide subtitle :26-30pt Color::white Corporate Font : FrutigerNext.
Peer-to-peer (p2p) value & cost Bob Briscoe Chief Researcher BT Group Sep 2008.
Collective delusions behind how capacity gets shared Bob Briscoe with Toby Moncaster & Lou Burness presented by Dirk Trossen Jan 2008.
Quality of Service (QoS)
Top-Down Network Design Chapter Thirteen Optimizing Your Network Design Oppenheimer.
Internet resource sharing: a way forward? Bob Briscoe Chief Researcher, BT May 2009 This work is partly funded by Trilogy, a research project supported.
ConEx Concepts and Uses Toby Moncaster John Leslie (JLC) Bob Briscoe (BT) Rich Woundy (ComCast) draft-moncaster-conex-concepts-uses-01.
Interconnect QoS business requirements Bob Briscoe BT Group CTO.
Internet resource sharing: a way forward? Bob Briscoe Chief Researcher, BT May 2009 This work is partly funded by Trilogy, a research project supported.
Congestion marking for low delay (& admission control) Bob Briscoe BT Research Mar 2005.
Capacity Sharing Bringing Together Cost, Value and Control Bob Briscoe Chief Researcher BT Oct 2011 This work was partly funded by Trilogy, a research.
Controlling Internet Quality with Price Market Managed Multiservice Internet Bob Briscoe BT Research, Edge Lab, University College London & M3I Technical.
Pushing Packet Processing to the Edge Scheduling in Optics is the Wrong Answer for Fine-Grained Resource Sharing Bob Briscoe Chief Researcher, BT Group.
ﺑﺴﻢﺍﷲﺍﻠﺭﺣﻣﻥﺍﻠﺭﺣﻳﻡ. Group Members Nadia Malik01 Malik Fawad03.
Re’Arch 2008 Policing Freedom… to use the Internet Resource Pool Arnaud.Jacquet, Bob.Briscoe, Toby.Moncaster December
Social & Economic Control of Networks Bob Briscoe Chief Researcher BT Networks Research Centre Jul 2007.
Congestion exposure BoF candidate protocol: re-ECN Bob Briscoe Chief Researcher, BT Nov 2009 This work is partly funded by Trilogy, a research project.
Tunnelling of Explicit Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-03.txt draft-briscoe-tsvwg-ecn-tunnel-03.txt Bob Briscoe, BT IETF-75 saag.
The speed of sharing stretching Internet access Bob Briscoe Chief Researcher, BT Apr 2009 This work is partly funded by Trilogy, a research project supported.
Design for tussle Bob Briscoe Chief Researcher, BT Jun 2009 re-architecting the Internet.
© British Telecommunications plc 1 nice traffic management without new protocols Bob Briscoe Chief Researcher, BT Oct 2012.
Downstream knowledge upstream re-feedback Bob Briscoe Arnaud Jacquet, Carla Di Cairano- Gilfedder, Andrea Soppera & Martin Koyabe BT Research.
QoS interconnect best without effort Bob Briscoe Chief Researcher BT Sep 2009 This work is partly funded by Trilogy, a research project supported by the.
Guaranteed QoS Synthesiser (GQS) Bob Briscoe, Peter Hovell BT Research Jan 2005.
Re-ECN: Adding Accountability for Causing Congestion to TCP/IP Bob Briscoe, BT & UCL Arnaud Jacquet, BT Alessandro Salvatori, BT CRN DoS w-g, Apr 2006.
CONEX BoF. Welcome to CONEX! Chairs: –Leslie Daigle –Philip Eardley Scribe Note well.
Session QoS vs bulk QoS Bob Briscoe Chief Researcher, BT Group Oct 2008.
Making stuff real re-feedback Bob Briscoe, BT Research Nov 2005 CRN DoS resistant Internet w-g.
Solving this Internet resource sharing problem... and the next, and the next Bob Briscoe Chief Researcher BT Group (& UCL) Lou Burness, Toby Moncaster,
Thoughts on the Evolution of TCP in the Internet (version 2) Sally Floyd ICIR Wednesday Lunch March 17,
Interconnect QoS settlements & impairments Bob Briscoe BT Group CTO.
Network Performance Isolation in Data Centres using Congestion Policing draft-briscoe-conex-data-centre-01.txt draft-briscoe-conex-data-centre-01.txt Bob.
Explicit Allocation of Best-Effort Service Goal: Allocate different rates to different users during congestion Can charge different prices to different.
Guidelines for Adding Congestion Notification to Protocols that Encapsulate IP (draft-ietf-tsvwg-ecn-encap-guidelines-04) Bob Briscoe (Simula Research.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Layered Encapsulation of Congestion Notification draft-briscoe-tsvwg-ecn-tunnel-01.txt draft-briscoe-tsvwg-ecn-tunnel-01.txt Bob Briscoe, BT IETF-72 tsvwg.
Internet Networking recitation #9
Bob Briscoe Simula Research Laboratory
Top-Down Network Design Chapter Thirteen Optimizing Your Network Design Copyright 2010 Cisco Press & Priscilla Oppenheimer.
Bob Briscoe, BT Murari Sridharan, Microsoft IETF-84 ConEx Jul 2012
Bob Briscoe, BT IETF-72 tsvwg Jul 2008
CONEX BoF.
Internet Networking recitation #10
Congestion Control, Quality of Service, & Internetworking
Flow Rate Fairness Many slides are borrowed from Bob Briscoe
Presentation transcript:

Internet capacity sharing: Fairer, Simpler, Faster? Bob Briscoe Chief Researcher, BT Mar 2010 This work is partly funded by Trilogy, a research project supported by the European Community

2 how to share the capacity of the Internet? the job of hosts using end-to-end protocols (e.g. TCP variants)? dynamic response to congestion from TCP-like protocols is fine but the way they share capacity is very wrong ISP's homespun alternatives have silently overridden TCP result: blocks, throttles & deep packet inspection if it’s new, it won’t get through (if it’s big, it won’t either) need a common goal for networks and hosts since 2006 IETF transport area consensus reversed ‘TCP-friendly’ was useful, but not a way forward rewrite of IETF capacity sharing architecture in process not just design-time: run-time, involving network approach: hosts still control capacity sharing by detecting congestion but using weighted variants of existing congestion controls (weighted TCP) similar dynamics, different shares give incentive for apps to set weights taking everyone into account backed by enforcement – simple policing at ingress of internetwork Internet topology visualization produced by Walrus (Courtesy of Young Hyun, CAIDA)CAIDA 2

moving mountains IETF since 2006 IETF support for TCP capacity sharing has collapsed to zero agree TCP dynamics correct, but sharing goal wrong many thought leaders support our new direction – not universally – yet! rewrite of IETF capacity sharing architecture in process IETF delegated process to IRTF design team – eventually IAB Oct’09 – Mar’10 formation of IETF working group: “congestion exposure” (ConEx) contentious: requires addition to IP (v4 & v6) IESG now ready to ratify, but not giving up last bit in IPv4 (yet!) >40 offers of significant help on list; individuals from Microsoft, Nokia, Cisco, Huawei, Alcatel-Lucent, NEC, Ericsson, NSN, Sandvine, Comcast, Verizon, … glossary IETF Internet Engineering Task Force IESG Internet Engineering Steering Group IAB Internet Architecture Board IRTF Internet Research Task Force

4 moving mountains ptII the global ICT industry GIIC: ~50 CxOs of the major global ICT corporations Apr ‘09: then BT CTO proposed GIIC endorses BT solution Sep ’09: expert review: public policy, commercial & technical Jan ‘10: GIIC published favourable assessment report manifesto in process: member lobbying & stds positions technical media coverage (ZDnet, PCWorld, Guardian, c’t, …) prompts near-universally reasonable reader postings on broadband speed, quality, pricing, net neutrality!

5 how Internet sharing ‘works’ endemic congestion & voluntary restraint those who take most, get most voluntarily polite algorithm in endpoints ‘TCP-friendliness’: a game of chicken – taking all and holding your ground pays or start more ‘TCP-friendly’ flows than anyone else (Web: x2, p2p: x5-100) or transfer more bytes for longer than anyone else (file transfer x200) net effect of both (p2p: x1,000-20,000 higher traffic intensity) flow 1 flow 2 bandwidth 2 bandwidth 1 capacity time (VoIP, VoD Joost 700kbps) unresponsive flow 3

6 no traditional sharing approaches harness end-system flexibility… over time light usage can go much faster hardly affects completion time of heavy usage NOTE: weighted sharing doesn't imply differentiated network service Just weighted aggressiveness of end-system's rate response to congestion cf. LEDBAT bit-rate time bit-rate time 1. TCP 4. DPI weighted sharing bit-rate time 2. WFQ bit-rate time 3. volume cap bit-rate time

7 no congestion across whole path  feeble transport protocol to complete ASAP, transfers should sense path bottleneck & fill it the trick congestion signal without impairment explicit congestion notification (ECN); update to IP in 2001 mark more packets as queue builds then tiny queuing delay and tiny loss for all traffic no need to avoid congestion to prevent impairment so far, gain too small to overcome deployment barriers congestion is not evil congestion signals are healthy time bit- rate time bit- rate 

8 measuring contribution to congestion user’s contribution to congestion congestion-volume = bytes marked can transfer v high volume but keep congestion-volume v low similar trick for video streaming not just two classes file sizes competing for a bottleneck span ~7 orders of magnitude bit-rate time congestion time 10GB 0.01% marking 1MB 1% marking 1MB 300MB 100MB 3MB 1% 0.01%

9 powerful resource accountability metric congestion-volume volume weighted by congestion when sent intuition contribution to congestion some ISPs count volume only during peak like counting (100% x volume) during peak and (0% x volume) otherwise congestion-volume counts p · x i over time a dual metric of customers to ISPs (too much traffic) and ISPs to customers (too little capacity) a)cost to other users of your traffic b)marginal cost of equipment upgrade so it wouldn’t have been congested so traffic wouldn’t have affected others competitive market matches a) & b) congestion, p = loss or marking fraction note: diagram is conceptual congestion volume & capital cost of equipment would be accumulated over time bit-rate x a bit-rate x b

'Cost': Congestion-Volume: Total TCP Loss [Byte] Initial results measured on Naples Uni net Each point is a user correlation coefficient: 0.43 Volume: Total TCP Traffic Volume [Byte] 100% 10% 1% 0.1% 0.01% 0.001% average congestion fraction WARNING: Requires validation with more sample data  volume  congestion-volume

11 incentivise shift to scalable performance regime as we move beyond TCP, window-equality no longer guides us we need a new framework to adjudicate sharing between overshoots at start-up and long-running flows between sluggish or aggressive recovery after congestion events to take account of run-time usage – bytes transferred, no’s of flows w t other flow arrivals are not reduced potential window without other flow arrivals w t w t other flow arrivals today a few years later TCP a scalable TCP

12 congestion-volume captures (un)fairness during dynamics time, t flow rate, x i x1x1 x2x2 congestion, p congestion bit rate, p x i v1v1 v2v2 area: congestion volume, v i =  p x i dt

13 incentive to avoid congestion policing only necessary at edge only throttles traffic when your contribution to congestion in the cloud exceeds your allowance if only... ingress net could see congestion... flat fee congestion policing bulk congestion policer Internet 0.3% congestion 0% 0.1% 2 Mb/s 0.3Mb/s 6 Mb/s Acceptable Use Policy 'congestion-volume' allowance: €15/month Allows ~70GB per day of data in typical conditions...but it can't the Internet wasn't designed this way path congestion only visible to end-points, not to network

14 Feedback path Data packet flow Sender Receiver +1+1 Routers Networks 1. Congested queue debit marks some packets 2. Receiver feeds back debit marks 3. Sender re-inserts feedback (re-feedback) into the forward data flow as credit marks congestion exposure standard ECN + re-inserted feedback (re-feedback) = re-ECN no changes required to IP data forwarding

15 congestion exposure with ECN & re-ECN measurable upstream, downstream and path congestion sender re-inserts feedback by marking packets black at any point on path,diff betw fractions of black & red bytes is downstream congestion forwarding unchanged (ECN) black marking e2e but visible at net layer for accountability 0% re-ECN fraction re-feedback 3% black – red resource index NANA NBNB R1R1 S1S1 2.6% 0.4% red (ECN) 3% feedback ECNECN Identification = C RERE X10Offset = 0 IPv6 ext hdr or IPv4 header

egress dropper (sketch) drop enough traffic to make fraction of red = black goodput best when receiver & sender both honest about feedback & re-feedback per flow state, but can re-route mid-flow (soft-state) short deterministic time-out (e.g. after >1s idle) 0 …i… n 2% code-point rate 3% 98% 2% 95% cheating sender or receiver understates black = = egress dropper NANA NBNB NDND R1R1 S1S1 policer dropper x 2/3

17 incentivise care with overshoot ‘pre-feedback’ or ‘cautious’ credit marks green : worth same as black byte for byte network gives no leeway to transport transport risks brief packet drop for any understatement advance to cover risk of congestion e.g. when opening up window makes transport internalise risk of harm to others basis for flow state mgmt on servers & middleboxes key to DDoS mitigation

18 battery optimisation applications & services transport layer on end-points usage costs currently visible here internetwork layer once usage costs revealed here ISPs won't need deep packet inspection for cost control link layer can remove bit-rate limits in shared access: passive optical, cable, wireless, cellular... the neck of the hourglass...but for control smooth quality video >2x more videos QoS mechanism simple – just go faster novel service & app behaviours traffic engin’g intra & inter QoS interconnect trivial hi-speed allowable network DDoS natural protection server DDoS protection shared medium access delegate upwards low latency always congestion policing simpler access technologies potential resilience using multi-paths access unbundling at IP layer! background transfers incentivised viable interface to Internetwork layer

19 would Microsoft set aside development time? incentives to cooperate across Internet value chain (another talk) content industry, CDNs, app & OS authors, network wholesalers & retailers, Internet companies, end-customers, business, residential what’s in it for Microsoft? ConEx certain to bring new deployment challenges intent: free host choice between ConEx & non-ConEx packets choice driven by performance, freedom and resilience market targeted Windows release as a performance leap? the feel of an enterprise LAN cf. DCTCP in the data centre not just immediate gains on upgrade continuing gains, as ISPs / enterprises… deploy AQM / ECN give ConEx traffic free pass thru old blocks and throttles withhold capacity growth from legacy non-ConEx traffic mounting pressure to ditch older Windows releases

20 summary network and host co-operation congestion-volume a metric to express and resolve conflicting interests robust to self-interest and malice ambitious but simple but deployment hurdles inevitable new horizons for the Internet if we take the challenge

21 more info... The whole story in 7 pages Bob Briscoe, “Internet Fairer is Faster", BT White Paper (Jun 2009)...this formed the basis of: Bob Briscoe, "A Fairer, Faster Internet Protocol", IEEE Spectrum (Dec 2008)A Fairer, Faster Internet Protocol Slaying myths about fair sharing of capacity [Briscoe07] Bob Briscoe, "Flow Rate Fairness: Dismantling a Religion" ACM Computer Communications Review 37(2) (Apr 2007)Flow Rate Fairness: Dismantling a Religion How wrong Internet capacity sharing is and why it's causing an arms race Bob Briscoe et al, "Problem Statement: Transport Protocols Don't Have To Do Fairness", IETF Internet Draft (Jul 2008)Problem Statement: Transport Protocols Don't Have To Do Fairness re-ECN protocol spec Bob Briscoe et al, “Adding Accountability for Causing Congestion to TCP/IP” IETF Internet Draft (Mar 2009)Adding Accountability for Causing Congestion to TCP/IP Re-architecting the Internet: The Trilogy project Trilogywww.trilogy-project.org IRTF Internet Capacity Sharing Architecture design team re-ECN & re-feedback project page: Congestion Exposure (ConEx) IETF ‘BoF’: subscribe:, post: implementation (linux or ns2)

22 Internet capacity sharing: Fairer, Simpler, Faster? discuss...

23 packet headers data 1 probability drop mark ave queue length ACKnowledgement packets network transport data probabilistic packet marking algorithm on all egress interfaces marked packet marked ACK explicit congestion notification (ECN) ECN bits 6 & 7 of IP DS byte 00:Not ECN Capable Transport (ECT) 01 or 10:ECN Capable Transport - no Congestion Experienced (sender initialises) 11:ECN Capable Transport - and Congestion Experienced (CE) DSCP IETF proposed std: RFC3168 Sep 2001 most recent change to IPv4&6 IETF proposed std: RFC3168 Sep 2001 most recent change to IPv4&6

24 Feedback path Data packet flow Sender Receiver +1+1 Routers Networks 1. Congested queue debit marks some packets 2. Receiver feeds back debit marks 3. Sender re-inserts feedback (re-feedback) into the forward data flow as credit marks 4. Outcome: End-points still do congestion control But sender has to reveal congestion it will cause Then networks can limit excessive congestion 5. Cheaters will be persistently in debt So network can discard their packets (In this diagram no-one is cheating) congestion exposure standard ECN + re-inserted feedback (re-feedback) = re-ECN no changes required to IP data forwarding

25 best without effort did you notice the interconnected QoS mechanism? endpoints ensure tiny queuing delay & loss for all traffic if your app wants more bit-rate, it just goes faster effects seen in bulk metric at every border (for SLAs, AUPs) simple – and all the right support for operations

26 NDND NANA NBNB NCNC legend: routing money and simple internalisation of all externalities re-ECN downstream congestion marking [%] bit rate area = instantaneous downstream congestion- volume just two counters at border, one for each direction meter monthly bulk volume of packet markings = aggregate money in flows without measuring flows 0|0|2|7|6|0|5 € € € highly congested link lightly congested link

27 congestion competition – inter-domain routing if congestion → profit for a network, why not fake it? upstream networks will route round more highly congested paths N A can see relative costs of paths to R 1 thru N B & N C the issue of monopoly paths incentivise new provision as long as competitive physical layer (access regulation), no problem in network layer NANA NBNB NCNC NDND R1R1 S1S1 ? down- stream route cost resource sequence index, i faked congestio n ? routin g choice

28 main steps to deploy re-feedback / re- ECN hosts (minor) addition to TCP/IP stack of sending device or sender proxy in network network turn on explicit congestion notification in data forwarding –already standardised in IP & MPLS –standards required for meshed network technologies at layer 2 (ECN in IP sufficient for point to point links) deploy simple active policing functions at customer interfaces around participating networks passive metering functions at inter-domain borders new phase of Internet evolution starts customer contracts & interconnect contracts endpoint applications and transports requires update to the IP standard (v4 & v6) in progress at IETF using bits in IPv4 header or IPv6 extension header summary rather than control sharing in the access links, pass congestion info & control upwards