France Télécom R&D Engineering for QoS and the limits of service differentiation Jim Roberts IWQoS June 2000.

Slides:



Advertisements
Similar presentations
DISTRIBUTED MULTIMEDIA SYSTEMS
Advertisements

Opportunistic Scheduling Algorithms for Wireless Networks
1 IK1500 Communication Systems IK1330 Lecture 3: Networking Anders Västberg
EE 4272Spring, 2003 Chapter 12 Congestion in Data Networks Effect of Congestion Control  Ideal Performance  Practical Performance Congestion Control.
William Stallings Data and Computer Communications 7 th Edition Chapter 13 Congestion in Data Networks.
24-1 Chapter 24. Congestion Control and Quality of Service (part 1) 23.1 Data Traffic 23.2 Congestion 23.3 Congestion Control 23.4 Two Examples.
Engineering Internet QoS
Abhay.K.Parekh and Robert G.Gallager Laboratory for Information and Decision Systems Massachusetts Institute of Technology IEEE INFOCOM 1992.
Courtesy: Nick McKeown, Stanford 1 Intro to Quality of Service Tahir Azim.
Priority Scheduling and Buffer Management for ATM Traffic Shaping Authors: Todd Lizambri, Fernando Duran and Shukri Wakid Present: Hongming Wu.
CS 4700 / CS 5700 Network Fundamentals Lecture 12: Router-Aided Congestion Control (Drop it like it’s hot) Revised 3/18/13.
Recent Progress on a Statistical Network Calculus Jorg Liebeherr Department of Computer Science University of Virginia.
Fair queueing and congestion control Jim Roberts (France Telecom) Joint work with Jordan Augé Workshop on Congestion Control Hamilton Institute, Sept 2005.
IP traffic and QoS control : the need for flow aware networking Jim Roberts France Telecom R&D NSF-COST Workshop.
Differentiated Services. Service Differentiation in the Internet Different applications have varying bandwidth, delay, and reliability requirements How.
A Case for Relative Differentiated Services and the Proportional Differentiation Model Constantinos Dovrolis Parameswaran Ramanathan University of Wisconsin-Madison.
AQM for Congestion Control1 A Study of Active Queue Management for Congestion Control Victor Firoiu Marty Borden.
End-to-End Analysis of Distributed Video-on-Demand Systems Padmavathi Mundur, Robert Simon, and Arun K. Sood IEEE Transactions on Multimedia, February.
Comparing flow-oblivious and flow-aware adaptive routing Sara Oueslati and Jim Roberts France Telecom R&D CISS 2006 Princeton March 2006.
Generalized Processing Sharing (GPS) Is work conserving Is a fluid model Service Guarantee –GPS discipline can provide an end-to-end bounded- delay service.
Bandwidth sharing: objectives and algorithms Jim Roberts France Télécom - CNET Laurent Massoulié Microsoft Research.
T.Sharon-A.Frank 1 Multimedia on the Internet. 2 T.Sharon-A.Frank Is the Internet Real-Time (MM)?
Katz, Stoica F04 EECS 122: Introduction to Computer Networks Packet Scheduling and QoS Computer Science Division Department of Electrical Engineering and.
Multiple Sender Distributed Video Streaming Thinh Nguyen, Avideh Zakhor appears on “IEEE Transactions On Multimedia, vol. 6, no. 2, April, 2004”
End-to-End Analysis of Distributed Video-on-Demand Systems P. Mundur, R. Simon, and A. K. Sood IEEE Transactions on Multimedia, Vol. 6, No. 1, Feb 2004.
1 Traffic Sensitive Quality of Service Controller Masters Thesis Submitted by :Abhishek Kumar Advisors: Prof Mark Claypool Prof Robert Kinicki Reader:
Traffic Sensitive Active Queue Management - Mark Claypool, Robert Kinicki, Abhishek Kumar Dept. of Computer Science Worcester Polytechnic Institute Presenter.
Traffic Characterization Dr. Abdulaziz Almulhem. Almulhem©20012 Agenda Traffic characterization Switching techniques Internetworking, again.
1 Proportional differentiations provisioning Packet Scheduling & Buffer Management Yang Chen LANDER CSE Department SUNY at Buffalo.
France Télécom R&D Engineering for QoS and the limits of service differentiation Jim Roberts IWQoS June 2000.
Ch. 28 Q and A IS 333 Spring Q1 Q: What is network latency? 1.Changes in delay and duration of the changes 2.time required to transfer data across.
Network Planning & Capacity Management Frank Yeong-Sung Lin ( 林永松 ) Department of Information Management National Taiwan University Taipei, Taiwan, R.O.C.
Packet Scheduling From Ion Stoica. 2 Packet Scheduling  Decide when and what packet to send on output link -Usually implemented at output interface 1.
A Generalized Processor Sharing Approach to Flow Control in Integrated Services Networks: The Single-Node Case Abhay K. Parekh, Member, IEEE, and Robert.
Bell Labs Advanced Technologies EMEAAT Proprietary Information © 2004 Lucent Technologies1 Overview contributions for D27 Lucent Netherlands Richa Malhotra.
CSE QoS in IP. CSE Improving QOS in IP Networks Thus far: “making the best of best effort”
France Télécom R&D Tequila Workshop Jan 2001 The statistical nature of traffic and its impact on the realisability of QoS guarantees Jim Roberts, France.
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 - Queuing and Basics of QoS.
Distributed Multimedia March 19, Distributed Multimedia What is Distributed Multimedia?  Large quantities of distributed data  Typically streamed.
Univ. of TehranAdv. topics in Computer Network1 Advanced topics in Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Computer Networks Performance Metrics. Performance Metrics Outline Generic Performance Metrics Network performance Measures Components of Hop and End-to-End.
Salim Hariri HPDC Laboratory Enhanced General Switch Management Protocol Salim Hariri Department of Electrical and Computer.
A T M (QoS).
1 Chapters 8 Overview of Queuing Analysis. Chapter 8 Overview of Queuing Analysis 2 Projected vs. Actual Response Time.
© Jörg Liebeherr, Quality-of-Service Architectures for the Internet Integrated Services (IntServ)
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 - Queuing and Basics of QoS.
ECS5365 Lecture 6 ATM Traffic and Network Management
Scheduling CS 218 Fall 02 - Keshav Chpt 9 Nov 5, 2003 Problem: given N packet streams contending for the same channel, how to schedule pkt transmissions?
Maciej Stasiak, Mariusz Głąbowski Arkadiusz Wiśniewski, Piotr Zwierzykowski Model of the Nodes in the Packet Network Chapter 10.
Explicit Allocation of Best-Effort Service Goal: Allocate different rates to different users during congestion Can charge different prices to different.
Random Early Detection (RED) Router notifies source before congestion happens - just drop the packet (TCP will timeout and adjust its window) - could make.
1 Queuing Delay and Queuing Analysis. RECALL: Delays in Packet Switched (e.g. IP) Networks End-to-end delay (simplified) = End-to-end delay (simplified)
BSnetworks.pptTKK/ComNet Research Seminar, SRPT Applied to Bandwidth Sharing Networks (to appear in Annals of Operations Research) Samuli Aalto.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Providing QoS in IP Networks
Scheduling for QoS Management. Engineering Internet QoS2 Outline  What is Queue Management and Scheduling?  Goals of scheduling  Fairness (Conservation.
Integrated Services & RSVP Types of pplications Basic approach in IntServ Key components Service models.
1 Lecture 15 Internet resource allocation and QoS Resource Reservation Protocol Integrated Services Differentiated Services.
Tango1 Considering End-to-End QoS Constraints in IP Network Design and Planning M.Ajmone Marsan, M. Garetto, E. Leonardi. M. Mellia, E. Wille Dipartimento.
QoS & Queuing Theory CS352.
Topics discussed in this section:
CSE679: Multimedia and Networking
Insensitivity in IP network performance
CprE 458/558: Real-Time Systems
Provision of Multimedia Services in based Networks
Lightweight but Powerful QoS Solution for Embedded Systems
CSE 550 Computer Network Design
Chapter-5 Traffic Engineering.
EECS 122: Introduction to Computer Networks Packet Scheduling and QoS
کنترل جریان امیدرضا معروضی.
Presentation transcript:

France Télécom R&D Engineering for QoS and the limits of service differentiation Jim Roberts IWQoS June 2000

quality of service transparency response time accessibility service model resource sharing priorities,... network engineering provisioning routing,... feasible technology a viable business model The central role of QoS

Engineering for QoS: a probabilistic point of view è statistical characterization of traffic ‡ notions of expected demand and random processes ‡ for packets, bursts, flows, aggregates è QoS in statistical terms ‡ transparency: Pr [packet loss], mean delay, Pr [delay > x],... ‡ response time: E [response time],... ‡ accessibility:Pr [blocking],... è QoS engineering, based on a three-way relationship: performance capacity demand

Outline è traffic characteristics è QoS engineering for streaming flows è QoS engineering for elastic traffic è service differentiation

Internet traffic is self-similar è a self-similar process ‡ variability at all time scales è due to: ‡ infinite variance of flow size ‡ TCP induced burstiness è a practical consequence ‡ difficult to characterize a traffic aggregate Ethernet traffic, Bellcore 1989

Traffic on a US backbone link (Thomson et al, 1997) è traffic intensity is predictable... è... and stationary in the busy hour

Traffic on a French backbone link è traffic intensity is predictable... è... and stationary in the busy hour 12h 18h 00h 06h tue wed thu fri sat sun mon

IP flows è a flow = one instance of a given application ‡ a "continuous flow" of packets ‡ basically two kinds of flow, streaming and elastic è streaming flows ‡ audio and video, real time and playback ‡ rate and duration are intrinsic characteristics ‡ not rate adaptive (an assumption) ‡ QoS  negligible loss, delay, jitter è elastic flows ‡ digital documents ( Web pages, files,...) ‡ rate and duration are measures of performance ‡ QoS  adequate throughput (response time)

Flow traffic characteristics è streaming flows ‡ constant or variable rate compressed audio (O[10 3 bps]) compressed video (O[10 6 bps]) ‡ highly variable duration ‡ a Poisson flow arrival process (?) è elastic flows ‡ infinite variance size distribution ‡ rate adaptive ‡ a Poisson flow arrival process (??) variable rate video

Modelling traffic demand è stream traffic demand ‡ arrival rate x bit rate x duration è elastic traffic demand ‡ arrival rate x size è a stationary process in the "busy hour" ‡ eg, Poisson flow arrivals, independent flow size busy hour traffic demand Mbit/s time of day

Outline è traffic characteristics è QoS engineering for streaming flows è QoS engineering for elastic traffic è service differentiation

Open loop control for streaming traffic è Open loop control -- a "traffic contract" ‡ QoS guarantees rely on ‡ traffic descriptors + admission control + policing è time scale decomposition for performance analysis ‡ packet scale ‡ burst scale ‡ flow scale user-network interface network-network interface user-network interface

Packet scale: a superposition of constant rate flows è constant rate flows ‡ packet size/inter-packet interval = flow rate ‡ maximum packet size = MTU è buffer size for negligible overflow? ‡ over all phase alignments... ‡ assuming independence between flows è worst case assumptions: ‡ many low rate flows ‡ MTU-sized packets è  buffer sizing for M/D MTU /1 queue ‡ Pr [queue > x] ~ C e -r x buffer size log Pr [saturation] M/D MTU /1 increasing number, increasing pkt size

The "negligible jitter conjecture" è constant rate flows acquire jitter ‡ notably in multiplexer queues è conjecture:  if all flows are initially CBR and in all queues:  flow rates < service rate ‡ they never acquire sufficient jitter to become worse for performance than a Poisson stream of MTU packets è M/D MTU /1 buffer sizing remains conservative

Burst scale: fluid queueing models è assume flows have an instantaneous rate ‡ eg, rate of on/off sources è bufferless or buffered multiplexing  Pr [arrival rate > service rate] <  ‡ E [arrival rate] < service rate packets bursts arrival rate

Pr [rate overload] log Pr [saturation] buffer size 0 0 Buffered multiplexing performance: impact of burst parameters

log Pr [saturation] buffer size 0 0 burst length shorter longer

Buffered multiplexing performance: impact of burst parameters burst length less variable more variable log Pr [saturation] buffer size 0 0

Buffered multiplexing performance: impact of burst parameters burst length short range dependence long range dependence log Pr [saturation] buffer size 0 0

Choice of token bucket parameters? r b è the token bucket is a virtual Q ‡ service rate r ‡ buffer size b è non-conformance depends on ‡ burst size and variability ‡ and long range dependence è a difficult choice for conformance ‡ r >> mean rate... ‡...or b very large b' non- conformance probability b

output rate C combined input rate  t time Bufferless multiplexing: alias "rate envelope multiplexing"  provisioning and admission control to ensure Pr [  t >C] <  è performance depends only on stationary rate distribution  loss rate  E [(  t -C) + ] / E [  t ] è insensitivity to self-similarity

Efficiency of bufferless multiplexing è small amplitude of rate variations... ‡ peak rate << link rate (eg, 1%) è... or low utilisation ‡ overall mean rate << link rate è we may have both in an integrated network ‡ priority to streaming traffic ‡ residue shared by elastic flows

Flow scale: admission control è accept new flow only if transparency preserved ‡ given flow traffic descriptor ‡ current link status è no satisfactory solution for buffered multiplexing ‡ (we do not consider deterministic guarantees) ‡ unpredictable statistical performance è measurement-based control for bufferless multiplexing ‡ given flow peak rate ‡ current measured rate (instantaneous rate, mean, variance,...) è uncritical decision threshold if streaming traffic is light ‡ in an integrated network

Provisioning for negligible blocking è "classical" teletraffic theory; assume M/M/m/m  Poisson arrivals, rate ‡ constant rate per flow r  mean duration 1/    mean demand, A = (  r bits/s è blocking probability for capacity C ‡ B = E(C/r,A/r) ‡ E(m,a) is Erlang's formula: E(m,a)= m=C/r, a=A/r ‡  scale economies è generalizations exist: ‡ for different rates ‡ for variable rates  utilization (  =a/m) for E(m,a) = 0.01 m

Outline è traffic characteristics è QoS engineering for streaming flows è QoS engineering for elastic traffic è service differentiation

Closed loop control for elastic traffic è reactive control ‡ end-to-end protocols (eg, TCP) ‡ queue management è time scale decomposition for performance analysis ‡ packet scale ‡ flow scale

è a multi-fractal arrival process ‡ but loss and bandwidth related by TCP (cf. Padhye et al.) ‡ ‡ thus, p = B -1 (p): ie, loss rate depends on bandwidth share ( B ~ p -1/2 ) ? B(p) loss rate p congestion avoidance Packet scale: bandwidth and loss rate

Packet scale: bandwidth sharing è reactive control (TCP, scheduling) shares bottleneck bandwidth unequally ‡ depending on RTT, protocol implementation, etc. ‡ and differentiated services parameters è optimal sharing in a network: objectives and algorithms... ‡ max-min fairness, proportional fairness, max utility,... è... but response time depends more on traffic process than the static sharing algorithm! route 0 route 1route L Example: a linear network

Flow scale: performance of a bottleneck link è assume perfect fair shares ‡ link rate C, n elastic flows  ‡ each flow served at rate C/n è assume Poisson flow arrivals ‡ an M/G/1 processor sharing queue  load,  = arrival rate x size / C è performance insensitive to size distribution  Pr [n transfers] =  n (1-  )  E [response time] = size / C(1-  )  instability if  > 1 ‡ i.e., unbounded response time ‡ stabilized by aborted transfers... ‡... or by admission control Throughput C   a processor sharing queue fair shares link capacity C

Generalizations of PS model è non-Poisson arrivals ‡ Poisson sessions ‡ Bernoulli feedback è discriminatory processor sharing  weight  i for class i flows  service rate   i è rate limitations (same for all flows) ‡ maximum rate per flow (eg, access rate) ‡ minimum rate per flow (by admission control) Poisson session arrivals p 1-p flows think time transfer processor sharing infinite server

Admission control can be useful... to prevent disasters at sea !

Admission control can also be useful for IP flows è improve efficiency of TCP ‡ reduce retransmissions overhead... ‡... by maintaining throughput è prevent instability  due to overload (  > 1)... ‡...and retransmissions è avoid aborted transfers ‡ user impatience ‡ "broken connections" è a means for service differentiation...

Choosing an admission control threshold è N = the maximum number of flows admitted  negligible blocking when   è M/G/1/N processor sharing system ‡ min bandwidth = C/N  Pr [blocking] =  N (1 -  )/(1 -  N+1 )  (1 - 1/  for  >  è uncritical choice of threshold ‡ eg, 1% of link capacity (N=100) N E [Response time]/size  = 0.9  = N Blocking probability  = 0.9  = 1.5

backbone link (rate C) access links (rate<<C) throughput C  access rate Impact of access rate on backbone sharing è TCP throughput is limited by access rate... ‡ modem, DSL, cable è... and by server performance è  backbone link is a bottleneck only if saturated!  ie, if  > 1

Provisioning for negligible blocking for elastic flows è "elastic" teletraffic theory; assume M/G/1/m  Poisson arrivals, rate ‡ mean size s è blocking probability for capacity C  utilization  = s/C ‡ m = admission ctl limit  B( ,m) =  m (1-  )/(1-  m+1 ) è impact of access rate ‡ C/access rate = m  B( ,m)  E(m,  m) - Erlang  utilization (  ) for B = 0.01 m E(m,  m)

Outline è traffic characteristics è QoS engineering for streaming flows è QoS engineering for elastic traffic è service differentiation

Service differentiation è discriminating between stream and elastic flows ‡ transparency for streaming flows ‡ response time for elastic flows è discriminating between stream flows ‡ different delay and loss requirements ‡... or the best quality for all? è discriminating between elastic flows ‡ different response time requirements ‡... but how?

Integrating streaming and elastic traffic è priority to packets of streaming flows ‡ low utilization  negligible loss and delay è elastic flows use all remaining capacity ‡ better response times ‡ per flow fair queueing (?) è to prevent overload ‡ flow based admission control... ‡...and adaptive routing è an identical admission criterion for streaming and elastic flows ‡ available rate > R elastic streaming

Differentiation for stream traffic è different delays? ‡ priority queues, WFQ,... ‡ but what guarantees? è different loss? ‡ different utilization (CBQ,..) ‡ "spatial queue priority" partial buffer sharing, push out è or negligible loss and delay for all ‡ elastic-stream integration.. ‡... and low stream utilization loss delay loss

Differentiation for elastic traffic è different utilization ‡ separate pipes ‡ class based queuing è different per flow shares ‡ WFQ ‡ impact of RTT,... è discrimination in overload ‡ impact of aborts (?) ‡ or by admission control throughput C access rate  1 st class 3 rd class 2 nd class throughput C access rate 

Different accessibility è block class 1 when N 1 =100 flows in progress - block class 2 when N 2 flows in progress è class 1: higher priority than class N 1 0 Blocking probability  = 0.9  = 1.5

Different accessibility è block class 1 when N 1 =100 flows in progress - block class 2 when N 2 =50 flows in progress  underload: both classes have negligible blocking (B 1  B 2  0) è in overload: discrimination is effective  if  1 < 1 <  1 +  2, B 1  0, B 2  (  1 +  2 -1)/  2  if 1 <  1,  2 B 1  (  1 -1)/  1, B 2  1 B1B1 B2B  1 =  2 = N2N2 B2B2 B1B  1 =  2 = N2N2 1 B2B10B2B10 0  1 =  2 = N2N2 1

Service differentiation and pricing è different QoS requires different prices... ‡ or users will always choose the best è...but streaming and elastic applications are qualitatively different ‡ choose streaming class for transparency ‡ choose elastic class for throughput è  no need for streaming/elastic price differentiation ? è different prices exploit different "willingness to pay"... ‡ bringing greater economic efficiency è...but QoS is not stable or predictable ‡ depends on route, time of day,.. ‡ and on factors outside network control: access, server, other networks,... è  network QoS is not a sound basis for price discrimination

Pricing to pay for the network è fix a price per byte ‡ to cover the cost of infrastructure and operation è estimate demand ‡ at that price è provision network to handle that demand ‡ with excellent quality of service demand time of day $$$ capacity $$$ demand time of day capacity optimal price  revenue = cost

Outline è traffic characteristics è QoS engineering for streaming flows è QoS engineering for elastic traffic è service differentiation è conclusions

Conclusions è a statistical characterization of demand ‡ a stationary random process in the busy period ‡ a flow level characterization (streaming and elastic flows) è transparency for streaming flows ‡ rate envelope ("bufferless") multiplexing ‡ the "negligible jitter conjecture" è response time for elastic flows ‡ a "processor sharing" flow scale model ‡ instability in overload (i.e., E [demand]> capacity) è service differentiation ‡ distinguish streaming and elastic classes ‡ limited scope for within-class differentiation ‡ flow admission control in case of overload C 