Xiaowei Yang xwy@cs.duke.edu CS 356: Computer Network Architectures Lecture 18: Quality of Service (QoS) Xiaowei Yang xwy@cs.duke.edu.

Slides:



Advertisements
Similar presentations
Quality of Service CS 457 Presentation Xue Gu Nov 15, 2001.
Advertisements

Spring 2003CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
Spring 2000CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 – QoS.
Xiaowei Yang CS 356: Computer Network Architectures Lecture 19: Integrated Services and Differentiated Services Xiaowei Yang
15-744: Computer Networking L-18 QOS - IntServ. QOS & IntServ QOS IntServ Architecture Assigned reading [She95] Fundamental Design Issues for the Future.
QoS: IntServ and DiffServ Supplemental Slides Aditya Akella 02/26/2007.
CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 20 – March 25, 2010.
Differentiated Services. Service Differentiation in the Internet Different applications have varying bandwidth, delay, and reliability requirements How.
Fall 2006CS 5611 Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion Quality of Service.
1 Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion.
15-441: Computer Networking Lecture 18: QoS Thanks to David Anderson and Srini Seshan.
Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion.
CS Summer 2003 Lecture 8. CS Summer 2003 Populating LFIB with LDP Assigned/Learned Labels Changes in the LFIB may be triggered routing or.
Spring 2002CS 4611 Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion.
Spring 2003CS 4611 Congestion Control Outline Queuing Discipline Reacting to Congestion Avoiding Congestion.
CSE 401N Multimedia Networking-2 Lecture-19. Improving QOS in IP Networks Thus far: “making the best of best effort” Future: next generation Internet.
15-744: Computer Networking L-18 QOS - IntServ. QOS & IntServ QOS IntServ Architecture Assigned reading [She95] Fundamental Design Issues for the Future.
1 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
15-744: Computer Networking
School of Information Technologies IP Quality of Service NETS3303/3603 Weeks
15-744: Computer Networking L-22 QOS - IntServ. L -22; © Srinivasan Seshan, QOS & IntServ QOS IntServ Architecture Assigned reading [She95]
CSc 461/561 CSc 461/561 Multimedia Systems Part C: 3. QoS.
Spring 2002CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
Internet Quality of Service. Quality of Service (QoS) The best-effort model, in which the network tries to deliver data from source to destination but.
24-1 Chapter 24. Congestion Control and Quality of Service part Quality of Service 23.6 Techniques to Improve QoS 23.7 Integrated Services 23.8.
Computer Networking Queue Management and Quality of Service (QOS)
Computer Networking Intserv, Diffserv, RSVP.
Quality of Service. Overview Why QoS? When QoS? One model: Integrated services Contrast to Differentiated Services (more modern; more practical; not covered)
Integrated Services Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot December 2010 December 2010.
Integrated Services (RFC 1633) r Architecture for providing QoS guarantees to individual application sessions r Call setup: a session requiring QoS guarantees.
CS 356: Computer Network Architectures Lecture 17: Network Resource Allocation Chapter Xiaowei Yang
CSE679: QoS Infrastructure to Support Multimedia Communications r Principles r Policing r Scheduling r RSVP r Integrated and Differentiated Services.
CSE QoS in IP. CSE Improving QOS in IP Networks Thus far: “making the best of best effort”
Computer Networking Intserv, Diffserv, RSVP.
QOS مظفر بگ محمدی دانشگاه ایلام. 2 Why a New Service Model? Best effort clearly insufficient –Some applications need more assurances from the network.
CSC 336 Data Communications and Networking Lecture 8d: Congestion Control : RSVP Dr. Cheer-Sun Yang Spring 2001.
1 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services MPLS.
CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 20 – March 25, 2010.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 21 – Case Studies for Multimedia Network Support (Layer 3) Klara Nahrstedt Spring 2009.
© Jörg Liebeherr, Quality-of-Service Architectures for the Internet Integrated Services (IntServ)
9.7 Other Congestion Related Issues Outline Queuing Discipline Avoiding Congestion.
Network Support for QoS – DiffServ and IntServ Hongli Luo CEIT, IPFW.
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 - Queuing and Basics of QoS.
CS640: Introduction to Computer Networks Aditya Akella Lecture 21 – QoS.
EE 122: Lecture 15 (Quality of Service) Ion Stoica October 25, 2001.
Advance Computer Networking L-7 QoS. QoS IntServ DiffServ Assigned reading [ [She95] Fundamental Design Issues for the Future Internet [CSZ92] Supporting.
1 Lecture, November 27, 2002 TCP Other Internet Protocols; Internet Traffic Scalability of Virtual Circuit Networks QoS.
Ch 6. Multimedia Networking Myungchul Kim
Spring 2015© CS 438 Staff - University of Illinois1 Next Topic: Vacation Planning UIUC Chicago Monterey San Francisco Chicago to San Francisco: ALL FLIGHTS.
Random Early Detection (RED) Router notifies source before congestion happens - just drop the packet (TCP will timeout and adjust its window) - could make.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
EE 122: Integrated Services Ion Stoica November 13, 2002.
Congestion Avoidance Created by M Bateman, A Ruddle & C Allison As part of the TCP View project.
CIS679: RSVP r Review of Last Lecture r RSVP. Review of Last Lecture r Scheduling: m Decide the order of packet transmission r Resource configuration.
Univ. of TehranIntroduction to Computer Network1 An Introduction Computer Networks An Introduction to Computer Networks University of Tehran Dept. of EE.
Spring 2002CS 3321 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Integrated Services & RSVP Types of pplications Basic approach in IntServ Key components Service models.
1 Lecture 15 Internet resource allocation and QoS Resource Reservation Protocol Integrated Services Differentiated Services.
Other Methods of Dealing with Congestion
Chapter 6 Congestion Avoidance
CS 268: Computer Networking
Taxonomy of network applications
Advanced Computer Networks
QoS Guarantees introduction call admission traffic specification
Other Methods of Dealing with Congestion
Advanced Computer Networks
Other Methods of Dealing with Congestion
University of Houston Quality of Service Datacom II Lecture 3
Presentation transcript:

Xiaowei Yang xwy@cs.duke.edu CS 356: Computer Network Architectures Lecture 18: Quality of Service (QoS) Xiaowei Yang xwy@cs.duke.edu

Overview Network Resource Allocation Congestion Avoidance Why QoS? Architectural considerations Approaches to QoS Fine-grained: Integrated services RSVP Coarse-grained: Differentiated services Next lecture Fine-grained: guarantees for each individual flow 2

Administrivia Five minutes pop quiz Lab 2 due midnight Hw2 out, due before next Thursday’s class

Design Space for resource allocation Router-based vs. Host-based Reservation-based vs. Feedback-based Window-based vs. Rate-based

Properties of Fair Queuing Work conserving Max-min fair

Weighted Fair Queuing Different queues get different weights Take wi amount of bits from a queue in each round Fi = Si+Pi / wi Quality of service

Deficit Round Robin (DRR) WFQ: extracting min is O(log Q) DRR: O(1) rather than O(log Q) Each queue is allowed to send Q bytes per round If Q bytes are not sent (because packet is too large) deficit counter of queue keeps track of unused portion If queue is empty, deficit counter is reset to 0 Similar behavior as FQ but computationally simpler

Unused quantum saved for the next round How to set quantum size? to offset packet size unfairness Unused quantum saved for the next round How to set quantum size? Too small Too large

Congestion Avoidance

Design goals Predict when congestion is going to happen Reduce sending rate before buffer overflows Not widely deployed Reducing queuing delay and packet loss are not essential

Mechanisms Router+host joint control Router: Early signaling of congestion Host: react to congestion signals Case studies: DECbit, Random Early Detection Host: Source-based congestion avoidance Host detects early congestion Case study: TCP Vegas

DECbit Add a congestion bit to a packet header A router sets the bit if its average queue length is non-zero Queue length is measured over a busy+idle interval If less than 50% of packets in one window do not have the bit set A host increases its congest window by 1 packet Otherwise Decreases by 0.875 AIMD

Random Early Detection Random early detection (Floyd93) Goal: operate at the “knee” Problem: very hard to tune (why) RED is generalized by Active Queue Managment (AQM) A router measures average queue length using exponential weighted averaging algorithm: AvgLen = (1-Weight) * AvgLen + Weight * SampleQueueLen

RED algorithm If AvgLen ≤ MinThreshold avg_qlen p min_thresh 1 max_thresh If AvgLen ≤ MinThreshold Enqueue packet If MinThreshold < AvgLen < MaxThreshold Calculate dropping probability P Drop the arriving packet with probability P If MaxThreshold ≤ AvgLen Drop the arriving packet

Even out packet drops TempP = MaxP x (AvgLen – Min)/(Max-Min) avg_qlen TempP min_thresh 1 max_thresh TempP = MaxP x (AvgLen – Min)/(Max-Min) P = TempP / (1 – count * TempP) Count keeps track of how many newly arriving packets have been queued when min < Avglen < max It keeps drop evenly distributed over time, even if packets arrive in burst

An example MaxP = 0.02 AvgLen is half way between min and max thresholds TempP = 0.01 A burst of 1000 packets arrive With TempP, 10 packets may be discarded uniformly randomly among the 1000 packets With P, they are likely to be more evently spaced out, as P gradually increases if previous packets are not discarded

Explicit Congestion Notification A new IETF standard We use two bits in IP header (ECN bits) for routers to signal congestion back to TCP senders TCP halves its window size as if it suffers a packet drop Use a Congestion Experience (CE) bit to signal congestion, instead of a packet drop Why is it better than a drop? AQM is used for packet marking CE=1 X ECE=1 CWR=1

Source-based congestion avoidance TCP Vegas Detect increases in queuing delay Reduces sending rate Details Record baseRTT (minimum seen) Compute ExpectedRate = cwnd/BaseRTT Diff = ExpectedRate - ActualRate When Diff < α, incr cwnd linearly, when Diff > β, decr cwnd linearly α < β

cwnd Top: congestion window Bottom: expected (colred) and actual (black) throughput The shaded area is the region between the alpha and beta thresholds.

Summary The problem of network resource allocation Case studies TCP congestion control Fair queuing Congestion avoidance Active queue management Source-based congestion avoidance

Overview Network Resource Allocation Congestion Avoidance Why QoS? Architectural considerations Approaches to QoS Fine-grained: Integrated services RSVP Coarse-grained: Differentiated services Next lecture Fine-grained: guarantees for each individual flow 22

Motivation Internet currently provides one single class of “best-effort” service No assurance about delivery Many existing applications are elastic Tolerate delays and losses Can adapt to congestion “Real-time” applications may be inelastic 23

Inelastic Applications Continuous media applications Lower and upper limit on acceptable performance Below which video and audio are not intelligible Internet telephones, teleconferencing with high delay (200 - 300ms) impair human interactions Hard real-time applications Require hard limits on performance E.g., industrial control applications Internet surgery 24

Design question #1: Why a New Service Model? What is the basic objective of network design? Maximize total bandwidth? Minimize latency? Maximize ISP’s revenues? the designer’s choice: Maximize social welfare: the total utility given to users What does utility vs. bandwidth look like? Must be non-decreasing function Shape depends on application 25

Utility Curve Shapes Stay to the right and you are fine for all curves BW U Elastic BW U Hard real-time BW U Delay-adaptive Stay to the right and you are fine for all curves 26

Playback Applications Sample signal  packetize  transmit  buffer  playback Fits most multimedia applications Performance concern: Jitter: variation in end-to-end delay Delay = fixed + variable = (propagation + packetization) + queuing Solution: Playback point – delay introduced by buffer to hide network jitter 27

Characteristics of Playback Applications In general lower delay is preferable Doesn’t matter when packet arrives as long as it is before playback point Network guarantees (e.g., bound on jitter) would make it easier to set playback point Applications can tolerate some loss 29

Applications Variations Rigid and adaptive applications Delay adaptive Rigid: set fixed playback point Adaptive: adapt playback point E.g. Shortening silence for voice applications Rate adaptive Loss tolerant and intolerant applications Four combinations Move playback point around based on observed network delay Gamble that network conditions will be the same as in the past Are prepared to deal with errors in their estimate Will have an earlier playback point than rigid applications How to adapt: 30

Applications Variations Really only two classes of applications 1) Intolerant and rigid Tolerant and adaptive Other combinations make little sense 3) Intolerant and adaptive - Cannot adapt without interruption Tolerant and rigid - Missed opportunity to improve delay 32

Design question 2: How to maximize V =  U(si) Choice #1: add more pipes Discuss later Choice #2: fix the bandwidth but offer different services Q: can differentiated services improve V?

If all users’ utility functions are elastic Does equal allocation of bandwidth maximize total utility? Bandwidth  si = B Max  U(si) 34

Design question: is Admission Control needed? If U(bandwidth) is concave  elastic applications Incremental utility is decreasing with increasing bandwidth U(x) = log(xp) V = nlog(B/n) p= logBpn1-p Is always advantageous to have more flows with lower bandwidth No need of admission control; This is why the Internet works! And fairness makes sense BW U Elastic 35

Utility Curves – Inelastic traffic BW U Delay-adaptive BW U Hard real-time Does equal allocation of bandwidth maximize total utility? 36

Is Admission Control needed? If U is convex  inelastic applications U(number of flows) is no longer monotonically increasing Need admission control to maximize total utility Admission control  deciding when the addition of new people would result in reduction of utility Basically avoids overload BW U Delay-adaptive 37

Incentives Who should be given what service? Users have incentives to cheat Pricing seems to be a reasonable choice But usage-based charging may not be well received by users

Over provisioning Pros: simple Cons Not cost effective Bursty traffic leads to a high peak/average ratio E.g., normal users versus leading edge users It might be easier to block heavy users

Comments End-to-end QoS has not happened Why? Can you think of any mechanism to make it happen?

Overview Why QOS? Approaches to QoS Architectural considerations Fine-grained: Integrated services RSVP Coarse-grained: Differentiated services Next lecture Fine-grained: guarantees for each individual flow 41

Components of Integrated Services Service classes What does the network promise? Service interface How does the application describe what it wants? Establishing the guarantee How is the promise communicated to/from the network How is admission of new applications controlled? Packet scheduling How does the network meet promises? 42

1. Service classes What kind of promises/services should network offer? Depends on the characteristics of the applications that will use the network …. 43

Service classes Guaranteed service For intolerant and rigid applications Fixed guarantee, network meets commitment as long as clients send at match traffic agreement Controlled load service For tolerant and adaptive applications Emulate lightly loaded networks Datagram/best effort service Networks do not introduce loss or delay unnecessarily Two components If conditions do not change, commit to current service If conditions change, take steps to deliver consistent performance (help apps minimize playback delay) Implicit assumption – network does not change much over time 44

Components of Integrated Services Type of commitment What does the network promise? Service interface How does the application describe what it wants? Establishing the guarantee How is the promise communicated to/from the network How is admission of new applications controlled? Packet scheduling How does the network meet promises? 45

Service interfaces Flowspecs TSpec: a flow’s traffic characteristics Difficult: bandwidth varies RSpec: the service requested from the network Service dependent E.g. controlled load

A Token Bucket Filter Operation: If bucket fills, tokens are discarded Tokens enter bucket at rate r Operation: If bucket fills, tokens are discarded Sending a packet of size P uses P tokens If bucket has P tokens, packet sent at max rate, else must wait for tokens to accumulate Bucket depth b: capacity of bucket 47

Token Bucket Operations Tokens Tokens Tokens Overflow Packet Packet Enough tokens  packet goes through, tokens removed Not enough tokens  wait for tokens to accumulate 48

Token Bucket Characteristics In the long run, rate is limited to r In the short run, a burst of size b can be sent Amount of traffic entering at interval T is bounded by: Traffic = b + r*T Information useful to admission algorithm 49

Token Bucket Specs BW Flow B 2 1 Flow A 1 2 3 Time Flow A: r = 1 MBps, B=1 byte 1 Flow A Flow B: r = 1 MBps, B=1MB 1 2 3 Time 50

TSpec TokenBucketRate TokenBucketSize PeakRate MinimumPolicedUnit MaximumPacketSize

Service Interfaces: RSpec Guaranteed Traffic TokenRate and DelayVariation Or DelayVariation and Latency Controlled load Type of service 52

Components of Integrated Services Type of commitment What does the network promise? Service interface How does the application describe what it wants? Establishing the guarantee How is the promise communicated to/from the network How is admission of new applications controlled? Packet scheduling How does the network meet promises? 53

RSVP Goals Used on connectionless networks Robust Should not replicate routing functionality Should co-exist with route changes Support for multicast Modular design – should be generic “signaling” protocol Approaches Receiver-oriented Soft-state Different receivers have different capabilities and want different QOS Changes in group membership should not be expensive Reservations should be aggregate – I.e. each receiver in group should not have to reserve Should be able to switch allocated resource to different senders 54

RSVP Service Model Make reservations for simplex data streams Receiver decides whether to make reservation Control msgs in IP datagrams (proto #46) PATH/RESV sent periodically to refresh soft state One pass: Failed requests return error messages - receiver must try again No e2e ack for success 55

PATH Messages PATH messages carry sender’s Tspec Token bucket parameters Routers note the direction PATH messages arrived and set up reverse path to sender Receivers send RESV messages that follow reverse path and setup reservations If reservation cannot be made, user gets an error 56

RESV Messages Forwarded via reverse path of PATH A receiver sends RESV messages TSpec from the sender Rspec 57

Admission control Router performs admission control and reserves resources If request rejected, send error message to receiver Guaranteed service: a yes/no based on available bandwidth Controlled load: heuristics If delay has not exceeded the bound last time after admitting a similar flow, let it in

Soft State to Adapt to Routing Changes Problems: Routing protocol makes routing changes Solution: PATH and RESV messages sent periodically Non-refreshed state times out automatically Ex: a link fails. How is a new reservation established? 59

Merging multicast reservations A requests a delay < 100ms B requests a delay < 200ms

Components of Integrated Services Type of commitment What does the network promise? Service interface How does the application describe what it wants? Establishing the guarantee How is the promise communicated to/from the network How is admission of new applications controlled? Packet scheduling How does the network meet promises? 61

Packet classification and scheduling Map a packet to a service class (src addr, dst addr, proto, src port, dst port) Use scheduling algorithms to provide the service An implementation issue

Scheduling for Guaranteed Traffic Use WFQ at the routers Q: will DRR work? Each flow is assigned to its individual queue Parekh’s bound for worst case queuing delay = b/r b = bucket depth r = rate of arrival 63

Controlled Load Service Goals: Isolation Isolates well-behaved from misbehaving sources Sharing Mixing of different sources in a way beneficial to all Possible Mechanisms: WFQ Aggregate multiple flows into one WFQ 64

Unified Scheduling Scheduling: use WFQ in routers Guaranteed Service Controlled Load Class I Controlled Load Class II Each guaranteed flow gets its own queue All controlled load service flows and best effort aggregates in single separate queue Controlled load classes Worst case delay for classes separated by order of magnitude When high priority needs extra bandwidth – steals it from lower class Best effort traffic acts as lowest priority class Best Effort Scheduling: use WFQ in routers 65

Scalability A lot of requests and state! ISPs feel it is not the right service model for them! Per-flow reservation/queue OC-48 link 2.5Gbps 64Kbps audio stream  39,000 flows Reservation and state needs to be stored in memory, and refreshed periodically Classify, police, nd queue each flows

Comments on RSVP Not widely deployed as a commercial service Used for other purposes Setting up MPLS tunnels etc.

Summary Why QOS? Approaches to QoS Next lecture: Architectural considerations Approaches to QoS Fine-grained: Integrated services RSVP Coarse-grained: Differentiated services Next lecture: DiffServ Net Neutrality Fine-grained: guarantees for each individual flow 68