Integrated Services High-speed networks have enabled new applications - they also need “deliver on time” assurances from network Applications that are.

Slides:



Advertisements
Similar presentations
DISTRIBUTED MULTIMEDIA SYSTEMS
Advertisements

Quality of Service CS 457 Presentation Xue Gu Nov 15, 2001.
Spring 2003CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
Spring 2000CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
CS640: Introduction to Computer Networks Aditya Akella Lecture 20 – QoS.
Quality of Service Requirements
Engineering Internet QoS
Chapter 30 Quality of Service
15-744: Computer Networking L-18 QOS - IntServ. QOS & IntServ QOS IntServ Architecture Assigned reading [She95] Fundamental Design Issues for the Future.
T.Sharon-A.Frank 1 Multimedia Quality of Service (QoS)
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 25 Multimedia.
High Speed Networks and Internets : Multimedia Transportation and Quality of Service Meejeong Lee.
Comparison and Analysis of FIFO, PQ, and WFQ Disciplines on multimedia
1 Call Admission Control Carey Williamson Department of Computer Science University of Calgary.
Supporting Real-time Applications in An Integrated Service Packet Network CSZ Sigcomm 92.
A Case for Relative Differentiated Services and the Proportional Differentiation Model Constantinos Dovrolis Parameswaran Ramanathan University of Wisconsin-Madison.
15-441: Computer Networking Lecture 18: QoS Thanks to David Anderson and Srini Seshan.
Multiple constraints QoS Routing Given: - a (real time) connection request with specified QoS requirements (e.g., Bdw, Delay, Jitter, packet loss, path.
Ncue-csie1 A QoS Guaranteed Multipolling Scheme for Voice Traffic in IEEE Wireless LANs Der-Jiunn Deng 、 Chong-Shuo Fan 、 Chao-Yang Lin Speaker:
T.Sharon-A.Frank 1 Multimedia on the Internet. 2 T.Sharon-A.Frank Is the Internet Real-Time (MM)?
Current Research Topics -Sigcomm Sessions -QoS -Network analysis & security -Multicast -giga/tera bit routers /fast classification -web performance -TCP.
CSE 401N Multimedia Networking-2 Lecture-19. Improving QOS in IP Networks Thus far: “making the best of best effort” Future: next generation Internet.
15-744: Computer Networking L-18 QOS - IntServ. QOS & IntServ QOS IntServ Architecture Assigned reading [She95] Fundamental Design Issues for the Future.
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002.
1 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
School of Information Technologies IP Quality of Service NETS3303/3603 Weeks
15-744: Computer Networking L-22 QOS - IntServ. L -22; © Srinivasan Seshan, QOS & IntServ QOS IntServ Architecture Assigned reading [She95]
CSc 461/561 CSc 461/561 Multimedia Systems Part C: 3. QoS.
Spring 2002CS 4611 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services.
Traffic Management & QoS. Quality of Service (QoS) J The collective effect of service performances which determine the degree of satisfaction of a user.
Internet Quality of Service. Quality of Service (QoS) The best-effort model, in which the network tries to deliver data from source to destination but.
24-1 Chapter 24. Congestion Control and Quality of Service part Quality of Service 23.6 Techniques to Improve QoS 23.7 Integrated Services 23.8.
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
Quality of Service. Overview Why QoS? When QoS? One model: Integrated services Contrast to Differentiated Services (more modern; more practical; not covered)
Integrated Services (RFC 1633) r Architecture for providing QoS guarantees to individual application sessions r Call setup: a session requiring QoS guarantees.
CSE QoS in IP. CSE Improving QOS in IP Networks Thus far: “making the best of best effort”
QOS مظفر بگ محمدی دانشگاه ایلام. 2 Why a New Service Model? Best effort clearly insufficient –Some applications need more assurances from the network.
CSC 336 Data Communications and Networking Lecture 8d: Congestion Control : RSVP Dr. Cheer-Sun Yang Spring 2001.
Distributed Multimedia March 19, Distributed Multimedia What is Distributed Multimedia?  Large quantities of distributed data  Typically streamed.
Distributed Multimedia Systems David Immordino. Introduction 4 A multimedia application is a real-time system responsible for the delivering and receiving.
1 Quality of Service Outline Realtime Applications Integrated Services Differentiated Services MPLS.
Quality of Service Karrie Karahalios Spring 2007.
Wolfgang EffelsbergUniversity of Mannheim1 Differentiated Services for the Internet Wolfgang Effelsberg University of Mannheim September 2001.
CSE Computer Networks Prof. Aaron Striegel Department of Computer Science & Engineering University of Notre Dame Lecture 20 – March 25, 2010.
A T M (QoS).
Beyond Best-Effort Service Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot November 2010 November.
CS 447 Network & Data Communication QoS (Quality of Service) & DiffServ Introduction Department of Computer Science Southern Illinois University Edwardsville.
Competitive Queue Policies for Differentiated Services Seminar in Packet Networks1 Competitive Queue Policies for Differentiated Services William.
TCP Trunking: Design, Implementation and Performance H.T. Kung and S. Y. Wang.
Packet switching network Data is divided into packets. Transfer of information as payload in data packets Packets undergo random delays & possible loss.
CS640: Introduction to Computer Networks Aditya Akella Lecture 21 – QoS.
EE 122: Lecture 15 (Quality of Service) Ion Stoica October 25, 2001.
Explicit Allocation of Best-Effort Service Goal: Allocate different rates to different users during congestion Can charge different prices to different.
Random Early Detection (RED) Router notifies source before congestion happens - just drop the packet (TCP will timeout and adjust its window) - could make.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
© 2006 Cisco Systems, Inc. All rights reserved. 3.2: Implementing QoS.
Univ. of TehranIntroduction to Computer Network1 An Introduction Computer Networks An Introduction to Computer Networks University of Tehran Dept. of EE.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Providing QoS in IP Networks
Integrated Services & RSVP Types of pplications Basic approach in IntServ Key components Service models.
Instructor Materials Chapter 6: Quality of Service
Topics discussed in this section:
Taxonomy of network applications
Advanced Computer Networks
CprE 458/558: Real-Time Systems
EE 122: Quality of Service and Resource Allocation
Computer Science Division
Congestion Control, Quality of Service, & Internetworking
University of Houston Quality of Service Datacom II Lecture 3
Presentation transcript:

Integrated Services High-speed networks have enabled new applications - they also need “deliver on time” assurances from network Applications that are sensitive to timeliness of data are called real-time applications - voice, video, industrial control, stock quotes, … Timeliness guarantees must come from inside the network - end-hosts can not correct late packets like they can correct for lost packets Need more than best-effort - IETF extensions to best-effort model

QoS Controls Admission Control: can requested QoS be met while honoring previously made QoS commitments to already accepted calls? This requires the application to specify its traffic pattern (through a service interface) Traffic Shaping/Policing: make sure the connection obeys its specified traffic once admitted Path Selection: select a path that is likely to satisfy requested QoS Flow Setup: communicate to intermediate routers on selected path the call’s requirements so that they reserve necessary resources (buffers, bandwidth, etc.) Packet Scheduling: manage the packets in the router queue so that they receive the service that has been requested

Traffic Classes Network should match offered service to source requirements Example: telnet requires low bandwidth and low delay - utility (or level of satisfaction) increases with decrease in delay - network should provide a low-delay service - or, telnet belongs to the low-delay traffic class Traffic classes encompass both user requirements and network service offerings

Traffic Classes (cont’d) A basic division: guaranteed (real-time) service and best-effort (elastic) - like flying with reservation or standby Guaranteed service - utility is zero unless application gets a minimum level of service quality (bandwidth, delay, loss) - open-loop flow control with admission control (goal is to lock sender and receiver into a common clocking regime) - e.g. telephony, remote sensing, interactive multiplayer games Best-effort service - send and pray - closed-loop flow control with no admission control (willing to adapt to whatever QoS is available) - e.g. , net news

IETF IntServ Traffic Classes Based on sensitivity to delay Guaranteed - intolerant: typically non-adaptive, e.g. telephony/interactive voice with fixed playback point - tolerant: typically adaptive, e.g. adaptive playback audio- or video-streaming application minimizing offset delay ==> less delay, but higher loss rate Best-effort - interactive burst (e.g. paging, messaging, ) - interactive bulk (e.g. ftp) - asynchronous bulk (e.g. net news, junk traffic)

IETF GS Subclasses Both subclasses require some bandwidth guarantee Tolerant GS - nominal mean delay, but can tolerate “occasional” variation - not specified what this means exactly! - called predictive or controlled-load service - through admission control, attempts to provide traffic delivery within the same bounds as an unloaded network - it really is this imprecise! Intolerant GS - need a worst-case delay bound - called guaranteed service (real deal!)

Scheduling Algorithms for GS Characterize source by “average” rate and minimal burst size using token bucket -- also called Linear Bounded Arrival Process (LBAP) Conformance if there is always enough tokens whenever a packet is generated. Non-conforming packets are dropped or tagged Use WFQ to reserve bandwidth at average rate Pros: - may use less bandwidth than with peak rate - can get an end-to-end delay guarantee (isolation) Cons: - for low delay bound, may need to reserve at high rate (coupling) - implementation complexity (timestamp calculation and priority queue) - can waste bandwidth; worst-case rarely happens!

Why not use WFQ for Predictive Service? Goal: minimize actual measured delay bounds WFQ provides isolation A burst by one source causes a sharp increase in delay (jitter) seen by that source In FIFO, bursts are multiplexed, thus a burst affects other sources but sees less delay (burst sharing => less jitter) Average delay is same (cf. conservation law), but 99.9 percentile delays are much smaller under FIFO Isolate different classes using WFQ, and use FIFO to benefit from sharing within each class FIFO+ reduces the accumulation of jitter over multiple hops by serving packets in order of expected arrival times under average service

Unified Scheduling WFQ with weights assigned to each guaranteed flow and to the predictive+datagram pseudo flow 0 Multiple priorities within flow 0 FIFO+ within each priority level Priority scheduling shifts jitter from higher priority to lower priority Put datagram traffic at lowest priority level Each priority level has a target delay bound Make jitter shifted from higher priority classes small by choosing target delay bounds widely spaced (at least order of magnitude), thus good isolation

Admission Control For guaranteed service, source asks the network for rate of token bucket, and uses P-G worst-case delay bound For predictive service, source specifies its token bucket parameters, and flow is admitted if - sum of its rate and measured guaranteed+predictive rate < 0.9 capacity (say, 10% for datagram) - for each lower or equal priority, the delay bound is not violated - measured quantities should be conservative to control delay violations Network utilization can be increased in the presence of predictive flows, since measured bounds are smaller

Worst-Case Delay for Predictive Effect of new predictive flow on same priority traffic: - sum of bucket sizes up to that level (including new bucket size) divided by minimum capacity leftover from higher levels ==> delay bound increase depends on new bucket size Effect of new predictive flow on lower priority traffic: - delay bound increase depends on new bucket size and new rate, which decreases leftover capacity ==> does most harm! Effect of guaranteed flow on predictive traffic: - delay bound increase depends on new rate

Approximation Replace worst-case parameters (bucket size and rate) of existing flows by measured delays and usage rates Measure delay for every packet, and update maximum delay every T unless - a new flow is added (a new T is started) - delay of a packet exceeds current value, then back off Measure utilization every S < T, and update maximum utilization every T unless - a new flow is added (a new T is started) - current value is exceeded

Tradeoffs Conservative through delay back off and burst utilization Once estimates are increased, they stay high until T expires Larger T means fewer delay violations and lower network utilization Even worse with shorter flow lifetimes

Traffic Models How users or aggregates of users typically behave - e.g. how long a user uses a modem - e.g. average size of a file transfer Models change with network usage We can only guess about the future Two types of models - measurements - educated guesses

Telephone Traffic Models How are calls placed? - call arrival model - studies show that time between calls is drawn from an exponential distribution - call arrival process is therefore Poisson How long are calls held? - usually modeled as exponential - however, measurement studies show it to be heavy tailed (e.g. Pareto distributed) - means a significant number of calls last a very long time

Internet Traffic Modeling A few applications account for most of the traffic - WWW, FTP, telnet A common approach is to model applications (this ignores distribution of destination!) - time between application invocations - connection duration - number of bytes transferred - packet interarrival distribution Little consensus on models. E.g. some found interarrival times between telnet and FTP sessions to follow exponential distribution, some a generalization of it called Weibull But two important features

Internet Traffic Models: Features LAN connections differ from WAN connections - higher bandwidth (more bytes/call) - longer holding times (free and higher bandwidth!) Many parameters are heavy-tailed - e.g. number of bytes in call, call duration - means a few calls are responsible for most of the traffic - these calls must be well managed - also means that even aggregates with many calls are not smooth (self-similar or LRD traffic) - can have long bursts New models appear all the time, to account for rapidly changing traffic mix

Benefits of Predictive Service Depending on traffic burstiness, utilization gains range from twice to order of magnitude: higher gain with more bursty sources A larger average lifetime to T ratio yields higher utilization but less reliable delay bound LRD traffic can be effectively handled with measurement-based admission control, as long as there is enough room to accommodate bursts (e.g. by lowering utilization target or increase T to reduce delay violations) A side general note on admission control: - flows traversing longer paths have higher chance of being rejected - also, more demanding flows - may want to implement some policy? May implement dynamic estimation or T, to account for burstiness?