FINDING THE OPTIMAL QUANTUM SIZE Revisiting the M/G/1 Round-Robin Queue VARUN GUPTA Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
VARUN GUPTA Carnegie Mellon University 1 Partly based on joint work with: Anshul Gandhi Mor Harchol-Balter Mike Kozuch (CMU) (CMU) (Intel Research)
Advertisements

VARUN GUPTA Carnegie Mellon University 1 With: Mor Harchol-Balter (CMU)
© 2004, D. J. Foreman 1 Scheduling & Dispatching.
Master Theorem Chen Dan Dong Feb. 19, 2013
Scheduling Criteria CPU utilization – keep the CPU as busy as possible (from 0% to 100%) Throughput – # of processes that complete their execution per.
Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 1  Load-balancing problem  migrate (large) jobs from a busy server to an underloaded.
Explicit Preemption Placement for Real- Time Conditional Code via Graph Grammars and Dynamic Programming Bo Peng, Nathan Fisher, and Marko Bertogna Department.
CPU Scheduling CPU Scheduler Performance metrics for CPU scheduling
IFIP Performance 2011 Best Paper
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
1 Statistical Analysis of Packet Buffer Architectures Gireesh Shrimali, Isaac Keslassy, Nick McKeown
Ceng Operating Systems Chapter 2.2 : Process Scheduling Process concept  Process scheduling Interprocess communication Deadlocks Threads.
Queuing Theory For Dummies
University of Pennsylvania 10/24/00CSE 3801 Disk Scheduling CSE 380 Lecture Note 11a Insup Lee.
ECS 152A Acknowledgement: slides from S. Kalyanaraman & B.Sikdar
Cs238 CPU Scheduling Dr. Alan R. Davis. CPU Scheduling The objective of multiprogramming is to have some process running at all times, to maximize CPU.
1 Queueing Theory H Plan: –Introduce basics of Queueing Theory –Define notation and terminology used –Discuss properties of queuing models –Show examples.
Join-the-Shortest-Queue (JSQ) Routing in Web Server Farms
Fundamental Characteristics of Queues with Fluctuating Load VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ. Alan Scheller-Wolf Carnegie.
Fluid level in tandem queues with an On/Off source VARUN GUPTA Carnegie Mellon University Joint work with PETER HARRISON Imperial College.
Operating System Process Scheduling (Ch 4.2, )
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
1 Scheduling Algorithms FCFS First-Come, First-Served Round-robin SJF Multilevel Feedback Queues.
Fundamental Characteristics of Queues with Fluctuating Load (appeared in SIGMETRICS 2006) VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ.
Effect of higher moments of job size distribution on the performance of an M/G/k system VARUN GUPTA Joint work with: Mor Harchol-Balter Carnegie Mellon.
Load Balancing and Switch Scheduling Xiangheng Liu EE 384Y Final Presentation June 4, 2003.
1 Chapter 7 Dynamic Job Shops Advantages/Disadvantages Planning, Control and Scheduling Open Queuing Network Model.
1Chapter 05, Fall 2008 CPU Scheduling The CPU scheduler (sometimes called the dispatcher or short-term scheduler): Selects a process from the ready queue.
Location Models For Airline Hubs Behaving as M/D/C Queues By: Shuxing Cheng Yi-Chieh Han Emile White.
Introduction to Queuing Theory
CPU Scheduling Chapter 6 Chapter 6.
Computer Architecture and Operating Systems CS 3230: Operating System Section Lecture OS-3 CPU Scheduling Department of Computer Science and Software Engineering.
MM Process Management Karrie Karahalios Spring 2007 (based off slides created by Brian Bailey)
Scheduling Chap 2. Scheduling Introduction to Scheduling (1) Bursts of CPU usage alternate with periods of I/O wait –a CPU-bound process –an I/O bound.
Operating Systems Lecture Notes CPU Scheduling Matthew Dailey Some material © Silberschatz, Galvin, and Gagne, 2002.
Aalto_inria2.pptINRIA Sophia Antipolis, France, On the Gittins index in the M/G/1 queue Samuli Aalto (TKK) in cooperation with Urtzi Ayesta.
1 Chapter 7 Dynamic Job Shops Advantages/Disadvantages Planning, Control and Scheduling Open Queuing Network Model.
Aalto.pptACM Sigmetrics 2007, San Diego, CA, June Mean Delay Optimization for the M/G/1 Queue with Pareto Type Service Times Samuli Aalto.
CPU Scheduling CSCI 444/544 Operating Systems Fall 2008.
1 Our focus  scheduling a single CPU among all the processes in the system  Key Criteria: Maximize CPU utilization Maximize throughput Minimize waiting.
On the variance curve of outputs for some queues and networks Yoni Nazarathy Gideon Weiss Yoav Kerner CWI Amsterdam March 2009.
Process A program in execution. But, What does it mean to be “in execution”?
Lecture 7: Scheduling preemptive/non-preemptive scheduler CPU bursts
The Asymptotic Variance of Departures in Critically Loaded Queues Yoni Nazarathy * EURANDOM, Eindhoven University of Technology, The Netherlands. (As of.
Order Optimal Delay for Opportunistic Scheduling In Multi-User Wireless Uplinks and Downlinks Michael J. Neely University of Southern California
1 Real-Time Scheduling. 2Today Operating System task scheduling –Traditional (non-real-time) scheduling –Real-time scheduling.
Analysis of SRPT Scheduling: Investigating Unfairness Nikhil Bansal (Joint work with Mor Harchol-Balter)
Non-Preemptive Buffer Management for Latency Sensitive Packets Moran Feldman Technion Seffi Naor Technion.
Lecture Topics: 11/15 CPU scheduling: –Scheduling goals and algorithms.
Reduced Rate Switching in Optical Routers using Prediction Ritesh K. Madan, Yang Jiao EE384Y Course Project.
CS333 Intro to Operating Systems Jonathan Walpole.
Purpose of Operating System Part 2 Monil Adhikari.
BSnetworks.pptTKK/ComNet Research Seminar, SRPT Applied to Bandwidth Sharing Networks (to appear in Annals of Operations Research) Samuli Aalto.
1 CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling.
CPU Scheduling Operating Systems CS 550. Last Time Deadlock Detection and Recovery Methods to handle deadlock – Ignore it! – Detect and Recover – Avoidance.
Lecture 4 CPU scheduling. Basic Concepts Single Process  one process at a time Maximum CPU utilization obtained with multiprogramming CPU idle :waiting.
Online Bipartite Matching with Augmentations Presentation by Henry Lin Joint work with Kamalika Chaudhuri, Costis Daskalakis, and Robert Kleinberg.
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Chapter 7 Scheduling Chien-Chung Shen CIS/UD
Scheduling.
Theory of Computational Complexity M1 Takao Inoshita Iwama & Ito Lab Graduate School of Informatics, Kyoto University.
CPU Scheduling Scheduling processes (or kernel-level threads) onto the cpu is one of the most important OS functions. The cpu is an expensive resource.
Load Balancing and Data centers
Serve Assignment Policies
Process Scheduling B.Ramamurthy 9/16/2018.
Lecture 23: Process Scheduling for Interactive Systems
Generalized Jackson Networks (Approximate Decomposition Methods)
Scheduling & Dispatching
Scheduling & Dispatching
Presentation transcript:

FINDING THE OPTIMAL QUANTUM SIZE Revisiting the M/G/1 Round-Robin Queue VARUN GUPTA Carnegie Mellon University

2 M/G/1/RR Jobs served for q units at a time Poisson arrivals Incomplete jobs

3 M/G/1/RR Jobs served for q units at a time Poisson arrivals Incomplete jobs

4 M/G/1/RR arrival rate = job sizes i.i.d. ~ S load Jobs served for q units at a time Completed jobs Incomplete jobs Poisson arrivals Squared coefficient of variability (SCV) of job sizes: C 2  0

5 M/G/1/RR M/G/1/PS M/G/1/FCFS q q → 0 q =   Preemptions cause overhead (e.g. OS scheduling)  Variable job sizes cause long delays small qlarge q preemption overheads job size variability

6 GOAL Optimal operating q for M/G/1/RR with overheads and high C 2 SUBGOAL Sensitivity Analysis: Effect of q and C 2 on M/G/1/RR performance (no switching overheads) PRIOR WORK Lots of exact analysis: [Wolff70], [Sakata et al.71], [Brown78]  No closed-form solutions/bounds  No simple expressions for interplay of q and C 2 effect of preemption overheads

7 Outline  Effect of q and C 2 on mean response time  Approximate analysis  Bounds for M/G/1/RR  Choosing the optimal quantum size No preemption overheads Effect of preemption overheads

8 Approximate sensitivity analysis of M/G/1/RR Approximation assumption 1: Service quantum ~ Exp(1/q) Approximation assumption 2: Job size distribution

9 Approximate sensitivity analysis of M/G/1/RR Monotonic in q –Increases from E[T PS ] → E[T FCFS ] Monotonic in C 2 –Increases from E[T PS ] → E[T PS ](1+ q) For high C 2 : E[T RR* ] ≈ E[T PS ](1+ q)

10 Outline  Effect of q and C 2 on mean response time Approximate analysis  Bounds for M/G/1/RR  Choosing the optimal quantum size

11 THEOREM: M/G/1/RR bounds Assumption: job sizes  {0,q,…,Kq} Lower bound is TIGHT: E[ S ] = iq Upper bound is TIGHT within (1+  /K): Job sizes  {0,Kq}

12 Outline Effect of q and C 2 on mean response time Approximate analysis Bounds for M/G/1/RR  Choosing the optimal quantum size

13 Optimizing q Preemption overhead = h 1. 2.Minimize E[T RR ] upper bound from Theorem: Common case:

14 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 h=0

15 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 approximation q* optimum q 1% h=0

16 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 1% 2.5% h=0 approximation q* optimum q

17 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 1% 2.5% 5% h=0 approximation q* optimum q

18 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 1% 2.5% 5% 7.5% h=0 approximation q* optimum q

19 service quantum (q) Mean response time Job size distribution: H 2 (balanced means)E[ S ] = 1, C 2 = 19,  = 0.8 1% 2.5% 5% 7.5% 10% h=0 approximation q* optimum q

20 Outline Effect of q and C 2 on mean response time Approximate analysis Bounds for M/G/1/RR Choosing the optimal quantum size

Conclusion/Contributions Simple approximation and bounds for M/G/1/RR Optimal quantum size for handling highly variable job sizes under preemption overheads q

22 Bounds – Proof outline D i = mean delay for i th quantum of service D = [D 1 D 2... D K ] D is the fixed point of a monotone linear system: D T = A P D T +b D1D1 D2D2 f 1 =0 f 2 =0 D D’D’ D*D* Sufficient condition for lower bound: D ’ T  A P D ’ T +b for all P Sufficient condition for upper bound: D * T  A P D * T +b for all P

23 Optimizing q Preemption overhead = h q’ = q+h, E[ S ]’ → E[ S ] (1+h/q),  ’ = E[ S ]’ Heavy traffic:Small overhead: