Download presentation
Presentation is loading. Please wait.
1
SE350: Operating Systems Lecture 6: Scheduling
2
Main Points Definitions Uniprocessor policies Multiprocessor policies
Response time, throughput, scheduling policy, … Uniprocessor policies FIFO, SJF, Round Robin, … Multiprocessor policies Scheduling sequential application Scheduling multithreaded applications
3
Definitions Task/Job Latency/response time Throughput Overhead
User request: e.g., mouse click, web request, shell command, … Latency/response time How long does a task take to complete? Throughput How many tasks can be done per unit of time? Overhead How much extra work is done by the scheduler? 1. Minimize response time: elapsed time to do an operation (or job) Response time is what the user sees: elapsed time to echo a keystroke in editor compile a program run a large scientific problem 2. Maximize throughput: operations (or jobs) per second Throughput is related to response time, but they're not identical -- for example, I’ll show that minimizing response time will lead you to do more context switching than you would if you were only concerned with throughput. Two parts to maximizing throughput a. Minimize overhead (for example, context switching) b. Efficient use of system resources (not only CPU, but disk, memory, etc.) What does CPU scheduling have to do with efficient use of the disk? A lot! Have to have CPU to make a disk request. 3. Fair: share CPU among users in some equitable way Tradeoff: will argue you can get better average response time by making system less fair. What does fairness mean? Minimize # of angry phone calls? Minimize my response time? Minimize average response time? We will argue fairness is a tradeoff against average response time; can get better average response time by making system less fair. Sort of like capitalism.
4
Definitions (cont.) Fairness Predictability Workload
How equal is the performance received by different users? Predictability How consistent is the performance over time? Workload Set of tasks for system to perform Preemptive scheduler If we can take resources away from a running task
5
Definitions (cont.) Work-conserving Scheduling algorithm
Resource is used whenever there is a task to run For non-preemptive schedulers, work-conserving is not always better Scheduling algorithm Takes a workload as input Decides which tasks to do first Performance metric (throughput, latency) as output Only preemptive, work-conserving schedulers to be considered
6
Important Notes Minimizing Average Response Time (ART) may lead to more context switching which hurts throughput Maximizing throughput has two parts Minimizing overhead (e.g., context switching) Utilizing system resources (e.g., CPU, disk, memory) Minimizing ART could make system less fair Delaying some tasks to make others faster
7
Main Points Definitions Uniprocessor policies Multiprocessor policies
Response time, throughput, scheduling policy, … Uniprocessor policies FIFO, SJF, Round Robin, … Multiprocessor policies Scheduling sequential application Scheduling multithreaded applications
8
First In First Out (FIFO)
Schedule tasks in the order they arrive Run them until they complete or yield the processor E.g., a supermarket line On what workloads is FIFO particularly bad? Short tasks land in line behind very long tasks FIFO leads to bad ART E.g., getting stuck behind someone with a full cart who insists on paying in pennies On what workloads is FIFO particularly good? All tasks have similar length and similar priority E.g., memcashed, Facebook cache of friend lists, …
9
Shortest Job First (SJF)
Always run the task that has the shortest remaining amount of work to do Often called Shortest Remaining Time First (SRTF) Requires knowledge of the future, usually impractical Right policy for bandwidth-constrained web service Service time could be predicted for static content ART is dominated by (frequent) requests to short pages Bandwidth costs are dominated by (less frequent) requests to large pages Large requests will be starved if web server is overloaded Requires its own set of solutions What counts as “shortest” is the remaining time left on the task, not its original length.
10
FIFO vs. SJF SJF runs short tasks first: huge effect on short tasks, negligible effect on long task
11
Questions SJF is optimal policy for minimizing ART! Why?
Consider alternative policy P (not SJF) that is optimal Because P is not SJF, at some point it chooses to run a task that is longer than the shortest task in the queue Keep the order of tasks the same, but run the shorter task first ⇒ reduced ART ⇒ contradiction! Does SJF have any downsides? Starvation, context switches, and variance in response time Some task might take forever! E.g., would it work if a supermarket uses SJF? Customers could game the system: come with one item at a time
12
Starvation and Sample Bias
Suppose you want to compare scheduling policies Create some infinite sequence of arriving tasks Start measuring Stop at some point Compute ART for finished tasks between start and stop Is this valid or invalid? SJF and FIFO would complete a different set of tasks Their ARTs are not directly comparable E.g., suppose you stopped at any point in FIFO vs. SJF slide Solution?
13
Solutions for Sample Bias
For both systems, measure for long enough that # of completed tasks >> # of uncompleted tasks Start and stop system in idle periods Idle period: no work to do If algorithms are work-conserving, both will complete the same tasks
14
Round Robin (RR) Each task runs on the processor for a fixed period of time (time quantum, slice, interval) If a task does not complete, it goes back in line Need to pick a time quantum What if time quantum is too long? Infinite? Approximates FIFO What if time quantum is too short? One instruction? Approximates SJF Can we combine best of both worlds? RR approximates SJF by moving long tasks to the end of the line.
15
Round Robin Short vs. Long Time Quantum
Approximation depends on granularity
16
Round Robin vs. FIFO Assuming zero-cost time slice, is Round Robin always better than FIFO for ART? No! For tasks with same size, RR hurts ART Show of hands! What’s the worst case for RR? Same sized jobs – then you are time-slicing for no purpose. Worse, this is nearly pessimal for average response time.
17
Round Robin vs. FIFO and SJF
Everything is fair, but ART is bad: all threads finish very late! If we are running streaming video, RR results in predictable, stable rate of progress FIFO is optimal in this case SJF maximizes variance, but RR minimizes it
18
Is Round Robin Always Fair?
Show of hands! After all, RR ensures we don’t starve, and gives everyone a turn, but lets short tasks complete before long tasks. Would this do under SJF? Every time the task returns to the CPU, it would get scheduled immediately! One iteratively needs I/O, others only need CPU I/O task has to wait its turn for CPU It gets a tiny fraction of the performance it could optimally get Other tasks get almost as much as they would if the I/O task wasn’t there Shorter quantum helps, but it increases overhead
19
What is Fairness? FIFO? Equal share of the CPU?
What if some tasks don’t need their full share? Minimize worst case divergence? Performance if no one else was running Performance under scheduling algorithm What is the slow down for previous slide if i/O bound task computes for 1 ms and the uses the disk for 10 ms and time quantum is 100ms - almost twenty!
20
Max-Min Fairness (MMF)
Maximize the minimum allocation given to a task If any task needs less than an equal share, schedule the smallest of these first Split the remaining time using max-min If all remaining tasks need at least equal share, split evenly Max-min could lead to too many context switches For tasks with same size, run one instruction of each task We can approximate Max-Min by allowing tasks to get ahead of their fair share by one time quantum At the end of each quantum, schedule the task with least accumulated allocation On previous slide, what would happen if we used max-min fairness? Then I/O task would be scheduled immediately – it’s always the one using less than its equal share. The disk-bound process needed only 10% of the processor to keep busy, but Round Robin only gave it 0.5% of the processor, while each of the two compute-bound processes received nearly 50%. Max-min fairness would assign 10% of the processor to the I/O-bound process, and it would split the remainder equally between the two compute-bound processes, with 45% each
21
Multi-level Feedback Queue (MFQ)
Goals: Responsiveness: run short tasks quickly (like SJF) Low overhead: minimize preemptions (like FIFO) Starvation freedom: make progress on all tasks (like RR) High/low priority: differ maintenance tasks Fairness: Assign max-min fair share to equal priority tasks Not perfect at any of them! Used in Linux (and probably Windows, MacOS)
22
Multi-level Feedback Queue (cont.)
MFQ is an extension of Round Robin Instead of one, set of queues each with a separate priority High priority queues have short time slices Low priority queues have long time slices Scheduler picks first thread in highest priority queue Tasks at higher priority levels preempt lower priority tasks Tasks start in highest priority queue If time slice expires, task drops one level Tasks stay at same level if they yield CPU because of I/O Tasks receiving less than their fair share go up one level
23
Multi-level Feedback Queue (cont.)
What strategy should you adopt if you knew you were running on an MFQ system? Grad school: wrote an AI to play Othello. Both sides run on a uniprocessor – so if I could slow down the other task, I’d win! Periodically print a dot to the console screen (back when printing to the screen was I/O).
24
Uniprocessor Summary (1)
FIFO is simple and minimizes overhead If tasks are variable in size, FIFO can have poor ART If tasks are equal in size, FIFO optimizes ART Considering only the processor, SJF optimizes ART SJF is pessimal in terms of variance in response time If tasks are variable in size, RR approximates SJF If tasks are equal in size, RR will have poor ART Tasks that intermix processor and I/O benefit from SJF and can do poorly under Round Robin
25
Uniprocessor Summary (2)
MMF can improve response time for I/O-bound tasks RR and MMF both avoid starvation By manipulating the assignment of tasks to priority queues, an MFQ scheduler can achieve a balance between responsiveness, low overhead, and fairness
26
Main Points Definitions Uniprocessor policies Multiprocessor policies
Response time, throughput, scheduling policy, … Uniprocessor policies FIFO, SJF, Round Robin, … Multiprocessor policies Scheduling sequential application Scheduling multithreaded applications
27
Multiprocessor Scheduling
What would happen if we used MFQ on a multiprocessor? Contention for scheduler spinlock Cache slowdown due to ready list data structure pinging from one CPU to another Limited cache reuse: thread’s data from last time it ran is often still in its old cache
28
Per-Processor Affinity Scheduling for Sequential Programs
Each processor has its own ready list Protected by a per-processor spinlock Put threads back on the ready list where it had most recently run E.g., when I/O completes, or on CV.signal() Idle processors can steal work from other processors Only if queue lengths are persistent enough to compensate for cache reloading for the migrated threads Per-processor data structures must be protected by locks In the common case the next processor to use the data will be the last one to have written it, minimizing cache coherence overhead and lock contention
29
Per-Processor Multi-level Feedback with Affinity Scheduling
30
Oblivious Scheduling for Parallel Programs
Each processor time-slices its ready list independently Each thread is treated as a separate entity If program uses locks and CVs, correctness is not affected Performance could be affected A thread could be time-sliced while others from same program are running
31
Bulk Synchronous Parallelism
A model for writing parallel programs. Eg., PageRank At each step, the computation is limited by the slowest processor to complete that step
32
Preemption and Multithreaded Programs
Preempting a thread in the middle of the chain can stall all of the processors in the chain More generally, preempting a thread on the critical path, however, will slow down the end result With “spin-then-wait”, preempting lock holder makes other tasks spin-wait until lock holder is re- scheduled
33
Gang Scheduling Schedule the same task at the same time everywhere. Problems? What about single threaded processes? You would waste time on the other processors. Threads from the same process are scheduled at exactly the same time Threads are time sliced together to provide a chance for other processes to run
34
Space Sharing Each process is assigned a subset of the processors
Minimizes processor context switches
35
How Many Processors Does a Process Need?
But we might also have non-perfect parallelism – e.g., for non-equal BSP tasks Or if you have a single lock that’s contended. For these last two cases, you’d rather give the system fewer processors and avoid time-slicing. There are overheads E.g., creating extra threads, synchronization, communication Overheads shift the curve down
36
Amdahl’s Law (G. Amdahl 1967)
Estimates upper bounds on speedups
37
What Portion of Code is Parallelizable?
Expert programmers may not know! Fortunately, we can measure speedup Set Measure Calculate To use Amdahl’s law, we need to know what portion of our code/algorithm is parallelizable. You’d still need to figure out x If you’re interested in this problem, heck out Zahedi et al., Amdahl's Law in the Datacenter Era: A Market for Fair Processor Allocation, HPCA’18 Fig. criticallyrated.files.wordpress.com ? Karp-Flatt Metric [Allen Karp and Horace Flatt 1990]
38
Acknowledgment This lecture is a slightly modified version of the one prepared by Tom Anderson
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.