1 Task Assignment with Unknown Duration Mor Harchol-Balter Carnegie Mellon.

Slides:



Advertisements
Similar presentations
Scheduling Introduction to Scheduling
Advertisements

VARUN GUPTA Carnegie Mellon University 1 With: Mor Harchol-Balter (CMU)
Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 1  Load-balancing problem  migrate (large) jobs from a busy server to an underloaded.
Page 1 Alan Scheller-Wolf Lunteren, The Netherlands January 16, 2013 Things I Thought I Knew about Queueing Theory, but was Wrong About (Part 2, Service.
Scheduling. Main Points Scheduling policy: what to do next, when there are multiple threads ready to run – Or multiple packets to send, or web requests.
Anshul Gandhi (Carnegie Mellon University) Varun Gupta (CMU), Mor Harchol-Balter (CMU) Michael Kozuch (Intel, Pittsburgh)
1 Size-Based Scheduling Policies with Inaccurate Scheduling Information Dong Lu *, Huanyuan Sheng +, Peter A. Dinda * * Prescience Lab, Dept. of Computer.
Software Diversity for Information Security Gaurav Kataria Carnegie Mellon University.
Load Balancing of Elastic Traffic in Heterogeneous Wireless Networks Abdulfetah Khalid, Samuli Aalto and Pasi Lassila
1. What is the probability that a randomly selected person is a woman who likes P.E.? 2. Given that you select a man, what is the probability that he likes.
Web Server Request Scheduling Mingwei Gong Department of Computer Science University of Calgary November 16, 2004.
How Useful is Old Information? On the Management and Efficiency of Cloud Based Services seminar MICHAEL MITZENMACHER November 2010 Computer Science faculty.
1 IOE/MFG 543 Chapter 11: Stochastic single machine models with release dates.
Chapter 3 Operating Systems. Chapter 3 Operating Systems 3.1 The Evolution of Operating Systems 3.1 The Evolution of Operating Systems 3.2 Operating System.
Optimization for QoS on Systems with Tasks Deadlines Luis Fernando Orleans Pedro Nuno Furtado.
Uniprocessor Scheduling Chapter 9. Aim of Scheduling The key to multiprogramming is scheduling Scheduling is done to meet the goals of –Response time.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies Dong Lu* + Peter Dinda* Yi Qiao* Huanyuan Sheng* *Northwestern.
Research Paper Example Exploiting Process Lifetime Distributions for Dynamic Load Balancing Mor Harchol-Balter Allen Downey SIGMETRICS 2006.
Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.
Cs238 CPU Scheduling Dr. Alan R. Davis. CPU Scheduling The objective of multiprogramming is to have some process running at all times, to maximize CPU.
1 Connection Scheduling in Web Servers Mor Harchol-Balter School of Computer Science Carnegie Mellon
Performance Evaluation
Join-the-Shortest-Queue (JSQ) Routing in Web Server Farms
5: CPU-Scheduling1 Jerry Breecher OPERATING SYSTEMS SCHEDULING.
Fundamental Characteristics of Queues with Fluctuating Load VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ. Alan Scheller-Wolf Carnegie.
1 TCOM 501: Networking Theory & Fundamentals Lectures 9 & 10 M/G/1 Queue Prof. Yannis A. Korilis.
Carnegie Mellon University Computer Science Department 1 CLASSIFYING SCHEDULING POLICIES WITH RESPECT TO HIGHER MOMENTS OF CONDITIONAL RESPONSE TIME Adam.
Scheduling task with heavy tail distributions Eddie Aronovich.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Emery Berger University of Massachusetts, Amherst Operating Systems CMPSCI 377 Lecture.
1 Mor Harchol-Balter Carnegie Mellon University Joint work with Bianca Schroeder.
Operating System Process Scheduling (Ch 4.2, )
1 Scheduling Algorithms FCFS First-Come, First-Served Round-robin SJF Multilevel Feedback Queues.
Fundamental Characteristics of Queues with Fluctuating Load (appeared in SIGMETRICS 2006) VARUN GUPTA Joint with: Mor Harchol-Balter Carnegie Mellon Univ.
Computer Systems Design
ORF Electronic Commerce Spring 2009 April 15, 2009 Week 10 Capacity Analysis Deterministic Model –Assume: each HTTP request takes T r (request time)
1Chapter 05, Fall 2008 CPU Scheduling The CPU scheduler (sometimes called the dispatcher or short-term scheduler): Selects a process from the ready queue.
1 Chapters 9 Self-SimilarTraffic. Chapter 9 – Self-Similar Traffic 2 Introduction- Motivation Validity of the queuing models we have studied depends on.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems David Goldschmidt, Ph.D.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Lecture 5 Operating Systems.
OPERATING SYSTEMS CPU SCHEDULING.  Introduction to CPU scheduling Introduction to CPU scheduling  Dispatcher Dispatcher  Terms used in CPU scheduling.
Chapter 9 Uniprocessor Scheduling Spring, 2011 School of Computer Science & Engineering Chung-Ang University.
1 Mor Harchol-Balter Computer Science Dept, CMU What Analytical Performance Modeling Teaches Us About Computer Systems Design.
1 Mor Harchol-Balter Carnegie Mellon University Computer Science Heavy Tails: Performance Models & Scheduling Disciplines.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
1 Chapter 5 Flow Lines Types Issues in Design and Operation Models of Asynchronous Lines –Infinite or Finite Buffers Models of Synchronous (Indexing) Lines.
Chapter 2 Machine Interference Model Long Run Analysis Deterministic Model Markov Model.
1 Scheduling Processes. 2 Processes Each process has state, that includes its text and data, procedure call stack, etc. This state resides in memory.
Chapter 6 Scheduling. Basic concepts Goal is maximum utilization –what does this mean? –cpu pegged at 100% ?? Most programs are I/O bound Thus some other.
임규찬. 1. Abstract 2. Introduction 3. Design Goals 4. Sample-Based Scheduling for Parallel Jobs 5. Implements.
Aalto.pptACM Sigmetrics 2007, San Diego, CA, June Mean Delay Optimization for the M/G/1 Queue with Pareto Type Service Times Samuli Aalto.
1 Nasser Alsaedi. The ultimate goal for any computer system design are reliable execution of task and on time delivery of service. To increase system.
1 The Effect of Heavy-Tailed Job Size Distributions on System Design Mor Harchol-Balter MIT Laboratory for Computer Science.
Process A program in execution. But, What does it mean to be “in execution”?
Lecture 7: Scheduling preemptive/non-preemptive scheduler CPU bursts
Chapter 5 Processor Scheduling Introduction Processor (CPU) scheduling is the sharing of the processor(s) among the processes in the ready queue.
Uniprocessor Scheduling
Scheduling. Main Points Scheduling policy: what to do next, when there are multiple threads ready to run – Or multiple packets to send, or web requests.
Analysis of SRPT Scheduling: Investigating Unfairness Nikhil Bansal (Joint work with Mor Harchol-Balter)
Modelling job allocation where service duration is unknown Nigel Thomas University of Newcastle upon Tyne.
Basic Queuing Insights Nico M. van Dijk “Why queuing never vanishes” European Journal of Operational Research 99 (1997)
1 Mor Harchol-Balter Carnegie Mellon with Nikhil Bansal with Bianca Schroeder with Mukesh Agrawal.
Queuing Theory Important definition in Queuing Theory: We define various terms, which are used in our queue model. 1.Queue length: The number of customers.
Lecture 5 Scheduling. Today CPSC Tyson Kendon Updates Assignment 1 Assignment 2 Concept Review Scheduling Processes Concepts Algorithms.
Web Servers load balancing with adjusted health-check time slot.
Load Balancing and Data centers
Serve Assignment Policies
Dynamic Graph Partitioning Algorithm
Queuing Theory Non-Markov Systems
Presentation transcript:

1 Task Assignment with Unknown Duration Mor Harchol-Balter Carnegie Mellon

2 Distributed Server Load Balancer employs TAP (Task Assignment Policy): rule for assigning jobs to hosts Age-old Question: What’s a good TAP ? L.B Large # jobs

3 The Model L.B. Large # jobs FCFS Processing requirement (size) of job is not known. Jobs are not preemptible. Jobs queued at a host are processed in FCFS order. Hosts are identical. Motivation for model: Distributed servers for supercomputing, where each host is a multi-processor.

4 1. Round-Robin 2. Random 3. Shortest-Queue Send job to host with fewest number jobs. 4. Least-Work-Left Central-Queue Send job to host with Host grabs next job when free. least total work left. L.B Which TAP is best (given model)? “best” -- minimize mean waiting time

5 1. Round-Robin 2. Random 3. Shortest-Queue Send job to host with fewest number jobs. 4. Least-Work-Left Central-Queue Send job to host with Host grabs next job least total work left. when free. L.B Which TAP is best (given model)? “best” -- minimize mean waiting time Known: Optimal for exponentially- distributed sizes.

6 But real jobs do NOT have exponentially-distributed sizes! They have heavy-tailed sizes.

7 Unix process CPU lifetime measurements [Harchol-Balter, Downey TOCS 97] We measured over 1 million UNIX processes. Instructional, research, and sys. admin. machines. Job of cpu age x has probability 1/2 of using another x. Fraction of jobs with CPU duration > x Duration ( x secs) (log-log plot) Pr{Size > x } = 1 x

8 Bounded Pareto (heavy-tailed) distribution Properties: Decreasing Failure Rate Very high variance! Heavy-tail property -- Miniscule fraction (<1%) of the very largest jobs comprise half the load. 1 job size 0   Pr{}Sizexx  , 0 2 : degree of variability more variable & more heavy-tailed less variable & less heavy-tailed   minmax

9 1. Round-Robin 2. Random 3. Shortest-Queue Send job to host with fewest number jobs. 4. Least-Work-Left Central-Queue Send job to host with Host grabs next job least total work left. when free. L.B Which TAP is best for heavy-tailed job sizes? “best” -- minimize mean waiting time Known: Optimal for exponentially- distributed sizes.

10 The TAGS algorithm “Task Assignment by Guessing Size” When job at host j reaches size s j, then job is killed and restarted from scratch at host j+1 s3s3 s2s2 s1s1 Outside Arrivals Host 1 Host 2 Host 3 Host 4

11 3 Flavors of TAGS How to choose the cutoffs: s 1, s 2, s 3, … TAGS-opt-meanslowdown TAGS-opt-meanwaitingtime TAGS-opt-fairness

12 TAGS is counterintuitive TAGS wastes resources … non-workconserving Big jobs seem unfairly penalized … yet somehow turns out to be fair? TAGS always operates under unbalanced load.

13 Results of Analysis 2 hosts only -- system load =.5 Random Least-Work-Left TAGS-opt-fairness Random Least-Work-Left TAGS-opt-slowdown

14 Results of Analysis 2 hosts only -- system load =.5 Random Least-Work-Left TAGS-opt-waitingtime

15 More Results 4 hosts -- system load =.3 Random Least-Work-Left TAGS

16 More Results New metric: Server Expansion Server expansion = number of hosts we would have to add to system to get mean slowdown down to 2 or 3. (Initial system: 2 hosts, system load =.7) TAGS Least-Work-Left

17 WHY does TAGS work so well? 1) Reduction of variance of job size distribution 2) Load Unbalancing

18 WHY does TAGS work so well? 1) Reduction of variance of job size distribution: TAGS reduces variance of job size distribution at the hosts. No other policy does this!  EW E X ( - ) {} {} 2 21  Second moment of Job Size Distribution Recall, P-K formula for M/G/1 queue: FCFS Mean Waiting Time

19 WHY does TAGS work so well? 2) Load Unbalancing: All other policies aim to balance the load. TAGS unbalances load. TAGS-opt-slowdownTAGS-opt-fairness Host 1 Host 2 Host 1 This is fair? YES

20 Conclusion This research challenges our common wisdom: Load unbalancing may be better than load balancing. It may be worthwhile to waste resources by restarting a job from scratch at a new machine … even if the new machine has a much higher load than the original machine! A policy which appears to greatly penalize large jobs may actually be fair.