Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielman Presented by Bryan Rosander.

Slides:



Advertisements
Similar presentations
CPU Scheduling.
Advertisements

Hadi Goudarzi and Massoud Pedram
SLA-Oriented Resource Provisioning for Cloud Computing
Introduction Background Knowledge Workload Modeling Related Work Conclusion & Future Work Modeling Many-Task Computing Workloads on a Petaflop Blue Gene.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Chapter 5 CPU Scheduling. CPU Scheduling Topics: Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling.
Scheduling in Batch Systems
CS 3013 & CS 502 Summer 2006 Scheduling1 The art and science of allocating the CPU and other resources to processes.
Chapter 6: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Chapter 6: CPU Scheduling Basic.
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
Wk 2 – Scheduling 1 CS502 Spring 2006 Scheduling The art and science of allocating the CPU and other resources to processes.
Job scheduling Queue discipline.
Ana-Maria Oprescu, Thilo Kielmann (Vrije University) Presented By Gal Cohen Cloud Computing Seminar CS Technion, Spring 2012.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
Chapter 5: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Basic Concepts Maximum CPU utilization.
OPERATING SYSTEMS CPU SCHEDULING.  Introduction to CPU scheduling Introduction to CPU scheduling  Dispatcher Dispatcher  Terms used in CPU scheduling.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Fault-Tolerant Workflow Scheduling Using Spot Instances on Clouds Deepak Poola, Kotagiri Ramamohanarao, and Rajkumar Buyya Cloud Computing and Distributed.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
SOFTWARE / HARDWARE PARTITIONING TECHNIQUES SHaPES: A New Approach.
Scheduling policies for real- time embedded systems.
Operating Systems Process Management.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
Lecture 7: Scheduling preemptive/non-preemptive scheduler CPU bursts
Chapter 5: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 5: CPU Scheduling Basic Concepts Scheduling Criteria.
1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.
Chapter 5: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Chapter 5: CPU Scheduling Basic.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
1 Real-Time Scheduling. 2Today Operating System task scheduling –Traditional (non-real-time) scheduling –Real-time scheduling.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
1 CS.217 Operating System By Ajarn..Sutapart Sappajak,METC,MSIT Chapter 5 CPU Scheduling Slide 1 Chapter 5 CPU Scheduling.
6.1 CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time Scheduling Algorithm Evaluation.
Scalable and Coordinated Scheduling for Cloud-Scale computing
University of Texas at Arlington Scheduling and Load Balancing on the NASA Information Power Grid Sajal K. Das, Shailendra Kumar, Manish Arora Department.
Operating Systems Scheduling. Scheduling Short term scheduler (CPU Scheduler) –Whenever the CPU becomes idle, a process must be selected for execution.
Lecture 4 CPU scheduling. Basic Concepts Single Process  one process at a time Maximum CPU utilization obtained with multiprogramming CPU idle :waiting.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
CPU Scheduling G.Anuradha Reference : Galvin. CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Real-Time.
CPU scheduling.  Single Process  one process at a time  Maximum CPU utilization obtained with multiprogramming  CPU idle :waiting time is wasted 2.
Basic Concepts Maximum CPU utilization obtained with multiprogramming
1 Chapter 5: CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms.
Resource Provision for Batch and Interactive Workloads in Data Centers Ting-Wei Chang, Pangfeng Liu Department of Computer Science and Information Engineering,
Green cloud computing 2 Cs 595 Lecture 15.
Wayne Wolf Dept. of EE Princeton University
Chapter 6: CPU Scheduling
Anne Pratoomtong ECE734, Spring2002
CPU Scheduling G.Anuradha
Module 5: CPU Scheduling
Lecture 21: Introduction to Process Scheduling
3: CPU Scheduling Basic Concepts Scheduling Criteria
Chapter5: CPU Scheduling
CSE8380 Parallel and Distributed Processing Presentation
Chapter 5: CPU Scheduling
Process scheduling Chapter 5.
Chapter 6: CPU Scheduling
CPU SCHEDULING.
Chapter 6: CPU Scheduling
Chapter 5: CPU Scheduling
Planning and Scheduling
Lecture 21: Introduction to Process Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Presentation transcript:

Bag-of-Tasks Scheduling under Budget Constraints Ana-Maria Oprescu, Thilo Kielman Presented by Bryan Rosander

Bag-of-Tasks Background Bag-of-Task problems are common and computationally expensive Condor provides a framework for "High-Throughput Computing" o Opportunistic batch processing  Use idle computing resources with little to no user intervention o Condor project started in 1988 o Allowed both scientists and small businesses to take advantage of already purchased computing power

Cloud Computing? Well suited to cloud computing Pay-for-use computing power o Allows users to choose from several different classes of computing power at different price points o Makes computationally intensive tasks possible for those that can't afford data-centers or supercomputers Cloud services don't offer much guidance May not know computing cost characteristics of each job

Budget-Aware Cloud Batch Processing Cloud Computing makes high throughput computing feasible for those without the resources to purchase the hardware o Allows users to choose from several different classes of computing power BaTS is a "budget-constrained scheduler" o Capable of scheduling large bags of tasks o Can utilize multiple clouds with different characteristics o Does not need prior knowledge about tasks or completion times o Will complete tasks within given budget or terminate when it is determined unfeasible o Will attempt to minimize run-time without violating budget constraint

BaTS - The Algorithm Assumptions: o Tasks in a bag are independent of each-other o Tasks can be preempted if necessary to reconfigure cloud environment o There is some unknown distribution of execution times o We know the number of tasks to be executed o Machines belong to multiple categories and machines within the same category are homogeneous o Category pricing is available and consistent "Scheduling large bags of tasks onto multiple cloud platforms" BaTS is run on a master machine (can be outside cloud environment)

Sampling Phase Sampling with replacement done on a per cluster level with a subset of size n n must be <= 0.05 * N in practice (actual upper bound given by below formula) Use modified cumulative moving average of task execution times from sample

Goal: Dynamic programming:

Rest of Run Update plan at regular intervals (at least 5 minutes apart) Continual refinement of estimated task completion time Ensure that machines do not become under utilized Machine cost incurred at start of ATU but jobs may not finish until end o Must look at how many tasks will still be undone when each machine runs out of time

Use previous cumulative average for time values, using this formula for unfinished tasks Use formulas on the left to see if the tasks should be finished within the constraints

Testing Emulated different types of clouds on DAS-3 (Distributed ASCI Supercomputer 3) multi-cluster system Requests for machines are handled realistically, with significant delay Medium size workload o 1000 tasks with normal distribution with mean minutes 15 and std deviation as the square root of 5 minutes o Cluster 0 is $3 per machine per ATU, cluster 1 varies as follows  S1-1 is the same in price and performance  S1-4 is the same price, 4 times as fast  S4-1 is 4 times as expensive, with same speed  S3-4 is 3 times as expensive and 4 times as fast  S4-3 is 4 times as expensive and 3 times as fast

Conclusions BaTS winds up with slower execution times than RR (round robin) given the same budget (due to the sampling phase) BaTS is capable of staying within budget when possible or terminating early if not When given smaller budgets, BaTS is cheaper but slower than RR Would be helpful to find a way to suggest suitable budgets for tasks High complexity of algorithm would be prohibitive to drastically increasing number of classes of computers, number of possible workers