On Scheduling in Map-Reduce and Flow-Shops

Slides:



Advertisements
Similar presentations
Alexander Kononov Sobolev Institute of Mathematics Siberian Branch of Russian Academy of Science Novosibirsk, Russia.
Advertisements

Coordination Mechanisms for Unrelated Machine Scheduling Yossi Azar joint work with Kamal Jain Vahab Mirrokni.
Machine scheduling Job 1Job 3 Job 4 Job 5Machine 1 Machine 2 time 0C max Job 2.
CALTECH CS137 Fall DeHon 1 CS137: Electronic Design Automation Day 19: November 21, 2005 Scheduling Introduction.
Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
COT 4600 Operating Systems Fall 2009 Dan C. Marinescu Office: HEC 439 B Office hours: Tu-Th 3:00-4:00 PM.
Properties of SPT schedules Eric Angel, Evripidis Bampis, Fanny Pascual LaMI, university of Evry, France MISTA 2005.
Thomas Moscibroda Distributed Systems Research, Redmond Onur Mutlu
Scheduling with Outliers Ravishankar Krishnaswamy (Carnegie Mellon University) Joint work with Anupam Gupta, Amit Kumar and Danny Segev.
Spring, Scheduling Operations. Spring, Scheduling Problems in Operations Job Shop Scheduling. Personnel Scheduling Facilities Scheduling.
Online Algorithms Motivation and Definitions Paging Problem Competitive Analysis Online Load Balancing.
Krakow, Jan. 9, Outline: 1. Online bidding 2. Cow-path 3. Incremental medians (size approximation) 4. Incremental medians (cost approximation) 5.
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
CSE 421 Algorithms Richard Anderson Lecture 6 Greedy Algorithms.
1 Scheduling on Heterogeneous Machines: Minimize Total Energy + Flowtime Ravishankar Krishnaswamy Carnegie Mellon University Joint work with Anupam Gupta.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Ecole Polytechnique, Nov 11, List Scheduling on Related Machines processors Related machines: machines may have different speeds  0.25 
Algorithms and Economics of Networks: Coordination Mechanisms Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Impact of Problem Centralization on Distributed Constraint Optimization Algorithms John P. Davin and Pragnesh Jay Modi Carnegie Mellon University School.
Assignment 4. (Due on Dec 2. 2:30 p.m.) This time, Prof. Yao and I can explain the questions, but we will NOT tell you how to solve the problems. Question.
Minimizing Flow Time on Multiple Machines Nikhil Bansal IBM Research, T.J. Watson.
Improved results for a memory allocation problem Rob van Stee University of Karlsruhe Germany Leah Epstein University of Haifa Israel WADS 2007 WAOA 2007.
Minimizing Makespan and Preemption Costs on a System of Uniform Machines Hadas Shachnai Bell Labs and The Technion IIT Tami Tamir Univ. of Washington Gerhard.
1 The Santa Claus Problem (Maximizing the minimum load on unrelated machines) Nikhil Bansal (IBM) Maxim Sviridenko (IBM)
Approximation Algorithms for Stochastic Combinatorial Optimization Part I: Multistage problems Anupam Gupta Carnegie Mellon University.
Throughput Competitive Online Routing Baruch Awerbuch Yossi Azar Serge Plotkin.
Yossi Azar Tel Aviv University Joint work with Ilan Cohen Serving in the Dark 1.
Approximation schemes Bin packing problem. Bin Packing problem Given n items with sizes a 1,…,a n  (0,1]. Find a packing in unit-sized bins that minimizes.
1 Server Scheduling in the L p norm Nikhil Bansal (CMU) Kirk Pruhs (Univ. of Pittsburgh)
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
Scheduling policies for real- time embedded systems.
Packing Rectangles into Bins Nikhil Bansal (CMU) Joint with Maxim Sviridenko (IBM)
Competitive Queue Policies for Differentiated Services Seminar in Packet Networks1 Competitive Queue Policies for Differentiated Services William.
Approximation Schemes Open Shop Problem. O||C max and Om||C max {J 1,..., J n } is set of jobs. {M 1,..., M m } is set of machines. J i : {O i1,..., O.
Outline Introduction Minimizing the makespan Minimizing total flowtime
Towards Proactive Replanning for Multi-Robot Teams Brennan Sellner and Reid Simmons 5th International Workshop on Planning and Scheduling for Space October.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
Network-Aware Scheduling for Data-Parallel Jobs: Plan When You Can
Advanced Operating Systems - Spring 2009 Lecture 14 – February 25, 2009 Dan C. Marinescu Office: HEC 439 B. Office.
Static Process Scheduling
Analysis of cooperation in multi-organization Scheduling Pierre-François Dutot (Grenoble University) Krzysztof Rzadca (Polish-Japanese school, Warsaw)
Introductory Seminar on Research CIS5935 Fall 2008 Ted Baker.
Concurrency and Performance Based on slides by Henri Casanova.
Scheduling Parallel DAG Jobs to Minimize the Average Flow Time K. Agrawal, J. Li, K. Lu, B. Moseley.
Clustering Data Streams
Networks and Operating Systems: Exercise Session 2
CHAPTER 8 Operations Scheduling
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Mapping Techniques Dr. Xiao Qin Auburn University.
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Data Partition Dr. Xiao Qin Auburn University.
Maximum Matching in the Online Batch-Arrival Model
Haim Kaplan and Uri Zwick
Process Scheduling B.Ramamurthy 9/16/2018.
Greedy Algorithms / Interval Scheduling Yin Tat Lee
CSCI1600: Embedded and Real Time Software
Dan C. Marinescu Office: HEC 439 B. Office hours: M, Wd 3 – 4:30 PM.
COT 4600 Operating Systems Spring 2011
Lecture 11 Overview Self-Reducibility.
Scheduling Algorithms to Minimize Session Delays
Coarse Grained Parallel Selection
Process Scheduling B.Ramamurthy 2/23/2019.
A Unified Approach to Approximating Resource Allocation and Scheduling
Selfish Load Balancing
Process Scheduling B.Ramamurthy 4/11/2019.
Process Scheduling B.Ramamurthy 4/7/2019.
Data Placement Problems in Database Applications
Clustering.
Zero-Skew Trees Zero-Skew Tree: rooted tree in which all root-to-leaf paths have the same length Used in VLSI clock routing & network multicasting.
List Scheduling Given a list of jobs (each with a specified processing time), assign them to processors to minimize makespan (max load) In Graham’s notation:
CSCI1600: Embedded and Real Time Software
Non-clairvoyant Precedence Constrained Scheduling
Presentation transcript:

On Scheduling in Map-Reduce and Flow-Shops Tamás Sarlós Yahoo! Research Joint work with: Ben Moseley, Anirban Dasgupta, Ravi Kumar

Model Related work in scheduling Results Proof Open problems Overview Model Related work in scheduling Results Proof Open problems 9/20/2018

Map-Reduce A Map-Reduce job is a set of map and reduce tasks 9/20/2018

A Typical Cluster 9/20/2018

Life of a Job 9/20/2018

Model A Map-Reduce job is a set of map and reduce tasks may have an arrival time Schedule Map tasks run on map machines Reduce tasks run on reduce machines Reduce tasks of a job J are scheduled after all maps of J are completed Each task runs on one machine Jobs are independent 9/20/2018

Simplifications We ignore The shuffle phase The network topology Data locality stylized unrelated machines instead Space usage of intermediate data emitted by maps Machine, network, and task failures Dependencies between jobs Strategic behavior of users, etc. Task run times are assumed to be known in advance No speculative execution 9/20/2018

Background - Scheduling Objectives Metrics of interest in a shared cluster Total completion time of jobs Total flow (= completion – arrival) time Not makespan Known: Shortest Remaining Processing Time is optimal for flow time on a single machine with preemption Flow time on identical parallel machines with or without preemption is NP hard, no O(1) approx [GK 07, LR 07] Thus flow time is hopeless? No, we show 2 approaches to work it around 9/20/2018

Background - Online Identical Machines We allow preemption Competitive ratio: flow time of online schedule / offline schedule Strong lower bound for competitive ratio for flow time log( min{P, n/N} ) N machines, n jobs, 1 task/job, P = max/min runtime [LR ‘07] Resource augmentation Our machines have 1+ε speed 1+ε speed O(1) competitive algorithm for flow time [AA ‘07] 9/20/2018

Background - Flow Shops Flow shop = Map-Reduce with 1 map and 1 reduce task / job Our model = Generalized 2 stage flexible flow shop Makespan in flow shops Johnson’s algorithm for 1 map & 1 reduce machine PTAS by Schuurman & Woeginger for multiple machines (flexible FS) Prior results on total completion time in flow shops Trivial 2 approximation for 1 map & 1 reduce machine [GS ‘78] None for multiple machines Flow time is still hard 9/20/2018

Results – Offline Identical Machines All jobs arrive at time 0 12 approx for total completion time Simulate shortest job first for map tasks on a single NM speed machine Simulate SJF for reduce task on a single NR speed machine Width of a job := max { map finish time, reduce finish time, length of longest task } Run map tasks by width increasing For each job delay reduce tasks till time 2 * width Run reduces by width increasing 9/20/2018

Proof Sketch All tasks finish by availability + 2 * width  Reduce finishes by 4 * width Σ width ≤ Σ map completion time + Σ reduce completion time + Σ length of longest task ≤ 3 * OPT Just mention busy schedule argument 9/20/2018

Results – Online Identical Machines Simulate shortest remaining processing time for map and reduce tasks on a single NM and NR speed machine If all simulated maps and reduces of job J are finished then Job width := max { length of longest task, max{map finish time, reduce finish time} – arrival time } k := log of width // Online load balancing Assign map tasks to minimize imbalance in class k map work else if all maps finished in the new schedule Assign reduce tasks to minimize imbalance in class k work end if On each map and reduce machine run the tasks whose job has minimum width Note that this is an online schedule 9/20/2018

Results – Online Identical Machines Theorem: The previous slide is an 1+ε speed competitive online algorithm for total flow time Remark: Note that ε appears in the analysis only 9/20/2018

Results – QPTAS for flow shops 1 map and reduce task / job Offline, arrival times, total completion time Techniques are based on Afrati et al. ’99 Main ideas Time intervals of size (1+ε)t Round processing times to (1+ε) powers Structural modifications let us schedule most jobs when they are small compared to the interval Guess the order of the rest Dynamic program indexed by count vectors of arrived, partially, and completely done jobs in an interval Use an algorithm for makespan as a subroutine for testing feasibility of scheduling an interval Tell that QPTAS comes from the number of distinct jobs types 9/20/2018

Open Problems 1 Evaluation Rent or buy a cluster Discrete event simulation Hadoop src/contrib/mumak We lose a factor 2 if there is no distinction between map and reduce machines. Can we do better? PTAS for flow shops irrespective of processing times? Is there a PTAS for multiple map and reduce tasks / job for identical machines? Is there an (1+ε) speed 1/ε or O(1) competitive online algorithm? Evaluation is TODO, not open 9/20/2018

Open Problems 2 – Unrelated Machines Run time of a task varies per machine arbitrarily Standard in scheduling, this can model Minimum memory requirements “Data locality” Theorem: Assuming 1 map and 1 reduce task per job There is a 6 competitive offline algorithm for total completion time There exists a (1+ε) speed competitive online algorithm for total flow time with arrival times Hard with multiple tasks / jobs We need a more realistic model, undo simplifications Challenge the audience for an easier and more realistic model 9/20/2018

Thank you! 9/20/2018