Scheduling with Outliers Ravishankar Krishnaswamy (Carnegie Mellon University) Joint work with Anupam Gupta, Amit Kumar and Danny Segev.

Slides:



Advertisements
Similar presentations
Alexander Kononov Sobolev Institute of Mathematics Siberian Branch of Russian Academy of Science Novosibirsk, Russia.
Advertisements

On the Complexity of Scheduling
Minimizing Average Flow-Time Naveen Garg IIT Delhi Joint work with Amit Kumar, Jivi Chadha,V Muralidhara, S. Anand.
ECE 667 Synthesis and Verification of Digital Circuits
 Review: The Greedy Method
Greedy Algorithms.
Truthful Mechanism Design for Multi-Dimensional Scheduling via Cycle Monotonicity Ron Lavi IE&M, The Technion Chaitanya Swamy U. of Waterloo and.
GRAPH BALANCING. Scheduling on Unrelated Machines J1 J2 J3 J4 J5 M1 M2 M3.
Approximation Algorithms for Capacitated Set Cover Ravishankar Krishnaswamy (joint work with Nikhil Bansal and Barna Saha)
Online Scheduling with Known Arrival Times Nicholas G Hall (Ohio State University) Marc E Posner (Ohio State University) Chris N Potts (University of Southampton)
Progress in Linear Programming Based Branch-and-Bound Algorithms
Optimization of thermal processes2007/2008 Optimization of thermal processes Maciej Marek Czestochowa University of Technology Institute of Thermal Machinery.
A polylogarithmic approximation of the minimum bisection Robert Krauthgamer The Hebrew University Joint work with Uri Feige.
1 Better Scalable Algorithms for Broadcast Scheduling Ravishankar Krishnaswamy Carnegie Mellon University Joint work with Nikhil Bansal and Viswanath Nagarajan.
THE SINGLE MACHINE EARLY/TARDY PROBLEM* PENG SI OW & THOMAS E. MORTON IE Paper Presentation A. İrfan Mahmutoğulları *Ow, P. S., & Morton, T. E. (1989).
June 3, 2015Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
An O(1) Approximation Algorithm for Generalized Min-Sum Set Cover Ravishankar Krishnaswamy Carnegie Mellon University joint work with Nikhil Bansal (IBM)
Computational problems, algorithms, runtime, hardness
Tirgul 10 Rehearsal about Universal Hashing Solving two problems from theoretical exercises: –T2 q. 1 –T3 q. 2.
Parameterized Approximation Scheme for the Multiple Knapsack Problem by Klaus Jansen (SODA’09) Speaker: Yue Wang 04/14/2009.
1 Approximation Algorithms for Demand- Robust and Stochastic Min-Cut Problems Vineet Goyal Carnegie Mellon University Based on, [Golovin, G, Ravi] (STACS’06)
Rounding Technique We can simplify scheduling problem structure by rounding job sizes, release dates.
Truthful Mechanisms for One-parameter Agents Aaron Archer, Eva Tardos Presented by: Ittai Abraham.
CSE 421 Algorithms Richard Anderson Lecture 6 Greedy Algorithms.
1 Scheduling on Heterogeneous Machines: Minimize Total Energy + Flowtime Ravishankar Krishnaswamy Carnegie Mellon University Joint work with Anupam Gupta.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
1 Scheduling Jobs with Varying Parallelizability Ravishankar Krishnaswamy Carnegie Mellon University.
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
Distributed Combinatorial Optimization
1 Introduction to Approximation Algorithms Lecture 15: Mar 5.
Minimizing Flow Time on Multiple Machines Nikhil Bansal IBM Research, T.J. Watson.
Improved results for a memory allocation problem Rob van Stee University of Karlsruhe Germany Leah Epstein University of Haifa Israel WADS 2007 WAOA 2007.
Minimizing Makespan and Preemption Costs on a System of Uniform Machines Hadas Shachnai Bell Labs and The Technion IIT Tami Tamir Univ. of Washington Gerhard.
1 The Santa Claus Problem (Maximizing the minimum load on unrelated machines) Nikhil Bansal (IBM) Maxim Sviridenko (IBM)
Operational Research & ManagementOperations Scheduling Flow Shop Scheduling 1.Flexible Flow Shop 2.Flexible Assembly Systems (unpaced) 3.Paced Assembly.
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
Fair Allocation with Succinct Representation Azarakhsh Malekian (NWU) Joint Work with Saeed Alaei, Ravi Kumar, Erik Vee UMDYahoo! Research.
SVM by Sequential Minimal Optimization (SMO)
Bold Stroke January 13, 2003 Advanced Algorithms CS 539/441 OR In Search Of Efficient General Solutions Joe Hoffert
Yossi Azar Tel Aviv University Joint work with Ilan Cohen Serving in the Dark 1.
1 Chapter 5 Flow Lines Types Issues in Design and Operation Models of Asynchronous Lines –Infinite or Finite Buffers Models of Synchronous (Indexing) Lines.
Approximation schemes Bin packing problem. Bin Packing problem Given n items with sizes a 1,…,a n  (0,1]. Find a packing in unit-sized bins that minimizes.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
1 Server Scheduling in the L p norm Nikhil Bansal (CMU) Kirk Pruhs (Univ. of Pittsburgh)
Batch Scheduling of Conflicting Jobs Hadas Shachnai The Technion Based on joint papers with L. Epstein, M. M. Halldórsson and A. Levin.
Techniques for truthful scheduling Rob van Stee Max Planck Institute for Informatics (MPII) Germany.
Princeton University COS 423 Theory of Algorithms Spring 2001 Kevin Wayne Approximation Algorithms These lecture slides are adapted from CLRS.
1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Solving the Maximum Cardinality Bin Packing Problem with a Weight Annealing-Based Algorithm Kok-Hua Loh University of Maryland Bruce Golden University.
Dynamic Programming.  Decomposes a problem into a series of sub- problems  Builds up correct solutions to larger and larger sub- problems  Examples.
Algorithmic Mechanism Design Shuchi Chawla 11/7/2001.
Inapproximability of the Multi- Level Facility Location Problem Ravishankar Krishnaswamy Carnegie Mellon University (joint with Maxim Sviridenko)
1 Approximation Algorithms for Generalized Min-Sum Set Cover Ravishankar Krishnaswamy Carnegie Mellon University joint work with Nikhil Bansal and Anupam.
1 Approximation Algorithms for Generalized Scheduling Problems Ravishankar Krishnaswamy Carnegie Mellon University joint work with Nikhil Bansal, Anupam.
Problems in Combinatorial Optimization. Linear Programming.
Sorting Algorithms Written by J.J. Shepherd. Sorting Review For each one of these sorting problems we are assuming ascending order so smallest to largest.
CS 361 – Chapter 10 “Greedy algorithms” It’s a strategy of solving some problems –Need to make a series of choices –Each choice is made to maximize current.
Scheduling Parallel DAG Jobs to Minimize the Average Flow Time K. Agrawal, J. Li, K. Lu, B. Moseley.
Tuesday, March 19 The Network Simplex Method for Solving the Minimum Cost Flow Problem Handouts: Lecture Notes Warning: there is a lot to the network.
Linear program Separation Oracle. Rounding We consider a single-machine scheduling problem, and see another way of rounding fractional solutions to integer.
Data Driven Resource Allocation for Distributed Learning
Haim Kaplan and Uri Zwick
On Scheduling in Map-Reduce and Flow-Shops
Integer Programming (정수계획법)
Dynamic and Online Algorithms for Set Cover
Integer Programming (정수계획법)
Flexible Assembly Systems
Non-clairvoyant Precedence Constrained Scheduling
Presentation transcript:

Scheduling with Outliers Ravishankar Krishnaswamy (Carnegie Mellon University) Joint work with Anupam Gupta, Amit Kumar and Danny Segev

Introduction Classical Scheduling Problems – Given jobs and machines – Find best schedule according to some objective Simple Example – N jobs, M machines. – Job j has a processing time of p j – Find schedule of minimum makespan Minimize maximal load on any machine.

A possible issue What if there are some rogue jobs? – They dominate objective value – Algorithms focus on handling these – Ignore effects of others For example, – Straggler job might slow down response time of all jobs – If we discard that job, other jobs finish much faster – Commonly seen in computers

Overcoming this.. Ignore these rogue jobs Scheduling with outliers – Or possibly, scheduling without liars? More Formally – Each job comes with a penalty if we discard it – Discard a total penalty of R – Schedule the others to optimize given objective

Outliers vs “Prize-Collecting” Prize-Collecting Model – Penalty of jobs left out figures in objective function – Minimize objective of scheduled jobs + penalty of outliers Outlier Model – Hard bound on penalty – leave out some jobs, while scheduling the others – Both model similar concept – Prize-Collecting combines two different measures – Can solve PC if we solve outlier problem.

Problems Studied Makespan/Generalized Assignment – n jobs and m unrelated machines – Job j has processing time p ij and cost c ij on machine i – Job j also has penalty r j – Goal is to minimize makespan while leaving out jobs of total penalty R Non-Outlier Setting: (C,2T)-approximation algorithm

Problems Studied Weighted Sum of Completion Times – n jobs and m unrelated machines – Job j has processing time p ij on machine i – Job j also has penalty r j – Goal is to minimize average completion time of the jobs while leaving out jobs of total penalty R Non-Outlier Setting: 2-approximation algorithm

Problems Studied Average Flow Time – n jobs and m identical machines – Job j has processing time p j and arrival time a j – Goal is to minimize average flow time of the jobs F j = C j – a j or the time for which j is present in the system while leaving out jobs of total penalty R Non-Outlier Setting: O(log P)-approximation algorithm

Our Results Generalized Assignment / Makespan A deterministic [C(1+є), 3T] approximation algorithm Generalized Assignment / Makespan A deterministic [C(1+є), 3T] approximation algorithm Weighted Sum of Completion Times A randomized constant factor approximation algorithm for the general case An FPTAS in the case of single machine sum of completion times Weighted Sum of Completion Times A randomized constant factor approximation algorithm for the general case An FPTAS in the case of single machine sum of completion times Average Flow Time (Preemptive) A deterministic O(log P) approximation algorithm when all penalties are unit Average Flow Time (Preemptive) A deterministic O(log P) approximation algorithm when all penalties are unit

An LP Formulation Adapted from Garg and Kumar [ICALP 06] x jt :: extent of job j is scheduled in time slot [t,t+1] y j :: fraction of j scheduled f j ::fractional flow time of j

Rounding: Some Obstacles For sum of completion times and makespan – We can use ½ point of any job effectively Does not quite work for flow time (α C j – a j ) >> α (C j – a j ) Such techniques need “speed-up” of α Without speed-up, we really need to work inside LP schedule

How can the LP cheat? 2k2k 2 k-1 2 k …… 2 k+1 2k2k 2 k … M LP Schedule: fraction ½ of each large job in the corresponding gray intervals fraction 1 of each small job in the blue intervals LP Cost is roughly 2 k + M Requirement: k/2 + M jobs

How can the LP cheat? 2k2k 2 k-1 2 k …… 2 k+1 2k2k 2 k … M Integral Schedule: once jobs M + k/2 jobs are chosen, SRPT is optimal all small jobs will be chosen k/2 large jobs all wait for period of M Integral Cost is ( M.k ) Requirement: k/2 + M jobs Give up globally; Work locally

Rounding 1: Local Swap Consider two jobs of processing times 2 k Let y 1 and y 2 denote their fractional extents in LP To make the schedule integral, suppose we swap Δ fraction of J 2 with equal fraction of J 1 J1J1 J2J2 a1a1 a2a2 Observation: LP cost increase is roughly Δ (a 2 – a 1 ) Δ

Local Swap Continued Can perform such swaps and ensure that – Each time instant t is charged at most 1 in total Good if job sizes are powers of two – Any point charged is not empty time – Total charge is upper bounded by LP OPT – Can get desired O(log P)-approximation algorithm How do we handle fact that all jobs are not 2 k ?

Handling General Sizes Group jobs into buckets. Look at one such bucket If j 2 has larger processing time – There is sufficient space to replace it by equal fraction of j 1 – Same argument as in previous slide If j 2 has smaller processing time – Not enough space – Schedule j 2 over j 1 ! – Might violate the release date of j 2 Still no good..  J1J1 J2J2 a1a1 a2a2

A Not-so-local Swap What’s the Problem? – Grow j for long time charging intervals till fraction 2/3 – Then j sees smaller job j’ scheduled to 2/3 – j’ eats j, but we’re still left with 1/3 of j – Cycle repeats… A Fix – Don’t be local -- Look Ahead – Avoid such issues – More complex charging argument

Ingredient 2: A Local Shift To fix the release date issue – Look at any job class – Consider all the time intervals where we schedule that class jobs – Shift the schedule by 2 k entirely within this interval Unfinished jobs increase by 2 per class Total extra cost: O(log P) LP OPT

Wrapping Up O(log P) approximation algorithm – flow-time on single machine with unit penalties – can be extended to identical machines Other results – O(1) for weighted completion times and makespan What about flow time with non-uniform penalties? Outlier versions of other problems?

Thank You!