Coordinated Workload Scheduling A New Application Domain for Mechanism Design Elie Krevat.

Slides:



Advertisements
Similar presentations
Combinatorial Auction
Advertisements

Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 17 Auction-based.
Testing Linear Pricing Algorithms for use in Ascending Combinatorial Auctions (A5) Giro Cavallo David Johnson Emrah Kostem.
Load Balancing Parallel Applications on Heterogeneous Platforms.
Algorithmic mechanism design Vincent Conitzer
SDN + Storage.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
An Approximate Truthful Mechanism for Combinatorial Auctions An Internet Mathematics paper by Aaron Archer, Christos Papadimitriou, Kunal Talwar and Éva.
Multi-item auctions with identical items limited supply: M items (M smaller than number of bidders, n). Three possible bidder types: –Unit-demand bidders.
CPU Scheduling Questions answered in this lecture: What is scheduling vs. allocation? What is preemptive vs. non-preemptive scheduling? What are FCFS,
A Sufficient Condition for Truthfulness with Single Parameter Agents Michael Zuckerman, Hebrew University 2006 Based on paper by Nir Andelman and Yishay.
Seminar In Game Theory Algorithms, TAU, Agenda  Introduction  Computational Complexity  Incentive Compatible Mechanism  LP Relaxation & Walrasian.
6.853: Topics in Algorithmic Game Theory Fall 2011 Matt Weinberg Lecture 24.
Xavier León PhD defense
Evolution of Cooperative problem-solving in an artificial economy by E. Baum and I. Durdanovic presented by Quang Duong.
Towards Feasibility Region Calculus: An End-to-end Schedulability Analysis of Real- Time Multistage Execution William Hawkins and Tarek Abdelzaher Presented.
Mechanism Design. Overview Incentives in teams (T. Groves (1973)) Algorithmic mechanism design (Nisan and Ronen (2000)) - Shortest Path - Task Scheduling.
Multicast Networks Profit Maximization and Strategyproofness David Kitchin, Amitabh Sinha Shuchi Chawla, Uday Rajan, Ramamoorthi Ravi ALADDIN Carnegie.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
Combinatorial Auction. Conbinatorial auction t 1 =20 t 2 =15 t 3 =6 f(t): the set X  F with the highest total value the mechanism decides the set of.
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
High Performance Computing 1 Parallelization Strategies and Load Balancing Some material borrowed from lectures of J. Demmel, UC Berkeley.
A Local Facility Location Algorithm Supervisor: Assaf Schuster Denis Krivitski Technion – Israel Institute of Technology.
Competitive Analysis of Incentive Compatible On-Line Auctions Ron Lavi and Noam Nisan SISL/IST, Cal-Tech Hebrew University.
Efficient agent-based selection of DiffServ SLAs over MPLS networks Thanasis G. Papaioannou a,b, Stelios Sartzetakis a, and George D. Stamoulis a,b presented.
Truthfulness and Approximation Kevin Lacker. Combinatorial Auctions Goals – Economically efficient – Computationally efficient Problems – Vickrey auction.
Pipelined Two Step Iterative Matching Algorithms for CIOQ Crossbar Switches Deng Pan and Yuanyuan Yang State University of New York, Stony Brook.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
AN INTRODUCTION TO THE OPERATIONAL ANALYSIS OF QUEUING NETWORK MODELS Peter J. Denning, Jeffrey P. Buzen, The Operational Analysis of Queueing Network.
Performance Evaluation of Parallel Processing. Why Performance?
Optimal Power Control, Rate Adaptation and Scheduling for UWB-Based Wireless Networked Control Systems Sinem Coleri Ergen (joint with Yalcin Sadi) Wireless.
ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,
Combinatorial Auctions By: Shai Roitman
Message-Passing for Wireless Scheduling: an Experimental Study Paolo Giaccone (Politecnico di Torino) Devavrat Shah (MIT) ICCCN 2010 – Zurich August 2.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
CPU Scheduling CSCI 444/544 Operating Systems Fall 2008.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
Algorithmic Mechanism Design: an Introduction Approximate (one-parameter) mechanisms, with an application to combinatorial auctions Guido Proietti Dipartimento.
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
1 Real-Time Scheduling. 2Today Operating System task scheduling –Traditional (non-real-time) scheduling –Real-time scheduling.
CSCI1600: Embedded and Real Time Software Lecture 24: Real Time Scheduling II Steven Reiss, Fall 2015.
An Evaluation of Partitioners for Parallel SAMR Applications Sumir Chandra & Manish Parashar ECE Dept., Rutgers University Submitted to: Euro-Par 2001.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.
Static Process Scheduling
Analysis of cooperation in multi-organization Scheduling Pierre-François Dutot (Grenoble University) Krzysztof Rzadca (Polish-Japanese school, Warsaw)
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Algorithmic Mechanism Design Shuchi Chawla 11/7/2001.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Combinatorial Auction. A single item auction t 1 =10 t 2 =12 t 3 =7 r 1 =11 r 2 =10 Social-choice function: the winner should be the guy having in mind.
Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.
Determining Optimal Processor Speeds for Periodic Real-Time Tasks with Different Power Characteristics H. Aydın, R. Melhem, D. Mossé, P.M. Alvarez University.
A Low Interference Channel Assignment Algorithm for Wireless Mesh Networks Can Que 1,2, Xinming Zhang 1, and Shifang Dai 1 1.Department of Computer Science.
Parallel Molecular Dynamics A case study : Programming for performance Laxmikant Kale
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Xinbing Wang*, Qian Zhang**
CLUSTER COMPUTING.
CSE8380 Parallel and Distributed Processing Presentation
CPU SCHEDULING.
Virtual-Time Round-Robin: An O(1) Proportional Share Scheduler
Market-based Dynamic Task Allocation in Mobile Surveillance Systems
Combinatorial Auction
Combinatorial Auction
Combinatorial Auction
Combinatorial Auction
Presentation transcript:

Coordinated Workload Scheduling A New Application Domain for Mechanism Design Elie Krevat

Introduction Distributed systems becoming larger, more complex Nodes perform computation and storage tasks Workloads enter system and are distributed across nodes Clients run many workloads, can pay for resources Nodes service many workloads (not dedicated) System provides QoS guarantees: Performance – load balance workloads to faster free nodes Efficiency – minimize cycles wasted when tasks available Fairness – nodes share resources across workloads

Benefits of Shared Storage Why cluster?Scaling, cost, and management. Why share?Slack sharing, economies of scale, uniformity.

Throughput Performance Insulation in Shared Storage Each of n workloads on a server: Executes efficiently within its portion of time (timeslice) Ideally: gets ≥ 1/ n of its standalone performance In practice: within a fraction of the ideal Argon project [Wachs07] provides bounds on efficiency across workloads for one server Problems extending to many servers (cluster-style) Synchronized workloads need coordination of schedules Performance of system limited by slowest node

Timeslice challenges 140 ms Workload 1 Workload 2 Workload 3 Workload 1 Workload ms Workload 4 Workload 1 Workload ms Server A Server B Server C

6 Cluster-style Storage Systems Client Switch Storage Servers R R R R 1 2 Data Block Data Fragment Synchronized Read Client now sends next batch of requests

Environment Assumptions One client per workload Bounded number W of workloads, N of nodes Constant set of workloads to be scheduled But mechanism might support changing set Communication doesn’t interfere with computation/storage tasks

Workload Distribution Settings Two alternative workload distribution settings Setting I: Free Workload Assignment Workloads can be freely assigned to many nodes Example: Embarrassingly parallel distributed apps Problem: Determine best set of nodes to assign Setting II: Fixed Workload Assignment Workloads must be assigned to fixed set of nodes Example: Cluster-style storage Problem: Coordinate responses of nodes with better timeslice scheduling

Computing Environments with Monetary Incentives Workloads pay for resources: Weather forecasting Seismic measurement simulations of oil fields Distributed systems sell resources Supercomputing centers sell resources Shared infrastructures Grid computing Individually-owned computers sell spare cycles for $$ May not have single administrative domain

Why Mechanism Design? Central coordinator(s) lack per-node information Different performance capabilities and revenue models Enforce cooperation and global QoS Efficiency and fairness not always goals of players Reduce scheduling problems to general mechanism Scheduling coordinated workloads is hard (proof later) Divide scheduling problems across nodes Design mechanism to produce coordination

Outline Background and Motivation  Mechanism I: Free Workload Assignment Mechanism II: Fixed Workload Assignment Conclusions

Revenue Model: Free Assignment Clients pay nodes directly after task Payment is per-workload Amount depends on many factors: Speed of response Number of requests/computations per timeslot Clients may also pay fixed cost to central scheduler Workloads want the best and fastest nodes Central scheduler doesn’t know load/speed of nodes Nodes are greedy and want lots of workloads May lie about load/speed if asked directly System Goal: Assign workloads to nodes that will respond fastest

Mechanism Design: VCG Run auction to decide which M nodes to assign FIFO approach to scheduling each workload Can also run combinatorial auction on bundles Nodes respond with bids Valuations depend on speed and current load Same factors that affect final payment Apply Vickrey-Clarke-Groves mechanism First auction iteration finds top M bids Remove Node X, recompute top M bids Additional auction iteration not actually necessary Difference between X’s bid and M+1 st bid is payment May also normalize payments to share wealth over nodes

Mechanism Results Incentive compatible Nodes have no incentive to lie, since if they over-report valuation for workload they’ll still be paid true valuation Global efficiency (i.e., best allocation for workload) Related to general task allocation problem [Nisan99] k tasks allocated to n agents Goal is to minimize completion time of last assignment (make-span) Valuation of agent is negation of total time spent on tasks Approximation/randomized algorithms exist for CA

Outline Background and Motivation Mechanism I: Free Workload Assignment  Mechanism II: Fixed Workload Assignment Conclusions

Revenue Model: Fixed Assignment Nodes paid by system at every timestep Payment is part of mechanism payment scheme System wants quick resolution of workload requests Nodes need monetary incentives to schedule fully coordinated workloads efficiently and fairly All M nodes service workload in same timeslice Uncoordinated workloads not important System Goals: Enforce coordination of workloads per timeslice Achieve fair distribution of resources Achieve efficient schedule allocations

Coordination is hard WkldNodes 1A,B 2B,C 3C,D 4A,B,D Reduce Max Independent Set problem to problem of scheduling max # of fully coordinated workloads per timeslice For every Node x i that services a workload w i, then w i has a dependency edge to all other workloads serviced by x i NP-Complete, but approximation algorithms exist For above example, max independent set is {1,3} NodeWklds A1,4 B1,2,4 C2,3 D3,4

Properties of Schedule Allocations Two types of schedule allocations Basic quanta timeslice allocation a Longer sequence of timeslices a tot Set of workloads serviced by node n x is S x # / timeslice quanta allocated per workload w i is q i Total quanta count Q x for each node n x Delay between consecutive workload schedules in allocation a tot is schedule distance d i,k k refers to schedule instance in a tot Average schedule distance d i,avg, per-node is d x,avg Maximum schedule distance d i,max, per-node is d x,max

Formulas for Schedule Allocation Properties

Possible Payment Scheme Node is paid max of P credits for each scheduled time quanta No credits for uncoordinated schedule For every cycle of time that workload isn’t scheduled, payment decreases by c (c << P) Node is fined F if starves workload over a period of quanta greater than Q thr Using derived properties of schedule allocations, each node calculates payments

Mechanism Design: Open Research Problem Goal is to improve efficiency and fairness Nodes compute their best allocations (through heuristics) using payment scheme that rewards efficiency/fairness Send valuations to central scheduler General mechanism determines best global allocation But coordination is hard optimization problem May be better suited only for central scheduler Expected properties of a mechanism: Nodes are players No additional utility past payments? Auctioned good may be single or total allocations Tradeoff is ability to adapt to changing workloads vs. better assessment of efficient allocations over longer time

Outline Background and Motivation Mechanism I: Free Workload Assignment Mechanism II: Fixed Workload Assignment  Conclusions

Conclusions Distributed systems environments provide new applications for mechanism design Goals of better global performance, efficiency, fairness Not always shared by individual nodes Model and analysis of 2 different distribution settings Free workload assignment solved with VCG Fixed workload assignment still open problem Revenue model and goals of mechanism vary Payment functions use derived allocation properties Coordination of workloads is hard optimization problem Motivation for further research in related areas