Download presentation
Presentation is loading. Please wait.
1
On Economics and the User-Scheduler Relationship in HPC and Grid Systems Cynthia Bailey Lee Advisor: Allan E. Snavely Department of Computer Science and Engineering San Diego Supercomputer Center University of California, San Diego November 14, 2007
2
Introduction The job submission routine: Edit job script, including resources needed and amount of time requested Submit job—typically, many questions remain: Did I request enough time? How long will the job wait in the queue? Eventually, job runs—more questions: I submitted to a ‘high-priority’ queue—was my wait time actually shorter than if I hadn’t? By how much? Was it worth it? Is this a satisfying relationship for either party? Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
3
Contributions of This Work 1)Falsified “The Padding Hypothesis” as the sole explanation for users’ inaccurate runtime requests 2)Quantified users’ valuation of turnaround by collecting actual users’ utility curves 3)Proposed a model for synthetically generating utility functions that draws on patterns seen in the actual user curves 4)A genetic algorithm-based scheduler that uses aggregate utility as an explicit objective function Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
4
“The Padding Hypothesis” The inaccuracy of users’ requested runtimes, relative to the actual runtime of jobs, is explained by users explicitly “padding” otherwise accurate runtime estimates in order to avoid the possibility of being killed by the scheduler. Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler SDSC users were asked to provide a “no-kill”/no-pressure estimate, with prizes for being accurate Users are able to self-identify as more or less accurate % Decrease 72
5
What is a Utility Function? 8 am 12–1pm 5 pm 8 am 9 am time u(t) Other factors: coordinate with other grid sites or sensors, paper deadlines, weather and hurricane prediction, … Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
6
Real Users' Functions Randomly-selected users of SDSC systems provided these data points for jobs they were submitting Utility is in terms of the SDSC charge unit (“SU”) Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
7
More Real Users' Functions Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
8
Model for Generating Synthetic Utility Functions Expected Linear Expected Exponential Step “Expected” refers to the fact that each point is chosen randomly (i.e. Most won't follow the pattern as cleanly as shown here) Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
9
Genetic Algorithm Scheduler Individuals: –permutations of the job queue ordering Mutation: –swap two randomly-selected jobs Reproduction: –zipper-like merging of parents (skip duplicates) Fitness: global utility of resulting schedule (approx.) J1J1 J2J2 J3J3 J4J4 J5J5 J1J1 J2J2 J3J3 J4J4 J5J5 J1J1 J2J2 J3J3 J4J4 J5J5 += J1J1 J1J1 J2J2 J2J2 J3J3 J1J1 J2J2 J3J3 J4J4 J5J5 Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
10
GA scheduler and Inaccurate Runtimes Schedulers compared: CONS = Conservative Backfilling EASY = Aggressive Backfilling PRIO = Priority FIFO (typical supercomputer priority scheduler) GA = genetic algorithm Workload is SDSC-BLUE from the Parallel Workloads Archive (Dror Feitelson) Load modified by scaling inter-arrival times Regular Load Heavy Load Introduction Runtime Inaccuracy Utility Functions Utility Model Scheduler
11
Related Work Very abbreviated—please see my publications for much more complete references Runtime request inaccuracy: Mu’alem and Feitelson 2001, Tsafrir 2005, Cirne and Berman 2001, Ward, Mahood and West 2002, (workaround via queue prediction) Brevik, Nurmi, Wolski 2006, (mitigate via grace period) Chiang, Arpaci-Dusseau and Vernon 2002 Utility functions: Chun and Culler 2004, Irwin, Grit and Chase 2004, Chen and Muhlethaler 1996 Market approach to scheduling: Wolski, Plank, Brevik and Bryan 2001, Stoica, Abdel-Wahab and Pothen 1995, Buyya, “Spawn” system: Waldburger, Hogg et al. 1992, “Tycoon” system: Lai et al. 2004, Popovici and Wilkes @ SC05, Singh, Kesselman, Deelman 2007, many, many more… Genetic algorithm in parallel job scheduling: Franke 2006 (many, many more in other scheduling domains)
12
For more information… Inaccurate runtime requests survey: Lee, C., Y. Schwartzman, J. Hardy, A. Snavely. “Are user runtime estimates inherently inaccurate?” Workshop on Job Scheduling Strategies for Parallel Processing, with SIGMETRICS, June 2004. Survey collecting SDSC users' utility curves: Lee, C. and A. Snavely. "On the User-Scheduler Dialogue: Studies of User-Provided Runtime Estimates and Utility Functions." International Journal of High Performance Computing Applications, vol. 20, 2006. Genetic algorithm scheduler and model for generating synthetic utility curves: Lee, C. and A. Snavely. “Precise and Realistic Utility Functions for User-Centric Performance Analysis of Schedulers.” HPDC-16, June 2007. Contact: Cynthia Lee, CL@SDSC.EDU
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.