Download presentation
Presentation is loading. Please wait.
Published byAmelia Cox Modified over 8 years ago
1
Holding slide prior to starting show
2
Scheduling Parametric Jobs on the Grid Jonathan Giddy J.P.Giddy@wesc.ac.uk
3
Parametric computation Scientifically: –Study the behaviour of output variables against a range of different input scenarios Computationally: –Execute an application multiple times, each time with a different combination of input parameters
4
Why use the Grid? Parametric computations –Require high performance computational resources –Require large numbers of computational resources –Generate large amounts of concurrency –Generate uncoupled computations –Tolerate high latencies
5
Nimrod/G CostDeadline
6
Minimise Cost Increasing price 7 4 1 8 5 2 6 3 Node 4 Node 3 Node 2 Node 1 Time Jobs Budget Cost 8 20 0 7 19 1 6 18 2 5 17 3 4 15 5 3 13 7 2 11 9 1 8 12 0 5 15
7
7 Minimise Time Increasing price 4 2 1 6 5 3 Node 4 Node 3 Node 2 Node 1 Time Jobs Budget Cost Budget / Job 8 20 0 2.5 7 19 1 2.71 6 17 3 2.83 5 16 4 3.2 4 13 7 3.25 3 11 9 3.67 2 8 12 4.0 1 4 16 4.0 0 20 8
8
Globus 1.1 GRAM API int globus_gram_client_job_check( char *resource_manager_contact, const char *description, const float conf_percentage, globus_gram_client_time_t *estimate, globus_gram_client_time_t *interval) Note: This is not yet implemented This function returns an estimate of the time it would take for a job of the description provided to reach an ACTIVE state.
9
Historical profiling Examine characteristics of all jobs in queue against historical profiles in order to determine expected start time of a job Returns start time and error estimate Warren Smith, Ian T. Foster, Valerie E. Taylor: Predicting Application Run Times Using Historical Information. Job Scheduling Strategies for Parallel Processing Workshop (JSSPP) 1998: 122-142
10
Information Overload Too many variables: –Number of CPUs –CPU speed –Processor architecture –Operating system –Real memory –Disk speed –Bandwidth –Latency –Other users
11
Extrapolation of completion rate A B C 2 jobs/hr 3 jobs/hr 6 jobs/hr 1 hr2 hr
12
0 10 20 30 40 50 60 70 80 02.557.51012.51517.520 Time Average No. Processors 20 Hour deadline 15 hour deadline 10 hour deadline
13
Assumptions Compute time >> Network time All jobs are the same length on any particular resource Price of a resource is constant over time Not much wriggle room during the end- game –Both scheduling schemes push up against the limit that they’re not minimising –Heuristic nature of completion time
14
What we really want… Guaranteed completion time –globus_gram_client_job_check() with teeth –Requires scheduler to internally reserve space for job in advance Advance reservation –As above, but with external interface
15
And this too… A real grid economy –Incentive for providers to provide resources –Incentive for consumers to describe requirements accurately –Incentive for consumers to use resources judiciously –Price mechanism budget as a timely global information parameter universally understood enables trade-offs in making QoS decisions
16
A final point Optimising is really hard in a wide area network –Requires centralised decision maker –Information is missing –Information is not contemporaneous –Information is out-of-date
17
Scalable information …is slow to change Budget and deadline are (relatively) constant and can be propagated far and wide in a timely manner Slow information comes from specifying requirements in the real world Satisfying (instead of optimising) a requirement is relatively simple –A resource can so it does –A resource can’t so it doesn’t
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.