Scheduling Architecture and Algorithms within ICENI Laurie Young, Stephen McGough, Steven Newhouse, John Darlington London e-Science Centre Department.

Slides:



Advertisements
Similar presentations
L ondon e-S cience C entre Application Scheduling in a Grid Environment Nine month progress talk Laurie Young.
Advertisements

Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Condor use in Department of Computing, Imperial College Stephen M c Gough, David McBride London e-Science Centre.
WS-JDML: A Web Service Interface for Job Submission and Monitoring Stephen M C Gough William Lee London e-Science Centre Department of Computing, Imperial.
ICENI: An Open Grid Services Architecture Implemented with Jini William Lee, Nathalie Furmento, Anthony Mayer, Steven Newhouse and John Darlington London.
Dialogue Policy Optimisation
H EURISTIC S OLVER  Builds and tests alternative fuel treatment schedules (solutions) at each iteration  In each iteration:  Evaluates the effects of.
Graph Theory ITEC 320 Lecture 21. Graph Theory Review Higher level usage of pointers –Factories –Flyweight –Disk pool Rationale Benefits / Downsides.
A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Communication Pattern Based Node Selection for Shared Networks
The Community Authorisation Service – CAS Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.1 Introduction.
WSN Simulation Template for OMNeT++
Workload Management Massimo Sgaravatto INFN Padova.
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Lecture 3: Computer Performance
EstiNet Network Simulator & Emulator 2014/06/ 尉遲仲涵.
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
Future UK e-Science Grid Middleware Dr Steven Newhouse London e-Science Centre Department of Computing, Imperial College London.
ICENI Overview & Grid Scheduling Laurie Young London e-Science Centre Department of Computing, Imperial College.
DISTRIBUTED COMPUTING
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Service-enabling Legacy Applications for the GENIE Project Sofia Panagiotidi, Jeremy Cohen, John Darlington, Marko Krznarić and Eleftheria Katsiri.
20 October 2006Workflow Optimization in Distributed Environments Dynamic Workflow Management Using Performance Data David W. Walker, Yan Huang, Omer F.
Using ICENI to run parameter sweep applications across multiple Grid resources Murtaza Gulamali Stephen McGough, Steven Newhouse, John Darlington London.
Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.
Batch Scheduling at LeSC with Sun Grid Engine David McBride Systems Programmer London e-Science Centre Department of Computing, Imperial College.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Performance Architecture within ICENI Dr Andrew Stephen M c Gough Laurie Young, Ali Afzal, Steven Newhouse and John Darlington London e-Science Centre.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Trust Propagation using Cellular Automata for UbiComp 28 th May 2004 —————— Dr. David Llewellyn-Jones, Prof. Madjid Merabti, Dr. Qi Shi, Dr. Bob Askwith.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Laboratório de Instrumentação e Física Experimental de Partículas GRID Activities at LIP Jorge Gomes - (LIP Computer Centre)
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
GridSAM - A Standards Based Approach to Job Submission Through Web Services William Lee and Stephen McGough London e-Science Centre Department of Computing,
October, 2000.A Self Organsing NN for Job Scheduling in Distributed Systems I.C. Legrand1 Iosif C. Legrand CALTECH.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
“Grids and eScience” Mark Hayes Technical Director - Cambridge eScience Centre GEFD Summer School 2003.
The UK eScience Grid (and other real Grids) Mark Hayes NIEeS Summer School 2003.
GridPP Presentation to AstroGrid 13 December 2001 Steve Lloyd Queen Mary University of London.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
A Software Framework for Distributed Services Michael M. McKerns and Michael A.G. Aivazis California Institute of Technology, Pasadena, CA Introduction.
Morgan Kaufmann Publishers
Predictable Workflow Deployment Service Stephen M C Gough Ali Afzal, Anthony Mayer, Steven Newhouse, Laurie Young London e-Science Centre Department of.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
RealityGrid Peter Coveney 1 and John Brooke 2 1. Centre for Computational Science, Department of Chemistry, Queen Mary, University of London 2. Manchester.
Utility Computing: Security & Trust Issues Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
A local search algorithm with repair procedure for the Roadef 2010 challenge Lauri Ahlroth, André Schumacher, Henri Tokola
Performance guided scheduling in GENIE through ICENI
BIT 3193 MULTIMEDIA DATABASE CHAPTER 5 : MULTIMEDIA DATABASE MANAGEMENT SYSTEM ARCHITECTURE.
Mapping of Scientific Workflow within the e-Protein project to Distributed Resources London e-Science Centre Department of Computing, Imperial College.
RealityGrid: An Integrated Approach to Middleware through ICENI Prof John Darlington London e-Science Centre, Imperial College London, UK.
Workflow Enactment in ICENI Dr Andrew Stephen M C Gough Laurie Young, Ali Afzal, Steven Newhouse and John Darlington London e-Science Centre 2 nd September.
GridQTL High Performance QTL analysis via the Grid/Cloud.
Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy
2004 Queue Scheduling and Advance Reservations with COSY Junwei Cao Falk Zimmermann C&C Research Laboratories NEC Europe Ltd.
18 May 2006CCGrid2006 Dynamic Workflow Management Using Performance Data Lican Huang, David W. Walker, Yan Huang, and Omer F. Rana Cardiff School of Computer.
Virtual Organisation Management in the Level 2 Grid Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College.
Workload Management Workpackage
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
Multiple Views of Workflow through ICENI
Laura Bright David Maier Portland State University
Wide Area Workload Management Work Package DATAGRID project
Presented By: Darlene Banta
Normal Form (Matrix) Games
Presentation transcript:

Scheduling Architecture and Algorithms within ICENI Laurie Young, Stephen McGough, Steven Newhouse, John Darlington London e-Science Centre Department of Computing, Imperial College London

Contents ICENI –Scheduling Architecture Scheduling Algorithms –Variety of different algorithms Experimental Results –Different policies –Different Grid sizes –Different Application Profiles

ICENI The Iceni, under Queen Boudicca, united the tribes of South-East England in a revolt against the occupying Roman forces in AD60. IC e-Science Networked Infrastructure Developed by LeSC Grid Middleware Group Collect and provide relevant Grid meta-data Use to define and develop higher-level services Interaction with other frameworks: OGSA, Jxta etc.

Component Applications Factory Design Generator Analyser Mesh Generator DRACS Mesh Generator Mesh Generator DRACS Each job is composed of multiple components. Each runs on a different resource Each component is connected to at least one other component. Data is passed along these connections

ICENI Scheduling Architecture ICENI Launching Framework Condor Launcher Globus Launcher Scheduling Framework Simulated Annealing Game Theory Schedule Evaluation Performance Repository Performance Model Statistical Prediction ICENI Scheduling Services Launching Framework Pluggable Launchers (SGE, Globus, Condor, ICENI) Scheduling Framework Pluggable Schedulers (Simulated Annealing, Game Theory Random, Best of n Random) Performance Framework Pluggable Performance Repositories (Perf. Models, Statistical Analysis)

Use a Benefit Function. Also called a Utility Function or Evaluation Function. A Benefit Function maps the metrics we are interested in to a single Benefit Value. Different benefit functions represent different optimisation preferences. Can set benefit to 0 if constraints (e.g. Budget) exceeded. Schedule Evaluation

Random / Best of n Random Random Best of n Random Random Scheduler Randomly selects a schedule Checks schedule can be executed Produces schedules very quickly Best of n Random Produces multiple random schedules Returns the best one Still very fast Better results than the random schedules

Simulated Annealing Monte Carlo method Generate schedule at random Modify current schedule Accept new schedule if better –If worse, accept with probability proportional to “temperature” and inversely proportional to benefit change Repeat, while reducing “temperature” Stop when no modifications to schedule accepted Random Simulated Annealing

Each component is a “Player” Each player has to choose best strategy (Grid resource) Each strategy has a benefit, depending on the strategy chosen by all other players. Players identify, then remove strategies guaranteed to never be optimal – “strictly dominated strategies” Produces the “Nash Equilibrium” Game Theory Player B 1234 Player A 1 6,36,47,38,4 2 4,73,94,75,6 3 4,55,57,46,5 4 7,45,56,47,6

Schedulers Random / Best of n Random Produces usable schedules fast. Game Theory Considers the scheduling problem as an economic problem. Simulated Annealing Algorithm for solving optimisation Problems Scheduling Policy Time Optimisation Best benefit from a schedule with the shortest execution time. Results show scheduling time + execution time. Cost Optimisation Best benefit from a schedule where the cost of using resources is low. Grid Description 4 Clusters of resources Saturn 16 Sparc III 750 MHz Processors 5Gbit Interconnects Rhea 8 Sparc III 900Mhz Processors 5Gbit Interconnects Viking T 16 node, 2GHz Pentium 4 1Gbit Interconnects Viking C 16 node, 2GHz Pentium 4 100Mbit Interconnects Simulated Scheduling Framework Consistent Interface Uses the same interface as the ICENI scheduling framework allowing the same schedule code to be used. Repeatability As the underlying description files never change the same experiment can be run many times. Application Description 21 DAG Applications Varying Depth DAG Depth between 2 and 7 Varying Complexity Between 2 and 8 Components Average component would take 2 minutes on an 2Ghz CPU Average communication between components would take 1 minute on a 100Mbit network Experiments Scheduler Simulated Scheduling Framework Grid and Application Description Files

Results (Cost Optimisation)

Results (Time Optimisation)

Summary ICENI Scheduling Architecture –Comprised of 3 services, using a pluggable architecture to allow different implementations to be used –Launcher implementations allow launching to different underlying execution environments. –Performance service enables execution time predictions –Scheduling service operates on information provided by other two services Decouples scheduler from application and environment

Summary Scheduling Algorithms –Four algorithms examined while varying: Grid Sizes Applications Policies –Simulated Annealing generally the best algorithm tested –Larger applications take longer to schedule and return –More choice in resources leads to: cheaper computation for users Longer return times for applications Increasing the Grid size can reduce or improve the quality of service experienced by the user

Acknowledgements Director: Professor John Darlington Technical Director: Dr Steven Newhouse Research Staff: –Anthony Mayer, Nathalie Furmento –Stephen McGough, James Stanton –Yong Xie, William Lee –Marko Krznaric, Murtaza Gulamali –Asif Saleem, Laurie Young, Gary Kong Contact:: – – Funding: –PPARC e-Science Studentship (PPA/S/E/2001/03335)