GridSAM and the Job Submission Description Language Presented by Mike Mineter (Most) slides from Stephen.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

March 6 th, 2009 OGF 25 Unicore 6 and IPv6 readiness and IPv6 readiness
GridSAM Overview Grid Job S ubmission A nd M onitoring Service What is GridSAM? Funded by the OMII Managed Programme (Started in Sept, 04) Client Perspective.
WS-JDML: A Web Service Interface for Job Submission and Monitoring Stephen M C Gough William Lee London e-Science Centre Department of Computing, Imperial.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
17 March 2008Standards for Interoperable Grids 1 Job Execution Standards for Interoperable Grids: Experience from NextGRID and OMII-Europe Clive Davenhall.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
3rd Campus Grid SIG Meeting. Agenda Welcome OMII Requirements document Grid Data Group HTC Workshop Research Computing SIG? AOB Next meeting (AG)
UK Campus Grid Special Interest Group Dr. David Wallom University of Oxford.
A JSDL Applications Repository and Data Staging Portal: Some New Parameter Sweep Developments and Data transfer Requirements David Meredith STFC e-Science.
London e-Science Centre Session 6: Distributed Computation Practical issues & Examples A. Stephen McGough Imperial College London Practical issues & Examples.
W w w. h p c - e u r o p a. o r g The HPC-Europa project and GridSphere Dawid Szejnfeld Poznan Supercomputing.
W w w. h p c - e u r o p a. o r g Single Point of Access to Resources of HPC-Europa Krzysztof Kurowski, Jarek Nabrzyski, Ariel Oleksiak, Dawid Szejnfeld.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
1 AHE Server Deployment and Hosting Applications Stefan Zasada University College London.
Accounting for the Grid Usage Records and a Resource Usage Service.
London e-Science Centre Imperial College London Making the Grid Pay Economic Services - Pricing and Payment William Lee.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
JRA1/Job Submission and Monitoring Moreno Marzolla on behalf of JRA1/Job Submission Task INFN Sezione di Padova,
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
Condor Birdbath Web Service interface to Condor
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
London e-Science Centre GridSAM A Standards Based Approach to Job Submission A. Stephen M C Gough Imperial College London A Standards Based Approach to.
Why do we need PGI? Shahbaz Memon Jülich Supercomputing Centre (JSC)
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
GridSAM - A Standards Based Approach to Job Submission Through Web Services William Lee and Stephen McGough London e-Science Centre Department of Computing,
Grids - the near future Mark Hayes NIEeS Summer School 2003.
1 Condor BirdBath SOAP Interface to Condor Charaka Goonatilake Department of Computer Science University College London
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
Basics of Grid Middleware – 2 (with an introduction to OMII-Europe) Mike Mineter NeSC-TOE.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
OGSA-UK: Putting the users first Steven Newhouse OMII Deputy Director.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Creating and running an application.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Standards driven AAA for Job Management within the OMII-UK distribution Steven Newhouse Director, OMII-UK
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
GridSAM: an Introduction Mike Mineter.
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Holding slide prior to starting show. Processing Scientific Applications in the JINI-Based OGSA-Compliant Grid Yan Huang.
London e-Science Centre Activity Schema What we’ve discussed already A. Stephen M C Gough Imperial College London What we’ve discussed already A. Stephen.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
OGSA-DAI.
London e-Science Centre Session 4: The GridSAM service A. Stephen McGough Imperial College London A. Stephen McGough Imperial College London.
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
Introduction to the Application Hosting Environment
Towards GLUE Schema 2.0 Sergio Andreozzi INFN-CNAF Bologna, Italy
OGSA Data Architecture Scenarios
Grid Systems: What do we need from web service standards?
Sergio Andreozzi Laurence Field Balazs Konya
Presentation transcript:

GridSAM and the Job Submission Description Language Presented by Mike Mineter (Most) slides from Stephen McGough (Imperial College)

2 Acknowledgement Many slides were prepared and given by A. Stephen McGough (Imperial College London), in Edinburgh on 22 Feb

3 Outline Overview Job Submission Description Langauge (JSDL) Grid Job Submission and Monitoring Service (GridSAM) GridSAM and the NGS

London e-Science Centre Overview Running Jobs on the Grid

London e-Science Centre 5 Context Middleware Map to resources jobs / legacy code / binary executables Resources

London e-Science Centre 6 security Stages to using the Grid – Classical View write (code) to solve problem “compile” against middleware advertise Select resources Deploy to resources middleware submit to Grid accounting Steering and visualisation Stage data

London e-Science Centre 7 What is wrong with this picture? There are already many DRM systems (Condor, Globus…)  Why do we need another one?  We don’t. What we really need is for them all to be able to talk to each other  Make life easy for all  We need a service which makes systems look the same There are already many DRM systems (Condor, Globus…)  Why do we need another one?  We don’t. What we really need is for them all to be able to talk to each other  Make life easy for all  We need a service which makes systems look the same

London e-Science Centre 8 To make life easy We want to hide the heterogeneity of the Grid User Grid resources Hide heterogeneity by tight abstraction here

London e-Science Centre Other Way… Standards Based Job Submission

London e-Science Centre 10 If all DRM systems supported the same interface…  If we had:  One interface definition for job submission  One job description language  Then life would be easier!  We’re getting there  JSDL is a proposed standard job submission description language  OGSA-BES are proposing a basic execution service interface  One day hopefully everyone will support this  Till then…  If we had:  One interface definition for job submission  One job description language  Then life would be easier!  We’re getting there  JSDL is a proposed standard job submission description language  OGSA-BES are proposing a basic execution service interface  One day hopefully everyone will support this  Till then…

London e-Science Centre JSDL 1.0 Primer Ali Anjomshoaa, Fred Brisard, Michel Drescher, Donal K. Fellows, William Lee, An Ly, Steve McGough, Darren Pulsipher, Andreas Savva, Chris Smith

London e-Science Centre 12 JSDL Introduction JSDL stands for Job Submission Description Language A language for describing the requirements of computational jobs for submission to Grids and other systems. A JSDL document describes the job requirements What to do, not how to do it No Defaults All elements must be satisfied for the document to be satisfied JSDL does not define a submission interface or what the results of a submission look like JSDL 1.0 is published as GFD-R-P.56 Includes description of JSDL elements and XML Schema Available at JSDL stands for Job Submission Description Language A language for describing the requirements of computational jobs for submission to Grids and other systems. A JSDL document describes the job requirements What to do, not how to do it No Defaults All elements must be satisfied for the document to be satisfied JSDL does not define a submission interface or what the results of a submission look like JSDL 1.0 is published as GFD-R-P.56 Includes description of JSDL elements and XML Schema Available at

London e-Science Centre 13 JSDL Document A JSDL document is an XML document It may contain Generic (job) identification information Application description Resource requirements (main focus is computational jobs) Description of required data files It is a template language Open content language – compose-able with others Out of scope, for JSDL version 1.0 Scheduling Workflow Security … A JSDL document is an XML document It may contain Generic (job) identification information Application description Resource requirements (main focus is computational jobs) Description of required data files It is a template language Open content language – compose-able with others Out of scope, for JSDL version 1.0 Scheduling Workflow Security …

London e-Science Centre 14 Workflow Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … JSDL: Conceptual relation with other standards RRL - Resource Requirements Language SDL – Scheduling Description Language WS-A – WS-Agreement JLM – Job Lifetime Management JPL – Job Policy Language

London e-Science Centre 15 A few words on JSDL and BES JSDL is a language No submission interface defined (on purpose) JSDL is independent of submission interfaces BES is defining a Web Service interface which consumes JSDL documents This is not the only use of JSDL Though we do like it JSDL is a language No submission interface defined (on purpose) JSDL is independent of submission interfaces BES is defining a Web Service interface which consumes JSDL documents This is not the only use of JSDL Though we do like it BES Container JSDL

London e-Science Centre 16 Hidden slides give more detail!

London e-Science Centre 17 JSDL Document Structure Overview ? * ? * Note: None[1..1] ?[0..1] * [0..n] + [1..n]

London e-Science Centre 18 Job Identification Element ? * ? ? * ? Example: My Gnuplot invocation Simple application … Extensibility point

London e-Science Centre 19 Application Element ? * ? * Example: gnuplot 5.7 Use the gnuplot application v5.7 regardless where it is installed on the target system How do I define an executable explicitly?

London e-Science Centre 20 Application: POSIXApplication extension * ? * … * ? * … POSIXApplication is a normative JSDL extension Defines standard POSIX elements stdin, stdout, stderr Working directory Command line arguments Environment variables POSIX limits (not shown here)

London e-Science Centre 21 Hello World <jsdl:JobDefinition xmlns:jsdl=“ xmlns:jsdl-posix= “ /bin/echo hello world <jsdl:JobDefinition xmlns:jsdl=“ xmlns:jsdl-posix= “ /bin/echo hello world

London e-Science Centre 22 Resource description requirements Support simple descriptions of resource requirements NOT a comprehensive resource requirements language Avoided explicit heterogeneous or hierarchical descriptions Can be extended with other elements for richer or more abstract descriptions Main target is compute jobs CPU, Memory, Filesystem/Disk, Operating system requirements Allow some flexibility for aggregate (Total*) requirements “I want 10 CPUs in total and each resource should have 2 or more” Very basic support for network requirements Support simple descriptions of resource requirements NOT a comprehensive resource requirements language Avoided explicit heterogeneous or hierarchical descriptions Can be extended with other elements for richer or more abstract descriptions Main target is compute jobs CPU, Memory, Filesystem/Disk, Operating system requirements Allow some flexibility for aggregate (Total*) requirements “I want 10 CPUs in total and each resource should have 2 or more” Very basic support for network requirements

London e-Science Centre 23 Resources Element ? * ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? * * ? * ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? * * Example: One CPU and at least 2 Megabytes of memory

London e-Science Centre 24 Relation of Individual* and Total* Resources elements It is possible to combine Individual* and Total* elements to specify complex requirements “I want a total of 10 CPUs, 2 or more per resource” Caveat: Not all Individual/Total combinations make sense It is possible to combine Individual* and Total* elements to specify complex requirements “I want a total of 10 CPUs, 2 or more per resource” Caveat: Not all Individual/Total combinations make sense

London e-Science Centre 25 RangeValues Example: Between 512MB and 2GB of memory (inclusive) exactepsilon intervalsranges Define exact values (with an optional “epsilon” argument), left- open or right-open intervals and ranges. Example: Between 2 and 16 processors

London e-Science Centre 26 JSDL Type Definitions Example: OperatingSystemTypeEnumeration JSDL defines a small number of types As far as possible re-use existing standards Example: OperatingSystemTypeEnumeration Basic value set defined based on CIM: Windows_XP, JavaVM, OS_390, LINUX, MACOS, Solaris, … CIM defines these as numbers; JSDL provides an XML definition Watching WS-CIM work Similarly for values of other types: ProcessorArchitectureEnumeration based on ISA values JSDL defines a small number of types As far as possible re-use existing standards Example: OperatingSystemTypeEnumeration Basic value set defined based on CIM: Windows_XP, JavaVM, OS_390, LINUX, MACOS, Solaris, … CIM defines these as numbers; JSDL provides an XML definition Watching WS-CIM work Similarly for values of other types: ProcessorArchitectureEnumeration based on ISA values

London e-Science Centre 27 Data Staging Requirement Previous statements included: “A JSDL document describes the job requirements What to do, not how to do it *” “Workflow is out of scope.” But … data staging is a common requirement for any meaningful job submission Especially for batch job submission No standard to describe such data movements Our solution Assume simple model: Stage-in – Execute – Stage-Out Files required for execution Files are staged-in before the job can start executing Files to preserve Files are staged-out after the job finishes execution More complex approaches can be used But this is outside JSDL You don’t need to use the JSDL Data Staging Previous statements included: “A JSDL document describes the job requirements What to do, not how to do it *” “Workflow is out of scope.” But … data staging is a common requirement for any meaningful job submission Especially for batch job submission No standard to describe such data movements Our solution Assume simple model: Stage-in – Execute – Stage-Out Files required for execution Files are staged-in before the job can start executing Files to preserve Files are staged-out after the job finishes execution More complex approaches can be used But this is outside JSDL You don’t need to use the JSDL Data Staging Stage-In Execute Stage-Out

London e-Science Centre 28 DataStaging Element ? ? * ? ? * Example: Stage in a file (from a URL) and name it “control.txt”. In case it already exists, simply overwrite it. After the job is done, delete this file. control.txt overwrite true

London e-Science Centre 29 JSDL Adoption The following projects have presented at GGF JSDL sessions and are known to have implementations of some version of JSDL; not necessarily 1.0. Business Grid Grid Programming Environment (GPE) GridSAM HPC-Europa Market for Computational Services NAREGI UniGrids The following groups also said they are or will be implementing JSDL: DEISA GridBus Project (see OGSA Roadmap, section 8) gridMatrix (Cadence) (presentation) Nordugrid Also within GGF a number of groups either use directly or have a strong interest or connection with JSDL: BES-WG, CDDLM-WG, DRMAA-WG, GRAAP-WG, OGSA-WG, RSS-WG An up-to-date version of this list is on Gridforge: The following projects have presented at GGF JSDL sessions and are known to have implementations of some version of JSDL; not necessarily 1.0. Business Grid Grid Programming Environment (GPE) GridSAM HPC-Europa Market for Computational Services NAREGI UniGrids The following groups also said they are or will be implementing JSDL: DEISA GridBus Project (see OGSA Roadmap, section 8) gridMatrix (Cadence) (presentation) Nordugrid Also within GGF a number of groups either use directly or have a strong interest or connection with JSDL: BES-WG, CDDLM-WG, DRMAA-WG, GRAAP-WG, OGSA-WG, RSS-WG An up-to-date version of this list is on Gridforge:

London e-Science Centre 30 JSDL Mappings ARC (NorduGrid) Condor eNANOS Fork Globus 2 GRIA provider Grid Resource Management System (GRMS) ARC (NorduGrid) Condor eNANOS Fork Globus 2 GRIA provider Grid Resource Management System (GRMS) JOb Scheduling Hierarchically (JOSH) LSF Sun Grid Engine Unicore

London e-Science Centre GridSAM Job Submission and Monitoring Web Service Other way…

London e-Science Centre 32 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM?  A Job Submission and Monitoring Web Service  Funded by the Open Middleware Infrastructure Institute (OMII) managed programme  V1.0 Available as part of the OMII 2.x release (v soon to be released)  Open source (BSD)  One of the first systems to support the GGF Job Submission Description Language (JSDL)  What is GridSAM?  A Job Submission and Monitoring Web Service  Funded by the Open Middleware Infrastructure Institute (OMII) managed programme  V1.0 Available as part of the OMII 2.x release (v soon to be released)  Open source (BSD)  One of the first systems to support the GGF Job Submission Description Language (JSDL)

London e-Science Centre 33 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM to the resource owners?  A Web Service to expose heterogeneous execution resources uniformly  Single machine through Forking or SSH  Condor Pool  Grid Engine 6 through DRMAA  Globus exposed resources  OR use our plug-in API to implement …  What is GridSAM to the resource owners?  A Web Service to expose heterogeneous execution resources uniformly  Single machine through Forking or SSH  Condor Pool  Grid Engine 6 through DRMAA  Globus exposed resources  OR use our plug-in API to implement …

London e-Science Centre 34 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM to end-users?  A set of end-user tools and client-side APIs to interact with a GridSAM web service  Submit and Start Jobs  Monitor Jobs  Terminate Jobs  File transfer  Client-side submission scripting  Client-side Java API  What is GridSAM to end-users?  A set of end-user tools and client-side APIs to interact with a GridSAM web service  Submit and Start Jobs  Monitor Jobs  Terminate Jobs  File transfer  Client-side submission scripting  Client-side Java API

London e-Science Centre 35 What’s it not?  GridSAM is not  a scheduling service  That’s the role of the underlying launching mechanism  That’s the role of a super-scheduler that brokers jobs to a set of GridSAM services  a provisioning service  GridSAM runs what’s been told to run  GridSAM does not resolve software dependencies and resource requirements  GridSAM is not  a scheduling service  That’s the role of the underlying launching mechanism  That’s the role of a super-scheduler that brokers jobs to a set of GridSAM services  a provisioning service  GridSAM runs what’s been told to run  GridSAM does not resolve software dependencies and resource requirements

London e-Science Centre 36 Deployment Scenario: Forking HTTP + WS-Sec./ HTTPS + WS- Sec. / HTTPS mutual. Local FS Local FS GSIFTP FTP WEBDAV HTTP …

London e-Science Centre 37 Deployment Scenario: Secure Shell (SSH) HTTP + WS-Sec./ HTTPS + WS- Sec. / HTTPS mutual. GSIFTP FTP WEBDAV HTTP … SFTP - FS SFTP - FS

London e-Science Centre 38 Deployment Scenario: Condor Pool Condor command- line wrapper HTTP + WS-Sec./ HTTPS + WS-Sec. / HTTPS mutual. GSIFTP FTP WEBDAV HTTP … Network FS Network FS

London e-Science Centre 39 Deployment Scenario: Globus 2.4.3

London e-Science Centre 40 Deployment Scenario: Grid Engine 6 GSIFTP FTP WEBDAV HTTP … Network FS Network FS

London e-Science Centre 41 Latest Features  Available in v2.0.0-rc1 (released 1/7/06)  MPI Application through GT2 plugin  Simple non-standard JSDL extension that extends with a element  Authorisation based on JSDL structure  Allow / deny submission based on a set of XPath rules and the identities of the submitter (e.g. distinguished name).  Prototype Basic Execution Service (ogsa-bes) interface  Demonstrated in the mini face-to-face in London last December  Shown interoperability with the Uni. Of Virginia BES (.NET based) implementation.  Available in v2.0.0-rc1 (released 1/7/06)  MPI Application through GT2 plugin  Simple non-standard JSDL extension that extends with a element  Authorisation based on JSDL structure  Allow / deny submission based on a set of XPath rules and the identities of the submitter (e.g. distinguished name).  Prototype Basic Execution Service (ogsa-bes) interface  Demonstrated in the mini face-to-face in London last December  Shown interoperability with the Uni. Of Virginia BES (.NET based) implementation.

London e-Science Centre 42 Upcoming Features  Job State Notification  Integrate with FINS (WS-Eventing)  Resource Usage Service  GGF RUS compliant service implementation for recording and querying usages  Integrate with GridSAM to account for job resource usage  Basic Execution Service  Continue tracking the changes in the ogsa-bes specification  Support dual submission WS-interfaces  Job State Notification  Integrate with FINS (WS-Eventing)  Resource Usage Service  GGF RUS compliant service implementation for recording and querying usages  Integrate with GridSAM to account for job resource usage  Basic Execution Service  Continue tracking the changes in the ogsa-bes specification  Support dual submission WS-interfaces

43 GridSAM and the NGS Recently deployed by Belfast But not in time to include practicals here

44 GridSAM client Download from OMII-UK – Trivial to install For guidance on use see: _training_guide.htm#gridsam/gridsam_java_api_tutorial.htm _training_guide.htm#gridsam/gridsam_java_api_tutorial.htm

London e-Science Centre 45 Further Information Official Download Project Information and Documentation Official Download Project Information and Documentation

London e-Science Centre Questions?