London e-Science Centre Session 4: The GridSAM service A. Stephen McGough Imperial College London A. Stephen McGough Imperial College London.

Slides:



Advertisements
Similar presentations
March 6 th, 2009 OGF 25 Unicore 6 and IPv6 readiness and IPv6 readiness
Advertisements

GridSAM Overview Grid Job S ubmission A nd M onitoring Service What is GridSAM? Funded by the OMII Managed Programme (Started in Sept, 04) Client Perspective.
WS-JDML: A Web Service Interface for Job Submission and Monitoring Stephen M C Gough William Lee London e-Science Centre Department of Computing, Imperial.
17 March 2008Standards for Interoperable Grids 1 Job Execution Standards for Interoperable Grids: Experience from NextGRID and OMII-Europe Clive Davenhall.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
3rd Campus Grid SIG Meeting. Agenda Welcome OMII Requirements document Grid Data Group HTC Workshop Research Computing SIG? AOB Next meeting (AG)
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
A JSDL Applications Repository and Data Staging Portal: Some New Parameter Sweep Developments and Data transfer Requirements David Meredith STFC e-Science.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
London e-Science Centre Session 6: Distributed Computation Practical issues & Examples A. Stephen McGough Imperial College London Practical issues & Examples.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
W w w. h p c - e u r o p a. o r g The HPC-Europa project and GridSphere Dawid Szejnfeld Poznan Supercomputing.
W w w. h p c - e u r o p a. o r g Single Point of Access to Resources of HPC-Europa Krzysztof Kurowski, Jarek Nabrzyski, Ariel Oleksiak, Dawid Szejnfeld.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
1 AHE Server Deployment and Hosting Applications Stefan Zasada University College London.
Accounting for the Grid Usage Records and a Resource Usage Service.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
JRA1/Job Submission and Monitoring Moreno Marzolla on behalf of JRA1/Job Submission Task INFN Sezione di Padova,
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
Condor Birdbath Web Service interface to Condor
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
London e-Science Centre GridSAM A Standards Based Approach to Job Submission A. Stephen M C Gough Imperial College London A Standards Based Approach to.
Why do we need PGI? Shahbaz Memon Jülich Supercomputing Centre (JSC)
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
JSDL/BES Tutorial Moreno Marzolla, INFN Padova OMII-EU All Hands Meeting, Stockholm, jul 31—aug 2, 2007.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
GridSAM - A Standards Based Approach to Job Submission Through Web Services William Lee and Stephen McGough London e-Science Centre Department of Computing,
Todd Tannenbaum Computer Sciences Department University of Wisconsin-Madison Condor RoadMap.
Grids - the near future Mark Hayes NIEeS Summer School 2003.
1 Condor BirdBath SOAP Interface to Condor Charaka Goonatilake Department of Computer Science University College London
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
OGSA-UK: Putting the users first Steven Newhouse OMII Deputy Director.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Standards driven AAA for Job Management within the OMII-UK distribution Steven Newhouse Director, OMII-UK
Grid Interoperability Update on GridFTP tests Gregor von Laszewski
GridSAM: an Introduction Mike Mineter.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Accounting in DataGrid HLR software demo Andrea Guarise Milano, September 11, 2001.
GridSAM and the Job Submission Description Language Presented by Mike Mineter (Most) slides from Stephen.
Holding slide prior to starting show. Processing Scientific Applications in the JINI-Based OGSA-Compliant Grid Yan Huang.
London e-Science Centre Activity Schema What we’ve discussed already A. Stephen M C Gough Imperial College London What we’ve discussed already A. Stephen.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
JRA1/Job Submission and Monitoring
Introduction to the Application Hosting Environment
Towards GLUE Schema 2.0 Sergio Andreozzi INFN-CNAF Bologna, Italy
OGSA Data Architecture Scenarios
Grid Systems: What do we need from web service standards?
Sergio Andreozzi Laurence Field Balazs Konya
Presentation transcript:

London e-Science Centre Session 4: The GridSAM service A. Stephen McGough Imperial College London A. Stephen McGough Imperial College London

London e-Science Centre 2 Outline  Overview  Other Way  JSDL  GridSAM  Overview  Other Way  JSDL  GridSAM

London e-Science Centre Overview Running Jobs on the Grid

London e-Science Centre 4 Context Middleware Map to resources jobs / legacy code / binary executables Resources

London e-Science Centre 5 security Stages to using the Grid – Classical View write (code) to solve problem “compile” against middleware advertise Select resources Deploy to resources middleware submit to Grid accounting Steering and visualisation Stage data

London e-Science Centre 6 What is wrong with this picture? There are already many DRM systems (Condor, Globus…)  Why do we need another one?  We don’t. What we really need is for them all to be able to talk to each other  Make life easy for all  We need a service which makes systems look the same There are already many DRM systems (Condor, Globus…)  Why do we need another one?  We don’t. What we really need is for them all to be able to talk to each other  Make life easy for all  We need a service which makes systems look the same

London e-Science Centre 7 To make life easy We want to hide the heterogeneity of the Grid User Grid resources Hide heterogeneity by tight abstraction here

London e-Science Centre Other Way… Standards Based Job Submission

London e-Science Centre 9 If all DRM systems supported the same interface…  If we had:  One interface definition for job submission  One job description language  Then life would be easier!  We’re getting there  JSDL is a proposed standard job submission description language  OGSA-BES are proposing a basic execution service interface  One day hopefully everyone will support this  Till then…  If we had:  One interface definition for job submission  One job description language  Then life would be easier!  We’re getting there  JSDL is a proposed standard job submission description language  OGSA-BES are proposing a basic execution service interface  One day hopefully everyone will support this  Till then…

London e-Science Centre JSDL 1.0 Primer Ali Anjomshoaa, Fred Brisard, Michel Drescher, Donal K. Fellows, William Lee, An Ly, Steve McGough, Darren Pulsipher, Andreas Savva, Chris Smith

London e-Science Centre 11 JSDL Introduction JSDL stands for Job Submission Description Language A language for describing the requirements of computational jobs for submission to Grids and other systems. A JSDL document describes the job requirements What to do, not how to do it No Defaults All elements must be satisfied for the document to be satisfied JSDL does not define a submission interface or what the results of a submission look like JSDL 1.0 is published as GFD-R-P.56 Includes description of JSDL elements and XML Schema Available at JSDL stands for Job Submission Description Language A language for describing the requirements of computational jobs for submission to Grids and other systems. A JSDL document describes the job requirements What to do, not how to do it No Defaults All elements must be satisfied for the document to be satisfied JSDL does not define a submission interface or what the results of a submission look like JSDL 1.0 is published as GFD-R-P.56 Includes description of JSDL elements and XML Schema Available at

London e-Science Centre 12 JSDL Document A JSDL document is an XML document It may contain Generic (job) identification information Application description Resource requirements (main focus is computational jobs) Description of required data files It is a template language Open content language – compose-able with others Out of scope, for JSDL version 1.0 Scheduling Workflow Security … A JSDL document is an XML document It may contain Generic (job) identification information Application description Resource requirements (main focus is computational jobs) Description of required data files It is a template language Open content language – compose-able with others Out of scope, for JSDL version 1.0 Scheduling Workflow Security …

London e-Science Centre 13 Workflow Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … Job JSDL RRL SDL WS-A JLM JPL … … JSDL: Conceptual relation with other standards RRL - Resource Requirements Language SDL – Scheduling Description Language WS-A – WS-Agreement JLM – Job Lifetime Management JPL – Job Policy Language

London e-Science Centre 14 A few words on JSDL and BES JSDL is a language No submission interface defined (on purpose) JSDL is independent of submission interfaces BES is defining a Web Service interface which consumes JSDL documents This is not the only use of JSDL Though we do like it JSDL is a language No submission interface defined (on purpose) JSDL is independent of submission interfaces BES is defining a Web Service interface which consumes JSDL documents This is not the only use of JSDL Though we do like it BES Container JSDL

London e-Science Centre 15 JSDL Document Structure Overview ? * ? * Note: None[1..1] ?[0..1] * [0..n] + [1..n]

London e-Science Centre 16 Job Identification Element ? * ? ? * ? Example: My Gnuplot invocation Simple application … Extensibility point

London e-Science Centre 17 Application Element ? * ? * Example: gnuplot 5.7 Use the gnuplot application v5.7 regardless where it is installed on the target system How do I define an executable explicitly?

London e-Science Centre 18 Application: POSIXApplication extension * ? * … * ? * … POSIXApplication is a normative JSDL extension Defines standard POSIX elements stdin, stdout, stderr Working directory Command line arguments Environment variables POSIX limits (not shown here)

London e-Science Centre 19 Hello World <jsdl:JobDefinition xmlns:jsdl=“ xmlns:jsdl-posix= “ /bin/echo hello world <jsdl:JobDefinition xmlns:jsdl=“ xmlns:jsdl-posix= “ /bin/echo hello world

London e-Science Centre 20 Resource description requirements Support simple descriptions of resource requirements NOT a comprehensive resource requirements language Avoided explicit heterogeneous or hierarchical descriptions Can be extended with other elements for richer or more abstract descriptions Main target is compute jobs CPU, Memory, Filesystem/Disk, Operating system requirements Allow some flexibility for aggregate (Total*) requirements “I want 10 CPUs in total and each resource should have 2 or more” Very basic support for network requirements Support simple descriptions of resource requirements NOT a comprehensive resource requirements language Avoided explicit heterogeneous or hierarchical descriptions Can be extended with other elements for richer or more abstract descriptions Main target is compute jobs CPU, Memory, Filesystem/Disk, Operating system requirements Allow some flexibility for aggregate (Total*) requirements “I want 10 CPUs in total and each resource should have 2 or more” Very basic support for network requirements

London e-Science Centre 21 Resources Element ? * ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? * * ? * ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? * * Example: One CPU and at least 2 Megabytes of memory

London e-Science Centre 22 Relation of Individual* and Total* Resources elements It is possible to combine Individual* and Total* elements to specify complex requirements “I want a total of 10 CPUs, 2 or more per resource” Caveat: Not all Individual/Total combinations make sense It is possible to combine Individual* and Total* elements to specify complex requirements “I want a total of 10 CPUs, 2 or more per resource” Caveat: Not all Individual/Total combinations make sense

London e-Science Centre 23 RangeValues Example: Between 512MB and 2GB of memory (inclusive) exactepsilon intervalsranges Define exact values (with an optional “epsilon” argument), left- open or right-open intervals and ranges. Example: Between 2 and 16 processors

London e-Science Centre 24 JSDL Type Definitions Example: OperatingSystemTypeEnumeration JSDL defines a small number of types As far as possible re-use existing standards Example: OperatingSystemTypeEnumeration Basic value set defined based on CIM: Windows_XP, JavaVM, OS_390, LINUX, MACOS, Solaris, … CIM defines these as numbers; JSDL provides an XML definition Watching WS-CIM work Similarly for values of other types: ProcessorArchitectureEnumeration based on ISA values JSDL defines a small number of types As far as possible re-use existing standards Example: OperatingSystemTypeEnumeration Basic value set defined based on CIM: Windows_XP, JavaVM, OS_390, LINUX, MACOS, Solaris, … CIM defines these as numbers; JSDL provides an XML definition Watching WS-CIM work Similarly for values of other types: ProcessorArchitectureEnumeration based on ISA values

London e-Science Centre 25 Data Staging Requirement Previous statements included: “A JSDL document describes the job requirements What to do, not how to do it *” “Workflow is out of scope.” But … data staging is a common requirement for any meaningful job submission Especially for batch job submission No standard to describe such data movements Our solution Assume simple model: Stage-in – Execute – Stage-Out Files required for execution Files are staged-in before the job can start executing Files to preserve Files are staged-out after the job finishes execution More complex approaches can be used But this is outside JSDL You don’t need to use the JSDL Data Staging Previous statements included: “A JSDL document describes the job requirements What to do, not how to do it *” “Workflow is out of scope.” But … data staging is a common requirement for any meaningful job submission Especially for batch job submission No standard to describe such data movements Our solution Assume simple model: Stage-in – Execute – Stage-Out Files required for execution Files are staged-in before the job can start executing Files to preserve Files are staged-out after the job finishes execution More complex approaches can be used But this is outside JSDL You don’t need to use the JSDL Data Staging Stage-In Execute Stage-Out

London e-Science Centre 26 DataStaging Element ? ? * ? ? * Example: Stage in a file (from a URL) and name it “control.txt”. In case it already exists, simply overwrite it. After the job is done, delete this file. control.txt overwrite true

London e-Science Centre 27 JSDL Adoption The following projects have presented at GGF JSDL sessions and are known to have implementations of some version of JSDL; not necessarily 1.0. Business Grid Grid Programming Environment (GPE) GridSAM HPC-Europa Market for Computational Services NAREGI UniGrids The following groups also said they are or will be implementing JSDL: DEISA GridBus Project (see OGSA Roadmap, section 8) gridMatrix (Cadence) (presentation) Nordugrid Also within GGF a number of groups either use directly or have a strong interest or connection with JSDL: BES-WG, CDDLM-WG, DRMAA-WG, GRAAP-WG, OGSA-WG, RSS-WG An up-to-date version of this list is on Gridforge: The following projects have presented at GGF JSDL sessions and are known to have implementations of some version of JSDL; not necessarily 1.0. Business Grid Grid Programming Environment (GPE) GridSAM HPC-Europa Market for Computational Services NAREGI UniGrids The following groups also said they are or will be implementing JSDL: DEISA GridBus Project (see OGSA Roadmap, section 8) gridMatrix (Cadence) (presentation) Nordugrid Also within GGF a number of groups either use directly or have a strong interest or connection with JSDL: BES-WG, CDDLM-WG, DRMAA-WG, GRAAP-WG, OGSA-WG, RSS-WG An up-to-date version of this list is on Gridforge:

London e-Science Centre 28 JSDL Mappings ARC (NorduGrid) Condor eNANOS Fork Globus 2 GRIA provider Grid Resource Management System (GRMS) ARC (NorduGrid) Condor eNANOS Fork Globus 2 GRIA provider Grid Resource Management System (GRMS) JOb Scheduling Hierarchically (JOSH) LSF Sun Grid Engine Unicore

London e-Science Centre GridSAM Job Submission and Monitoring Web Service Other way…

London e-Science Centre 30 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM?  A Job Submission and Monitoring Web Service  Funded by the Open Middleware Infrastructure Institute (OMII) managed programme  V1.0 Available as part of the OMII 2.x release (v soon to be released)  Open source (BSD)  One of the first system to support the GGF Job Submission Description Language (JSDL)  What is GridSAM?  A Job Submission and Monitoring Web Service  Funded by the Open Middleware Infrastructure Institute (OMII) managed programme  V1.0 Available as part of the OMII 2.x release (v soon to be released)  Open source (BSD)  One of the first system to support the GGF Job Submission Description Language (JSDL)

London e-Science Centre 31 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM to the resource owners?  A Web Service to expose heterogeneous execution resources uniformly  Single machine through Forking or SSH  Condor Pool  Grid Engine 6 through DRMAA  Globus exposed resources  OR use our plug-in API to implement …  What is GridSAM to the resource owners?  A Web Service to expose heterogeneous execution resources uniformly  Single machine through Forking or SSH  Condor Pool  Grid Engine 6 through DRMAA  Globus exposed resources  OR use our plug-in API to implement …

London e-Science Centre 32 GridSAM Overview Grid Job Submission and Monitoring Service  What is GridSAM to end-users?  A set of end-user tools and client-side APIs to interact with a GridSAM web service  Submit and Start Jobs  Monitor Jobs  Terminate Jobs  File transfer  Client-side submission scripting  Client-side Java API  What is GridSAM to end-users?  A set of end-user tools and client-side APIs to interact with a GridSAM web service  Submit and Start Jobs  Monitor Jobs  Terminate Jobs  File transfer  Client-side submission scripting  Client-side Java API

London e-Science Centre 33 What’s not?  GridSAM is not  a scheduling service  That’s the role of the underlying launching mechanism  That’s the role of a super-scheduler that brokers jobs to a set of GridSAM services  a provisioning service  GridSAM runs what’s been told to run  GridSAM does not resolve software dependencies and resource requirements  GridSAM is not  a scheduling service  That’s the role of the underlying launching mechanism  That’s the role of a super-scheduler that brokers jobs to a set of GridSAM services  a provisioning service  GridSAM runs what’s been told to run  GridSAM does not resolve software dependencies and resource requirements

London e-Science Centre 34 Deployment Scenario: Forking HTTP + WS-Sec./ HTTPS + WS- Sec. / HTTPS mutual. Local FS Local FS GSIFTP FTP WEBDAV HTTP …

London e-Science Centre 35 Deployment Scenario: Secure Shell (SSH) HTTP + WS-Sec./ HTTPS + WS- Sec. / HTTPS mutual. GSIFTP FTP WEBDAV HTTP … SFTP - FS SFTP - FS

London e-Science Centre 36 Deployment Scenario: Condor Pool Condor command- line wrapper HTTP + WS-Sec./ HTTPS + WS-Sec. / HTTPS mutual. GSIFTP FTP WEBDAV HTTP … Network FS Network FS

London e-Science Centre 37 Deployment Scenario: Globus 2.4.3

London e-Science Centre 38 Deployment Scenario: Grid Engine 6 GSIFTP FTP WEBDAV HTTP … Network FS Network FS

London e-Science Centre 39 Latest Features  Available in v2.0.0-rc1 (released 1/7/06)  MPI Application through GT2 plugin  Simple non-standard JSDL extension that extends with a element  Authorisation based on JSDL structure  Allow / deny submission based on a set of XPath rules and the identities of the submitter (e.g. distinguished name).  Prototype Basic Execution Service (ogsa-bes) interface  Demonstrated in the mini face-to-face in London last December  Shown interoperability with the Uni. Of Virginia BES (.NET based) implementation.  Available in v2.0.0-rc1 (released 1/7/06)  MPI Application through GT2 plugin  Simple non-standard JSDL extension that extends with a element  Authorisation based on JSDL structure  Allow / deny submission based on a set of XPath rules and the identities of the submitter (e.g. distinguished name).  Prototype Basic Execution Service (ogsa-bes) interface  Demonstrated in the mini face-to-face in London last December  Shown interoperability with the Uni. Of Virginia BES (.NET based) implementation.

London e-Science Centre 40 Upcoming Features  Job State Notification  Integrate with FINS (WS-Eventing)  Resource Usage Service  GGF RUS compliant service implementation for recording and querying usages  Integrate with GridSAM to account for job resource usage  Basic Execution Service  Continue tracking the changes in the ogsa-bes specification  Support dual submission WS-interfaces  Job State Notification  Integrate with FINS (WS-Eventing)  Resource Usage Service  GGF RUS compliant service implementation for recording and querying usages  Integrate with GridSAM to account for job resource usage  Basic Execution Service  Continue tracking the changes in the ogsa-bes specification  Support dual submission WS-interfaces

London e-Science Centre 41 Further Information Official Download Project Information and Documentation Official Download Project Information and Documentation

London e-Science Centre Questions?