F.Pacini - Milan - 7 maggio, 2001 - n° 1 UI Interactions and Interfaces with the Workload Manager Components DataGrid WP1 F. Pacini

Slides:



Advertisements
Similar presentations
Grid Workload Management (WP 1) Report to INFN-GRID TB Massimo Sgaravatto INFN Padova.
Advertisements

DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
The Grid Job Monitoring Service Luděk Matyska et al. CESNET, z.s.p.o. Prague Czech Republic.
Workload Management David Colling Imperial College London.
WP 1 Members of Wp1: INFN Cesnet DATAMAT PPARC. WP 1 What does WP1 do? Broker Submission mechanism JDL/JCL and other UIs Logging computational economics.
Job Submission and Resource Brokering WP 1. Contents: The components What (should) works now and configuration How to submit jobs … the UI and JDL Planned.
EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
The Grid Constantinos Kourouyiannis Ξ Architecture Group.
Job Submission The European DataGrid Project Team
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
CGI Programming: Part 1. What is CGI? CGI = Common Gateway Interface Provides a standardized way for web browsers to: –Call programs on a server. –Pass.
Job Submission The European DataGrid Project Team
Form Handling, Validation and Functions. Form Handling Forms are a graphical user interfaces (GUIs) that enables the interaction between users and servers.
CLIENT SERVER INTERACTION USING MESSAGES. Introduction  The best choice for client server.  The interaction mechanisms remain similar.  Event loops.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
“Grey areas” of the new architecture Massimo Sgaravatto INFN Padova.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) WMPROXY API Python & C++ Diego Scardaci
Elisabetta Ronchieri - How To Use The UI command line - 10/29/01 - n° 1 How To Use The UI command line Elisabetta Ronchieri by WP1 elisabetta.ronchieri.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Computing I CONDOR.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
Enabling Grids for E-sciencE Workload Management System on gLite middleware Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi (Vietnam)
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
The Plan for this morning: Description of the EDG WP 1 software: How it works, basic commands, how to get started etc Example of how to submit jobs: From.
Nadia LAJILI User Interface User Interface 4 Février 2002.
F.Pacini - Milan - 8 May, n° 1 Results of Meeting on Workload Manager Components Interaction DataGrid WP1 F. Pacini
Grid Workload Management Massimo Sgaravatto INFN Padova.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
Job Submission The European DataGrid Project Team
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Job Submission and Resource Brokering WP 1. Contents: The components What (should) works now and configuration How to submit jobs … the UI and JDL The.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Provenance Challenge gLite Job Provenance.
Data Manipulation Jonathan Rosenberg dynamicsoft.
A university for the world real R © 2009, Chapter 9 The Runtime Environment Michael Adams.
WP1 WMS rel. 2.0 Some issues Massimo Sgaravatto INFN Padova.
E-infrastructure shared between Europe and Latin America 1 Workload Management System-WMS Luciano Diaz Universidad Nacional Autónoma de México - UNAM Mexico.
Enabling Grids for E-sciencE Workload Management System on gLite middleware - commands Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
EGEE is a project funded by the European Union under contract IST WS-Based Advance Reservation and Co-allocation Architecture Proposal T.Ferrari,
Mapping Salutation to Bluetooth 4 Overview 4 Bluetooth Summary 4 Salutation Architecture Summary 4 Mapping Bluetooth SDP to Salutation APIs 4 Mapping Bluetooth.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Workload Management System Jason Shih WLCG T2 Asia Workshop Dec 2, 2006: TIFR.
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
EDG - WP1 (Grid Work Scheduling) Status and plans Massimo Sgaravatto INFN Padova.
JSS Job Submission Service Massimo Sgaravatto INFN Padova.
STAR Scheduling status Gabriele Carcassi 9 September 2002.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
Model View ViewModel Architecture. MVVM Architecture components.
WP1 Status and plans Francesco Prelz, Massimo Sgaravatto 4 th EDG Project Conference Paris, March 6 th, 2002.
Job Submission The European DataGrid Project Team
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
Workload Management System on gLite middleware
z/Ware 2.0 Technical Overview
Workload Management System ( WMS )
Corso di Calcolo Parallelo Grid Computing
With a short description of Voltron.Controller
EGEE tutorial, Job Description Language - more control over your Job Assaf Gottlieb Tel-Aviv University EGEE is a project.
Basic Grid Projects – Condor (Part I)
F. Pacini Datamat Status Report F. Pacini
Presentation transcript:

F.Pacini - Milan - 7 maggio, n° 1 UI Interactions and Interfaces with the Workload Manager Components DataGrid WP1 F. Pacini

F.Pacini - Milan - 7 maggio, n° 2 Summary u Interactions between UI and RB u Interactions between UI and M&C u Other Points

F.Pacini - Milan - 7 maggio, n° 3 Interactions between UI and RB (1/10) u The Job Submission UI contacts the RB when the following commands are issued by the user: s dg-job-submit s dg-list-job-match s dg-job-cancel u Communication is performed via socket (TCP/IP) u An agreement on the Communication protocol is needed

F.Pacini - Milan - 7 maggio, n° 4 Interactions between UI and RB (2/10)  dg-job-submit dg-job-submit [-resource res_id] [-notify e_mail_address] n the information flow from the UI to the RB consists of a job class-ad built from the job description file plus a variable indicating the request type. The job class-ad object consists in a list of entries “attribute = expression”. The following attributes are always present in the job class-ad (jobAd): s UserID s CertificateSubject s ExecutableName s Input s Output s Constraints

F.Pacini - Milan - 7 maggio, n° 5 Interactions between UI and RB (3/10) u If the dg-job-submit has been issued with the “-resource” option, then the job-ad contains the attribute: n ResourceID = res_id and the RB shall submit the job to the resource identified by “res_id” without going through the match-making process. u If the dg-job-submit has been issued with the “-notify” option, then the job class-ad contains the attribute: n UserContact = e_mail_address and the RB shall send an notification to e_mail_address at each job status transition. Content is the same as output from dg- job-status command (TBD).

F.Pacini - Milan - 7 maggio, n° 6 Interactions between UI and RB (4/10) u The variable indicating the request type, say requestType, is an enumerative with the following values: s JOB_SUBMIT s LIST_MATCH and is passed in this case with the JOB_SUBMIT value. u Summarising the UI passes to the RB a structure with the two following fields: n requestType n jobAd

F.Pacini - Milan - 7 maggio, n° 7 Interactions between UI and RB (5/10) u The expected response from the RB consists of: n ReturnCode, a numeric code indicating the operation result n ReturnMessage, a string describing the operation result n dg_jobId, a string identifying the submitted job UI to RBRB to UI - requestType (JOB_SUBMIT) - jobAd - ReturnCode - ReturnMessage - dg_jobId

F.Pacini - Milan - 7 maggio, n° 8 Interactions between UI and RB (6/10)  dg-list-job-match dg-list-job-match n the information flow from the UI to the RB consists of a job class-ad built from the job description file plus a variable indicating the request type. The job class-ad object consists in a list of entries “attribute = expression”. The following attributes are always present in the job class-ad (jobAd): s UserID s CertificateSubject s ExecutableName s Input s Output s Constraints

F.Pacini - Milan - 7 maggio, n° 9 Interactions between UI and RB (7/10) u The variable indicating the request type, requestType, is is passed in this case with the LIST_MATCH value. u The jobAd associated with this requestType will never contain neither the ResourceID nor the UserContact attributes. u In this case the RB does not submit the job but only searches for resources compatible with the input jobAd.

F.Pacini - Milan - 7 maggio, n° 10 Interactions between UI and RB (8/10) u The expected response from the RB consists of: n ReturnCode, a numeric code indicating the operation result (0 stays for success) n ReturnMessage, a string describing the operation result n ResourceList, a list of ResourceIds, i.e. strings identifying the resources matching with the jobAd. UI to RBRB to UI - requestType (LIST_MATCH) - jobAd - ReturnCode - ReturnMessage - ResourceList

F.Pacini - Milan - 7 maggio, n° 11 Interactions between UI and RB (9/10)  dg-job-cancel dg-job-cancel n the information flow from the UI to the RB consists of s userID, a string representing the user identifier s certSubject, a string containing the user certificate subject s dg_jobIdList, a list of string representing the identifiers of jobs to be canceled as specified by the user n if the dg-job-cancel command has been issued with the “-all” input parameter, then the dg_jobIdList passed to the RB will be empty indicating that all jobs submitted by the user identified by userID have to be canceled.

F.Pacini - Milan - 7 maggio, n° 12 Interactions between UI and RB (10/10) u The expected response from the RB consists of: n ReturnCode, a numeric code indicating the operation result n ReturnMessage, a string describing the operation result n dg_jobIdList, a list of dg_jobIds, i.e. strings identifying the jobs effectively deleted UI to RBRB to UI - userID - certSubject - dg_jobIdList (can be empty) - ReturnCode - ReturnMessage - dg_jobIdList

F.Pacini - Milan - 7 maggio, n° 13 Interactions between UI and M&C (1/11) u The Job Submission UI contacts the M&C when the following commands are issued by the user: s dg-job-status s dg-get-logging-info u to serve this request the UI uses the bookkeeping and logging APIs made available by the M&C component (see the Cesnet L&B Service Document) u APIs implementation shall encompass network communication

F.Pacini - Milan - 7 maggio, n° 14 Interactions between UI and M&C (2/11)  dg-job-status dg-job-status | -all > [full] n The UI will use the provided L&B server API with the following input information: s UserId, a string representing the user identifier s dg_jobIdList, a list of dg_jobIds s InformationLevel, indicating the required information level (SHORT/FULL) according to the command option “-full” n If the dg-job-status command has been issued with the “-all” input parameter, then the dg_jobIdList will be empty indicating that status information about all jobs submitted by the user identified by userID are requested.

F.Pacini - Milan - 7 maggio, n° 15 Interactions between UI and M&C (3/11) u Returned information should encompass: n ReturnCode, a numeric code indicating the operation result n ReturnMessage, a string describing the operation result n JobsStatusInfo, consisting of (TBD): s userID s dg_jobID s jobStatus s ResourceID s Executable s input s output s submissionTime (when the job has been submitted from the UI) s scheduledTime (when the job has been submitted to the resource) s startRunningTime (when the job has started its execution) s StopRunningTime (when the job has completed its execution) (if the InformationLevel is SHORT).

F.Pacini - Milan - 7 maggio, n° 16 Interactions between UI and M&C (4/11) n JobsStatusInfo, consisting of (TBD): s ResourceManagementType s ResourceManagementVersion s Gramversion s Architecture s OpSys s traversalTime s TotalCpus s FreeCpus s RunningJobs s IdleJobs s MaxTotalJobs s MaxRunningJobs s Status s userID s dg_jobID s jobStatus s ResourceID s ResourceName s Executable s input s output s submissionTime s scheduledTime s startRunningTime s StopRunningTime s CpuTime s Rank s Constraint (if the InformationLevel is FULL)

F.Pacini - Milan - 7 maggio, n° 17 Interactions between UI and M&C (5/11)  Summarising with the dg-job-status command we have: UI to M&CM&C to UI - userID - dg_jobIdList (can be empty) - InformationLevel - ReturnCode - ReturnMessage - JobsStatusInfo

F.Pacini - Milan - 7 maggio, n° 18 Interactions between UI and M&C (6/11) u dg-get-logging-info dg-get-logging-info | -all > [-from T1] [-to T2] [-full] n The UI will use the provided L&B server API with the following input information: s UserId, a string representing the user identifier s dg_jobIdList, a list of dg_jobIds s fromTime, timestamp s toTime, timestamp s InformationLevel, indicating the required information level (SHORT/FULL) according to the command option “-full” n If the dg-get-logging-info command has been issued with the “-all” input parameter, then the dg_jobIdList will be empty.

F.Pacini - Milan - 7 maggio, n° 19 Interactions between UI and M&C (7/11) u Returned information should encompass: n ReturnCode, a numeric code indicating the operation result n ReturnMessage, a string describing the operation result n JobsLogInfo according to the requested InformationLevel.  Summarising with the dg-get-logging-info command we have: UI to M&CM&C to UI - userID - dg_jobIdList (can be empty) - fromTime - toTime - InformationLevel - ReturnCode - ReturnMessage - JobLogInfo

F.Pacini - Milan - 7 maggio, n° 20 Interactions between UI and M&C (8/11) n userID n jobID n jobStatus n submissionTime n scheduledTime n startTime n finishTime n executable n executableSize n input data LFN n output data LFN n pendingReasons n Constraint n Rank n NumCpus n CpuTime n swapSpace n totalI/O n totalDataSpace n WallClockTime u JobLogInfo has to be defined:

F.Pacini - Milan - 7 maggio, n° 21 Interactions between UI and M&C (9/11) u RunWindows u ResourcePriority u MaxCpuTime u MaxWallTime u networkReq u fromTime u toTime u RunWindows u queuePriority u MaxCpuTime u MaxWallTime u ResourceID u ResourceName u ResourceManagementType u ResourceManagementVersion u Gramversion u executingHost u Architecture u OpSys u traversalTime u TotalCpus u MaxTotalJobs u MaxRunningJobs u ResourceStatus

F.Pacini - Milan - 7 maggio, n° 22 Interactions between UI and M&C (10/11) u Some points from the L&B service document (pages 2,3,5): n SUBMITTED status: does it mean that the job is still in the UI? n The UI does not know the dg_jobId before the job is in the RB. How can use the logging service? n CHKPT status: is checkpoint supported for PM9? n CLEARED status: who triggers transition 8? Who is going to log the JobClearedEvent? n GridScheduler, Condor-G and Globus job-manager are expected to log: the UI too? n Modification to Globus job-manager for logging: which effort? Who does it? n JobDoneEvent: does Globus provides the job exit status?

F.Pacini - Milan - 7 maggio, n° 23 Interactions between UI and M&C (11/11) u Some points from the L&B service document (pages 10,11,13): n L&B API seems not to provide a way to select info by time s Is it foreseen for PM9? s Shall the UI filter by time locally? n jobLog(): “level” vs “InformationLevel” n jobStatus(): “appropriate details” are already defined? n Encapsulation of network communication in the L&B server API is needed JobSubmitEvent: source of this log event is the GUI. Again jobID is needed but the UI does not know it.

F.Pacini - Milan - 7 maggio, n° 24 Other Points (1/2) u The information flow between the RB and the SS (the Condor-G wrapper) consists of class-ad objects n The RB maintain a persistent queue of jobs submitted by the users through the UI. Jobs are described by jobAd objects and identified by the dg_jobIds. n Once a suitable resource for a job has been found by the RB through the match making process, the jobAd is enriched with the found ResourceId and passed to the SS for submission u Which is the information returned to teh RB? n a job handle (the condor_jobId) and the jobStatus (condor_rm command)?

F.Pacini - Milan - 7 maggio, n° 25 Other Points (2/2) u How to detect job status transitions? Who does it? n Inspection of the Condor-G log files n globus_gram_client_job_status function n Other ways?