Download presentation
Presentation is loading. Please wait.
Published byJoseph Oliver Modified over 8 years ago
1
Enabling Grids for E-sciencE www.eu-egee.org Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial
2
Enabling Grids for E-sciencE 2 Outline Workload Management –Introduction –User Interface –JDL (Job Description Language) –Job work flow –Demo of Job submission –Practical
3
Enabling Grids for E-sciencE 3 Workload Management System The purpose of the Workload Management System (WMS): - To accept user jobs - To assign them to the most appropriate Computing Element - To record their status - To retrieve their output. The WMS used to be called Resource Broker (RB). new service is called gLite-WMS.
4
Enabling Grids for E-sciencE 4 Scope U I WMS File catalog I S S E C E & W N UI JDL Input Sandbox Output Sandbox
5
Enabling Grids for E-sciencE 5 User Interface Node The user’s interface to the Grid. The basic functionalities are for User to: -list the computing resources -submit a job, -get the job status, -cancel a job, -retrieve the output of a job. UI JDL
6
Enabling Grids for E-sciencE 6 Example JDL file Type = "Job"; JobType = "Normal"; Executable = "/bin/hostname"; StdOutput = "hostname.out"; StdError = "hostname.err"; OutputSandbox = {"hostname.err","hostname.out"}; Arguments = "-f"; Using a Job Description Language (JDL) to describe a job. Job Description Language (JDL)
7
Enabling Grids for E-sciencE 7 Relevant JDL attributes JobType –Normal (sequential batch job), Collection, DAG, Parametric, Interactive, MPICH, Checkpointable Executable –The name of the executable (absolute path) Arguments –Job command line arguments StdInput, StdOutput, StdError –Standard input/output/error of the job Environment List of environment variables to be set for the binary
8
Enabling Grids for E-sciencE 8 Relevant JDL attributes 2. InputSandbox –List of files on the UI local disk needed by the job for running –The listed files will be staged to the remote resource OutputSandbox –List of files, generated by the job, which have to be retrieved –Files will be transfered back Requirements –Required CE caracteristics Rank –“Goodness” value for compatible CEs ShallowRetryCount – In case of grid error, retry job this many times (“Shallow”: before job is running)
9
Enabling Grids for E-sciencE INFSO-RI-508833 9 Job Control Commands WMS version LCG-2 WMS gLite WMS via NS gLite 3.0 gLite WMS via WMProxy gLite 3.1+ Delegate proxy glite-wms-job-delegate- proxy -d delegID Submit edg-job-submit [-o joblist]jdlfile glite-job-submit [-o joblist] jdlfile glite-wms-job-submit [-d delegID] [-a] [-o joblist] jdlfile Status edg-job-status [-v verbosity] [-i joblist] jobIDs glite-job-status [-v verbosity] [-i joblist] jobIDs glite-wms-job-status [-v verbosity] [-i joblist] jobIDs Logging edg-job-get-logging- info [-v verbosity] [-i joblist] jobIDs glite-job-logging-info [-v verbosity] [-i joblist] jobIDs glite-wms-job-logging- info [-v verbosity] [-i joblist] jobIDs Output edg-job-get-output [-dir outdir] [-i joblist] jobIDs glite-job-output [-dir outdir] [-i joblist] jobIDs glite-wms-job-output [-dir outdir] [-i joblist] jobIDs Cancel edg-job-cancel [-i joblist] jobID glite-job-cancel [-i joblist] jobID glite-wms-job-cancel [-i joblist] jobID Compatible resources edg-job-list-match jdlfile glite-job-list-match jdlfile glite-wms-job-list-match [-d delegID] [-a] jdlfile Recommended
10
Enabling Grids for E-sciencE 10 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage Input Sandbox files Job submitted WMS glite-wms-job-submit myjob.jdl WMProxy responsible for accepting incoming requests
11
Enabling Grids for E-sciencE 11 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage waiting submitted Job Status WM: responsible to take the appropriate actions to satisfy the request Job WMS
12
Enabling Grids for E-sciencE 12 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage waiting submitted Match- Maker/ Broker Where must this job be executed ? WMS Matchmaker: responsible to find the “best” CE where to submit a job
13
Enabling Grids for E-sciencE 13 WMS UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage waiting submitted Information supermarket Responsible of resource information available to Matchmaker Match- Maker/ Broker
14
Enabling Grids for E-sciencE 14 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage waiting submitted Match- Maker/ Broker WMS Information supermarket CE choice
15
Enabling Grids for E-sciencE 15 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage JC: responsible for the actual job management operations (done via CondorG) Job submitted waiting ready WMS Task Queue
16
Enabling Grids for E-sciencE 16 WMS UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element CE characts & status SE characts & status RB storage Job Input Sandbox files submitted waiting ready scheduled Task Queue
17
Enabling Grids for E-sciencE 17 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB storage Input Sandbox submitted waiting ready scheduled running “Grid enabled” data transfers/ accesses Job WMS Task Queue
18
Enabling Grids for E-sciencE 18 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB storage Output Sandbox files submitted waiting ready scheduled running done WMS Task Queue
19
Enabling Grids for E-sciencE 19 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB storage Output Sandbox submitted waiting ready scheduled running done glite-wms-get-output WMS Task Queue
20
Enabling Grids for E-sciencE 20 UI WM Proxy Job Contr. - CondorG Workload Manager LFC Inform. Service Computing Element Storage Element RB storage Output Sandbox files submitted waiting ready scheduled running done cleared WMS Task Queue
21
Enabling Grids for E-sciencE 21 Job monitoring UI Logging & Bookkeeping WM Proxy Job Contr. - CondorG Workload Manager Computing Element LB: receives and stores job events; processes corresponding job status Log of job events Job status glite-wms-job-status glite-wms-job-logging-info WMS LB proxy
22
Enabling Grids for E-sciencE 22 Job Submission - Demo 1 Create a simple JDL file Delegate proxy : glite-wms-job-delegate-proxy -d $USER
23
Enabling Grids for E-sciencE 23 Job Submission - Demo 2 List the CEs that can accept it: glite-wms-job-list-match -d $USER hostname.jdl
24
Enabling Grids for E-sciencE 24 Job Submission - Demo 3 Submit it : glite-wms-job-submit -d $USER -o jobid hostname.jdl
25
Enabling Grids for E-sciencE 25 Job Submission - Demo 4 Check its status until its done : glite-wms-job-status -i jobid
26
Enabling Grids for E-sciencE 26 Possible job states job aborted by middleware, check reasonABORT job output retrievedCLEARED job terminated without grid errorsDONE job executing on a WN of the selected CE queueRUNNING job scheduled in the CE queue managerSCHEDULED job being sent to executing CEREADY job match making for resourcesWAIT submission logged in the LBSUBMITTED Meaning Flag
27
Enabling Grids for E-sciencE 27 Job Submission - Demo 5 Retrieve output : glite-wms-job-output -i jobid
28
Enabling Grids for E-sciencE 28 Practical Follow links on the agenda page: https://grid.ct.infn.it/twiki/bin/view/GILDA/Si mpleJobSubmission User Interface: glite-tutor2.ct.infn.it Before submit job, We need to Create a VOMS proxy $ voms-proxy-init -voms gilda pass phrase :
29
Enabling Grids for E-sciencE 29 Reference Job submit explains step-by-step how to submit your job https://edms.cern.ch/document/498081/1.0 Job Description language How To. http://server11.infn.it/workload-grid/docs/DataGrid-01-TEN-0102-0_2- Document.pdfhttp://server11.infn.it/workload-grid/docs/DataGrid-01-TEN-0102-0_2- Document.pdf Resource Broker Resource Broker Achitecture and APIs http://server11.infn.it/workload-grid/docs/20010613-RBArch-2.pdf WMS WP1 Workload Management Software - Administrator and User Guide. http://server11.infn.it/workload-grid/docs/DataGrid-01-TEN-0118-1_2.pdf WP1 internal documents - more complete list of documents http://server11.infn.it/workload-grid/internal-documents.html http://server11.infn.it/workload-grid/internal-documents.html
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.