M. Sgaravatto – n° 1 Overview of WP1 Workload Management System in EDG 2.x Massimo Sgaravatto INFN Padova - DataGrid WP1

Slides:



Advertisements
Similar presentations
DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
Advertisements

Workload Management David Colling Imperial College London.
EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
Grid in action: from EasyGrid to LCG testbed and gridification techniques. James Cunha Werner University of Manchester Christmas Meeting
The Grid Constantinos Kourouyiannis Ξ Architecture Group.
Job Submission The European DataGrid Project Team
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
CMS HLT production using Grid tools Flavia Donno (INFN Pisa) Claudio Grandi (INFN Bologna) Ivano Lippi (INFN Padova) Francesco Prelz (INFN Milano) Andrea.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Submission Fokke Dijkstra RuG/SARA Grid.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
The EDG Workload Management System – n° 1 The EDG Workload Management System.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
Workload Management Massimo Sgaravatto INFN Padova.
FESR Consorzio COMETA - Progetto PI2S2 Using MPI to run parallel jobs on the Grid Marcello Iacono Manno Consorzio COMETA
EDG - WP1 (Grid Work Scheduling) Status and plans Massimo Sgaravatto - INFN Padova Francesco Prelz – INFN Milano.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
EGEE Summer School Grid Systems – 3-8 July Job submission into the LHC Grid (Job Management + JDL) EGEE is funded by the European Union under.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – M. Sgaravatto – n° 1 The EU DataGrid Workload Management System: towards.
Enabling Grids for E-sciencE Workload Management System on gLite middleware Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi (Vietnam)
M. Sgaravatto – n° 1 The EDG Workload Management System: release 2 Massimo Sgaravatto INFN Padova - DataGrid WP1
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
Grid Workload Management Massimo Sgaravatto INFN Padova.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Grid checkpointing in the European DataGrid Project Alessio Gianelle – INFN Padova Rosario Peluso – INFN Padova Francesco Prelz – INFN Milano Massimo Sgaravatto.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Submission Fokke Dijkstra RuG/SARA Grid.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
EGEE is a project funded by the European Union under contract IST EGEE Tutorial Turin, January Job Services Emidio.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
M. Sgaravatto – n° 1 Overview of release 2 of the EDG WP1 Workload Management System deployed in the INFN production Grid Massimo Sgaravatto INFN Padova.
WP1 WMS rel. 2.0 Some issues Massimo Sgaravatto INFN Padova.
E-infrastructure shared between Europe and Latin America 1 Workload Management System-WMS Luciano Diaz Universidad Nacional Autónoma de México - UNAM Mexico.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
Workload Management System Jason Shih WLCG T2 Asia Workshop Dec 2, 2006: TIFR.
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
Data Management The European DataGrid Project Team
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA Special Jobs Valeria Ardizzone INFN - Catania.
EDG - WP1 (Grid Work Scheduling) Status and plans Massimo Sgaravatto INFN Padova.
JSS Job Submission Service Massimo Sgaravatto INFN Padova.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
WP1 WMS release 2: status and open issues Massimo Sgaravatto INFN Padova.
EGEE 3 rd conference - Athens – 20/04/2005 CREAM JDL vs JSDL Massimo Sgaravatto INFN - Padova.
WP1 Status and plans Francesco Prelz, Massimo Sgaravatto 4 th EDG Project Conference Paris, March 6 th, 2002.
User Interface UI TP: UI User Interface installation & configuration.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
LCG2 Tutorial Viet Tran Institute of Informatics Slovakia.
EGEE is a project funded by the European Union under contract IST LCG open issues Massimo Sgaravatto INFN Padova JRA1 IT-CZ cluster meeting,
Probes Requirement Review OTAG-08 03/05/ Requirements that can be directly passed to EMI ● Changes to the MPI test (NGI_IT)
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
FESR Consorzio COMETA - Progetto PI2S2 Using MPI to run parallel jobs on the Grid Marcello Iacono Manno Consorzio Cometa
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
CREAM Status and plans Massimo Sgaravatto – INFN Padova
EU 2nd Year Review – Feb – WP1 Demo – n° 1 WP1 demo Grid “logical” checkpointing Fabrizio Pacini (Datamat SpA, WP1 )
Advanced services in gLite Gergely Sipos and Peter Kacsuk MTA SZTAKI.
Massimo Sgaravatto INFN Padova
WP1 WMS release 2: status and open issues
Job Submission in the DataGrid Workload Management System
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
Francesco Giacomini – INFN JRA1 All-Hands Nikhef, February 2008
Presentation transcript:

M. Sgaravatto – n° 1 Overview of WP1 Workload Management System in EDG 2.x Massimo Sgaravatto INFN Padova - DataGrid WP1

M. Sgaravatto – n° 2 WMS release 2 u Addressed reliability and scalability problems seen with EDG rel. 1.x u Many many improvements (in design and implementation) and bug fixes u New functionalities introduced n User APIs n GUI n Support for interactive jobs n Job checkpointing n Support for parallel jobs n Gangmatching n Support for automatic output data upload and registration n …

M. Sgaravatto – n° 3 Interactive jobs u Specified setting JobType = “Interactive” in JDL u When an interactive job is executed, a window for the stdin, stdout, stderr streams is opened n Possibility to send the stdin to the job n Possibility the have the stderr and stdout of the job when it is running u Possibility to start a window for the standard streams for a previously submitted interactive job with command edg-job-attach

M. Sgaravatto – n° 4 Job checkpointing u Checkpointing: saving from time to time job state n Useful to prevent data loss, due to unexpected failures n Approach: provide users with a “trivial” logical job checkpointing service n User can save from time to time the state of the job (defined by the application) n A job can be restarted from an intermediate (i.e. “previously” saved) job state u Different than “classical checkpointing (i.e. saving all the information related to a process: process’s data and stack segments, open files, etc.) n Very difficult to apply (e.g. problems to save the state of open network connections) n Not necessary for many applications u To submit a checkpointable job n Code must be instrumented (see next slides) n JobType=Checkpointable to be specified in JDL

M. Sgaravatto – n° 5 Job checkpointing example int main () { … for (int i=event; i < EVMAX; i++) { ;}... exit(0); } Example of Application (e.g. HEP MonteCarlo simulation)

M. Sgaravatto – n° 6 Job checkpointing example #include "checkpointing.h" int main () { JobState state(JobState::job); event = state.getIntValue("first_event"); PFN_of_file_on_SE = state.getStringValue("filename"); …. var_n = state.getBoolValue("var_n"); ; … for (int i=event; i < EVMAX; i++) { ;... state.saveValue("first_event", i+1); ; state.saveValue("filename", PFN of file_on_SE);... state.saveValue("var_n", value_n); state.saveState(); } … exit(0); } User code must be easily instrumented in order to exploit the checkpointing framework …

M. Sgaravatto – n° 7 Job checkpointing example #include "checkpointing.h" int main () { JobState state(JobState::job); event = state.getIntValue("first_event"); PFN_of_file_on_SE = state.getStringValue("filename"); …. var_n = state.getBoolValue("var_n"); ; … for (int i=event; i < EVMAX; i++) { ;... state.saveValue("first_event", i+1); ; state.saveValue("filename", PFN of file_on_SE);... state.saveValue("var_n", value_n); state.saveState(); } … exit(0); } User defines what is a state Defined as pairs Must be “enough” to restart a computation from a previously saved state

M. Sgaravatto – n° 8 Job checkpointing example #include "checkpointing.h" int main () { JobState state(JobState::job); event = state.getIntValue("first_event"); PFN_of_file_on_SE = state.getStringValue("filename"); …. var_n = state.getBoolValue("var_n"); ; … for (int i=event; i < EVMAX; i++) { ;... state.saveValue("first_event", i+1); ; state.saveValue("filename", PFN of file_on_SE);... state.saveValue("var_n", value_n); state.saveState(); } … exit(0); } User can save from time to time the state of the job

M. Sgaravatto – n° 9 Job checkpointing example #include "checkpointing.h" int main () { JobState state(JobState::job); event = state.getIntValue("first_event"); PFN_of_file_on_SE = state.getStringValue("filename"); …. var_n = state.getBoolValue("var_n"); ; … for (int i=event; i < EVMAX; i++) { ;... state.saveValue("first_event", i+1); ; state.saveValue("filename", PFN of file_on_SE);... state.saveValue("var_n", value_n); state.saveState(); } … exit(0); } Retrieval of the last saved state The job can restart from that point

M. Sgaravatto – n° 10 Job checkpointing scenarios u Scenario 1 n Job submitted to a CE n When job runs it saves from time to time its state n Job failure, due to a Grid problem (e.g. CE problem) n Job resubmitted by the WMS possibly to a different CE n Job restarts its computation from the last saved state s  No need to restart from the beginning s  The computation done till that moment is not lost u Scenario 2 n Job failure, but not detected by the Grid middleware n User can retrieve a saved state for the job (typically the last one) s edg-job-get-chkpt –o n User resubmits the job, specifying that the job must start from a specific (the retrieved one) initial state s edg-job-submit –chkpt

M. Sgaravatto – n° 11 Submission of parallel jobs u Possibility to submit MPI jobs u MPICH implementation supported u Only parallel jobs inside a single CE can be submitted u Submission of parallel jobs very similar to normal jobs n Just needed to specify in the JDL: s JobType= “MPICH” s NodeNumber = n; n The number (n) of requested CPUs u Matchmaking n CE chosen by RB has to have MPICH sw installed, and at least n total CPUs n If there are two or more CEs satisfying all the requirements, the one with the highest number of free CPUs is chosen

M. Sgaravatto – n° 12 Gangmatching u Allow to take into account both CE and SE information in the matchmaking u For example to require a job to run on a CE close to a SE with at least 200 MB of available space: Requirements = anyMatch(other.storage.CloseSEs, target.GlueSAStateAvailableSpace > 200);

M. Sgaravatto – n° 13 Output data registration OutputData = { [ OutputFile = "filename1"; LogicalFileName = "lfn:mylfn1"; StorageElement = "testbed007.cnaf.infn.it" ], [ OutputFile = "filename2" ], [ OutputFile = "filename3"; LogicalFileName = "lfn:mylfn2" ], [ OutputFile = "filename4"; StorageElement = "testbed007.cnaf.infn.it" ] } Both LFN and target SE specified Nor LFN nor target SE specified Only LFN specified Only target SE specified

M. Sgaravatto – n° 14 WP1 releases WP1 RPMsRelease date Jul 2003 EDG 2.0.x LCG-1

M. Sgaravatto – n° 15 WP1 releases WP1 RPMsRelease date Jul Aug 2003 EDG 2.0.x LCG-1 Fix for proxy renewal with short proxies Fix for verbosity in log files Fix for EDG_LOCATION setting (needed for interactions with WP2 Services) Man pages …

M. Sgaravatto – n° 16 WP1 releases WP1 RPMsRelease date Jul Aug Aug 2003 EDG 2.0.x LCG-1 Fix for LM DB corruption causing LM and JC failures (#1807) Fix for ‘done’ job status declared too early (#1808) Fix for automatic output data upload and registration …

M. Sgaravatto – n° 17 WP1 releases WP1 RPMsRelease date Jul Aug Aug Sep 2003 EDG 2.0.x LCG-1 Minor fix for automatic output data upload and registration

M. Sgaravatto – n° 18 WP1 releases WP1 RPMsRelease date Jul Aug Aug Sep Sep 2003 EDG 2.0.x Fix for gangmatching Fix for log rotation Fix for proxy renewal db corruption Much more useful error messages (no more “an helper failed”) Fix for CEId in JDL causing WM crash Fix for matchmaking problems when multiple closeSEs for a given CE Fix for edg-Brokerinfo getDataAccessProtocol … Deployed in EDG dev testbed Under deployment by LCG

M. Sgaravatto – n° 19 WP1 releases WP1 RPMsRelease date Jul Aug Aug Sep Sep Very soon ! EDG 2.0.x VOMS support Support for ACLs in L&B (to set who can query the status of a given job) Fix for matchmaking when attributes belonging to Subcluster objectclass used as rank … EDG 2.1.x Deployed in EDG dev testbed Under deployment by LCG

M. Sgaravatto – n° 20 VOMS support u VO taken from VOMS user proxy u Matchmaking performed wrt VO n  Not necessary to publish anymore in the information service the list of authorized users (only list of authorized VOs needed) u WMS works also with non-VOMS proxies n If not possible to get VO from proxy, VO taken: s from JDL (VirtualOrganisation attribute), if specified s from --vo VO option, if specified s from UI conf file n Matchmaking: if no matches wrt VO, matchmaking tried wrt user DN

M. Sgaravatto – n° 21 Other issues raised by applications u “Cryptic” error messages n Much more useful error messages with WP1 RPMs >= s No more “cannot plan (an helper failed)” s Examples of new error messages when jobs get aborted: n Cannot plan: BrokerHelper: Problems querying the information service bbq.mi.infn.it n Cannot plan: BrokerHelper: The user is not authorized on any resource currently registered in bbq.mi.infn.it n Cannot plan: BrokerHelper: no compatible resources n Cannot plan: BrokerHelper: All compatible resources are unavailable (problems during rank evaluation) s … and these are not failures in the WMS software … n Going to do the same for edg-job-list-match

M. Sgaravatto – n° 22 Other issues raised by applications u BrokerInfo responsibility n Writing and reading (edg-brokerinfo and brokerinfo APIs) now completely up to WP1 s In the past brokerinfo reading was a WP1-WP2 joint effort n Now 1 open issue: “replacement for old getSelectedFile” u “RB queries all suitable CEs” n This means that only CEs which match all the requirements are queried n Needed to see if the CE is still available n Needed to get fresher info than the one published in GOUT/bdII

M. Sgaravatto – n° 23 Other issues raised by applications u “Matchmaking: with multiple files will match on 1 even if the others are not available” n Nothing new: same algorithm used in release 1 n Algorithm: choose the CE which meet all the requirements, and where most of the files are close to it n The application is then supposed to copy locally/access remotely the non-closed files u Outbound connectivity for the WNs n We don’t think it is a real problem … n … in any case we are going to evaluate a different mechanism to transfer sandbox files between WN and RB, for which outbound connectivity is not required for WNs anymore n Outbound connectivity still required for interactive jobs n What about other services (e.g. to copy files from/to a remote SE) ??

M. Sgaravatto – n° 24 Other issues raised by applications u “Cannot read JobWrapper output, both from Condor and from Maradona” problem n Background s The stdout of the script where the user job is wrapped around is transferred via Globus GASS from the CE node to the RB machine in order to check if the job was successfully executed s Many reasons for the “Cannot read …” problem found and addressed in release 1, and fixes are also in release 2 n Not problems in WP1 software, but mostly addressed by WP1 s In order to reduce the rate of failures, a modification was introduced in the WMS software, in order to transfer the standard output of job wrapper also via gridftp, and not only via GASS: Maradona n This problems now appears only from time to time and only on some sites s Local configuration problems ?? n Warning: the problem appears if OutputData used in the JDL with WP1 RPMs < (the bug has then been fixed)

M. Sgaravatto – n° 25 Conclusions u Thanks very much for your feedback n Your inputs have been always valuable and contributed to improve the EDG WMS u Hope you are enough happy with what we provided and with our level of support u Waiting for feedback on some new functionalities that seem not to have been tried by the applications yet n Many of them were not in the initial WP1 agenda, but they were addressed as requested by applications s E.g. support for interactive jobs, strongly asked by WP8 s E.g. support for MPI jobs, requested by WP9 and WP10 u WP1 next future (up to the end of the project) n Support and bug fixes n Refinements and new functionalities (e.g. DAGMan) s Detailed plans to be discussed at the WP1 meeting here in Heidelberg