Download presentation
Presentation is loading. Please wait.
Published byAnissa Lee Modified over 9 years ago
1
A proposal for standardizing the working environment for a LCG/EGEE job David Bouvet - Grid Computing team - CCIN2P3 HEPIX Karlsruhe 13/05/2005
2
David Bouvet – HEPIX Karlsruhe 13/05/2005 2 Motivation Problem raised some months ago by Jeff Templon: –D0 jobs encountered problems at Lyon due to different use of environment variables to address scratch/temp disk space Standard is defined for: –Environment Variables « IEEE Std 1003.1, 2004 POSIX Part 1: Base definitions, Amendment 8 » http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap08.html http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap08.html among which: HOME, PATH, PWD, SHELL, TMPDIR, USER –Batch Environment Services « IEEE Std 1003.1, 2004 POSIX Part 2: Shell and Utilities, Amendment 1 » http://www.opengroup.org/onlinepubs/009695399/utilities/xcu_chap03.html http://www.opengroup.org/onlinepubs/009695399/utilities/xcu_chap03.html PBS_ENVIRONMENT, PBS_JOBID, PBS_JOBNAME, PBS_QUEUE PBS_O_HOME, PBS_O_HOST, PBS_O_LOGNAME, PBS_O_PATH, PBS_O_QUEUE, PBS_O_SHELL, PBS_O_WORKDIR these variables are not directly used by the jobs There is no standard definition of environment variables for grid batch jobs Proposal for LCG/EGEE sites of a common definition of minimal set of environment variables for grid batch jobs
3
David Bouvet – HEPIX Karlsruhe 13/05/2005 3 Current status through several batch used on the grid Environment variables for grid batch job have been checked on several LCG/EGEE sites (among which all the LCG T1s) Conditions of test: ATLAS VO, short queue Batch system CEs distribution # CEs checked BQS32 CONDOR43 TORQUE7211 PBS3613 LSF54
4
David Bouvet – HEPIX Karlsruhe 13/05/2005 4 Current status: POSIX variables : defined : not defined on some sites not all these variables are defined on the various batch systems VariableBQSCONDORTORQUEPBSLSF POSIX basic: HOME PATH PWD SHELL TMPDIR USER POSIX batch
5
David Bouvet – HEPIX Karlsruhe 13/05/2005 5 Current status (cont.) : defined : not defined on some sites even for Globus, not all the sites define the same set of environment variables. VariableBQSCONDORTORQUEPBSLSF GLOBUS variables: GLOBUS_LOCATION GLOBUS_PATH GLOBUS_TCP_PORT_RANGE X509_USER_PROXY MYPROXY_SERVER (useful for proxy renewal)
6
David Bouvet – HEPIX Karlsruhe 13/05/2005 6 Current status: LCG environment variables (middleware related) (list from the LCG Users Guide) VariableDefinitionBQSCONDORTORQUEPBSLSF EDG_LOCATION Base of the installed EDG software LCG_LOCATION Base of the installed LCG software EDG_WL_JOBID Job ID (for a running job) in a WN EDG_WL_LOCATION Base of the EDG’s WMS software EDG_WL_PATH Path for EDG’s WMS commands EDG_WL_RB_BROKERINFO Location of the.BrokerInfo file in a WN LCG_GFAL_INFOSYS Location of the BDII for lcg-utils and GFAL LCG_CATALOG_TYPE Type of file catalog used (edg or lfc) for lcg-utils and GFAL LFC_HOST Location of the LFC catalog (only for catalog type lfc)
7
David Bouvet – HEPIX Karlsruhe 13/05/2005 7 Current status: LCG environment variables (job related) (list from the LCG Users Guide) VariableDefinitionBQSCONDORTORQUEPBSLSF EDG_TMP Temp directory LCG_TMP Temp directory VO_ _DEFAULT_SE Default SE defined for a CE in a WN VO_ _SW_DIR Base directory of the VO’s software in a WN possible uniformization to POSIX name: TMPDIR ?
8
David Bouvet – HEPIX Karlsruhe 13/05/2005 8 Current status: gLite environment variables gLite environment variables on WN (in config. files and scripts) from gLite installation guide: –GLITE_LOCATION /opt/glite –GLITE_LOCATION_VAR /var/glite –GLITE_LOCATION_LOG /var/log/glite –GLITE_LOCATION_TMP /tmp/glite GLITE_LOCATION_TMP another tmp directory to clean!
9
David Bouvet – HEPIX Karlsruhe 13/05/2005 9 Proposal for standardization Variable type DefinitionName POSIX Home directory of job user on WN HOME Temp directory TMPDIR (currently LCG_TMP, EDG_TMP, GLITE_LOCATION_TMP) PWD SHELL PATH Grid batch jobs Job working directory on WN GRID_WORKDIR Site name on which the job run (same as siteName in Information Provider) GRID_SITENAME WN hostname on which the job run GRID_HOSTNAME CE and queue names on which the job run (same as GlueCEUniqueID in Information Provider) GRID_CEID Job ID in local batch system GRID_LOCAL_JOBID Job ID on grid GRID_GLOBAL_JOBID (currently EDG_WL_JOBID) User’s DN of certificate GRID_USERID
10
David Bouvet – HEPIX Karlsruhe 13/05/2005 10 Proposal for standardization (cont.) Use of POSIX variable when existing –TMPDIR: POSIX variable which can replace LCG_TMP, EDG_TMP, GLITE_LOCATION_TMP –HOME: MPI jobs need a home directory
11
David Bouvet – HEPIX Karlsruhe 13/05/2005 11 Proposal for standardization (cont.) Minimal set of environment variable (not related to middleware). The naming convention must be independant of grid middleware name for grid job portability –GRID_WORKDIR –GRID_WORKDIR: work directory specific to the job (unix right 700) e.g.: /scratch/atlas0011293.ccwl0092 –GRID_SITENAME –GRID_SITENAME: to know on which site the job run (same as siteName in the Information System) e.g.: IN2P3-CC –GRID_HOSTNAME –GRID_HOSTNAME: could be useful to know the WN hostname for problem tracking (and parallel jobs?) e.g.: ccwl0006.in2p3.fr –GRID_CEID –GRID_CEID: CE and queue names on which the job run (same as GlueCEUniqueID in Information System) e.g.: heplnx201.pp.rl.ac.uk:2119/jobmanager-torque-short –GRID_LOCAL_JOBID –GRID_LOCAL_JOBID: useful for problem tracking (and parallel jobs?) lcg0509104420-07243e.g.: lcg0509104420-07243 –GRID_GLOBAL_JOBID –GRID_GLOBAL_JOBID: same as EDG_WL_JOBID for LCG e.g.: https://lxn1188.cern.ch:9000/HPMN2WVHurMlji-Fnqba0A –GRID_USERID –GRID_USERID: DN of user’s certificate (already exists on some sites) /e.g.: /O=GRID-FR/C=FR/O=CNRS/OU=CC-LYON/CN=David Bouvet/Email=dbouvet@in2p3.fr
12
David Bouvet – HEPIX Karlsruhe 13/05/2005 12 Proposal for standardization (cont.) When agreed on a set of variables and a naming convention, this standard should be implemented on all LCG/EGEE CEs. Based on today’s discussion, a document will be distributed to site administrators and applications. A possible deadline for discussion and beginning of deployment: end of June
13
David Bouvet – HEPIX Karlsruhe 13/05/2005 13 Proposal for standardization (discussion) Variable type DefinitionName Agreement on POSIX Home directory of job user on WN HOME Temp directory TMPDIR (currently LCG_TMP, EDG_TMP, GLITE_LOCATION_TMP) Grid batch jobs Job working directory on WN GRID_WORKDIR Site name on which the job run (same as siteName in Information Provider) GRID_SITENAME WN hostname on which the job run GRID_HOSTNAME CE and queue names on which the job run (same as GlueCEUniqueID in Information Provider) GRID_CEID Job ID in local batch system GRID_LOCAL_JOBID Job ID on grid GRID_GLOBAL_JOBID (currently EDG_WL_JOBID) User’s DN of certificate GRID_USERID
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.