Glexec/SCAS Pilot: IN2P3-CC status

Slides:



Advertisements
Similar presentations
2010/05/07 Glexec, CREAM CE Pierre Girard Réunion CAF
Advertisements

JSAGA2 Overview job desc. gLite plug-ins Globus plug-ins JSAGA hidemiddlewareheterogeneity (e.g. gLite, Globus, Unicore) JDLRSL.
A module to customize CREAM jobs according to site policies Tsukuba, KEK, 21 December 2010 Sylvain Reynaud JWGEN :
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
11/30/2007 Overview of operations at CC-IN2P3 Exploitation team Reported by Philippe Olivero.
Grid infrastructure analysis with a simple flow model Andrey Demichev, Alexander Kryukov, Lev Shamardin, Grigory Shpiz Scobeltsyn Institute of Nuclear.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp - SWITCH EGI TF Prague.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Angela Poschlad (PPS-FZK), Antonio Retico.
Pilot Jobs John Gordon Management Board 23/10/2007.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Glexec, SCAS & CREAM. Milestones CREAM-CE capable of large-scale direct job submission Glexec & SCAS capable of large-scale use on WN in logging only.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EMI INFSO-RI Argus Policies in Action Valery Tschopp (SWITCH) on behalf of the Argus PT.
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
LCG Support for Pilot Jobs John Gordon, STFC GDB December 2 nd 2009.
LCG Pilot Jobs and glexec John Gordon.
EMI INFSO-RI Argus The EMI Authorization Service Valery Tschopp (SWITCH) Argus Product Team.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks glexec/SCAS pilot service Status and short-term.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp (SWITCH) – Argus Product Team.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Why you should care about glexec OSG Site Administrator’s Meeting Written by Igor Sfiligoi Presented by Alain Roy Hint: It’s about security.
Job Priorities and Resource sharing in CMS A. Sciabà ECGI meeting on job priorities 15 May 2006.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Vendredi 27 avril 2007 Management of ATLAS CC-IN2P3 Specificities, issues and advice.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Simone Campana (CERN) Job Priorities: status.
Honolulu - Oct 31st, 2007 Using Glideins to Maximize Scientific Output 1 IEEE NSS 2007 Making Science in the Grid World - Using Glideins to Maximize Scientific.
DIRAC: Workload Management System Garonne Vincent, Tsaregorodtsev Andrei, Centre de Physique des Particules de Marseille Stockes-rees Ian, University of.
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win: Porting of gLite middleware to Windows platform
Grid2Win Porting of gLite middleware to Windows XP platform
AuthN and AuthZ in StoRM A short guide
gLExec and OS compatibility
SuperB – INFN-Bari Giacinto DONVITO.
The EDG Testbed Deployment Details
David Bouvet Fabio Hernandez IN2P3 Computing Centre - Lyon
Farida Naz Andrea Sciabà
Workload Management System
Glexec deployment models local credentials and grid identity mapping in the presence of complex schedulers David Groep NIKHEF.
INFN-GRID Workshop Bari, October, 26, 2004
John Gordon, STFC-RAL GDB 10 October 2007
Accounting at the T1/T2 Sites of the Italian Grid
Grid2Win: Porting of gLite middleware to Windows XP platform
Pierre Girard Réunion CMS
Grid2Win: Porting of gLite middleware to Windows XP platform
Grid services for CMS at CC-IN2P3
glexec/SCAS pilot service
CC IN2P3 - T1 for CMS: CSA07: production and transfer
Grid Deployment Board meeting, 8 November 2006, CERN
OpenGATE meeting/Grid tutorial, mars 9nd 2005
Short update on the latest gLite status
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
Gri2Win: Porting gLite to run under Windows XP Platform
LCG middleware and LHC experiments ARDA project
VMDIRAC status Vanessa HAMAR CC-IN2P3.
Pierre Girard ATLAS Visit
Danilo Dongiovanni INFN-CNAF
The GENIUS portal and the GILDA t-Infrastructure
GRID Workload Management System for CMS fall production
Information System (BDII)
gLite The EGEE Middleware Distribution
The LHCb Computing Data Challenge DC06
Presentation transcript:

Glexec/SCAS Pilot: IN2P3-CC status 07/09/2018 2009/04/08 Glexec/SCAS Pilot: IN2P3-CC status Pierre Girard CCIN2P3 T1-T2 2009-02-03

Grid deployment at CCIN2P3 Initial plan for pilot of Glexec/Scas 07/09/2018 Content Grid deployment at CCIN2P3 Initial plan for pilot of Glexec/Scas Setting-up issues Conclusion Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Grid Job Management at CCIN2P3 07/09/2018 Grid Job Management at CCIN2P3 Several Grid WN versions at time AFS Computing Element Computing Element Computing Element Computing Element Glite-WN-3.1.26-glexec Glite-WN-3.1.26-prod BQS Glite-WN-3.1.19-prod Anastasie Glite-WN-3.1.666-pps No MW locally on worker WN WN WN WN WN WN WN WN Globus4-WN Shared FS (afs.in2p3.fr) Computing Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Overview of grid job submission 07/09/2018 Overview of grid job submission Grid Job Credentials 1 RSL WN U-job Submit Glite-WN Computing Element lcg0507012233-1234.sh U-job SL4.5 4 Job Manager spawn 2 Local Job Wrapping lcg0507012233-1234.sh 3 BQS #!/bin/sh #PBS -q T #PBS -l M=2200MB #PBS -l T=3801600 #PBS -l scratch=16250MB #PBS -l platform=LINUX #PBS --share T1prod … qsub U-job Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

WN profile selection by BQS JobManager 07/09/2018 WN profile selection by BQS JobManager Grid Job Credentials 1 RSL WN U-job Submit BQS-JM config lcg0507012233-1234.sh Glite-WN Computing Element 6 U-job Dynamically link to WN profile SL4.5 BQS JM 5 rules spawn 2 Globus4-WN Glite-WN-3.1.666-pps Glite-WN-3.1.19-prod Glite-WN-3.1.26-prod Glite-WN-3.1.26-glexec Local Job Wrapping lcg0507012233-1234.sh 3 4 BQS Set WN profile qsub Glite-WN-3.1.26-glexec U-job AFS Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Last BQS JM enhancements 07/09/2018 Last BQS JM enhancements BQS JM control Submission policy (deny, accept) Forbearance management if BQS becomes unresponsive BQS JM Outputs BQS submission parameters Class: A (=short), G (=Medium), T (=Long), J (=verylong) Amount of {Mem, CPU, Scratch} Farm name Platform (SL3, SL4, SL5) Logical resources (list of) u_dcache_atlas, u_dcache_alice, u_OracleStress_atlas, … VO Share Wrapped data WN profile to be used profilesDirectory = /afs/in2p3.fr/grid/profiles/glite/3.1.25-0/SL4_64/WN32 Site Name AFS token (or not) Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Last BQS JM enhancements 07/09/2018 Last BQS JM enhancements BQS JM configuration capabilities (Most of) BQS JM outputs are determined according to configuration rules A rule is basically an assignment Ex.: SubmissionPolicy = ACCEPT But can be conditionned depending on some job input data (in the precedence order) Mapped account Mapped group CE queue Ex.: UserSubmissionPolicy_atlas050 = DENY # Specific requirements for ATLAS with queue verylong GroupVirtualQueueMaxMem_atlas_verylong = default GroupVirtualQueueMaxCPU_atlas_verylong = max GroupVirtualQueueMaxScratch_atlas_verylong = default Configuration syntax Is quite ugly Makes the condition combination not possible But, seems enough for now Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Glexec deployment at IN2P3-CC 07/09/2018 Glexec deployment at IN2P3-CC Glexec is a tool to be deployed on the WN to be used by the VOs to manage the « real user jobs » within a job pilot With a setuid capability (job pilot forks the « real user job » by using another account) Site authorization by « real user job » based on real user proxy How the deployment was planned Deploy the Glite-WN/Glexec relocated on AFS Use the configuration capabilities to redirect the pilot jobs to this deployment profilesDirectory = /afs/in2p3.fr/grid/profiles/glite/3.1.25-0/SL4_64/WN32 UserProfilesDirectory_dteam049 = /afs/in2p3.fr/grid/profiles/glite/3.1.25-0/SL4_32/WN32_GLEXEC Sounded easy… Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Glexec deployment Issues at IN2P3-CC 07/09/2018 Glexec deployment Issues at IN2P3-CC Glexec requires to be locally installed on Worker Configuration file absolute path hardcoded /opt/glite/etc/glite.conf Only one MW configuration possible Dynamic library configuration (due to « setuid ») /etc/ld.so.conf Only one MW installation possible Log configuration (syslog) Not so problematic for now Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Glexec deployment in use at IN2P3-CC 07/09/2018 Glexec deployment in use at IN2P3-CC We are part of the « SCAS Pilot Service » Asked to provide SCAS/glexec in production Load test for SCAS services by Atlas and Lhcb Deployment done Useable by both LHCb and Dteam Through the T1 CEs According to specific VOMS roles/groups But Deployment issues Break down our WN setup strategy Relocatable distribution was not ready (home-made) First tests with LHCb Were not satisfactory Raised some questions Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08

Glite-WN only Will be activated Glite-3.2.0 (SL5) at IN2P3-CC 07/09/2018 Glite-3.2.0 (SL5) at IN2P3-CC Glite-WN only Deployed on AFS Tested with a test CE on BQS Farm « lcg » Will be activated as soon as SL5 workers enter the production (done) A queue will be added to the T2 (T1?) CEs Pierre Girard - Glexec/SCAS: IN2P3-CC status 2009/04/08