DIRAC Pilot Jobs A. Casajus, R. Graciani, A. Tsaregorodtsev for the LHCb DIRAC team Pilot Framework and the DIRAC WMS DIRAC Workload Management System.

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyProxy and EGEE Ludek Matyska and Daniel.
Advertisements

1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Haga clic para cambiar el estilo de título Haga clic para modificar el estilo de subtítulo del patrón DIRAC Framework A.Casajus and R.Graciani (Universitat.
Workload Management Massimo Sgaravatto INFN Padova.
Stuart K. PatersonCHEP 2006 (13 th –17 th February 2006) Mumbai, India 1 from DIRAC.Client.Dirac import * dirac = Dirac() job = Job() job.setApplication('DaVinci',
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
SSC2 and Update on Multi-user Pilot Jobs Framework Mingchao Ma, STFC – RAL HEPSysMan Meeting 20/06/2008.
DIRAC Web User Interface A.Casajus (Universitat de Barcelona) M.Sapunov (CPPM Marseille) On behalf of the LHCb DIRAC Team.
BESIII distributed computing and VMDIRAC
Computing and LHCb Raja Nandakumar. The LHCb experiment  Universe is made of matter  Still not clear why  Andrei Sakharov’s theory of cp-violation.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
Grid Initiatives for e-Science virtual communities in Europe and Latin America DIRAC TEAM CPPM – CNRS DIRAC Grid Middleware.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
PanDA Multi-User Pilot Jobs Maxim Potekhin Brookhaven National Laboratory Open Science Grid WLCG GDB Meeting CERN March 11, 2009.
Use of Condor on the Open Science Grid Chris Green, OSG User Group / FNAL Condor Week, April
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
Pilot Jobs John Gordon Management Board 23/10/2007.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
Trusted Virtual Machine Images a step towards Cloud Computing for HEP? Tony Cass on behalf of the HEPiX Virtualisation Working Group October 19 th 2010.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
EGEE is a project funded by the European Union under contract INFSO-RI Practical approaches to Grid workload management in the EGEE project Massimo.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
CHEP 2006, February 2006, Mumbai 1 LHCb use of batch systems A.Tsaregorodtsev, CPPM, Marseille HEPiX 2006, 4 April 2006, Rome.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
INFSO-RI Enabling Grids for E-sciencE The gLite File Transfer Service: Middleware Lessons Learned form Service Challenges Paolo.
EMI INFSO-RI Argus Policies in Action Valery Tschopp (SWITCH) on behalf of the Argus PT.
Mar 27, gLExec Accounting Solutions in OSG Gabriele Garzoglio gLExec Accounting Solutions in OSG Mar 27, 2008 Middleware Security Group Meeting Igor.
DIRAC Review (12 th December 2005)Stuart K. Paterson1 DIRAC Review Workload Management System.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
LCG Support for Pilot Jobs John Gordon, STFC GDB December 2 nd 2009.
Workload management, virtualisation, clouds & multicore Andrew Lahiff.
GRID Security & DIRAC A. Casajus R. Graciani A. Tsaregorodtsev.
WLCG Authentication & Authorisation LHCOPN/LHCONE Rome, 29 April 2014 David Kelsey STFC/RAL.
INFSO-RI Enabling Grids for E-sciencE glexec on worker nodes David Groep NIKHEF.
INFSO-RI Enabling Grids for E-sciencE Policy management and fair share in gLite Andrea Guarise HPDC 2006 Paris June 19th, 2006.
DIRAC Project A.Tsaregorodtsev (CPPM) on behalf of the LHCb DIRAC team A Community Grid Solution The DIRAC (Distributed Infrastructure with Remote Agent.
LHCb Pilot Job Tests. We have not started this test due to several reasons: our analysis job submission (ganga) was not interfaced in time to DIRAC3,
Proxy management mechanism and gLExec integration with the PanDA pilot Status and perspectives.
Shifters Jamboree Kaushik De ADC Jamboree, CERN December 4, 2014.
LCG Pilot Jobs + glexec John Gordon, STFC-RAL GDB 7 December 2007.
Ian Collier, STFC, Romain Wartel, CERN Maintaining Traceability in an Evolving Distributed Computing Environment Introduction Security.
Security and VO management enhancements in Panda Workload Management System Jose Caballero Maxim Potekhin Torre Wenaus Presented by Maxim Potekhin at HPDC08.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EGEE-III INFSO-RI Enabling Grids for E-sciencE VO Authorization in EGEE Erwin Laure EGEE Technical Director Joint EGEE and OSG Workshop.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp (SWITCH) – Argus Product Team.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
Trusted Virtual Machine Images the HEPiX Point of View Tony Cass October 21 st 2011.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Multi-community e-Science service connecting grids & clouds R. Graciani 1, V. Méndez 2, T. Fifield 3, A. Tsaregordtsev 4 1 University of Barcelona 2 University.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
DIRAC: Workload Management System Garonne Vincent, Tsaregorodtsev Andrei, Centre de Physique des Particules de Marseille Stockes-rees Ian, University of.
Workload Management Workpackage
Design rationale and status of the org.glite.overlay component
CREAM-CE/HTCondor site
Grid Deployment Board meeting, 8 November 2006, CERN
Short update on the latest gLite status
WMS Options: DIRAC and GlideIN-WMS
Wide Area Workload Management Work Package DATAGRID project
Resource and Service Management on the Grid
The LHCb Computing Data Challenge DC06
Presentation transcript:

DIRAC Pilot Jobs A. Casajus, R. Graciani, A. Tsaregorodtsev for the LHCb DIRAC team Pilot Framework and the DIRAC WMS DIRAC Workload Management System Job State Update Job Manager Job Monitoring TaskQueue Director TaskQueue DB Job DB Matcher CEWMS Worker Node Pilot Job Watchdog Job Wrapper Application Summary: DIRAC WMS uses pilot jobs and late matching based on TaskQueues to provide LHCb with the best handle to applied VO policies and priorities for the use of its computing resources. Pilot Jobs are a special type of jobs, submitted to a computing resource, that only has the payload fully defined once it is safely running on the Worker Node. This strategy has several benefits with respect to the classic middleware push approach: Pilots create an overlay network that mask heterogeneity of underlying resources. Late binding of resource to payload allows to apply VO priorities independently of the local batch systems. Pull scheduling allows to find the payload that currently best fits the resources reserved by the pilot. Directors are the DIRAC component submitting pilots jobs to the resources. Generic Pilot Jobs are able to change the user identity as required by the payload. They are the latest advance in this field. A special DIRAC group (lhcb_pilot) and VOMS role (/lhcb/Role=pilot) are used for pilot job submission. This special group is later allowed to retrieve payload limited proxies (full glexec support already included). Security aspects on [90]. Pilot Jobs are a special type of jobs, submitted to a computing resource, that only has the payload fully defined once it is safely running on the Worker Node. This strategy has several benefits with respect to the classic middleware push approach: Pilots create an overlay network that mask heterogeneity of underlying resources. Late binding of resource to payload allows to apply VO priorities independently of the local batch systems. Pull scheduling allows to find the payload that currently best fits the resources reserved by the pilot. Directors are the DIRAC component submitting pilots jobs to the resources. Generic Pilot Jobs are able to change the user identity as required by the payload. They are the latest advance in this field. A special DIRAC group (lhcb_pilot) and VOMS role (/lhcb/Role=pilot) are used for pilot job submission. This special group is later allowed to retrieve payload limited proxies (full glexec support already included). Security aspects on [90]. TaskQueue DB Job Job DB lhcb_prod lhcb_test lhcb_user lhcb_admin Job In the DIRAC Workload Management System new user Tasks are organized into TaskQueues for optimizations purposes. TaskQueues are groups of tasks, pending for execution, with identical execution requirements: Owner, OwnerGroup, Allowed|Banned Sites, CPUTime,… They may have different priorities assigned by their owner. Each DIRAC group has a basic Priority assigned by the VO. This priority is distributed among all the TaskQueues of the same group using the priorities the Owner has assigned to each of his/her tasks. Directors submit pilot jobs to the resources (via gLite WMS) following the priorities and number of pending Tasks in the system. Pilot Jobs, once running at a given computing resource, request a pending task following the assigned priorities. In the DIRAC Workload Management System new user Tasks are organized into TaskQueues for optimizations purposes. TaskQueues are groups of tasks, pending for execution, with identical execution requirements: Owner, OwnerGroup, Allowed|Banned Sites, CPUTime,… They may have different priorities assigned by their owner. Each DIRAC group has a basic Priority assigned by the VO. This priority is distributed among all the TaskQueues of the same group using the priorities the Owner has assigned to each of his/her tasks. Directors submit pilot jobs to the resources (via gLite WMS) following the priorities and number of pending Tasks in the system. Pilot Jobs, once running at a given computing resource, request a pending task following the assigned priorities. DIRAC is the LHCb solution to interact with distributed computing resources. For Workload Management tasks DIRAC implements pull scheduling mechanism and was the first system to introduce the usage of Pilot Jobs in the Grid. Private Pilot Jobs include the full identity of the user and can only match tasks for this identity. They do not allow to apply VO policies since late binding is limited. Generic Pilot Jobs use a special identity and can match tasks from any user. Before executing the task the execution changes to the proper user identity (glexec). Private Pilot Jobs include the full identity of the user and can only match tasks for this identity. They do not allow to apply VO policies since late binding is limited. Generic Pilot Jobs use a special identity and can match tasks from any user. Before executing the task the execution changes to the proper user identity (glexec).