June 5, 2006gLexec1 gLexec (within the OSG and Fermilab) June 5, 2006 Keith Chadwick.

Slides:



Advertisements
Similar presentations
Dec 14, 20061/10 VO Services Project – Status Report Gabriele Garzoglio VO Services Project WBS Dec 14, 2006 OSG Executive Board Meeting Gabriele Garzoglio.
Advertisements

Role Based VO Authorization Services Ian Fisk Gabriele Carcassi July 20, 2005.
Dan Bradley Computer Sciences Department University of Wisconsin-Madison Schedd On The Side.
OSG AuthZ Architecture AuthZ Components Legend VO Management Services Grid Site GUMS Site Services SAZ CE Gatekeeper Prima Is Auth? Yes / No SE SRM gPlazma.
Implementing Finer Grained Authorization in the Open Science Grid Gabriele Carcassi, Ian Fisk, Gabriele, Garzoglio, Markus Lorch, Timur Perelmutov, Abhishek.
Andrew McNab - EDG Access Control - 14 Jan 2003 EU DataGrid security with GSI and Globus Andrew McNab University of Manchester
Campus Grids Report OSG Area Coordinator’s Meeting Dec 15, 2010 Dan Fraser (Derek Weitzel, Brian Bockelman)
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
May 8, 20071/15 VO Services Project – Status Report Gabriele Garzoglio VO Services Project – Status Report Overview and Plans May 8, 2007 Computing Division,
Apr 30, 20081/11 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting Apr 30, 2008 Gabriele Garzoglio.
Mine Altunay OSG Security Officer Open Science Grid: Security Gateway Security Summit January 28-30, 2008 San Diego Supercomputer Center.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
Use of Condor on the Open Science Grid Chris Green, OSG User Group / FNAL Condor Week, April
Mar 28, 20071/9 VO Services Project Gabriele Garzoglio The VO Services Project Don Petravick for Gabriele Garzoglio Computing Division, Fermilab ISGC 2007.
Evolution of the Open Science Grid Authentication Model Kevin Hill Fermilab OSG Security Team.
May 11, 20091/17 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting May 11, 2009 Gabriele Garzoglio.
Jan 10, 20091/16 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting Jan 10, 2009 Gabriele Garzoglio.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
CHEP03 Mar 25Mary Thompson Fine-grained Authorization for Job and Resource Management using Akenti and Globus Mary Thompson LBL,Kate Keahey ANL, Sam Lang.
Mine Altunay July 30, 2007 Security and Privacy in OSG.
LCG Pilot Jobs + glexec John Gordon, STFC-RAL GDB 7 November 2007.
Legion - A Grid OS. Object Model Everything is object Core objects - processing resource– host object - stable storage - vault object - definition of.
Overview of Privilege Project at Fermilab (compilation of multiple talks and documents written by various authors) Tanya Levshina.
Role Based VO Authorization Services Ian Fisk Gabriele Carcassi July 20, 2005.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
VO Privilege Activity. The VO Privilege Project develops and implements fine-grained authorization to grid- enabled resources and services Started Spring.
OSG AuthZ components Dane Skow Gabriele Carcassi.
Mar 27, gLExec Accounting Solutions in OSG Gabriele Garzoglio gLExec Accounting Solutions in OSG Mar 27, 2008 Middleware Security Group Meeting Igor.
Jun 12, 20071/17 AuthZ Interoperability – Status and Plan Gabriele Garzoglio AuthZ Interoperability Status and Plans June 12, 2007 Middleware Security.
OSG Site Admin Workshop - Mar 2008Using gLExec to improve security1 OSG Site Administrators Workshop Using gLExec to improve security of Grid jobs by Alain.
LCG Support for Pilot Jobs John Gordon, STFC GDB December 2 nd 2009.
June 6, 2006OSG - Draft VO AUP1 Open Science Grid Trust as a Foundation June 6, 2006 Keith Chadwick.
Eileen Berman. Condor in the Fermilab Grid FacilitiesApril 30, 2008  Fermi National Accelerator Laboratory is a high energy physics laboratory outside.
INFSO-RI Enabling Grids for E-sciencE glexec on worker nodes David Groep NIKHEF.
An Introduction to Campus Grids 19-Apr-2010 Keith Chadwick & Steve Timm.
Parag Mhashilkar Computing Division, Fermi National Accelerator Laboratory.
Sep 17, 20081/16 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting Sep 17, 2008 Gabriele Garzoglio.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
Development of the Fermilab Open Science Enclave Policy and Baseline Keith Chadwick Fermilab Work supported by the U.S. Department of.
April 18, 2006FermiGrid Project1 FermiGrid Project Status April 18, 2006 Keith Chadwick.
OSG VO Security Policies and Requirements Mine Altunay OSG Security Team July 2007.
Why you should care about glexec OSG Site Administrator’s Meeting Written by Igor Sfiligoi Presented by Alain Roy Hint: It’s about security.
FermiGrid The Fermilab Campus Grid 28-Oct-2010 Keith Chadwick Work supported by the U.S. Department of Energy under contract No. DE-AC02-07CH11359.
Virtualization within FermiGrid Keith Chadwick Work supported by the U.S. Department of Energy under contract No. DE-AC02-07CH11359.
HEPiX Virtualisation working group Andrea Chierici INFN-CNAF Workshop CCR 2010.
1 Team Skill 3 Defining the System Part 1: Use Case Modeling Noureddine Abbadeni Al-Ain University of Science and Technology College of Engineering and.
Honolulu - Oct 31st, 2007 Using Glideins to Maximize Scientific Output 1 IEEE NSS 2007 Making Science in the Grid World - Using Glideins to Maximize Scientific.
How to integrate portals with EGI accounting system R.Graciani EGI TF 2012.
FermiGrid - PRIMA, VOMS, GUMS & SAZ Keith Chadwick Fermilab
OpenPBS – Distributed Workload Management System
Chapter 6: User-Defined Functions I
Duncan MacMichael & Galen Deal CSS 534 – Autumn 2016
Operating a glideinWMS frontend by Igor Sfiligoi (UCSD)
FermiGrid - PRIMA, VOMS, GUMS & SAZ
DJRA3.1 issues Olle Mulmo.
GWE Core Grid Wizard Enterprise (
Glexec deployment models local credentials and grid identity mapping in the presence of complex schedulers David Groep NIKHEF.
A Model for Grid User Management
f f FermiGrid – Site AuthoriZation (SAZ) Service
AuthZ Interop report out
John Gordon, STFC-RAL GDB 10 October 2007
Distribution and components
IS442 Information Systems Engineering
The New Virtual Organization Membership Service (VOMS)
Chapter 2: The Linux System Part 1
Outline Module 1 and 2 dealt with processes, scheduling and synchronization Next two modules will deal with memory and storage Processes require data to.
From Prototype to Production Grid
Multithreaded Programming
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

June 5, 2006gLexec1 gLexec (within the OSG and Fermilab) June 5, 2006 Keith Chadwick

June 5, 2006gLexec2 Background - Late Binding & Glide-Ins Some Virtual Organizations are using grid job glide-ins as a mechanism to accomplish “late binding”. In particular, CDF is using glide-ins* on their recently commissioned glide-CAF cluster at Fermilab: A job is submitted to a Globus gatekeeper, when the job begins execution, it has guaranteed access to a CPU. At this point the so-called “pilot” job calls “home” and brings in a production job for execution. * The term glide-in is used generically, and does not necessarily imply the use of Condor.

June 5, 2006gLexec3 Glide-Ins can do Multiple Tasks Nothing restricts the “pilot” job to running only one job - there may be reasons why the “pilot” job might run multiple jobs: Start stage-in of input datasets While staging input datasets run monte-carlo Pause monte-carlo to run analysis against input datasets Start stage-out of analysis results While staging out, resume monte-carlo However: Don Petravick: “Fermilab will view with extreme disfavor any “pilot” job which attempts to subvert the batch system scheduler by running multiple copies of CPU intensive code on systems with multiple CPUs.”

June 5, 2006gLexec4 What is the Issue? Vicky White, the head of the Fermilab Computing Division raised several concerns with the use of grid glide-ins on the CDF glide-caf grid farm during the January 2006 OSG Collaboration Meeting held in Gainesville, FL: The “pilot” job does pass through the entire gamut (such as it is) of the Fermilab grid authorization and accounting infrastructure, The glide-in jobs run by the “pilot” job bypass the authorization and accounting infrastructure in the Globus gatekeeper Furthermore, the real user of the CPU resources was not immediately apparent to management.

June 5, 2006gLexec5 The current state of Glide-Ins DN=Amber DN=Mark DN=Bob DN=Joe CDFCA F... DN=Igor Gatekeeper/Batch Queue WN Amber’s job runs as DN=Igor WN Amber’s job runs as DN=Igor GUMS SAZ (5) GUMS – DN to local UID mapper SAZ – site DN authorization Current modus operandi WN Mark’s job runs as DN=Igor WN Mark’s job runs as DN=Igor...

June 5, 2006gLexec6 gLexec - A Solution ? gsEXEC and gLexec, both of which are based on the Apache suEXEC module, have been implemented A “enhanced” gLexec has been selected to implement a solution to the glide-in problem. Fermilab and NIKHEF are collaborating on the design and implementation of the enhanced gLexec which will use plug-ins for LCAS/LCMAPS or GUMS/SAZ callouts together with reuse of code from Prima. Miron Livny has agreed to implement a “stub” gLexec module in Condor.

June 5, 2006gLexec7 gLexec - Block Diagram Block Diagram of job flow using gLexec: Child jobs are vetted through both GUMS and SAZ. gLexec can also communicate with accounting and auditing modules (if desired).

June 5, 2006gLexec8 Prima Software Components This diagram shows the current state of prima: The black box surrounding the entire thing is prima. The components in the blue box are those developed originally by Markus Lorch and now maintained by Vikram, et al. OSG people. The green box contains several standard open source libraries that are largely unmodified, except for libsaml which Markus added an extension to help in communicating with the Gatekeeper. The build order of these components is bottom to top. Igor modified libprima_authz and libprima_saml_support to remove the mutual dependency on each other, libprima_authz included the libprima_saml_support header file, and a function in libprima_saml_support required some parameters from libprima_authz. He separated these from each other so libprima_saml_support doesn't depend on libprima_authz. The Gatekeeper uses the authz callout to interface to the prima_saml_support shared object which uses libsaml to format the request, and the other green components to contact the GUMS server.

June 5, 2006gLexec9 gLexec Software Components The diagram shows the current state of the gLexec interface to GUMS and/or SAZ: The libprima_authz component has been replaced by lcas and lcmaps plugins (developed by NIKHEF) which will pass the parameters to libprima_saml_support which will then function as before. After Igor's changes, in the globus gatekeeper scenario, Vikram can compile everything up to and including libprima_saml_support, but the Makefile for the libprima_authz is missing some things which he should have done either yesterday or today. The goal is to make sure we can use the same code base and, ergo, not have maintain 2 sets of code - one for prima and one for glexec.

June 5, 2006gLexec10 Fermilab - Draft Pilot Job Definition Draft Pilot Job Definition: A Pilot Job is defined as any job where a Job Manager submits a job request to a grid batch system, but the application that is executed on the Worker Nodes was created by, and the input and output data is owned by, another user. Draft Pilot Job Requirement: A Pilot Job must correctly map the executed application and the input and output data to the actual owner of the application.

June 5, 2006gLexec11 Fermilab - Draft Pilot Job Policy Draft Pilot Job Policy: Any job that is discovered to be a Pilot Job which : Does not correctly perform the appropriate user mapping using the Site supplied utility (i.e., glexec), or Lacks a pre-agreed Trust relationship with Fermilab approved by the CSEXEC may be terminated immediately. In addition, the DN of the pilot may be placed on the Site Black List until the situation has been rectified. Furthermore any existing trust relationship with that VO may be reexamined.

June 5, 2006gLexec12 Fermilab - Draft Trust Relationships Policy Draft Trust Relationships Policy: Glide-in operation at Fermilab is allowed only by Virtual Organizations that have a valid Trust relationship with Fermilab. Trust relationships are granted and revoked by the Fermilab Computer Security Executive. Virtual Organizations hosted at Fermilab (CDF, CMS, D0, MINOS, LQCD, etc) have an initial valid grandfathered trust relationship, based on the established basis for management and operational security controls that exists within Fermilab. The granting of Trust relationships will be based on processes which have yet to defined. It is a goal of the OSG that this process is not intended to be heavyweight or overly burdensome.

June 5, 2006gLexec13 Fermilab - Site and VO Trust Model In the process of thinking about glide-ins, our thinking has morphed of a more general discussion of: "why a Site should trust a VO”. “why a VO should trust a Site”. These questions are also foundational for the larger question of “Trust” across the grid. At the present, such trust relationships only exist on the basis of, for lack of a better word, "small world" principles. The veracity and completeness of your statements, your responsiveness, an understanding of the technology you are using, and so forth. Companies like eBay, for example, make this more scalable, by evaluating the trust with a varieties of policies and technologies. This is an interesting area for further discussion and exploration. (see the talk on Trust and the draft OSG VO AUP for more discussion).

June 5, 2006gLexec14 Fin Any questions?