EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks GLUE Schema Configuration for SRM 2.2 Stephen.

Slides:



Advertisements
Similar presentations
How did we get here? (CMIS v0.5) F2F, January 2009.
Advertisements

70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Africa & Arabia ROC tutorial The GSTAT2 Grid Monitoring tool Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Experiences with the GLUE information schema.
INFSO-RI Enabling Grids for E-sciencE SRMv2.2 experience Sophie Lemaitre WLCG Workshop.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Middleware: gLite Information Systems (IS) EGEE Tutorial 23 rd APAN Meeting,
Data management in grid. Comparative analysis of storage systems in WLCG.
SRM 2.2: status of the implementations and GSSD 6 th March 2007 Flavia Donno, Maarten Litmaath INFN and IT/GD, CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Migration to the GLUE 2.0 information schema in the LCG/EGEE/EGI.
SRM 2.2 Issues Well, er, and 2.3 too Jens Jensen (STFC RAL/GridNet2) On behalf of GSM-WG OGF22, Cambridge, MA.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks R-GMA Now With Added Authorization Steve.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Bazaar Vision Ideas of RC/VO coordination,
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG GDB, CERN 10 December 2008.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Batch Systems and the Info (Dynamic) Provider.
SRM Monitoring 12 th April 2007 Mirco Ciriello INFN-Pisa.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
SRM & SE Jens G Jensen WP5 ATF, December Collaborators Rutherford Appleton (ATLAS datastore) CERN (CASTOR) Fermilab Jefferson Lab Lawrence Berkeley.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/06/07 SRM v2.2 working group update Results of the May workshop at FNAL
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Progress on first user scenarios Stephen.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Data management in LCG and EGEE David Smith.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update Authorization Service Christoph Witzig,
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical: The Information Systems.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
INFSO-RI Enabling Grids for E-sciencE SRMv2.2 in DPM Sophie Lemaitre Jean-Philippe.
Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG Management Board, CERN 25 November 2008.
Handling of T1D0 in CCRC’08 Tier-0 data handling Tier-1 data handling Experiment data handling Reprocessing Recalling files from tape Tier-0 data handling,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ian Bird All Activity Meeting, Sofia
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite configuration (plans) Robert Harakaly.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System Tutorial Laurence Field.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Management cluster summary David Smith JRA1 All Hands meeting, Catania, 7 March.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Storage Element Model and Proposal for Glue 1.3 Flavia Donno,
Grid Deployment Board 5 December 2007 GSSD Status Report Flavia Donno CERN/IT-GD.
1 SRM v2.2 Discussion of key concepts, methods and behaviour F. Donno CERN 11 February 2008.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Study on Authorization Christoph Witzig,
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
INFSO-RI Enabling Grids for E-sciencE BAR: The Current Status Charaka Palansuriya EPCC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Architecture of LHC File Catalog Valeria Ardizzone INFN Catania – EGEE-II NA3/NA4.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
User Domain Storage Elements SURL  TURL LFC Domain (LCG File Catalogue) SA1 – Data Grid Interoperation Enabling Grids for E-sciencE EGEE-III INFSO-RI
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
Status of the SRM 2.2 MoU extension
Practical: The Information Systems
Flavia Donno CERN GSSD Storage Workshop 3 July 2007
SRM v2.2 / v3 meeting report SRM v2.2 meeting Aug. 29
SRM Developers' Response to Enhancement Requests
SRM2 Migration Strategy
Stephen Burke egi.eu EGI TF Prague September 20th 2012
Information System (BDII)
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GLUE Schema Configuration for SRM 2.2 Stephen Burke, RAL GSSD, October 9 th 2007

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 2 Overview What is the schema for? Some history Details of the schema objects and attributes Summary References

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 3 Schema philosophy The schema is not a model of a system It publishes attributes which are needed to fulfil specific use cases –Finding services/resources and choosing between them  Client tools  Direct queries by users –Overall Grid monitoring (e.g. gstat)  Including summary information for managers  (Accounting – but only for a high-level view) Total information volume needs to be limited –Use the information system for a fast query covering all resources in the Grid –Use direct queries for detailed information The schema must cope with all implementations & configurations in a reasonable way –May need some simplifying assumptions –Formally, nearly all attributes are optional The schema must be implementable in LDAP!

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 4 Schema evolution The GLUE schema needs to be agreed between many parties, hence agreement can be hard to reach –The list is growing as GLUE is adopted more widely Deploying a schema update takes a long time (1-2 years) –Introduce Capability attributes – arbitrary text tags which give limited scope to add extra information –Also GlueServiceData – key/value pairs for services So far we have required upgrades to be backwards-compatible –Sites upgrade gradually –Client tools are adapted gradually –Accumulation of deprecated items, limits on structural change GLUE 2.0 is now being developed in the OGF framework –Not backwards-compatible –Will probably have to deploy in parallel –Get involved now if you want to influence it!

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 5 History GLUE schema first defined in 2003 Original SE schema (1.1) was for “classic SE” – simple disk server + gridftp –Plus other access protocols, e.g. rfio, file Version 1.2 defined in 2005 –Fairly small changes for SE:  New ControlProtocol, GlueService objects  Decouple SA ID from path  Introduce SAType (permanent, durable, volatile)  Introduce Capabilities for protocols –Classic SE was still the standard In practice many of the 1.2 changes are still unused

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 6 GLUE 1.3 Small window of opportunity to upgrade the schema at the end of 2006 –SRM 2.2 was the main reason for the upgrade –Must be backward compatible –Changes should be as small as possible –Should be motivated by real use cases  User queries, client tools, monitoring/accounting – but not SRM management Many discussions in the SRM mailing list, + two documents with proposals –Documents were iterated several times Face-to-face meeting agreed changes –Some things in the proposals were left out as being too complicated and not clearly motivated, notably the Storage Component (modelling the storage hardware)

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 7 Schema structure Storage Element (GlueSE) –Summary information for the whole SE Storage Library (GlueSL) –Storage hardware –Never used, obsolete since 1.2 Storage Space (Area) (GlueSA) –Area of storage assigned to one or more VOs –Schema allows multiple SAs per VO or multiple VOs per SA, but client tools generally don’t Access protocol (GlueSEAccessProtocol) –gridftp, rfio, dcap, file, … ControlProtocol (GlueSEControlProtocol) –Should be SRM endpoint, or dummy for classic SE –Not used in practice so far Service (GlueService) –Endpoints for all services, including SRM

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 8 GlueSE UniqueID: opaque ID (should not be assumed to be the hostname) Name: human-friendly name (no semantics) InformationServiceURL: URL for the SE GRIS (not used in practice) Architecture: disk, tape, multidisk, other (not used in practice) ImplementationName: DPM, dCache, CASTOR, … (useful!) ImplementationVersion: Overall version number (useful!) Status: Queueing, Production, Closed, Draining (useful!) TotalOnlineSize, TotalNearlineSize, UsedOnlineSize, UsedNearlineSize: Summary information for monitoring/management (useful, but need not be very accurate) NB Deprecated items omitted, but in general should still be published where backward compatibility is required Attributes in 1.1, 1.2, 1.3

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 9 AccessProtocol LocalID: Locally unique ID (no semantics) Type: rfio, dcap, … (used by M/W) Version: Protocol version (not used?) Endpoint: Protocol endpoint (not needed with SRM) Capability: Arbitrary text tags (not used so far) MaxStreams: Maximum number of parallel streams allowed (not used yet but may be useful?) Q: Where is the list of protocol types supported by SRM?

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 10 ControlProtocol LocalID: Locally unique ID (no semantics) Type: SRM, classic (formerly used srm_v1) Version: Protocol version (not always published correctly) Endpoint: URL of the SRM (not always published correctly) Capability: Arbitrary text tags to identify non-standard features (not used yet but may be useful?) Can have multiple CPs per SE, e.g. for SRM1 and 2 Note that clients currently take the SRM endpoint from GlueService. For anything which needs other information from the SE this is not ideal, as there is no good way to link GlueService to GlueSE – assuming a match between the hostname and the SEUniqueID is unreliable.

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 11 SA LocalID: Locally unique ID (no semantics) Name: Human-friendly name (no semantics) Path: SURL prefix to use when writing files (used by M/W) AccessControlBaseRule: Authorisation for writing (VO names or FQANs) RetentionPolicy: custodial, output, replica AccessLatency: online, nearline, offline ExpirationMode: neverExpire, warnWhenExpired, releaseWhenExpired Capability: Arbitrary text tags to identify non-standard features (not used yet) TotalOnlineSize, UsedOnlineSize, FreeOnlineSize, ReservedOnlineSize, TotalNearlineSize, UsedNearlineSize, FreeNearlineSize, ReservedNearlineSize: Various information about the size of the space (to be discussed …) GLUE 1.3 agreement was that GlueSA == SRM space, hence RetentionPolicy et al

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 12 VOInfo LocalID: Locally unique ID (no semantics) Name: Human-friendly name (no semantics) Path: SURL prefix to use when writing files (overrides SAPath if present) Tag: Space token description (unique per VO) AccessControlBaseRule: Authorisation for writing (same as or subset of the SA ACBR) Separate object due to LDAP restrictions, but logically part of parent SA Paths could be omitted (or /) but there appear to be many requests to be able to specify different default paths for writing ChangeSpaceForFiles means that you cannot in general do a reverse lookup from an existing SURL to an SA Allows different VOs/groups to share an SA with different space tokens/paths (common use case for sites with many “small” VOs)

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 13 GlueService UniqueID: opaque ID (can be tricky to ensure uniqueness) Name: Human-friendly name (no semantics) Type: SRM (formerly used srm_v1) Version: Protocol version (1.1.0 or 2.2.0) Endpoint: URL of the SRM Status: OK, Warning, Critical, Unknown, Other (currently not useful) StatusInfo: Text to explain the Status (currently not useful) WSDL: URL of a WSDL document (currently not defined) Semantics: URL of a manual or specification (currently not defined) StartTime: The service start time (currently not useful) Owner: VO name (not relevant for SRM) AccessControlBaseRule: Service authorisation (should be set?) Also legacy LCG attributes (never part of GLUE) Current information provider is static so dynamic attributes aren’t useful A dynamic information provider is in certification

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 14 GSSD example GlueSE: not yet updated for 1.3 –Name and Type are wrong, Port should be removed –New attributes should be added GlueSL: obsolete, not used by anything GlueSA: –“Old” SA (LocalID == VO name, no VOInfo) for backward-compatibility (need to check behaviour) –“New” SAs generally OK (free space = 0?) –Some deprecated attributes (size, lifetime) included for backward- compatibility GlueVOInfo: examples OK –Better to move the VOInfo objects after the corresponding SA –Need a VOInfo with Tag=default to mark the default space GlueService: OK, only ACBR is new in 1.3 –Remove deprecated attributes?

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 15 Summary GLUE 1.3 was intended to provide the minimum we need to survive for the next 1-2 years Driven by use cases (see documents) –We decided to exclude management use cases – assumed to be done directly and not via the schema –Can anyone give an important use case that is not supported by the current schema? We should publish all meaningful attributes unless there is a strong reason why not –We need to interoperate with other Grids, especially OSG –Size attributes will need a dedicated discussion about what is feasible –But we should not publish meaningless attributes! Do we have volunteers to write the info providers? What is planned on the client side – lcg-utils etc?

Enabling Grids for E-sciencE EGEE-II INFSO-RI GLUE schema configuration - GSSD 16 References GLUE 1.2: GLUE 1.3: GLUE 2: Input documents to SRM 1.3 discussion: e-wg/docman.root.background.specifications/doc14619http://forge.ogf.org/sf/docman/do/downloadDocument/projects.glu e-wg/docman.root.background.specifications/doc devel&P=214145&E=2&B=-- Boundary_%28ID_EryUZRmYbVia2QzE4Pzreg%29&N=GLUE-SE- 1.3-input-1.03.pdf&T=application%2Fdownloadhttp://listserv.fnal.gov/scripts/wa.exe?A3=ind0611c&L=srm- devel&P=214145&E=2&B=-- Boundary_%28ID_EryUZRmYbVia2QzE4Pzreg%29&N=GLUE-SE- 1.3-input-1.03.pdf&T=application%2Fdownload GSSD schema example: