Stephen Burke egi.eu EGI TF Prague September 20th 2012

Slides:



Advertisements
Similar presentations
MN Parcel Data Standard State Standards Process and Useful Insights DCDC 12/04/2009Mark Kotz.
Advertisements

July 11 th, 2005 Software Engineering with Reusable Components RiSE’s Seminars Sametinger’s book :: Chapters 16, 17 and 18 Fred Durão.
Africa & Arabia ROC tutorial The GSTAT2 Grid Monitoring tool Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Migration to the GLUE 2.0 information schema in the LCG/EGEE/EGI.
User Support Chapter 8. Overview Assumption/IDEALLY: If a system is properly design, it should be completely of ease to use, thus user will require little.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp - SWITCH EGI TF Prague.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
Oleg LODYGENSKY Etienne URBAH LAL, Univ Paris-Sud, IN2P3/CNRS, Orsay,
Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG GDB, CERN 10 December 2008.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
EMI INFSO-RI Accounting John Gordon (STFC) APEL PT Leader.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
WLCG Grid Deployment Board CERN, 14 May 2008 Storage Update Flavia Donno CERN/IT.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
Storage Accounting John Gordon STFC GDB, Lyon 6 th April2011 GDB January 2012.
Pledged and delivered resources to ALICE Grid computing in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI MPI VT report OMB Meeting 28 th February 2012.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Information system workshop Stephen Burke egi.eu EGI TF Madrid September.
EMI is partially funded by the European Commission under Grant Agreement RI EMI Status And Plans Laurence Field, CERN Towards an Integrated Information.
Human Computer Interaction Lecture 21 User Support
PeerWise Student Instructions
SOFTWARE TESTING Date: 29-Dec-2016 By: Ram Karthick.
The evolution of the information system in EGI/WLCG
EGI Operations Management Board
gLite Information System
Business Process Modeling
OGF PGI – EDGI Security Use Case and Requirements
CHAPTER 4 REPETITION CONTROL STRUCTURE / LOOPING
CS170 – Week 1 Lecture 3: Foundation Ismail abumuhfouz.
Lecture 1 Introduction Richard Gesick.
Introduction to the C Language
FOP: Buttons and Events
Practical: The Information Systems
Short term improvements to the Information System: a status report
Impact-Oriented Project Planning
SRM2 Migration Strategy
Proposal for obtaining installed capacity
Lecture 25 More Synchronized Data and Producer/Consumer Relationship
Software engineering – 1
Outline Introduction Objectives Motivation Expected Output
Dynamic SQL Writing Efficient Queries on the Fly
GLUE 2 Support in gLite Data Management
Discussions on group meeting
Malwarebytes Installation Issues Number Facing error with Malwarebytes software is not something unusual as most of the users use to face.
MAKE SDTM EASIER START WITH CDASH !
A conceptual model of grid resources and services
Updating GML datasets S-100 WG TSM September 2017
Chapter 13 Quality Management
Coding Concepts (Standards and Testing)
Chapter 6: Repetition Statements
Chapter 11 user support.
Focus of the Course Object-Oriented Software Development
Information system workshop
SDLC Phases Systems Design.
Geant4 Documentation Geant4 Workshop 4 October 2002 Dennis Wright
a middleware implementation
Chapter 10: Compilers and Language Translation
Information System (BDII)
IEEE- P2600 PP Guidelines Suggested Format and Content
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Information Services Claudio Cherubino INFN Catania Bologna
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

Stephen Burke egi.eu EGI TF Prague September 20th 2012 GLUE 2.0 Profile for EGI Stephen Burke egi.eu EGI TF Prague September 20th 2012

GLUE 2.0 Profile - EGI TF Prague Overview Why we needed a profile document Classification of use cases Classification of importance of publication Validation Installed capacity Document structure Review process Implementation GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Introduction The GLUE 2.0 schema is intentionally very flexible Many ways to use it, not necessarily interoperable Several SRM implementations Need a profile to specify how it should be used in EGI Currently “in EGI” means “in the BDII” – may change Detailed semantics of each attribute, what should and should not be published Monitoring tools should enforce the usage First public draft is available Some internal comments already Will need detailed review, especially by middleware developers Hope to converge by the end of October Likely to need updates in the light of experience GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Use cases The schema can potentially satisfy many use cases Various implications: importance, need for accuracy, update rate, latency, caching ... The BDII currently has all information, but we may want to have separate systems for different kinds of information EMIR? The profile document classifies each attribute as potentially useful for one or more of 5 categories of use case The fact that something can be used in a particular way doesn’t mean that it will be! GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Use case categories Service Discovery (SD): used as selection parameters in queries to find services, or service attributes returned by such queries Service Selection (SS): dynamic information used to choose a particular service, e.g. number of queued jobs Monitoring (M): information useful to monitor the overall state of the Grid, e.g. as used in gstat Oversight (O): high-level management information, e.g. installed capacity and service versions Diagnostic (D): information which may help diagnose problems with the services or in the info system itself GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Importance The schema document just classifies attributes as mandatory or optional For EGI we need more detailed guidance as to which attributes need to be published Again these are classified in 5 categories GLUE 2.0 Profile - EGI TF Prague

Importance categories Mandatory (M): information which must be published. Includes all attributes which are mandatory in the schema, but adds some which are important for EGI. Recommended (R): information which should be published unless there is a good reason, e.g. technically impossible or not meaningful in a particular case. Desirable (D): information which is likely to be useful, and should be published if reasonably practical, but which may be omitted. Optional (O): information which is truly optional. Typically these attributes have no use in the existing EGI infrastructure, or are only useful in specialised cases. Undesirable (U): information which may be damaging in the EGI context, e.g. due to large data volumes or exposure of sensitive information. GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Validation Validation of the published information has traditionally been a weak point LDAP has few internal constraints, and gives cryptic errors for the things it does check The definition of which values are allowed is fairly poor Many things just defined by the code or common practice The gstat tool has some checks, but with limited coverage and inflexible for updating the tests The current information system is full of mistakes! Circular problem: hard to persuade people to publish unused attributes correctly, hard to persuade people to use attributes if there are many errors For GLUE 2 we should try to do better GLUE 2.0 Profile - EGI TF Prague

Validation categories FATAL: errors which invalidate the structure of the information, e.g. unique IDs which are not unique. ERROR: values which are definitely incorrect. WARNING: values which are likely to be incorrect, or which are valid within the schema in general but invalid in the EGI context, e.g. very large numbers of jobs. INFO: values which are technically correct but may be wrong, e.g. strings in the wrong case, locations at (0,0), typos, values in the wrong units etc. It is therefore desirable to have heuristics to try to identify such mistakes – e.g. collect a list of commonly-published batch system names and flag names which are not in the list. GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Validation tools We need tools which validate the information in different situations: Interoperability tests in middleware development Service-specific tests, e.g. for CE and SE During middleware acceptance tests Inside the BDII before publication Checking the entire published information in a top-level BDII May be able to have a common set of tests which can be plugged in to different tools? Need to report errors to the right place Middleware bugs to developers, not system managers! GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Validation tests The profile document has many suggested validation tests Intended as guidance and not fixed rules Some things may be too hard to implement New/different tests always welcome INFO-level tests will probably have many false positives Need to be able to switch them off and/or update them GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Installed capacity Useful to know the total installed capacity (CPU power, disk and tape sizes) and the sharing between VOs WLCG has a document for GLUE 1: https://twiki.cern.ch/twiki/pub/LCG/WLCGCommonComputingReadinessChallenges/WLCG_GlueSchemaUsage-1.8.pdf Some things are difficult in GLUE 1, should be easier in GLUE 2 Profile document has guidance Computing power straightforward Logical CPUs, HEPSPEC-06, VO shares Storage more complex Including/excluding unavailable disk servers Treatment of cached files, parity/spare disks, T1D0, ... Shared space vs space tokens Logical vs physical view, double counting May need to iterate with developers GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Document structure Introduction explaining concepts Detailed section for each schema class Doesn’t repeat information from schema document unless needed for clarity Table giving information for every attribute, plus text notes if necessary E.g. for the Contact class: GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Review process Long, technical document, needs careful review Especially by developers May take some time to converge May well need updates after experience Document is versioned Compliance with a given version should be published GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Timetable Internal EGI review Some responses already Will update the document by the end of September So far requested changes seem fairly small External review Especially EMI Comments by the end of October “Final” document by the end of November Version 1.0 Likely to evolve with experience GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague Implementation Difficulty will vary – many things may be compliant already Developers can check compliance while reviewing the document Need to start validation – if only by hand Submit bugs as problems are found Need people to work on validation tools GLUE 2.0 Profile - EGI TF Prague

GLUE 2.0 Profile - EGI TF Prague References Draft profile document https://documents.egi.eu/public/ShowDocument?docid=1324 GLUE 2.0 specification http://www.ogf.org/documents/GFD.147.pdf LDAP rendering specification (draft) http://forge.ogf.org/sf/go/doc15518?nav=1 WLCG installed capacity document https://twiki.cern.ch/twiki/pub/LCG/WLCGCommonComputingReadinessChallenges/WLCG_GlueSchemaUsage-1.8.pdf GLUE 2.0 Profile - EGI TF Prague