The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014.

Slides:



Advertisements
Similar presentations
Lavoisier 2.0 Tsukuba, KEK, 21 December 2010 Sylvain Reynaud 2.0.
Advertisements

LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
DPM Python tools Ivan Calvet IT/SDC-ID DPM Workshop 10 th October 2014.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Automatic Resource & Usage Monitoring Steve Traylen/Flavia Donno CERN/IT.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
CE extensions requirements for the Information System Pre-GDB 11 th July 2012 Maria Alandes Pradillo CERN IT Department, Grid Technology Group With contributions.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Accounting Update John Gordon and Stuart Pullinger January 2014 GDB.
The new FTS – proposal FTS status. EMI INFSO-RI /05/ FTS /05/ /05/ Bugs fixed – Support an SE publishing more than.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
WLCG Information System Use Cases Review WLCG Operations Coordination Meeting 18 th June 2015 Maria Alandes IT/SDC.
Next Steps after WLCG workshop Information System Task Force 11 th February
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Ideal information system - CMS Andrea Sciabà IS.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Information system workshop Stephen Burke egi.eu EGI TF Madrid September.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Open Science Grid and GLUE 2.0 Rob Quick OSG Operations Area Coordinator Manager High Throughput Computing Indiana University.
EMI is partially funded by the European Commission under Grant Agreement RI EMI Status And Plans Laurence Field, CERN Towards an Integrated Information.
Piotr Bała, Marcin Radecki, Krzysztof Benedyczak
Daniele Bonacorsi Andrea Sciabà
WLCG Workshop 2017 [Manchester] Operations Session Summary
EGI Operations Management Board
gLite Information System
ATLAS Grid Information System
WLCG Resources Reporting
POW MND section.
Short term improvements to the Information System: a status report
BDII Performance Tests
WLCG experiments FedCloud through VAC/VCycle in the EGI
Proposal for obtaining installed capacity
Compute Area Marco Cecchi Massimo Sgaravatto
ADC Requirements and Recommendations for Sites
Infrastructure Area EMI All Hands Summary.
GLUE 2 Support in gLite Data Management
Solutions for federated services management EGI
Monitoring of the infrastructure from the VO perspective
gLite Information System(s)
Operations Officer, EGI
EGEE Middleware: gLite Information Systems (IS)
Stephen Burke egi.eu EGI TF Prague September 20th 2012
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

The Grid Information System Maria Alandes Pradillo IT-SDC White Area Lecture, 4th June 2014

Introduction  What is the Information System  What is the GLUE schema  The past  Software errors  Performance issues  Information volatility  Obsolete entries  Site misconfigurations  The present  GLUE validation  The future  Common solution  Cloud resources  Open issues June 2014 White Area Lectures - WLCG Information System 2

WHAT IS THE INFORMATION SYSTEM? June 2014White Area Lectures - WLCG Information System 3

June 2014White Area Lectures - WLCG Information System 4

June 2014White Area Lectures - WLCG Information System 5

June 2014White Area Lectures - WLCG Information System 6

WLCG Information System Architecture June 2014 White Area Lectures - WLCG Information System 7

Every 2 minutes… June 2014 White Area Lectures - WLCG Information System 8

Behind the scenes June 2014 White Area Lectures - WLCG Information System 9 ComponentLOCLanguage bdii-update398Python glite-info-provider-ldap461Perl glite-info-provider-service513Perl dpm-listspaces1426Python info-dynamic-lsf1531Perl fts-info-publisher204C++ Requirements Operating SystemSL5 or SL6 Python2.5 or 2.6 Openldap2.4 RAM4GB for top BDII

Information Providers June 2014 White Area Lectures - WLCG Information System 10 ComponentMaintained by CREAM CEINFN PBSINFN LSFCERN SGELIP/CESGA SLURMINFN DPMCERN dCacheDESY StoRMINFN Castor/EOSCERN FTSCERN glite-info-provider-serviceRAL

Is the BDII the only way to know about WLCG resources?  Not really…  Without taking into account monitoring tools, there are many information sources for different purposes within WLCG June 2014 White Area Lectures - WLCG Information System 11

WLCG Information Sources ToolMaintained byDescription BDIIWLCGStatic and Dynamic information for EGI and OSG services GstatUnmaintained?Visualisation and validation tool based on the BDII information GOCDBEGIEGI Service Registry (static information manually edited by sys admins) OIMOSGOSG Service Registry (static information) AGISATLASATLAS sites and services (collects information from BDII, GOCDB, OIM and allows for manual input by AGIS admins) SiteDBCMSCMS sites, pledges, CEs and SEs (manual input by sys admins and SiteDB admins) DIRACLHCbLHCb sites and services AliENALICEALICE sites and services (manually maintained) REBUSWLCGPledge information, WLCG topology, Capacity (taken from the BDII and manual input by federations) APELEGIAccounting information directly taken from CEs June 2014 White Area Lectures - WLCG Information System 12

WHAT IS THE GLUE SCHEMA? June 2014White Area Lectures - WLCG Information System 13

June 2014White Area Lectures - WLCG Information System 14

A common data model June 2014 White Area Lectures - WLCG Information System 15 GLUE 1 Computing Resources GLUE 2 Computing Resources

LDAP trees in the BDII June 2014 White Area Lectures - WLCG Information System 16

Do you want to have a look at the information system? Jacques Cousteau approach  Get familiar with the GLUE schema  Try with ldapsearch  Examples available in the BDII Sys Admin guide June 2014 White Area Lectures - WLCG Information System 17 National Geographic approach  Use any of the existing clients available in lxplus  GLUE 1.3  lcg-info  lcg-infosites  GLUE 2.0  ginfo

GLUE 1 vs GLUE 2  EGI sites publish GLUE 1 and GLUE 2  OSG sites publish GLUE 1 only  Information providers use the same logic to publish GLUE 1 and GLUE 2  EGI planned to decommission GLUE 1 this year  This requires clients to start consuming GLUE 2 information  Including experiment frameworks and any application that consumes information from the BDII  It will be a long process that hasn’t started yet June 2014 White Area Lectures - WLCG Information System 18

THE PAST June 2014White Area Lectures - WLCG Information System 19

Software Errors June 2014 White Area Lectures - WLCG Information System matches in the LCG-ROLLOUT list for ‘BDII’ since matches only in 2013 and 2014

Performance issues  Openldap 2.3  LDAP performance tuning for GLUE 1  LDAP performance tuning for GLUE 2 June 2014 White Area Lectures - WLCG Information System 21

Information volatility  Cache implemented in top BDII  To be more robust against system instabilities June 2014 White Area Lectures - WLCG Information System 22

Obsolete entries  Obsolete information published in GLUE 2  Due to a bug in the bdii-update script June 2014 White Area Lectures - WLCG Information System 23

Site misconfigurations  Information Providers rely on configurations files that need to be properly defined by the site  Batch system configuration is not easy  Storage systems could also be complex  Smaller sites with less expertise have sometimes difficulties to get things right June 2014 White Area Lectures - WLCG Information System 24

THE PRESENT June 2014White Area Lectures - WLCG Information System 25

Can we trust the information published in the information system? June 2014 White Area Lectures - WLCG Information System 26

GLUE validation  There are 369 sites currently published in the BDII  It’s impossible to control what they publish right now!  But we need a mechanism to measure how ‘good’ is the information they are publishing  Validation activities started in 2013  Checking the information providers and following up issues with the developers  Monitoring sites  Manually at first  Automatically using the Dashboard later  Nagios monitoring presently done by EGI  Automation is key due to the high volume of information  Validation actions targeting specific attributes have also proven to be more useful for experiments like LHCb June 2014 White Area Lectures - WLCG Information System 27

June 2014White Area Lectures - WLCG Information System 28 WLCG Sites Validation Monitoring

Nagios Monitoring June 2014 White Area Lectures - WLCG Information System 29

Specific VO validation  SRM vs BDII storage capacity numbers June 2014 White Area Lectures - WLCG Information System 30

THE FUTURE June 2014White Area Lectures - WLCG Information System 31

Common solution  Experiments have implemented their own information systems  AGIS has been evaluated for CMS  It would be interesting to unify the effort and the work towards a common solution June 2014 White Area Lectures - WLCG Information System 32

Cloud resources  GLUE schema is being extended to define cloud resources  A new version GLUE 2.1 to be expected by the end of the year  No impact on current version, just an extension  It will require an update in the BDII transparent for the users  EGI is already publishing Cloud resources in the BDII using GLUE 2.0 data model June 2014 White Area Lectures - WLCG Information System 33

Open questions  The BDII seems to be now a stable and mature service offering robustness and performance  However, it’s not the best architecture for combining static and dynamic information  Move more static information to GOCDB/OIM  Use messaging for publishing dynamic information  Information quality still requires an on going effort  Reduce the amount of information currently published by the BDII  Enforce good quality before publication and otherwise publish nothing  Experiments have deployed their own information system solutions  Effort is duplicated  We could learn from each other experience and have a more clear idea of how information systems should evolve June 2014 White Area Lectures - WLCG Information System 34

QUESTIONS? June 2014White Area Lectures - WLCG Information System 35