EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Site Monitoring for Grid Services WLCG Grid.
Advertisements

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Infrastructure overview Arnold Meijster &
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
OSG Middleware Roadmap Rob Gardner University of Chicago OSG / EGEE Operations Workshop CERN June 19-20, 2006.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite IPv6 compliance project tests Further.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PPS All sites Meeting: Introduction & Agenda.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GRNET SA3 Progress Report Ioannis Liabotis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
INFSO-RI Enabling Grids for E-sciencE Experience with monitoring of Prague T2 site Tomáš Kouba NEC 2007, Varna, Bulgaria
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid Monitoring Tools Alexandre Duarte CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to GILDA and gaining access.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Operations Automation Team KoM, May ROC VIEW (SWE)‏ Javier Lopez Cacheiro/
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
INFSO-RI Enabling Grids for E-sciencE Installing & configuring Joachim Flammer Integration Team, CERN EMBRACE Tutorial, Clermont-Ferrand.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
1 Grid Service Monitoring James Casey, CERN IT-GD WLCG/OSG Operations Meeting 14th June 2007.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Monitoring Working Group Update Grid Deployment Board 5 th December, CERN Ian Neilson.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks New WLCG Grid Service Monitoring Displays.
Enabling Grids for E-sciencE EGEE-II INFSO-RI ROC managers meeting at EGEE 2007 conference, Budapest, October 1, 2007 Admin Matters Vera Hanser.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
NGI and Site Nagios Monitoring
Use of Nagios in Central European ROC
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Monitoring in EGEE Automatisierung & Regionalisierung im Hinblick auf EGI Torsten Antoni (KIT), James Casey (CERN), Sabine Reißer (KIT)
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J. Casey, CERN E. Imamagic, SRCE ISGC 2008

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 2 Overview Nagios Nagios-based grid monitoring Site monitoring prototype Demo Current status Future work Conclusions

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 3 Nagios Open source monitoring framework –widely used & actively developed Host and service problems detection and recovery Provides wide set of basic sensors –easy to develop custom sensors Centralized vs. distributed deployment High configurability –service dependencies, fine-grained notification options Web interface –status view, administration

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 4 Nagios-based Grid Monitoring Monitoring CRO-GRID Infrastructure ( ) –Globus Toolkit Pre-WS & WS, UNICORE, other services –active recovery of services – Monitoring EGEE resources in Central Europe (CE) –core services since mid 2006 –all CE sites for 1st line support since September 2006 – Grid Services Monitoring (GSM) WG –site monitoring prototype, mid 2007 – (egee.srce.hr) – (CERN-PPS)

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 5 Site Monitoring Prototype … Site nodes Site BDII CESELFC MyProxy Refresh proxy Get VOMS proxy Service checks Get remote results Probe descriptions … Get site’s & nodes information Get nodes information Live node checks Get Nagios results Site admins Get site status Issue alarms Monitoring server

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 6 Grid Probes Provided by SRCE, CERN, OSG Security facilities & services –CA distribution, Certificate lifetime, MyProxy Monitoring & information services –R-GMA, BDII, MDS, GridICE Job management services –Globus Gatekeeper, RB, WMS, WMProxy, Job matching File management services –GridFTP, SRM, DPNS, LFC, FTS

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 7 Standard Components Specifications defined by GSM WG Probe wrapper –enables integration of standardized probes –Grid Monitoring Probes Specification – ificationhttps://twiki.cern.ch/twiki/bin/view/LCG/GridMonitoringProbeSpec ification Publisher & remote gatherers –integration with other tools –Grid Monitoring Data Exchange Standard – ngeStandardhttps://twiki.cern.ch/twiki/bin/view/LCG/GridMonitoringDataExcha ngeStandard

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 8 Nagios Config Generator Uses multiple information sources –SAM, BDII, active heuristic checks Modular approach –plugging in additional information sources –integration with other monitoring systems (e.g. LEMON) User-defined rules –configuration tuning for non-standard grid sites Standalone configuration –integration with existing Nagios server

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 9 Remote gLite UI Avoid installation of grid middleware on Nagios server –execute grid probes on existing gLite UI –use Nagios Remote Plugin Executor (NRPE) … Site nodes Site BDII CESELFC Service checks

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 10

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 11

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 12

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 13 SAM Standard probes NPM

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 14 Current Status Three sets of standard probes integrated –SRCE, CERN, OSG Two external monitoring systems –SAM, ENOC DownCollector Several deployments –CERN-PPS, SRCE, NIKHEF, PIC, IN2P3, ScotGrid RPMs in apt and yum repository Installation and configuration manual More info

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 15 Future Work NCG development –providing configuration for multiple sites (regional monitoring) –providing configuration for multiple VOs Integration with global monitoring systems –ActiveMQ messaging system –Operations Automation Team mandate Enabling “on-host” check via NRPE –process, logs, ports, files, etc Probe description & site topology databases definition

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 16 Conclusions Nagios –highly configurable monitoring framework with notifications, service dependencies, … –widely used by site admins Grid extensions –integration with existing infrastructure (user certificates, VOMS, GOCDB, SAM) –probes for key grid services Implementation of GSM WG specifications –probe wrapper, publisher & remote gatherers –easy integration with existing probes and monitoring systems

Enabling Grids for E-sciencE EGEE-II INFSO-RI ISGC 2008 / Simply monitor a grid site with Nagios 17 Thank You! Questions?