EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.

Slides:



Advertisements
Similar presentations
Mardi 30 mars 2010 Lavoisier : a way to integrate heteregeneous monitoring systems. Cyril LOrphelin IN2P3/CNRS Computing Centre, Lyon, France.
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks R. Brunetti INFN-Torino The Italian Regional.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC SEE By E. Atanassov,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD June 2009 COD-20 Hélène Cordier COD-20, CNRS-IN2P3, CSC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Pole 3 – COD TOOLS Cyril L’Orphelin - CNRS/IN2P3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Romanian SA1 report Alexandru Stanciu ICI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What GGUS can do for you JRA1 All hands.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
INFSO-RI Enabling Grids for E-sciencE EGEE 1 st EU Review – 9 th to 11 th February 2005 CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC UKI John Walsh.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD21 22 Sept 2009 Forum & COD-22 since COD21 until EGI Hélène Cordier COD-22, CNRS-IN2P3,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD June 2009 COD-20 Parallel sessions Hélène Cordier COD-20, CNRS-IN2P3,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Dashboard Cyril L’Orphelin - CNRS/IN2P3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Operations procedures: summary for round table Maite Barroso OCC, CERN
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CIC portal Requirements from users WLCG service.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
Vendredi 19 février 2016 CIC portal development status and TODO list Gilles Mathieu, Osman Aidel, Cyril L’Orphelin IN2P3/CNRS Computing Centre, Lyon, France.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Alistair.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Support for Distributed Computing Infrastructures.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-17
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Pole 2 : Restructuration of the OPS Manual.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal Development Update on Requirements Cyril L'Orphelin IN2P3/CNRS.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Best Practices and Use cases David Bouvet,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operational Procedures (Contacts, procedures,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROC model assessment AP ROC ShuTing Liao.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-16 (Transition to EGEE-III) Report to.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-17
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
INFSO-RI Enabling Grids for E-sciencE GOCDB2 Matt Thorpe / Philippa Strange RAL, UK.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal OTAG September, 21th 2011 Cyril L’Orphelin – CCIN2P3/CNRS.
Enabling Grids for E-sciencE EGEE-II INFSO-RI ROC managers meeting at EGEE 2007 conference, Budapest, October 1, 2007 Admin Matters Vera Hanser.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD EGEE09 Barcelona Pole-2 Restructuring of Procedures Vera Hansper.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks An insight into GOCDB for ROD Operators.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyEGEE David Horat (
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
NGI and Site Nagios Monitoring
Lavoisier : a way to integrate heteregeneous monitoring systems.
Operations & Coordination Tools
Cyril L’Orphelin (CC-IN2P3) COD-19, Bologna, March 30th 2009
Pole 3 – Dashboard Assessment COD 20 - Helsinki
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin - CNRS/IN2P3 Amsterdam, ROD team Workshop

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Context The dashboard has been developed during EGEE I, II and III at IN2P3-CC in the SA1 activity. The aim of this project was to propose on a single interface the maximum of information useful for the daily operations and to ease Operators to follow the procedures. Initially the dashboard was hosted on the CIC Portal – This portal is migratetd step by step to Operations Portal – The dashboard is the first module we have migrated. Since the beginning of May we are involved in the JRA1 activity of EGI. Our objectives in the coming year : – Migrate other features – Improve the dashboard module (regional helpdesk, ergonomic,...) – Distribute these features in a package – Propose programmatic interfaces (xml, json )

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting The dashboard is a tool designed to follow and track problems on sites. This tool is a integration platform and propose a synoptic view of different data sources :  Gstat, monitoring tool of the publication done by sites  Nagios, system and network monitoring application  SAM, framework of job submission (used for VO specific tests )  GOC DB, the DB for the Sites.  GGUS, the global ticketing system.  BDII, a ldap repository with dynamic information published by sites. In summary you track problem with the different results from Monitoring Tools ( Nagios, Gstat) and you can open and update trouble ticket in GGUS direclty in the dashboard. We use also GOC DB and BDII to consolidate monitoring informations with downtime information and dynamic statuses.

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Centrale Instance : Architecture

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting The main page

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting 1 st Level : the synoptic view Site Name + infos Alarms Ticket Downtimes Global Informations Actions Open a ticket without “alarms” Send a notepad to the site See the graph of alarms or downtimes Refresh informations

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting 2 nd Level : Access to details

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Other Pages  C-COD view (restricted access) A synoptic view of informations related to problematic sites ( alarms older than 72 h, tickets expired, tickets opened since one month )  Handover A tool to report or share problems between regional teams or between C-COD team  User List Set up your own lists of sites to use in the dashboard.  Regional List View regional information ( contact, responsibles)  GridMap Visualizing the state of your grid with GridMaps How-to // User documentation 

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Nagios Integration

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting The “notifications” work-flow The global work-flow is based on the exchange of notifications between Nagios and the Lavoisier WS. The decision to send out notifications is made in the service check and host check logic. * When a hard state change occurs. More information on state types and hard state changes can be found here: o tatetypes.html o otifications.html * When a host or service remains in a hard non-OK state and the time specified by the option in the host or service definition has passed since the last notification was sent out. At this point Lavoisier is connected to the broker and the topic corresponding to all notifications. We apply a filter on these notifications : - on the role, the name of the roc/ngi, the hostname => to distinguish the Nagios cern // nagios box in region - on the test name and the status to keep only tests defined critical. If the notification is passing successfully through the filter, we sent an acknowledgment notification to a specific broker and we register the notification in the DB. If a notification is already registered in the DB for a specified host and specified service we just update its status. The acknowledgment mechanism will permit in case of problem (on the notification system or on our Web Service) to send again the notifications. These different steps could explain some differences on what you're seeing on the Nagios Interface and on the dashboard interface.

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Next steps Nagios filtering improvements : – A dynamic filter is in place based on a on-line configuration file given by Nagios Team – Add in this filter the list of the critical tests / ROC – The Problem ID is not taken into account as a primary Key for the notifications.It means that only one record will be active in the same time for a given host and a given test. – An acknowledgment mechanism is in place. It could be used in case of problem between the notification system and Lavoisier. Other Improvements : – On the main site view your default site list will be directly loaded. – The access will no more limited to people registered in GOC DB ( a certificate is enough) – A new alarm might be masked by an assigned one – Optimize the DB to increase performances

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Regional package The application has been modified to cope completely with a regional context. A synchronization system is in place to exchange information with the future regional instances. The package will be proposed with 2 modules : – The lavoisier Web Service =>a rpm file (download-able from SVN) – the php part => direct checkout from SVN The package will be released June 8 th : – Czech NGI and Portugal NGI will evaluate the package in a first time – After this the package will be more widely distributed

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Regional Package - Synchronization

Enabling Grids for E-sciencE EGEE-III INFSO-RI COD-16 Transition meeting Links Lavoisier Web Service: Operations Portal Documentation, paper, posters Tracking System (you can use also GGUS) Dashboard : URL User documentation To Contact us :