Download presentation
Presentation is loading. Please wait.
1
Regional Grid Monitoring - timeline
Nick Thackray CERN COD Meeting, 27 January 2010, Lyon
2
Time Line Jan Feb Mar Apr TODAY
Compare results in prod dashboard + SAM with test dashboard + Nagios Big-Bang: switch prod dashboard from using SAM to using Nagios results Clean up CERN-based “project” Nagios boxes Jan Feb Mar Apr TODAY ROCs install regional Nagios. Check that test results from regional Nagios are same as test results from “project” Nagios. Fix issues where found. ROC by ROC, switch from using “project” Nagios to regional Nagios for monitoring
3
Compare results in prod dashboard + SAM with test dashboard + Nagios
Big-Bang: switch prod dashboard from using SAM to using Nagios results Clean up CERN-based “project” Nagios boxes Jan Feb Mar Apr TODAY ROCs install regional Nagios. Check that test results from regional Nagios are same as test results from “project” Nagios. Fix issues where found. ROC by ROC, switch from using “project” Nagios to regional Nagios for monitoring
4
Now end of January Production system and test system in place.
Comparison carried out between the alarms raised in the production system and those raised in the test system. GGUS Production dashboard (Symphony) SAM Compare these for a while to check they are equivalent Test GGUS Test dashboard (Symphony) Project Nagios boxes CERN) Prod msg bus
5
Compare results in prod dashboard + SAM with test dashboard + Nagios
Big-Bang: switch prod dashboard from using SAM to using Nagios results Clean up CERN-based “project” Nagios boxes Jan Feb Mar Apr TODAY ROCs install regional Nagios. Check that test results from regional Nagios are same as test results from “project” Nagios. Fix issues where found. ROC by ROC, switch from using “project” Nagios to regional Nagios for monitoring
6
Production dashboard (Symphony) Test dashboard (Symphony)
Before 12th February Switch over from SAM to Nagios GGUS Production dashboard (Symphony) SAM Test GGUS Test dashboard (Symphony) Project Nagios boxes CERN) Prod msg bus Project Nagios boxes CERN) MyEGEE
7
Compare results in prod dashboard + SAM with test dashboard + Nagios
Big-Bang: switch prod dashboard from using SAM to using Nagios results Clean up CERN-based “project” Nagios boxes Jan Feb Mar Apr TODAY ROCs install regional Nagios. Check that test results from regional Nagios are same as test results from “project” Nagios. Fix issues where found. ROC by ROC, switch from using “project” Nagios to regional Nagios for monitoring
8
By 26th February at the latest
Check (using MyEGEE) that each regional Nagios gives same results as project Nagios Project Nagios boxes CERN) Regional ROC Nagios boxes Project Nagios boxes CERN) Project Nagios boxes CERN) Project Nagios boxes CERN) Project Nagios boxes CERN) MyEGEE Project Nagios boxes CERN) MyEGEE For each ROC, check that results match. If not, fix the problem.
9
Compare results in prod dashboard + SAM with test dashboard + Nagios
Big-Bang: switch prod dashboard from using SAM to using Nagios results Clean up CERN-based “project” Nagios boxes Jan Feb Mar Apr TODAY ROCs install regional Nagios. Check that test results from regional Nagios are same as test results from “project” Nagios. Fix issues where found. ROC by ROC, switch from using “project” Nagios to regional Nagios for monitoring
10
Production dashboard (Symphony)
Before 19th March Switch over from Project to Regional Nagios Production dashboard (Symphony) Project Nagios boxes CERN) Project Nagios boxes CERN) Regional Nagios boxes
11
Summary of timeline By 31 Jan : Make sure that Dashboard+Nagios gives equivalent monitoring results to Dashboard+SAM By 12 Feb : Switch central dashboard to use Nagios results instead of SAM results Between now and 26th Feb : Make sure (using MyEGEE) that each regional Nagios gives the same results as the project Nagios Before 19th March : In production, replaced all project Nagios boxes by the ROCs regional Nagios
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.