Presentation is loading. Please wait.

Presentation is loading. Please wait.

EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,

Similar presentations


Presentation on theme: "EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,"— Presentation transcript:

1 EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09, Barcelona, Spain

2 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Overview Introduction Architecture Nagios Nagios Config Generator Messaging System Nagios & Messaging System Integration Conclusion Links 2

3 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Introduction Improve the reliability of the grid by giving grid administrators better tools Rely on existing and widely accepted solution Provide system which fits current and future organizational model Integrate components and automate operations to reduce manpower 3

4 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Architecture 4

5 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios What is the Nagios? –open source monitoring framework –highly flexible with advanced features –widely used & actively developed Why do we need it? –probes need to be executed –avoid development & maintenance of house-grown tools –provide solution admins are familiar with 5

6 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios Config Generator What is the Nagios Config Generator (NCG)? –automatic generation of Nagios configuration –based on multiple information sources –simple bootstrap of Nagios instances Why do we need it? –configuring Nagios is hard –information is out there, why not use it? –consistent configuration of entities 6

7 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios Config Generator - Information Sources Database components –Aggregated Topology Provider (ATP) –Metric Description Database (MDDB) Operations services –GOCDB, SAM, ENOC Grid information services –BDII Static files –https://twiki.cern.ch/twiki/bin/view/EGEE/GridMonitoringNcgOver view#Static_file_ruleshttps://twiki.cern.ch/twiki/bin/view/EGEE/GridMonitoringNcgOver view#Static_file_rules 7

8 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios Config Generator - Probes Local probes –probes executed by Nagios –SAM probes (CE, WN and SRM) –WLCG probes (SRCE, CERN) –BDII & Gstat probes –Nagios native probes –lightweight service checks (ENOC Downcollector) Contributions welcome –http://nagiosplug.sourceforge.net/developer-guidelines.htmlhttp://nagiosplug.sourceforge.net/developer-guidelines.html –https://twiki.cern.ch/twiki/bin/view/EGEE/EGEESA1BuildingPack ageshttps://twiki.cern.ch/twiki/bin/view/EGEE/EGEESA1BuildingPack ages 8

9 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios Config Generator - Probes Remote probes –results imported from external systems –remote Nagios instances –classic SAM monitoring system –ENOC Downcollector 9

10 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios Config Generator Network topology information –distinguish service failure from network failure Feedback from regional to site instance –via the messaging system Feedback to operational tools –Dashboard, Metric Result Store Multiple VO support –execute probes for multiple VOs Packages for SL4 & SL5 available 10

11 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Messaging System What is the messaging system? –standardized, asynchronous and scalable communication between distributed entities –reliable network of brokers that provides guaranteed delivery of messages –https://twiki.cern.ch/twiki/bin/view/EGEE/MsgArchitecturehttps://twiki.cern.ch/twiki/bin/view/EGEE/MsgArchitecture Why do we need it? –interaction between distributed monitoring components –standard interface enables integration of components 11

12 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Messaging System FUSE Message Broker –based on Apache ActiveMQ –industry support contract is being negotiated –training organized in July Deployment –networked brokers at CERN and SRCE –https://twiki.cern.ch/twiki/bin/view/EGEE/MsgServerDetailshttps://twiki.cern.ch/twiki/bin/view/EGEE/MsgServerDetails Packages for SL4 & SL5 available 12

13 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Nagios & Messaging System Integration 13

14 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Conclusion Multilevel monitoring based on proven commodity software System fits the organizational model of the grid Provide the means for administrators to better monitor their services Integration with existing components to automate operations of monitoring instances 14

15 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Links OAT web page https://twiki.cern.ch/twiki/bin/view/EGEE/OAT_EGEE_III https://twiki.cern.ch/twiki/bin/view/EGEE/OAT_EGEE_III OAT Multi-level monitoring architecture https://twiki.cern.ch/twiki/bin/view/EGEE/MultiLevelMon itoringOverview https://twiki.cern.ch/twiki/bin/view/EGEE/MultiLevelMon itoringOverview OAT Milestones https://twiki.cern.ch/twiki/bin/view/EGEE/MultiLevelMon itoringMilestones https://twiki.cern.ch/twiki/bin/view/EGEE/MultiLevelMon itoringMilestones Operations Automation Strategy https://edms.cern.ch/document/927171 https://edms.cern.ch/document/927171 15

16 Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Thank You! Questions? 16


Download ppt "EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,"

Similar presentations


Ads by Google