Introduction To Nagios A Linux-based Monitoring System.

Slides:



Advertisements
Similar presentations
NAGIOS AND CACTI NETWORK MANAGEMENT AND MONITORING SYSTEMS.
Advertisements

1 Network Monitoring with Nagios Asian Internet Interconnection Initiatives Project Yan Adikusuma Nara Institute of Science and Technology
Nagios: An introduction and Brief Tutorial
Nagios on Tier1 farm Jonathan Wheeler RAL Tier1 Fabric Team 20 th June 2008.
Nagios System monitoring, the easy way. What is Nagios Nagios watches your computers through user-defined commands It can be set to inform you when a.
Nagios and Kentix System Partners - Critical Monitoring Alerts in Your Hand Tanja Lewit
How to Monitor Ingres with Open Source Tools
Network & System Monitoring with Nagios & Cacti Kevin Mueller.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Network Monitoring with Nagios and other tools Wednesday, 19 July 2006 Martin B. Smith
Network Monitoring with Nagios. Joint technical meeting #2 What is a network monitoring system? A combination of hardware and software used to administer.
M. Bechtel, S. Blümel, A. Quignon1 Linux Network Server Group: Nagios Marc Bechtel Sebastian Blümel Alexandre Quignon.
R. Lange, M. Giacchini: Monitoring a Control System Using Nagios Monitoring a Control System Using Nagios Ralph Lange, BESSY – Mauro Giacchini, LNL.
SEE-GRID-SCI Monitoring Tools
11 Distributed Monitoring and Cloud Scaling for Web Apps Fernando Hönig
Papeete, French Polynesia PacNOG 5 Papeete, French Polynesia 17 June 2009 Hervey Allen.
These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
Passive Monitoring with Nagios Jim Prins
Your university or experiment logo here Nagios: An introduction and Brief Tutorial Chris Brew SciTech/PPD.
WhatsUp Gold v15 – WhatsUp Companion 3.7 WhatsUp Companion Extended
Unix Linux Administration II Class 10: Shell prompts and functions.
Josh Riggs Utilizing Open Source Network Monitoring.
Inventory:OCSNG + GLPI Monitoring: Zenoss 3
workshop eugene, oregon Nagios Network Design and Operations 24 July 2009
1. A key measurement tool for actively monitoring availability of devices and services. Possible the most used open source network monitoring software.
2010 These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
ITIS 1210 Introduction to Web-Based Information Systems Chapter 23 How Web Host Servers Work.
Rational Configuration Design John Murphy To Prevent Irrational Problem Solving.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
Network Monitoring System for the UNIX Lab Bradley Kita Capstone Project Mentor: Dr C. David Shaffer Fall 2004/Spring 2005.
Alerting With MySQL and Nagios Sheeri Cabral Senior DB Admin/Architect
11 Distributed Monitoring for Web Apps Fernando Hönig
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
Nagios Speaker: Yan-Shiang Wang Adviser: Quincy Wu Date: 2005/12/21.
Nagios The monitoring tool. Why ? Nagios is a powerful, modular network monitoring system that can be used to monitor many network services like smtp,
NAGIOS 1. Introduction A key measurement tool for actively monitoring availability of devices and services. Possible the most used open source network.
CS2204: Introduction to Unix January 19 th, 2004 Class Meeting 1 * Notes adapted by Christian Allgood from previous work by other members of the CS faculty.
Networking in Linux. ♦ Introduction A computer network is defined as a number of systems that are connected to each other and exchange information across.
2010 These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
Master thesis Analysis and implementation of monitoring systems of active network equipment. Scientific advisor: Univ. Prof., Dr. Hab., Pavel TOPALA Master.
Apache Web Server Architecture Chaitanya Kulkarni MSCS rd April /23/20081Apache Web Server Architecture.
1 Grid Monitoring with Nagios Aries Hung, Joanna Huang, Felix Lee, Min Tsai ASGC WLCG T2 Asia Workshop TIFR, Dec 2, 2006.
2010 NAGIOS APRICOT 2010 Kuala Lumpur, Malaysia.
Monitoring Alfresco with Nagios/Icinga Toni de la Fuente Alfresco Senior Solutions Engineer Blog: blyx.com
Queensland University of Technology Nagios – an Open Source monitoring solution and it’s deployment at QUT.
APACHE INSTALL Linux: CentOS 6.5 WHAT IS APACHE Apache is an open source application which is governed by the GNU licensing for use and alteration of.
Alerting With MySQL and Nagios Sheeri Cabral Senior DB Admin/Architect
ADMINISTRACIÓN DE SERVICIOS PARA LA WEB BRENDA GONZÁLEZ GÓMEZ
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
2009 Manila, Philippines Nagios APRICOT February 2009 Hervey Allen.
Nagios FTW TriLUG 8/10/06 Presented by: Jason Faulkner Ian Kilgore.
1 Hervey Allen Sebastian Buettrich PacNOG 6 Nadi, Fiji.
Monitoring Asterisk with Icinga2
Monitoring IBMi by Nagios Core & Nagiosgraph
Network Management Workshop March – Bangkok, Thailand
Advisor : Quincy Wu Speaker : Xang-Ting Date : 2010/06/08
Consulting Services JobScheduler Architecture Decision Template
Application or server monitoring
Use of Nagios in Central European ROC
What is nagios? Version 2 8/ M.A.Newhall.
INFNGRID Monitoring Group report
z/Ware 2.0 Technical Overview
Consulting Services JobScheduler Architecture Decision Template
Monitoring with Nagios
Objects Mike Weber
PHP / MySQL Introduction
Nagios – Our Open Source Network Management Solution
How to monitor the $H!T out of Hadoop
Introduction to JBoss application server
Presentation transcript:

Introduction To Nagios A Linux-based Monitoring System

What Is Nagios? Nagios is a system that monitors availability of network resources, such as hosts and services. Nagios is a system that monitors availability of network resources, such as hosts and services. It enables you to identify and resolve IT infrastructure problems before they affect critical processes. It enables you to identify and resolve IT infrastructure problems before they affect critical processes.

Brief History Originally created under the name NetSaint, it was written and is maintained by Ethan Galstad along with a group of plugin developers Originally created under the name NetSaint, it was written and is maintained by Ethan Galstad along with a group of plugin developers

History cont. Launched in March of 1999 under the GNU General Public License Launched in March of 1999 under the GNU General Public License March 2002 due to trademark issues with the name “NetSaint” Ethan decides to rename the project to Nagios, a recursive acronym that stands for “Nagios ain’t gonna insist on sainthood”. March 2002 due to trademark issues with the name “NetSaint” Ethan decides to rename the project to Nagios, a recursive acronym that stands for “Nagios ain’t gonna insist on sainthood”.

Requirements * A machine running Linux or Unix-variant * A machine running Linux or Unix-variant * C compiler, e.g. gcc * C compiler, e.g. gcc TCP/IP configured TCP/IP configured CGIs (optional) apache web server, Thomas Boutell’s gd library version or higher. Used by the statusmap and trends CGIs. CGIs (optional) apache web server, Thomas Boutell’s gd library version or higher. Used by the statusmap and trends CGIs. * Must have

What Can Nagios Monitor? Applications (tomcat servers) Applications (tomcat servers) Host Resources (cpu load, disk space) Host Resources (cpu load, disk space) Infrastructure components (routers, switches) Infrastructure components (routers, switches) Database servers (mySQL, Oracle) Database servers (mySQL, Oracle) Network services (http, ssh, ping) Network services (http, ssh, ping) Web servers Web servers Mail servers Mail servers

Nagios Configuration Nagios.cfg Nagios.cfg CGI.cfg CGI.cfg Resource.cfg Resource.cfg Object Definition Files Object Definition Files Commands Commands Hosts and Services Hosts and Services Contacts and contact groups Contacts and contact groups Plugins Plugins Homemade Plugins Homemade Plugins

Commands and Plugins A plugin is an executable or script that can be run from the command line and returns an exit code of 0=ok, 1=warning, 2=critical or 3=unknown A command consists of a plugin plus macros and is used to perform the host or service check. define command { command_name check_host-alive command_line $USER1$/check_ping $HOSTADDRESS$ -w ,80% -c ,100% -p 1 } define host { host_name glastlnx19.slac.stanford.edu check_command check_host_alive }

Homemade Plugins

Host and Service Definitions define host { use generic-host; Name of host template to use use generic-host; Name of host template to use host_name glastlnx19.slac.stanford.edu host_name glastlnx19.slac.stanford.edu alias glastlnx19 alias glastlnx19 address address check_command check-host-alive check_command check-host-alive max_check_attempts 10 max_check_attempts 10 check_period 24 x 7 check_period 24 x 7 notification_interval 120 notification_interval 120 notification_period 24 x 7 notification_period 24 x 7 contact_groups core contact_groups core}

define service { use generic-service use generic-service host_name glastlnx19.slac.stanford.edu host_name glastlnx19.slac.stanford.edu service_description Web App Telemetry Trending – tomcat 12 service_description Web App Telemetry Trending – tomcat 12 is_volatile 0 is_volatile 0 check_period 24 x 7 check_period 24 x 7 max_check_attempts 4 max_check_attempts 4 normal_check_interval 5 normal_check_interval 5 retry_check_interval 1 retry_check_interval 1 contact_groups core contact_groups core notification_options w,u,c,r notification_options w,u,c,r notification_interval 960 notification_interval 960 notification_period 24 x 7 notification_period 24 x 7 check_command check_jmx!-uservice:jmx:rmi://jndi/rmi://glast- tomcat12.slac.stanford.edu:8081/jmxrmi!- mCatalina:j2eeType=WebModule,name=//localhost/TelemetryTrending,J2E EApplication=none,J2EEServer=none!-astate!-e1 check_command check_jmx!-uservice:jmx:rmi://jndi/rmi://glast- tomcat12.slac.stanford.edu:8081/jmxrmi!- mCatalina:j2eeType=WebModule,name=//localhost/TelemetryTrending,J2E EApplication=none,J2EEServer=none!-astate!-e1}

Nagios File Structure For Fermi

Monitoring

Nagios Remote Plugin Executor

Chronological Progression Of Service State

Notifications

Nagios Web Interface

Contacts and Contact Groups define contact { contact_name Brian contact_name Brian alias Brian Van Klaveren alias Brian Van Klaveren service-_notification_options w,u,c,r service-_notification_options w,u,c,r service_notification_period 24 x 7 service_notification_period 24 x 7 service_notification_commands notify_by_ service_notification_commands notify_by_ host_notification_commands notify_by_ host_notification_commands notify_by_ define contactgroup { contactgroup_name oracle_load_group contactgroup_name oracle_load_group alias Oracle Load Group alias Oracle Load Group members Brian, Tony members Brian, Tony}

Host and Service Definition define host{ use generic-host ; Name of host template to use host_name glast-astro-db1.slac.stanford.edu host_name glast-astro-db1.slac.stanford.edu alias glast-astro-db1 alias glast-astro-db1 address address check_command check-host-alive check_command check-host-alive max_check_attempts 10 max_check_attempts 10 check_period 24x7 check_period 24x7 notification_interval 120 notification_interval 120 notification_period 24x7 notification_period 24x7 notification_options d,r notification_options d,r contact_groups oracle_load_group contact_groups oracle_load_group } define service{ use generic-service ; Name of service template to use host_name glast-astro-db1.slac.stanford.edu host_name glast-astro-db1.slac.stanford.edu service_description Oracle Astro Pass 7 service_description Oracle Astro Pass 7 is_volatile 0 is_volatile 0 check_period 24x7 check_period 24x7 max_check_attempts 4 max_check_attempts 4 normal_check_interval 5 normal_check_interval 5 retry_check_interval 1 retry_check_interval 1 contact_groups oracle_load_group contact_groups oracle_load_group notification_options w,u,c,r notification_options w,u,c,r notification_interval 1800 notification_interval 1800 notification_period 24x7 notification_period 24x7 check_command check_command }

Plugins Plugins are executables or scripts that can be run from a command line and return an exit code Plugins are executables or scripts that can be run from a command line and return an exit code homemade plugins (aka commands) are built from plugins and macros; Nagios can call external programs using these commands homemade plugins (aka commands) are built from plugins and macros; Nagios can call external programs using these commands

define service { use generic-service; Name of service template use generic-service; Name of service template host_name glastlnx19.slac.stanford.edu host_name glastlnx19.slac.stanford.edu service_description Ping service_description Ping is_volatile 0 is_volatile 0 check_period 24 x 7 check_period 24 x 7 max_check_attempts 4 max_check_attempts 4 normal_check_interval 5 normal_check_interval 5 retry_check_interval 1 retry_check_interval 1 contact_groups core contact_groups core notification_options w,u,c,r notification_options w,u,c,r notification_interval 960 notification_interval 960 notification_period 24 x 7 notification_period 24 x 7 check_command check_ping!100.0,20%!500.0,60% check_command check_ping!100.0,20%!500.0,60%}