1 Automating Monitoring with Puppet Chris Mague Moovweb May 23, 2012.

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

Nagios: An introduction and Brief Tutorial
About Me CTO, Individual Digital, Inc. (Startup) Author of ext/tidy, PHP 5 Unleashed, Zend Ent. PHP Patterns
WEB AND WIRELESS AUTOMATION connecting people and processes InduSoft Web Solution Welcome.
© 2012 Entrinsik, Inc. Informer Administration Exploring the system menu and functions PRESENTER: Jason Vorenkamp| Informer Software Engineer| March 2012.
Welcome to the Award Winning Easiest to Use & Most Advanced View, Manage, and Control Security, Access Control, Video, Energy & Lighting Systems, & Critical.
Visualization of Monitoring Data at the NASA Advanced Supercomputing Facility Janice Singh
Bangkok, Thailand An Introduction intERLab at AIT Network Management Workshop March – Bangkok, Thailand Hervey Allen & Phil Regnauld.
Week 6: Chapter 6 Agenda Automation of SQL Server tasks using: SQL Server Agent Scheduling Scripting Technologies.
Cisco Confidential 1 © 2010 Cisco and/or its affiliates. All rights reserved. Next Generation Monitoring in Cisco Security Cloud Leon De Jager and Nitin.
Microsoft Patterns and Practices SharePoint Guidance Robert L. Bogue MS MVP, MCSE, MCSA: Security
Document management New: SharePoint and SharePoint Online support Excel Add-in New: Office 365 support via export, filtering APIs, dimension descriptions,
BizTalk Deployment using Visual Studio Release Management
IIS 7: The Next Generation Web Application Server Platform Michael Volodarsky Program Manager Web Platform and Tools Team Microsoft Corporation.
MultiJob PanDA Pilot Oleynik Danila 28/05/2015. Overview Initial PanDA pilot concept & HPC Motivation PanDA Pilot workflow at nutshell MultiJob Pilot.
1 of 5 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
QA Automation Solution. Solution Architecture Test Management tool CI Tool Automation framework Testing Project BDD Tool Text of test to Testing Project.
Struts 2.0 an Overview ( )
Getting to Push Button Deploys Moovweb January 19, 2012.
Using Nagios XI To Empower Your Developers To Own Their Own Checks Nick Winn Twitter: technick Nagios Forums: technick.
DONE-10: Adminserver Survival Tips Brian Bowman Product Manager, Data Management Group.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Session 10 Windows Platform Eng. Dina Alkhoudari.
Your university or experiment logo here Nagios: An introduction and Brief Tutorial Chris Brew SciTech/PPD.
What’s new in Stack 3.2 Michael Youngstrom. Disclaimer This IS a presentation – So sit back and relax Please ask questions.
Puppetize It! An Introduction to Puppet Mike Seda CEO, Seda Systems, Inc.
Advanced Features of Nagios XI Sam Lansing -
111 EMC CONFIDENTIAL—INTERNAL USE ONLY NMC -- NW Administration NMC Team NetWorker 7.3 TOI July 28, 2005.
Empowering the User Custom Web Reporting M. Keener & R. Kolatalo | Thursday, March 1, 2012.
Microsoft SharePoint Server 2010 for the Microsoft ASP.NET Developer Yaroslav Pentsarskyy
The Art and Zen of Managing Nagios with Puppet Michael Merideth - VictorOps
CERN IT Department CH-1211 Genève 23 Switzerland t Experiences running a production Puppet Ben Jones HEPiX Bologna Spring.
11 Distributed Monitoring for Web Apps Fernando Hönig
Nagios Network Monitoring Andrew Hamilton TJ IT Technician.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Microsoft Management Seminar Series SMS 2003 Change Management.
WHAT IS PHP FRAMEWORK? Set of Classes and Functions(Methods) Design for the development of web applications Provide basic structure Rapid application development(RAD)
(ITI310) By Eng. BASSEM ALSAID SESSIONS 10: Internet Information Services (IIS)
Data Analysis w ith PROOF, PQ2, Condor Data Analysis w ith PROOF, PQ2, Condor Neng Xu, Wen Guan, Sau Lan Wu University of Wisconsin-Madison 30-October-09.
Session 11: Cookies, Sessions ans Security iNET Academy Open Source Web Development.
Overview of Basic 3D Experience (Enovia V6) Concepts
Monitoring Alfresco with Nagios/Icinga Toni de la Fuente Alfresco Senior Solutions Engineer Blog: blyx.com
Michael Mast Senior Architect Applications Technology Oracle Corporation.
Alfresco Monitoring with OpenSource Tools Miguel Rodriguez Technical Account Manager.
Overview of cluster management tools Marco Mambelli – August OSG Summer Workshop TTU - Lubbock, TX THE UNIVERSITY OF CHICAGO.
Automating operational procedures with Daniel Fernández Rodríguez - Akos Hencz -
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
USER ADOPTION TRACKER. FEATURES Monitor User Adoption of Dynamics CRM by tracking daily usage Support for tracking on OOB & Custom entities Configure.
Arizona SharePoint Professionals Group.
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
Ethan Galstad What Is Nagios? What Nagios Is IT Infrastructure Monitoring.
1 Policy Based Systems Management with Puppet Sean Dague
Nagios FTW TriLUG 8/10/06 Presented by: Jason Faulkner Ian Kilgore.
'08 Rabat An Introduction AfNOG 2008 Network Management Workshop June 1-2 – Rabat, Morocco Hervey Allen & Phil Regnauld.
Metrics data published Via different methods Monitoring Server
Network Management Workshop March – Bangkok, Thailand
SQL Database Management
ENOG13 Saint Petersburg Diego Luis Neto SW NL-ix.
UKNOF37 Manchester Diego Luis Neto SW NL-ix.
@ Bucharest DevOps Hacker Meetup
What is nagios? Version 2 8/ M.A.Newhall.
GWE Core Grid Wizard Enterprise (
Monitoring with Nagios
Objects Mike Weber
Get Amazon AWS-DevOps-Engineer-Professional Exam Real Questions - Amazon AWS-DevOps-Engineer-Professional Dumps Realexamdumps.com
Adding Objects To Nagios 3.0
Arrested by the CAP Handling Data in Distributed Systems
Exploring the Power of EPDM Tasks - Working with and Developing Tasks in EPDM By: Marc Young XLM Solutions
Agile testing for web API with Postman
Presentation transcript:

1 Automating Monitoring with Puppet Chris Mague Moovweb May 23, 2012

2 Where I Want to be

3 What I'll Settle For.....

4 Requirements Rock solid stability Automated node addition (discovery) Scales horizontally Service dependency models Easy to write plugins Promotes sane workflows Unified front end view Flexible configuration

5 Tool Stack

6 What???? but #monitoringsucks and #ihatenagios How could you?

7 In defense of Nagios Been around since 1996 Has Service dependencies Easy to write plugins Easy-ish to troubleshoot ROCK SOLID

8 Valid attacks on Nagios No automated discovery It's complicated to setup Text files – really? Front end won't win any beauty contests Development is slow Stats collection is a PITA

9 Solutions Use Icinga! Use Puppet to auto configure Stats – leave it to graphite. It's really good at that Big boys and girls learn their tools

10 Icinga Fork of Nagios Configurations are compatible More solid architecture ( core, API, Web, IDODB ) Nice front end, nice mobile front end Can use NRPE

11 High Level View

12 Configure Icinga Servers using Puppet Standard Types

13 Things to configure with Standard Types icinga.cfg (file) => icinga main config file Apache icinga.conf (file) => http access to each server cgiauth.cfg (file) => cgi access cgi.cfg (file) => options, users templates.cfg (file) got lazy => use for basic classes idomod.cfg (template) => template for hostname to DB

14 Configure Icinga using Nagios Types

15 Puppet Nagios Types nagios_command nagios_contact nagios_contactgroup nagios_host nagios_hostdependency nagios_hostescalation nagios_hostextinfo nagios_hostgroup nagios_service nagios_servicedependency nagios_serviceescalation nagios_serviceextinfo nagios_servicegroup nagios_timeperiod

16 Configuring Hosts

17 Overview

18 Detailed Overview

19 Store Configs Store puppet info in a DB Retrieve information from DB Share info across nodes Use thin_storeconfigs Set up on puppet master

20 Exporting Nagios_host Resources Export = Save to DB Use facter for dynamic data PRO TIP: use ENC PRO TIP: use targets PRO TIP: hostgroups PRO TIP: use tags

21 PRO TIP: Use your ENC

22 PRO TIP: use targets Use cfg_dir in icinga.cfg Create a unique file per host or service Addition and removal are now super easy Also default dirs are in a horrible place /etc/nagios

23 PRO TIP: hostgroups Add machines to a hostgroup Add services to a hostgroup New machines inherit all of the services associated with a hostgroup

24 PRO TIP: use tags Tags allow you to filter resources so that you only realize those resources that you need

25 Configuring Services/Commands

26 Icinga Services OR 'Stuff I want to monitor' Associate with a hostgroup Use a target

27 Icinga Commands OR 'What actually gets run' Use Macros to set paths in resource.cfg

28 Dependencies

29 PRO TIP: Dependencies Unreliable services Cut down on the number of alerts Tell me what's really wrong Route alerts accordingly

30 Nagios_servicedependencies

31 NRPE

32 NRPE Runs on client Secured via SSL Has ACLs Runs as nobody Can run commands Useful for other things...

33 Configuring NRPE

34 NRPE Checks

35 Plugins

36 exchange.nagios.org

37 Writing Plugins Write in any language Output 1 line to stdout NRPE/Icinga/Nagios all use exit codes to determine status Run by hand to check

38 Workflows

39 Watching Monitoring

40 Scheduling Downtime

41 Filtering

42 Alerting

43 #monitoringisawesome REMOVE unreliable checks Just MONITOR – don't bolt on - especially stats TIER your monitoring Use timeperiods for sanity Delegate responses Use dependencies to pin down problems quickly Work smart

44

45 Resources Icinga Puppet NRPE IRC ##infra-talk, #icinga, #puppet #gaijin (freenode), Thanks Yvonne Kong, Michael Catlin, Juan Ortega, Anthony Kong, Puppet Labs, Icinga Team