Monitoring Netezza database with Nagios Frank Pantaleo

Slides:



Advertisements
Similar presentations
Steve Lewis J.D. Edwards & Company
Advertisements

Performance Testing - Kanwalpreet Singh.
ONE STOP THE TOTAL SERVICE SOLUTION FOR REMOTE DEVICE MANAGMENT.
Cut Costs and Increase Productivity in your IT Organization with Effective Computer and Network Monitoring. Copyright © T3 Software Builders, Inc 2004.
The Premier Software Usage Analysis and Reporting Toolset CELUG Presentation – May 12, 2010 LT-Live : License Tracker’s License Server Monitor.
Week 6: Chapter 6 Agenda Automation of SQL Server tasks using: SQL Server Agent Scheduling Scripting Technologies.
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
How to Monitor Ingres with Open Source Tools
Multi-DNC Data Collection/Monitoring
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Computer Monitoring System for EE Faculty By Yaroslav Ross And Denis Zakrevsky Supervisor: Viktor Kulikov.
Cacti Workshop Tony Roman Agenda What is Cacti? The Origins of Cacti Large Installation Considerations Automation The Current.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
Maintaining and Updating Windows Server 2008
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Setting Up a Sandbox Presented by: Kevin Brunson Chief Technology Officer.
PC Manager Meeting January 25, Today Updates –Next Meeting –Meeting Maker Upgrade –Windows Policy –Training –Licensing –Security –Tool Of The Month.
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
Hands-On Microsoft Windows Server 2008
ZENworks for Servers Presenter Name Title Directory-Enabled Management Novell is a leader in Directory-Enabled Workstation management, and we are now.
Passive Monitoring with Nagios Jim Prins
Copyright ®xSpring Pte Ltd, All rights reserved Versions DateVersionDescriptionAuthor May First version. Modified from Enterprise edition.NBL.
Josh Riggs Utilizing Open Source Network Monitoring.
Enterprise PI - How do I manage all of this? Robert Raesemann J Jacksonville, FL.
Section 1: Introducing Group Policy What Is Group Policy? Group Policy Scenarios New Group Policy Features Introduced with Windows Server 2008 and Windows.
Module 7: Fundamentals of Administering Windows Server 2008.
Network Management Tool Amy Auburger. 2 Product Overview Made by Ipswitch Affordable alternative to expensive & complicated Network Management Systems.
1 Accelerated Web Development Course JavaScript and Client side programming Day 2 Rich Roth On The Net
© 2013 Cisco and/or its affiliates. All rights reserved. This document is Cisco Confidential. For Channel Partners only. Do not distribute. C
Time lag between discovering issue and resolving Difficult to find solutions and patches that can help resolve issue Service outages expensive and.
Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
Computer Emergency Notification System (CENS)
Network Monitoring System for the UNIX Lab Bradley Kita Capstone Project Mentor: Dr C. David Shaffer Fall 2004/Spring 2005.
The ProactiveWatch Monitoring Service. Are These Problems For You? Your business gets disrupted when your IT environment has issues Your employee and.
Templates, Databases and Frameworks. Databases: DBI Common database interface for perl Provides a functional,
11 Distributed Monitoring for Web Apps Fernando Hönig
What’s New in WatchGuard XCS v9.1 Update 1. WatchGuard XCS v9.1 Update 1  Enhancements that improve ease of use New Dashboard items  Mail Summary >
ASAP RDF SGP RDF 1.2 and 1.3 Transfer of Information
Module 15 Monitoring SQL Server 2008 R2 with Alerts and Notifications.
SQLRX – SQL Server Administration – Tips From the Trenches SQL Server Administration – Tips From the Trenches Troubleshooting Reports of Sudden Slowdowns.
Architecture and Design of Customer Support System using Microsoft.NET technologies Nikolay Pavlov Asen Rahnev.
A Brief Documentation.  Provides basic information about connection, server, and client.
Integrating and Troubleshooting Citrix Access Gateway.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
IT Priorities Minimize CAPEX Maximize employee productivity Grow the business Add new compute resources real- time to support growth Meet compliance requirements.
Master thesis Analysis and implementation of monitoring systems of active network equipment. Scientific advisor: Univ. Prof., Dr. Hab., Pavel TOPALA Master.
MISSION CRITICAL COMPUTING Siebel Database Considerations.
Jorke Odolphi Product Technology Specialist WebCentral Using Microsoft Operations Manager To Monitor And Maintain Your Farm.
Maintaining and Updating Windows Server 2008 Lesson 8.
SQL Advanced Monitoring Using DMV, Extended Events and Service Broker Javier Villegas – DBA | MCP | MCTS.
Queensland University of Technology Nagios – an Open Source monitoring solution and it’s deployment at QUT.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
Managing Microsoft SQL 2000 with MOM MOM Overview Why Monitor SMS 2003 with MOM 2005 The SMS 2003 Management Pack Inside The Management Pack Best.
Unit 3 Computer Systems. What is software? unlike hardware it can’t be physically touched it’s the missing link between the computer hardware and the.
Agenda for Today  DATABASE Definition What is DBMS? Types Of Database Most Popular Primary Database  SQL Definition What is SQL Server? Versions Of SQL.
Ethan Galstad What Is Nagios? What Nagios Is IT Infrastructure Monitoring.
I/Watch™ Weekly Sales Conference Call Presentation (See next slide for dial-in details) Andrew May Technical Product Manager Dax French Product Specialist.
Fundamental of Databases
Smarter Technology for Better Business
Application or server monitoring
World Wide Web policy.
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Maximum Availability Architecture Enterprise Technology Centre.
Robert Szuman – Poznań Supercomputing and Networking Center, Poland
NCAR-Developed Tools Bill Anderson and Marc Genty
The Client/Server Database Environment
How to monitor the $H!T out of Hadoop
Presentation transcript:

Monitoring Netezza database with Nagios Frank Pantaleo

Introduction & Agenda A couple of W’s State of monitoring Netezza Monitoring Netezza with Nagios Future direction

A couple of W’s - Why Why are we monitoring Netezza ? How much $ does your business lose when IT is down ? 7 million each year from IT downtime Gartner (2005) pegs the hourly cost of downtime for computer networks at $42,000 A data center outage by itself can cost an average of $5,600 per minute Outages damage their reputation Now take this and bring it to a Cloud level - For every hour it is not up and running, Amazon.com takes a hit of almost $5 million Allows you to be more proactive Allow upper management to plan for DB growth (includes secondary effects e.g. DR, tape, disk for backup)

A Couple of W’s - What What are we looking for in a monitor ? Universal monitoring Efficient Alert Notifications (also allows your IT staff to tell each other when something is being worked on) Web Dashboard (one stop shopping!) Issue Escalation (separate lists for warning, high) Distributed Monitoring and Scalability (high availability)

A couple of W’s - What What are we looking for in a monitor ? (cont) Reporting (how many times was this service down ?) External Application Integration (Can I enable my current applications to allow for early issue notification) Open source solution

State of Netezza monitoring Monitoring systems available for Netezza Netezza event monitor – comes stock with tool Netezza portal – comes stock with tool Commercial offerings – Brightlight Consulting Observation Deck

State of Netezza monitoring Netezza comes with 34 alerts Alerts actions have limited responses Script execution In Version 7.1 can auto create support ticket Configuration can be done through NPS client or command line interface on Netezza server

State of Netezza monitoring Examples of Netezza 7.1 stock sample alerts Disk Full SPU Full Hardware Failed Hardware needs attention Hardware restarted Hardware service requested Heat threshold exceeded History capture event History load event HwvoltageFaultAuto NPSNoLongerOnline RegenFault RunAwayQuery No custom events allowed

State of Netezza monitoring Netezza Portal Face on glass monitoring Custom queries can be added to the monitor All queries can be seen as numeric or graphic No alerting Tool can also be used for maintaining database objects, users, events, and sessions If you are using LDAP, portal can’t take advantage of it. Once you login to portal though you will be using your DB username/password

Netezza monitoring using Nagios What are we monitoring in Netezza ? Table Locks by non-EDW statements during EDW batch cycle User queries exceeding 1 hour (90% time poorly formed queries) User queries during EDW batch cycle (depends on SLA) Age of backup older than SLA LDAP server available for SSO

Netezza monitoring using Nagios What are we monitoring in Netezza ? (cont) SPU space unbalanced (generally a side effect of poor distribution) State of EDW e.g. loading files, file processing complete Late arrival of files preventing the EDW from meeting SLA’s

Netezza monitoring using Nagios Architecture options with Nagios Sensors live on Nagios monitoring server Sensors live on Database server and are controlled by NRPE. This is what we went with based on customer security rules. Scripting language is Perl. Really could be any language that allows ability to query the database and deal with responses. There are other options such as Bash, Java, Python, and C.

Netezza monitoring using Nagios Architecture options with Nagios (cont) Active – NRPE is a intermediary for running scripts and bringing results back to Nagios. Passive – SNMP is an option but current provided alerts need to be tied into a SNMP agent that reports status. Netezza doesn’t raise SNMP alerts OOB.

Netezza monitoring using Nagios Passive alerts require snmp trap software  Nagios server must be enabled to receive alerts – snmptt.html snmptt.html – configuration.html configuration.html  Once Nagios is enabled Netezza events must be changed to make Nagios aware there is a issue – event-manager-to-generate-snmp-traps event-manager-to-generate-snmp-traps

Netezza monitoring using Nagios Passive alerts architecture

Netezza monitoring using Nagios Active alerts require NRPE to be installed  Checking is done using shell script and Perl  Perl DBI ODBC  Downside is you have to have a exposed user/password. In this case it was against IT policy so I stopped using this option.  If we use this though all agents could live on Nagios server  Perl supplied package from Netezza  Downside is this is equivalent of admin so you can do anything  Upside is no username/password configuration  Agents must live on Database server

Netezza monitoring using Nagios Active Alert architecture

Netezza monitoring using Nagios Active Alert agent writing (interface requirements) MUST set a return code e.g. # 0 OK # 1 WARNING # 2 CRITICAL # 3 UNKNOWN Nagios dashboard displays associated text if (some logic here ) print "Ok\n"; else print "Error please look at tablexyz\n";

Netezza monitoring using Nagios Active alerts - NRPE configuration on Netezza server If using the Perl package commands must run as nz user so /etc/nagios/nrpe.cfg must use the following – nrpe_user=nz – nrpe_group=nz Once a sensor (perl script) is written and tested it must be added to nrpe.cfg file. command[check_nz_longqry]=/export/home/nz/scrip ts/check_nz_longqry.pl Best practice - Request /etc/nagios/nrpe.cfg be open to read/write from nz user

Netezza monitoring using Nagios Active alerts - How does NRPE work on Nagios server ? define command{ command_name check_nrpe command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -t 300 } define service{ use generic-service host_name proddb service_description NZSQL Long query check_command check_nrpe!check_nz_longqry! notifications_enabled 0 }

Netezza monitoring using Nagios Active Alerts - Perl programming using SQL.pm package Invocation use lib "/nz/kit/share/perl"; use nz::SQL; Package can only be used by the nz owner NO username & password my ($KITDIR, $DATADIR); $DATADIR = "/nz/data.1.0"; $KITDIR = "/nz/kit"; nz::SQL::config(KITDIR => $KITDIR, DATADIR => $DATADIR); Best practice - use alarm timers around SQL statements Handy variables after each SQL execution $qresp->{nrows}, ncols, colid, qtype;

Netezza monitoring using Nagios Perl programming using SQL.pm package (continued) Interface example … nz::SQL::query($dbname, $sql). Unlike DBI the database must be called out every time you query. Resultsets are not active in database (unlike DBI) they are in perl memory Resultset traversal is done using perl foreach e.g. foreach my $row { ($blocker_username,$blocker_sql,$blockee_username,$blockee_sql) Best practice: If you can avoid dealing with resultset and deal only with counts e.g (nrows). Most efficient use especially when dealing with a Nagios alert check that is going to occur several times a day.

Future direction Data graphing Expand areas that we are monitoring for in Netezza Integrate into a product offering (Observation Deck) from Brightlight that collects NZHIST for customer Predict when we are going to outgrow our current processing and database needs

Conclusion  Key takeaways are  Using Nagios can help your company have an extensible event monitor. Understanding Nagios architecture is important to a stable and working monitoring setup. Once you understand architecture setup writing an agent is trivial. If you can write SQL to detect an event then you can write an agent.  Other Reading materials or learning devices on this subject that you would like to share  URL’s provided in document have the recipe for how to setup Nagios, SNMP traps, and Netezza. Please visit those sites to get that info.

Questions? Any questions? Thanks!

Reference in-CentOS-and-Red-Hat/details 01.ibm.com/support/knowledgecenter/SSULQD_7.1.0/com.ibm.nz.portal.doc /c_portal_welcome.html management/how-to-quantify-downtime.html

The End Frank Pantaleo