2002 Called; They want their rrdtool shell scripts back

Slides:



Advertisements
Similar presentations
How We Manage SaaS Infrastructure Knowledge Track
Advertisements

26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
DynaTrace Platform.
Real Time Power and Performance Monitoring of Supercomputer Application Shankar Prajapati BS in Computer Science Claflin University
Cacti Workshop Tony Roman Agenda What is Cacti? The Origins of Cacti Large Installation Considerations Automation The Current.
Modern Application Lifecycle Pla n Develop + Test Monitor + Learn Release.
Zebra Technologies: Link-OS Flexible. Smart. Scalable.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
How WebMD Maintains Operational Flexibility with NoSQL Rajeev Borborah, Sr. Director, Engineering Matt Wilson – Director, Production Engineering – Consumer.
BMC Software confidential. BMC Performance Manager Will Brown.
Nagios and Mod-Gearman In a Large-Scale Environment Jason Cook 8/28/2012.
Management Suite for Dell Servers The Power of Control! Kevin Winert, Dell OpenManage Product Marketing Bryan Rhodes, Dell Alliance Product Manager, Altiris.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG MADES - A Multi-Layered, Adaptive, Distributed Event Store Tilmann Rabl Mohammad Sadoghi Kaiwen Zhang Hans-Arno.
An Overview of Using Computers
Inventory:OCSNG + GLPI Monitoring: Zenoss 3
Bright Cluster Manager Advanced cluster management made easy Dr Matthijs van Leeuwen CEO Bright Computing Mark Corcoran Director of Sales Bright Computing.
Technische Universität München Application Performance Monitoring of a scalable Java web-application in a cloud infrastructure Final Presentation August.
Graphing and statistics with Cacti AfNOG 11, Kigali/Rwanda.
Distributed monitoring system. Why Monitor? Solve them! Identify Problems Ensure conduct Requirements Manage many computers Spot trends in the system.
Ch 10 Monitoring NCNU CSIE 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia.
® BayTech The Power to Control Global Power Management.
1 Chapter Overview Defining Operators Creating Jobs Configuring Alerts Creating a Database Maintenance Plan Creating Multiserver Jobs.
DAM-Alarming Data Analytics from Monitoring, for alarming Summer Student Project 2015 A. Martin, C. Cristovao, G. Domenico thanks to Luca Magnoni IT-SDC-MI.
Page 1 Printing & Terminal Services Lecture 8 Hassan Shuja 11/16/2004.
Distributed Time Series Database
Performance Testing Test Complete. Performance testing and its sub categories Performance testing is performed, to determine how fast some aspect of a.
verifone HQtm Estate Management Solution
Nagios Fusion 2012 Mike Guthrie Twitter: mguthrie88 Projects:
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
CLOUDWATCH VERY SIRIUS MONITORING. SENDING METRICS Metrics can be sent in simply through additional handlers to common tools No handler yet for collectd.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
MISSION CRITICAL PLATFORMDEVELOPER & IT PRODUCTIVITYPERVASIVE INSIGHT Microsoft Confidential—Preliminary Information Subject to Change SQL Server.
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
Metrics at Mantas Klasavičius.
Distributed Monitoring with Nagios: Past, Present, Future Mike Guthrie
OpenNMS Case Studies SCALE 5x 2007 Feb 10. Agenda ● What the heck is OpenNMS? ● What can it do? ● Case Studies – New Edge Networks – Hospitality Services.
I/Watch™ Weekly Sales Conference Call Presentation (See next slide for dial-in details) Andrew May Technical Product Manager Dax French Product Specialist.
Monitoring Asterisk with Icinga2
Nagios Performance Tuning Nick Scott
Metrics data published Via different methods Monitoring Server
Network Management Workshop March – Bangkok, Thailand
Analytics and Value Creation
Network measurements with InfluxDB
Introduction ITEC 420.
Application or server monitoring
System Monitoring with Lemon
SURF Quality Guard (SURFopzichter)
Machine Learning Turbo-Charges the Ops Portion of DevOps
Use of Nagios in Central European ROC
Database System Concepts and Architecture
The Impact of Cloud Technology on Multi-Unit Operators
Smart Ethernet I/O P2P and GCL Introduction
Principles of Information Technology
Gather Valuable Customer Data
VirtualGL.
Using Grafana to show Postgres Statistics
How SCADA Systems Work?.
NCAR-Developed Tools Bill Anderson and Marc Genty
Monitoring HTCondor with Ganglia
Network Monitoring System
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Denys FOSDEM 2018 What's new in Graphite 1.1 Denys FOSDEM 2018.
Monitoring for large infrastructure
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Datacastle RED Delivers a Proven, Enterprise-Class Endpoint Data Protection Solution that Is Scalable to Millions of Devices on the Microsoft Azure Platform.
PROFITABLE INVENTORY CONTROL SYSTEMS
The ELK stack - get to know logs
5 Azure Services Every .NET Developer Needs to Know
Ready Pre-day Azure Monitoring Workshop
Presentation transcript:

2002 Called; They want their rrdtool shell scripts back Dave Josephsen dave@dbg.com

A Brief history of time-series data visualization architectectures 2002 Called; They want their rrdtool shell scripts back A Brief history of time-series data visualization architectectures Dave Josephsen dave@dbg.com

A Tale of 3 Sysadmin A Brief history of time-series data visualization 2002 Called; They want their rrdtool shell scripts back A Brief history of time-series data visualization Architectures A Tale of 3 Sysadmin Dave Josephsen dave@dbg.com

Jer, Per, and quitter (aka Dave) 2012

Jer, traditional needs for Fortune-500 Suitcorp >5000 hosts >20,000 services 1, 9-story office building Plenty of Budget Beefy Hardware 1.5m/1000 hosts 2012

Nagios + NG + Drraw (ho-hum) 2012

Per, near real-time data from Lots of hosts Singularity.gov 80,000 hosts in 80 clusters No budget Mad Scientists No measurable impact allowed 15 second polling interval (max) CPU, Mem, Disk, Net Needs to alert on performance thresholds 2012

Enter Ganglia 2012

That's all fine but what about Nagios? Awesome Nagios Integration Easily send data from Nagios to Ganglia with gmetric Monitor server metrics stored in Ganglia with Nagios with a series of included Nagios plug-ins Check host heartbeat Check single metric on a specific host Check multiple metrics on a specific host Check multiple metrics on a set of hosts Verify a single metric is the same on a set of hosts Display Ganglia graphs in Nagios via the Gweb URL interface Monitor Ganglia with Nagios (duh) 2012

Not just for mad scientists with supercomputers Ganglia is a great fit if You want to offload Performance data processing. You're worried about scale You want a super-lightweight metric gathering agent You need near-real time data You want a really great rrdtool FE Drag scaling, trend-lines, holt-winters forecasting, time-shifts Lots more 2012

Quitter.. er.. Dave: Graph everything always Massive Ginormic DevOps “paradise” (nightmare) Visualize datapoints on irregular intervals Code promotions Function calls LOTS of metrics (millions) Centralized time-series visualization for LOTS of very different data sources Nagios Application instrumentation Sales... thingies 2012

Enter Graphite Life after RRDTool Carbon Whisper Trivial, remote, updates Smart buffering/cacheing Horizontal scalability Whisper Automatic provisioning Interval-agnosticism Type agnosticism Graphite Functions! Typeglobs! Graphic Stolen from: http://www.aosabook.org/en/graphite.html 2012

Not just for billion dollar mega-giants Graphite works great if You want to combine data from multiple monitoring systems Nagios, Ganglia, Collectd etc.. You want to assimilate data from other groups or business units Dev, Sales, etc.. You want really flexible centralized visualization that scales You want to empower non-ops groups to explore their own data 2012

Functions! Rate is the derivative of the counter: Say you have counter data: &target=router1.bytes&target=router2.bytes &target=derive(router1.bytes) OR: &target=router[12].bytes But actually, the raw counter data is kind of interesting if We visualize it correctly: &target=router1.bytes&target=secondYAxis(router2.bytes) 2012

Moar functions! &target=user.registrations &target=summarize(user.registrations,”1h”) &target=summarize(user.registrations,”1h”) &target=summarize(user.registrations,”1h”)&target=threshold(400,”goal) &target=summarize(user.registrations,”1h”)&target=timeShift(summarize(user.registrations,”1h”),”30d”)&target=threshold(400,”goal) 2012

Nagios World Conference OK BYE! http://ganglia.sourceforge.net https://launchpad.net/graphite http://www.aosabook.org/en/graphite.html (and speaking of “buy”...) 2011 Nagios World Conference