Metrics data published Via different methods Monitoring Server

Slides:



Advertisements
Similar presentations
1 Network Monitoring with Nagios Asian Internet Interconnection Initiatives Project Yan Adikusuma Nara Institute of Science and Technology
Advertisements

MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Lucas Schill Brent Grover Ed Schilla Advisor: Danny Miller.
Overview of network monitoring development at AMRES Slavko Gajin.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
1 Opentest Architecture Table of Content –The Design Basic Components High-Level Test Architecture Test Flow –Services provided by each Layer Test Mgt.
MONITORING TOOLS Open Source Security Tools to monitor your network.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
Pro Exchange SPAM Filter An Exchange 2000 based spam filtering solution.
.NET, and Service Gateways Group members: Andre Tran, Priyanka Gangishetty, Irena Mao, Wileen Chiu.
Passive Monitoring with Nagios Jim Prins
Testing Tools using Visual Studio Randy Pagels Sr. Developer Technology Specialist Microsoft Corporation.
WhatsUp Gold v15 – WhatsUp Companion 3.7 WhatsUp Companion Extended
Josh Riggs Utilizing Open Source Network Monitoring.
Inventory:OCSNG + GLPI Monitoring: Zenoss 3
EVOLUTION OF THE EXPERIMENT PROBE SUBMISSION FRAMEWORK (SAM/NAGIOS) Marian Babik.
TrueSight vs Nagios & Foglight
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Pakiti.
Module 7: Fundamentals of Administering Windows Server 2008.
Overview of MSS System Human Actors Non-Human Actors In-house developed components Third party products.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Lucas Schill Brent Grover Ed Schilla Advisor: Danny Miller.
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
Security monitoring boxes Andrew McNab University of Manchester.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
Ceilometer + Gnocchi + Aodh Architecture
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
Web Technologies Lecture 13 Introduction to cloud computing.
+ Logentries Is a Real-Time Log Analytics Service for Aggregating, Analyzing, and Alerting on Log Data from Microsoft Azure Apps and Systems MICROSOFT.
Monitoring with InfluxDB & Grafana
Service Charging Platform. EMS (Entity Management System) 0 Logging Agent Provides detailed activity logs and reports all raw facts as they happen to.
Alfresco Monitoring with OpenSource Tools Miguel Rodriguez Technical Account Manager.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
Passwords Passwords are unpleasant Hard to remember Remember a couple
Jaime Pérez Virginia Martín-Rubio TERENA Networking Conference Prague, May 2011.
Monitoring Evolution and IPv6
Collectd 101.
Collectd 101.
Execution Monitoring Notification Aychin Gasimov DOAG 2016.
Users and Administrators
Shared Services with Spotfire
Tulika Chaudharie / Harikharan Krishnaraju
Section 13 - Integrating with Third Party Tools
Open Source distributed document DB for an enterprise
Platform as a Service.
Web Services CO5027.
Campus Monitoring Service
Monitoring with Nagios
TYPES OF SERVER. TYPES OF SERVER What is a server.
Network Monitoring System
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Adding Objects To Nagios 3.0
Northbound API Dan Shmidt | January 2017
IS3440 Linux Security Unit 9 Linux System Logging and Monitoring
AmLight’s SDN Looking Glass – An SDN centralized monitoring system
X Windows.
Near Real Time ETLs with Azure Serverless Architecture
CUPS Print Services.
Module 01 ETICS Overview ETICS Online Tutorials
Cloud computing mechanisms
Get your ETL flow under statistical process control
Technical Capabilities
The ELK stack - get to know logs
Configuration management suite
Software interoperability in the NGN Service layer
Anant Mudambi, U. Virginia
Features Overview.
Client/Server Computing and Web Technologies
Users and Administrators
Presentation transcript:

Metrics data published Via different methods Monitoring Server Monitored Machine Agent Pattern-1 Passive? Metrics data published Via different methods Monitoring Server Monitored Machine Agent request commands to be run on the monitored machine, which return the status of the machine Bunch of Scripts in Any Language. Agent will execute those scripts. Results to central server or to other scripts to take action. Pattern-2 Active? Pattern-3: Hybrid Monitoring Server (optional) (Config/ Command/ Probe) Metrics Data NRPE (Nagios Remote plugin Executor) A Nagios centric protocol to collect remote metrics (active checks). NSCA (Nagios Service Check Acceptor) Another Nagios centric protocol for submitting results (passive checks). NRDP A Nagios replacment for NSCA. check_mk Is a protocol utilized by the check_mk monitoring system. Syslog A protocol primarily designed for submitting log records to central servers. Graphite Graphite is a graphing solution which allows you do real-time graphing. SMTP SMTP is used for sending email (this more of a toy currently). CollectD A protocol for collecting information REST Web based easily firewalled protocol. internal mechanisms for checking the status of hosts and services. Collectd external programs (called plugins) to do all the dirty work Icinga, nagios, Plugins are compiled executables or scripts (Perl scripts, shell scripts, etc.) that can be run from a command line to check the status or a host or service. Icinga uses the results from plugins to determine the current status of hosts and services Plugins act as an abstraction layer between the monitoring logic present in the Icinga daemon and the actual services and hosts that are being monitored. Monitored Machine Agent

Pattern-1 Vs Pattern-2 Pattern1 Pattern2 Internal Mechanism for checking the status of the hosts/servers Not Mandatory to have such built-in mechanism – can rely on external mechanism No Trigger from the server to begin monitoring Trigger required from the monitoring server. Apart from simple thresholds and filters – complex operations are carried out at the monitoring server. Complex Modifications can happen at the agent before sending the data to the server. Passive? active monitoring of systems 

Agent and pattern Agent Pattern Collectd 1 Monasca Snap 3* Node-exporter Telegraf Icinga 2 Sensu 3 Diamond Reimann Beats Ceilometer 1|2 Munin Nagios Centreon NSClient++ OpenNMS Agent and pattern

Which is the most common Pattern? No clear winner!

Maximum number of metrics support? Nagios. How? Number of plugins – 4000+++ https://www.slideshare.net/superdupersheep/stop-using-nagios-so-it-can-die-peacefully/

Which agent supports dynamic configuration? All Pattern-2 and Pattern-3 agents. Custom solutions for Pattern-1.

Any of the agents been used in large-scale real-world deployments Any of the agents been used in large-scale real-world deployments? If so, please provide the details on the performance. Collectd: Many blogs by those who have deployed. No performance data – rather, minimal. Part of a complete solution. Riemann, Telegraf – same problem as collectd. Nagios: Many deployments https://www.usenix.org/legacy/event/lisa07/tech/gaspar_talk.pdf There are performance issues. http://blogs.gartner.com/jonah-kowall/2013/02/22/got-nagios-get-rid-of-it/

Interoperability: Which agent is 'most interoperable' Interoperability: Which agent is 'most interoperable'? (Work with maximum of 'servers' (collection node) Collectd monasca Prometheus nagios sensu Riemann Telegraf

Which agent provides maximum 'freedom' w. r. t Which agent provides maximum 'freedom' w.r.t. Licenses (core agent + plugins)? Sensu, Telegraf, Reimann, Diamond. MIT license provides the maximum freedom.

Best for Time Series Databases – Direct compatibility and Publishes to maximum Databases. Collectd. Next (Snap) Influxdb, Elasticsearch, Cassandra, OpenTSDB, Prometheus, Graphite, Riak. Druid – Missing!!!! https://github.com/intelsdi-x/kubesnap/issues/85 Graphite (store and graph metrics) Collection, Forwarding, Visualization, Monitoring, Storage-Backends, etc. Collectd, Reimann, Sensu, Nagios, Icinga, Diamond, Graphite

Which agent is part of a solution that has Analytics, Alerts, and Graphing. Almost all!

Which agent has the least - Libraries, OS/Kernel versions, etc.? Mostly Similar, but Collectd stood out. Adaptability to the system configuration is also better. Python based tools.

Which solution has “plugins” for “processing” the metrics – and what type? Soln Threshold Statistics Anomaly Detection Tagging Collectd Y N SNAP N? Telegraf Mostly realized by Kapacitor. Sensu Monasca Node Exporter Ceilometer Icinga Nagios & Variants N* Diamond Reimann Beats Handled by the other Elastic solutions Munin

Gaps: Are there any metrics/Events that are not supported by any of the agent and that are relevant to NFV? Metrics: NO. *** When the collection point is guest, there are challenges with those metrics that relies of Virtualization platform to expose access to the hardware (ex: PMU). Single Solution: NO Events: NO*** Single Solution: YES (no single solution addresses all the events). What all sources and references considered: Barometer project - https://wiki.opnfv.org/display/fastpath/Metrics+and+Events+Requirements+High+Level+List OpenStack Monitoring Projects VNFM Monitoring Solutions. Vendor Blogs.

What all agents support metrics over REST-API. Collectd Ceilometer SNAP Monasca* Nagios*, Icinga* Riemann*

Anybody supporting REDFISH APIs? None. As Almost all vendors (Dell, HP, SuperMicro, etc) are already providing the support (via built-in controllers, agent-free) for both custom or generic clients. No need for additional entity between. Maybe, we can have an entity that gets the metrics via Redfish REST-APIs and share it over the ‘common-channel’ (message queue, tcp socket, etc).