Distributed monitoring system. Why Monitor? Solve them! Identify Problems Ensure conduct Requirements Manage many computers Spot trends in the system.

Slides:



Advertisements
Similar presentations
NAGIOS AND CACTI NETWORK MANAGEMENT AND MONITORING SYSTEMS.
Advertisements

26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
Performance Testing - Kanwalpreet Singh.
Real Time Power and Performance Monitoring of Supercomputer Application Shankar Prajapati BS in Computer Science Claflin University
1 Dynamic DNS. 2 Module - Dynamic DNS ♦ Overview The domain names and IP addresses of hosts and the devices may change for many reasons. This module focuses.
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
PlanetLab Operating System support* *a work in progress.
TA : Eng.Hala O. Abu Radi.. Nslookup Command SYNOPSIS nslookup [-option... ] [host-to-find | -[server ] ] DESCRIPTION Nslookup is a program to query Internet.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 8: Implementing and Managing Printers.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Pro Exchange SPAM Filter An Exchange 2000 based spam filtering solution.
VMware vCenter Server Module 4.
Automating Linux Installations at CERN G. Cancio, L. Cons, P. Defert, M. Olive, I. Reguero, C. Rossi IT/PDP, CERN presented by G. Cancio.
Understanding and Managing WebSphere V5
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW Understand the difference between service.
11 MAINTAINING THE OPERATING SYSTEM Chapter 5. Chapter 5: MAINTAINING THE OPERATING SYSTEM2 CHAPTER OVERVIEW  Understand the difference between service.
Linux Operations and Administration
Course 6421A Module 7: Installing, Configuring, and Troubleshooting the Network Policy Server Role Service Presentation: 60 minutes Lab: 60 minutes Module.
Overview Print and Document Services Print Management console Printer properties Troubleshooting.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Acceleratio Ltd. is a software development company based in Zagreb, Croatia, founded in We create innovative software solutions for SharePoint,
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Introduction to HP LoadRunner Getting Familiar with LoadRunner >>>>>>>>>>>>>>>>>>>>>>

Monitoring Your Data Center Using Apache and Ganglia Brad Nicholes Sr. Software Engineer, Novell Member Apache Software Foundation
Bright Cluster Manager Advanced cluster management made easy Dr Matthijs van Leeuwen CEO Bright Computing Mark Corcoran Director of Sales Bright Computing.
Module 7: Fundamentals of Administering Windows Server 2008.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
Computer Emergency Notification System (CENS)
INDIANAUNIVERSITYINDIANAUNIVERSITY Grid Monitoring from a GOC perspective John Hicks HPCC Engineer Indiana University October 27, 2002 Internet2 Fall Members.
Optimizer Deployment Centralized Database module on Optimizer hub server Each monitored server has an instance of optimizer installed.
Graphing and statistics with Cacti AfNOG 11, Kigali/Rwanda.
Kurt Mueller San Diego Supercomputer Center NPACI HotPage Updates.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Ch 10 Monitoring NCNU CSIE 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia.
Server Performance, Scaling, Reliability and Configuration Norman White.
Software Architecture in Practice Practical Exercise in Performance Engineering.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
General rules 1. Rule: 2. Rule: 3. Rule: 10. Rule: Ask questions ……………………. 11. Rule: I do not know your skill. If I tell you things you know, please stop.
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
Linux Operations and Administration
ECHO A System Monitoring and Management Tool Yitao Duan and Dawey Huang.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Operating Environment. Installation and Upgrade Options Solaris suninstall program Solaris Web Start Installation Custom Jumpstart procedure Standard.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
APACHE INSTALL Linux: CentOS 6.5 WHAT IS APACHE Apache is an open source application which is governed by the GNU licensing for use and alteration of.
Domain Name System: DNS To identify an entity, TCP/IP protocols use the IP address, which uniquely identifies the Connection of a host to the Internet.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
BY: SALMAN 1.
BY: SALMAN.
System Monitoring with Lemon
INFNGRID Monitoring Group report
Network Load Balancing
SUBMITTED BY: NAIMISHYA ATRI(7TH SEM) IT BRANCH
Monitoring HTCondor with Ganglia
Virtualization in the gLite Grid Middleware software process
Working at a Small-to-Medium Business or ISP – Chapter 7
Working at a Small-to-Medium Business or ISP – Chapter 7
Working at a Small-to-Medium Business or ISP – Chapter 7
Introduction to Apache
Computer Networks Primary, Secondary and Root Servers
Presentation transcript:

Distributed monitoring system

Why Monitor? Solve them! Identify Problems Ensure conduct Requirements Manage many computers Spot trends in the system Increase Performance Identify problems Applications

Monitoring Grids Grid Cluster Node Grid consists of – Nodes ( A single machine) – Clusters (Collection of Nodes) – Grids (Collection of Clusters) General Objective of Grid: To perform high performance computing. Solution: Monitor at levels.

Monitoring Nodes Nodes A terminal with single/multiple processors. Factors to monitor Temperatures CPU/Memory Usage Disk space Network Activity Jobs Provide vital statistics of each node. Grid Cluster Node

Monitoring Clusters & Grids Clusters & Grids Collection of Nodes Factors to monitor Load Processing power Uptime Availability Provides performance statistics

Ganglia Gangalia: is distributed monitoring system. Based on a hierarchical structure Lightweight :- low overhead and high concurrency. Prominent Features:- Visualization using graphs Selective statistics

Gangalia: Architecture [IBM (2008), 'Perormance Monitoring using Ganglia', IBM Manual - Wiki.]

Gangalia: Gmond Lightweight service Records and sends data via XDR CPU statistics Memory statistics Network statistics Job statistics Uses XML over TCP Gmond Node Gmond Node Gmond Node Gmtead Central Node

Gangalia: Gmtead Lightweight service Receives and sends data obtained from Gmond Gmtead Saves data on disk using RRD (round robin database) Supports multiple creation of monitoring domains Reason: Gangalia is very scalable

Gangalia: Web Server & GUI Tools GUI Tools PHP scripts which extract data from Gangalia Generates visualization using graphs. Web Server Apache + PHP support to hosts and execute scripts SSL and XML support is required.

Gangalia: gstat gstat Command line tool to extract gmond for information. Syntax: $gstat --help Usage: gstat [OPTIONS]... -h --help Print help and exit -V --version Print version and exit -a --all List all hosts. Not just hosts running gexec (default=off) -d --dead Print only the hosts which are dead (default=off) -m --mpifile Print a load-balanced mpifile (default=off) -1 --single_line Print host and information all on one line (default=off) -l --list Print ONLY the host list (default=off) -n --numeric Print numeric addresses instead of hostnames (default=off) -iSTRING --gmond_ip=STRING Specify the ip address of the gmond to query (default=' ') -pINT --gmond_port=INT Specify the gmond port to query (default=8649)

Gangalia:Using gstat

Gangalia:gmetric

Gagalia:Using gmetric

Building Monitoring Domains Using Gangalia

[IBM (2008), 'Performance Monitoring using Ganglia', IBM Manual - Wiki.]

Guidelines to Building Monitoring Domains ServiceFunctionSends ToReceives gmondCollects data from nodes gmond & gmteadgmond gmteadSaves data to diskgmteadgmond & gmtead gstatExtracts information. -gmond gmetricCreates custom metrics --

Prerequisites Finalize your IP Finalize you Domain Name Finalize your time zone Update the time zone of the machine using NTP Download following packages Gangalia [ ] PHP [ ] Apache [ ] rrdtools [ ]

Steps in Installing Gangalia Map Monitoring Domains Choose Central Nodes from Domains Install gmond on Nodes Install gmtead on Central Nodes Install Web Server on Central Nodes