Miroslav Siket, Dennis Waldron

Slides:



Advertisements
Similar presentations
GridPP7 – June 30 – July 2, 2003 – Fabric monitoring– n° 1 Fabric monitoring for LCG-1 in the CERN Computer Center Jan van Eldik CERN-IT/FIO/SM 7 th GridPP.
Advertisements

26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
19/06/2002WP4 Workshop - CERN WP4 - Monitoring Progress report
Back-End Services Middle Tier Clients Front-End Clients General Services Architecture Services provide functionality to a host of clients RIA's require.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Post-C5 Lemon-web 2.0 Daniel Lenkes and Ivan Fedorko.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Lemon Tutorial Lemon Overview Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
NGOP J.Fromm K.Genser T.Levshina M.Mengel V.Podstavkov.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Institute of Computer Science AGH Performance Monitoring of Java Web Service-based Applications Włodzimierz Funika, Piotr Handzlik Lechosław Trębacz Institute.
Enterprise Reporting with Reporting Services SQL Server 2005 Donald Farmer Group Program Manager Microsoft Corporation.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Fundamentals of Database Chapter 7 Database Technologies.
May PEM status report. O.Bärring 1 PEM status report Large-Scale Cluster Computing Workshop FNAL, May Olof Bärring, CERN.
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
Quattor-for-Castor Jan van Eldik Sept 7, Outline Overview of CERN –Central bits CDB template structure SWREP –Local bits Updating profiles.
RRDtool Miroslav Siket FIO-FS /
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
Lemon Monitoring Presented by Bill Tomlin CERN-IT/FIO/FD WLCG-OSG-EGEE Operations Workshop CERN, June 2006.
EU 2nd Year Review – Feb – WP4 demo – n° 1 WP4 demonstration Fabric Monitoring and Fault Tolerance Sylvain Chapeland Lord Hess.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Lemon for Quattor I.Fedorko CERN CF/IT 16 March 2011.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Lemon Tutorial Sensor Exception Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
3D Testing and Monitoring Lee Lueking LCG 3D Meeting Sept. 15, 2005.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CF Monitoring: Lemon, LAS, SLS I.Fedorko(IT/CF) IT-Monitoring.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Alarming with GNI VOC WG meeting 12 th September.
Lemon Tutorial Sensor How-To Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
Lemon security. Previous security enhancements user lemon: lemon-db-admin-OraMon will create user lemon (Miro). - OraMon switches to user lemon at its.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Lemon monitoring and Lemon Alarm System (sensors, exception, alarm)
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Lemon Computer Monitoring at CERN Miroslav Siket, German Cancio, David Front, Maciej Stepniewski Presented by Harry Renshall CERN-IT/FIO-FS.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
Lemon Tutorial Quattor and Non-Quattor Configuration of the lemon-agent Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
The Holmes Platform and Applications
Chapter 13 Web Application Infrastructure
SmartCode Brad Argue INLS /19/2001.
WP4 meeting Heidelberg - Sept 26, 2003 Jan van Eldik - CERN IT/FIO
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
System Monitoring with Lemon
Monitoring and Fault Tolerance
Diskpool and cloud storage benchmarks used in IT-DSS
Status of Fabric Management at CERN
Amazon Storage- S3 and Glacier
LEMON – Monitoring in the CERN Computer Centre
Consulting Services JobScheduler Architecture Decision Template
Spark Presentation.
Configuration for gLite
Platform as a Service.
The ETICS Build and Test Service
Virtualization in the gLite Grid Middleware software process
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Overview Introduction VPS Understanding VPS Architecture
Database Services for CERN Deployment and Monitoring
Oracle Architecture Overview
HC Hyper-V Module GUI Portal VPS Templates Web Console
Module 01 ETICS Overview ETICS Online Tutorials
Tango in a Nutshell 31/12/2018.
LitwareHR v2: an S+S reference application
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
AIMS Equipment & Automation monitoring solution
Sending data to EUROSTAT using STATEL and STADIUM web client
Web Application Development Using PHP
SDMX IT Tools SDMX Registry
Presentation transcript:

Miroslav Siket, Dennis Waldron Lemon Tutorial Lemon Overview Miroslav Siket, Dennis Waldron http://cern.ch/lemon CERN-IT/FIO-FD

Tutorial Why? Number of services is expanding. More to monitor every day. For whom? Service managers to configure monitoring of their services Developers to simplify their life when writing sensors Site managers to setup their monitoring instances 09/10/2006 Lemon Tutorial

Tutorial Outline Architecture Writing sensors Running and configuring Agent Using lemon tools Running Lemon server(s) Running and configuring web interface Running alarm system 09/10/2006 Lemon Tutorial

Architecture 09/10/2006 Lemon Tutorial

Architecture II Three layers: Data producing/consuming Data manipulation Data Storage 09/10/2006 Lemon Tutorial

Client side Agent forks sensors and communicate with them using custom protocol over a bi-directional “pipes” configures metric instances of metric classes of a sensor and pulls for metrics checks on status of sensors agent sends data to servers using TCP or UDP monitors itself with internal MSA sensor caches data locally Default Linux client distribution comes with the agent, linux and file sensors. Footprint: agent - 5.5MB and 0.02% of CPU utilization* core sensors (Linux, file, exception) – 10MB, 0.2% of CPU* parseLog – 9.4MB Currently C++ and perl APIs available. * i386, SLC3/4, RHES3/4 – average over CERN CC 09/10/2006 Lemon Tutorial

Server side Two implementations: Oracle based – OraMon optimized for high performance and for large Computer Centers runs on Oracle 9i+ (with alarms system on 10g) validation of metric samples, metadata information Flat files based – FlatMon (edg-fmon-server) uses OS files for storing data for smaller sites (scalable to 1000 machines max.) General features: multithreaded UDP/TCP server built in authentication mechanism 09/10/2006 Lemon Tutorial

Server side - planning Space considerations CPU considerations About 400kB of data per machine/day (Oracle Enterprise edition with compression) – 700kB without compression (XE, Standard) About 1.2MB for FlatMon per machine per day CPU considerations Dual PIV, 3GHz, 4GB of memory with Oracle DB server + OraMon requires about 15% CPU for 4000 monitored machines Adding Alarm system on Oracle requires additional 5% of CPU FlatMon saturates the above machine with 1000 monitored hosts OraMon/FlatMon require about 105MB of memory Functionality considerations FlatMon does not provide metric checks and has no metadata concept Lemon Alarm System (LAS) runs on Oracle as PL/SQL procedures and requires Oracle 10g – integrated with OraMon schema in Oracle database For HA architecture, use Oracle RAC and multiple OraMon servers 09/10/2006 Lemon Tutorial

User/administration tools Lemon-cli Retrieving monitoring data from the local machine cache Allows retrieving data from the server Currently uses SOAP interface (to be retired soon) Lemon-host-check Checks status of the machine based on the values of exceptions Checks status of the monitoring agent and sensors Manages status of exceptions 09/10/2006 Lemon Tutorial

Configuration management At CERN we use Quattor Configuration Database Configuration is stored in hierarchical templates per domain/cluster/node NCM framework is used to download configuration XML profile to nodes NCM components are used: For agent/sensors configuration – using fmonagent component For server configuration (metadata) – using oramonserver component For smaller sites with homogeneous structures Use default agent and sensor rpms from Lemon Use rpms for custom sensors/settings 09/10/2006 Lemon Tutorial

Lemon RRD framework User front-end for visualization and caching monitoring data Two layers Pre-processing – consumes monitoring data and creates rrd files per machine/cluster/… (aging, averages) - lemonmrd Visualization – using rrd files for fast visualization or direct access to the monitoring repository – status web pages Different plugins/options available: Synoptic display of the Computer Center (XML driven) Lemon Alarm GUI Quattor .tpl file browser, … Requirements Web server with PHP (v5+ if want to use LAS) rrdtool rpm 500kB space per machine’s rrd file 09/10/2006 Lemon Tutorial

Automatic recovery actions and alarms Sensor exception For defined values of measured metrics an actuator is called with predefined action An example: ssh daemon dead – action /sbin/service sshd start Definition: metric X, field Y <op> reference value Z => call actuator <op> can be ==,<,>,regexp, range, +,-,*,/ etc.. Each occurrence is logged in the Monitoring Repository Already about 230 predefined exceptions with automatic recovery actions Exceptions are base for alarms in Lemon Alarm System Allow multi-valued metrics and on-behalf metrics Allow corrective actions (actuators) up to n-times or within given time window Allow distinguishing of the alarm state (failed actuator, silenced,…) Example: (10004:7 > 100 && (10005:3 – 34:5)>100:56) On behalf: (soap_srvx:302:1 > 10) 09/10/2006 Lemon Tutorial

Lemon Alarm System Features Newest addition to Lemon Build on top of the OraMon schema in Oracle database Comes in two pieces: PL/SQL stored procedures (requires Oracle 10g) to consume exceptions and to produce alarms GUI – web based interface based on AJAX – part of LRF Features Reduction of alarms (by type or by node/cluster) Possibility to hide/inhibit alarms Access control History tracking Future: notifications, RSS feeds 09/10/2006 Lemon Tutorial

Software distribution RPM direct download from http://lemon.web.cern.ch/lemon/downloads.shtml or at http://linuxsoft.cern.ch/lemon/ YUM setup with /etc/yum.repos.d/lemon.repo [lemon] name=Lemon baseurl=http://linuxsoft.cern.ch/lemon/linux/RPMS/i386/sl4/stable/ enabled=1 gpgcheck=1 gpgkey=http://linuxsoft/lemon/RPM-GPG-KEY-lemon APT setup with /etc/apt/sources.list.d/lemon.list # Lemon stable rpm http://linuxsoft.cern.ch/lemon linux/RPMS/i386/sl4 lemon_stable_sl4 Source code CVS CVSROOT=:pserver:anonymous@isscvs.cern.ch:/local/reps/elfms 09/10/2006 Lemon Tutorial

Future and additional information Things not covered/under development XML gateway with API to several languages (C++, perl, python, java,…) Python Sensor API LAS notification, RSS feeds Encryption of data between agent and server Authentication for user access Service views for LRF Check Web pages: http://cern.ch/lemon for additional information 09/10/2006 Lemon Tutorial