Performance of the Relational Grid Monitoring Architecture (R-GMA) CMS data challenges. The nature of the problem. What is GMA ? And what is R-GMA ? Performance.

Slides:



Advertisements
Similar presentations
21 Sep 2005LCG's R-GMA Applications R-GMA and LCG Steve Fisher & Antony Wilson.
Advertisements

IEEE NSS 2003 Performance of the Relational Grid Monitoring Architecture (R-GMA) CMS data challenges. The nature of the problem. What is GMA ? And what.
Network Weather Service Sathish Vadhiyar Sources / Credits: NWS web site: NWS papers.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Grid Monitoring Discussion Dantong Yu BNL. Overview Goal Concept Types of sensors User Scenarios Architecture Near term project Discuss topics.
The Network Weather Service A Distributed Resource Performance Forecasting Service for Metacomputing Rich Wolski, Neil T. Spring and Jim Hayes Presented.
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
Technical Architectures
Extensible Scalable Monitoring for Clusters of Computers Eric Anderson U.C. Berkeley Summer 1997 NOW Retreat.
October 2003 Iosif Legrand Iosif Legrand California Institute of Technology.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
COMP 410 & Sky.NET May 2 nd, What is COMP 410? Forming an independent company The customer The planning Learning teamwork.
Use of R-GMA in BOSS Henry Nebrensky (Brunel University) VRVS 26 April 2004 Some slides stolen from various talks at EDG 2 nd Review (
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
INFSO-RI Enabling Grids for E-sciencE GridICE: a monitoring service for Grid Systems Sergio Andreozzi INFN (Italy)
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE GridICE: a monitoring service for Grid Systems Giuseppe Misurelli INFN-CNAF (Italy) giuseppe.misurelli.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
1 Network Monitoring Mi-Jung Choi Dept. of Computer Science KNU
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Fermilab Distributed Monitoring System (NGOP) Progress Report J.Fromm K.Genser T.Levshina M.Mengel V.Podstavkov.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
ALICE, ATLAS, CMS & LHCb joint workshop on
13 May 2004EB/TB Middleware meeting Use of R-GMA in BOSS for CMS Peter Hobson & Henry Nebrensky Brunel University, UK Some slides stolen from various talks.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
GridICE: a monitoring service for Grid Systems. OUTLINE GridICE Server Installation – Brief Introduction – System Requirements – Core Packages & Dependencies.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
 CMS data challenges. The nature of the problem.  What is GMA ?  And what is R-GMA ?  Performance test description  Performance test results  Conclusions.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
EGEE is a project funded by the European Union under contract IST Information and Monitoring Services within a Grid R-GMA (Relational Grid.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Network Weather Service. Introduction “NWS provides accurate forecasts of dynamically changing performance characteristics from a distributed set of metacomputing.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
CERN 21 January 2005Piotr Nyczyk, CERN1 R-GMA Basics and key concepts Monitoring framework for computing Grids – developed by EGEE-JRA1-UK, currently used.
DataTAG is a project funded by the European Union International School on Grid Computing, 23 Jul 2003 – n o 1 GridICE The eyes of the grid PART I. Introduction.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 10 Grid Monitoring A conceptual introduction to GridICE Sergio Andreozzi
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Architecture Review 10/11/2004
Database Replication and Monitoring
GridICE: a monitoring service for Grid Systems
Performance of the Relational Grid Monitoring Architecture (R-GMA)
Sergio Fantinel, INFN LNL/PD
GridICE monitoring for the EGEE infrastructure
Scalability Tests With CMS, Boss and R-GMA
a VO-oriented perspective
#01 Client/Server Computing
Author: Laurence Field (CERN)
Technical Capabilities
Information Services Claudio Cherubino INFN Catania Bologna
#01 Client/Server Computing
Presentation transcript:

Performance of the Relational Grid Monitoring Architecture (R-GMA) CMS data challenges. The nature of the problem. What is GMA ? And what is R-GMA ? Performance test description Performance test results Conclusions

The Nature of the problem As part of the preparations for data taking CMS is performing DATA CHALLENGES. Large number of simulated events to optimise detectors and prepare software Enormous processing requirements BUT each event is independent of all the others each event can be generated on a machine without any interaction with any other

The local solution Work split between farms. How to handle the book-keeping ? a data-base automatically updated Implemented via a job wrapper BOSS Output to and is intercepted and the information is recorded in a mySQL production database. Event generation and job accounting decoupled

The local solution (schematic) Database Machine Submission Machine UI Worker Node (WN) WN

The grid solution (schematic) Database Machine Submission Machine UI

Grid Monitoring Architecture (GMA) of the GGF Producer Consumer Registry (Directory services) register producer locate producer address of producer data Ask for data

R-GMA (Relational GMA) Developed for E(uropean) D(ata) G(rid) Extends the GMA in two important ways 1.Introduces a time stamp on the data. 2.A relational implementation 3.Hides the registry behind the API Can be used for information and monitoring Each V irtual O rganisation appears to have one RDBMS

The syntax of R-GMA The user interface to R-GMA is via SQL statements (not all SQL statements and structures are supported) Information is advertised via a table create Information is published via insert Information is read via select … from table The first read request registers the consumer as interested in this data. Relational queries are supported NOTE : sql is the interface – it should not be supposed an actual database lies behind it.

Fit between R-GMA and BOSS R-GMA can be dropped into the framework with very little disruption 1.Set up calls for mySQL are replaced by those for R-GMA producers 2.An archiver (joint consumer/producer) runs on a single machine which collects the data from all the running jobs and writes it to a local database (and possible republishes it). The data can then be queried either by direct mySQL calls or via R-GMA consumer (a distributed database has been created)

Database BOSS LAN Connection R-GMA WAN Connection Fit between R-GMA and BOSS (i)

R-GMA Measurements The architecture of GMA clearly provides a putative solution to the wide area monitoring problem. BUT Does a specific implementation provide a practical solution Before entrusting CMS production to R-GMA, we must be confident that it will perform. What load will it fail at and why ?

Message time distribution from 44 jobs 35 chars.

Simulation of a CMS job Multi-threaded job each thread produces messages. Length 35 chars, suitable distribution. Threads starting time distribution can be altered. One machine delivers the R-GMA load of a farm. R-GMA servlet R-GMA consumer

Simulation of the CMS Grid One machine per grid cluster providing loads of greater than the cluster R-GMA consumer R-GMA servlet R-GMA servlet R-GMA servlet R-GMA servlet

Current status R-GMA can survive loads of around 20% of the current CMS requirements and does provides a grid method for monitoring. An overload of a factor 2 jobs causes problems after about five minutes running. We believe these instabilities are soluble. When production starts in earnest we will compare reality with our model.

GridICE Server Installation 16

Brief Introduction GridICE: –is a distributed monitoring tool for grid systems –integrates with local monitoring systems –offers a web interface for publishing monitoring data at the Grid level –fully integrated in the LCG-2 Middleware gridice-clients data collector installation and configuration for each site ralized by the Yaim scripts. 17

System Requirements Suggested Operating system is Scientific Linux with a minimal installation The GridICE server should be installed on a performant machine –PostgreSQL service - RAM intensive demand –Apache web server - RAM-CPU intensive demand 18

Core Packages & Dependencies The GridICE server software is composed by three core packages: 1.gridice-core (setup and maintenance scripts / discovery components) 2.gridice-www (web interface scripts and components) 3.gridice-plugins (monitoring scripts) Plus several dependencies: –Apache http web server –PostgreSQL database server –Nagios monitoring tool –... 19

The Four Main Phases of Monitoring 20 Generation Distributing Presenting Processing Sensors inquiring entities and encoding the measurements according to a schema Transmission of the events from the source to any interested parties (data delivery model: push vs. pull; periodic vs. aperiodic) Processing and abstract the number of received events in order to enable the consumer to draw conclusions about the operation of the monitored system e.g., filtering according to some predefined criteria, or summarising a group of events

The GridICE Approach 21

Generating Events Generation of events: –Sensors: typically perl scripts or c programs. –Schema: GLUE Schema v GridICE extension. –System related (e.g., CPU load, CPU Type, Memory size). –Grid service related (e.g., CE ID, queued jobs). –Network related (e.g., Packet loss). –Job usage (e.g., CPU Time, Wall Time). –All sensors are executed in a periodic fashion. 22

Distributing Events Distribution of events: – Hierarchical model. Intra-site: by means of the local monitoring service – default choice, LEMON ( Inter-site: by offering data through the Grid Information Service. Final Consumer: depending on the client application. – Mixed data delivery model. Intra-site: depending on the local monitoring service (push for lemon). Inter-site: depending on the GIS (current choice, MDS 2.x, pull). Final consumer: pull (browser/application), push (publish/subscribe notification service coming on the next release). 23

Presenting Events Data stored in a RDBMS used to build aggregated statistics. Data retrieved from the RDBMS are encoded in XML files. XSL to XHTML transformations to publish aggregated data in a Web context. 24

Monitoring a Grid 25

Challenges for Data Collection The distribution of monitoring data is strongly characterised by significant requirements (e.g., Scalability, Heterogeneity, Security, System Health) None of the existing tools satisfy all of these requirements Grid data collection should be customized depending on what are the needs of your Grid users selected 26

Challenges for Data Presentation Different Grid users are interested in different subset of Grid data and different aggregation levels Usability principles should be taken into account to help users finding relevant Grid monitoring information A sintetic data aggregation is crucial to permit a drill- down navigation (from the general to te detailed) of the Grid data 27

Grid Monitoring Architecture (GMA) of the GGF Producer Consumer Registry (Directory services) register producer locate producer address of producer data Ask for data

R-GMA (Relational GMA) Developed for E(uropean) D(ata) G(rid) Extends the GMA in two important ways 1.Introduces a time stamp on the data. 2.A relational implementation 3.Hides the registry behind the API Can be used for information and monitoring Each V irtual O rganisation appears to have one RDBMS

The user interface to R-GMA is via SQL statements (not all SQL statements and structures are supported) Information is advertised via a table create Information is published via insert Information is read via select … from table The first read request registers the consumer as interested in this data. Relational queries are supported NOTE : sql is the interface – it should not be supposed an actual database lies behind it. The syntax of R-GMA

R-GMA can be dropped into the framework with very little disruption 1.Set up calls for mySQL are replaced by those for R-GMA producers 2.An archiver (joint consumer/producer) runs on a single machine which collects the data from all the running jobs and writes it to a local database (and possible republishes it). The data can then be queried either by direct mySQL calls or via R-GMA consumer (a distributed database has been created) Fit between R-GMA and BOSS

Database BOSS LAN Connection R-GMA WAN Connection Fit between R-GMA and BOSS (i)

How is Ganglia different from Nagios Ganglia is architecturally designed to perform efficiently in very large monitoring environments: each Ganglia gmond performs its service checks locally, reporting in at a regular interval to the gmetad. Nagios performs its service checks by polling each device across a network connection and waiting for a response (known as "active checks"), which can be more resource and bandwidth intensive. Nagios uses the results of its active checks to determine state by comparing the metrics it polls to thresholds. These state changes can in turn be used to generate notifications and customizable corrective actions. Ganglia, by contrast, has no built-in thresholds, and so does not generate events or notifications. The general rule of thumb has been: if you need to monitor a limited number of aspects of a large number of identical devices, use Ganglia; if you want to monitor lots of aspects of a smaller number of different devices, use Nagios. But those distinctions are blurring as Ganglia supports more and more devices, and as Nagios' scalability improves. 10/7/2015T.R.LEKHAA/AP/IT/SNSCE33

How is Ganglia different from Nagios The problem with ganglia and all the other external web pages we have been looking at is that you have to look at them! If all is well with your system you don’t want to have to look. This is where Nagios comes in. It can be setup to alert you when something goes wrong, or a value passes a threshold. 10/7/2015T.R.LEKHAA/AP/IT/SNSCE34

Monitoring: What?

Monitoring: How(1)? Monitor Node Grid middleware Monitoring Architecture IperfER PingER UDPmon MiperfER bbcp/ftp Tools installed on dedicated & similar node at each centre MESH Publication service 30 mins

Monitoring: How(2)?

Network Weather Service

Introduction “NWS provides accurate forecasts of dynamically changing performance characteristics from a distributed set of metacomputing resources” What will be the future load (not current load) when a program is executed? Producing short-term performance forecasts based on historical performance measurements The forecasts can be used by dynamic scheduling agents

Introduction Resource allocation and scheduling decisions must be based on predictions of resource performance during a timeframe NWS takes periodic measurements of performance and using numerical models, forecasts resource performance

NWS Goals Components –Persistent state –Name server –Sensors Passive (CPU availability) Active (Network measurements) –Forecaster

Architecture

Performance measurements Using sensors CPU sensors –Measures CPU availability –Uses uptime vmstat Active probes Network sensors –Measures latency and bandwidth Each host maintains –Current data –One-step ahead predictions –Time series of data

Network Measurements

Issues with Network Sensors Appropriate transfer size for measuring throughput Collision of network probes Solutions –Tokens and hierarchical trees with cliques

Available CPU measurement

The formulae shown does not take into account job priorities Hence periodically an active probe is run to adjust the estimates

Predictions To generate a forecast, forecaster requests persistent state data When a forecast is requested, forecaster makes predictions for existing measurements using different forecast models Dynamic choice of forecast models based on the best Mean Absolute Error, Mean Square Prediction Error, Mean Percentage Prediction Error Forecasts requested by: –InitForecaster() –RequestForecasts() Forecasting methods –Mean-based –Median based –Autoregressive

Forecasting Methods Notations: Prediction Accuracy: Mean Absolute Error (MAE) is the average of the above Prediction Method:

Forecasting Methods – Mean-based

Forecasting Methods – Mean-based 4. 5.

Forecasting Methods – Median-based

Autoregression 1. a i found such that it minimizes the overall error. r i,j is the autocorellation function for the series of N measurements.

Forecasting Methodology

Forecast Results

Forecasting Complexity vs Accuracy Semi Non-parametric Time Series Analysis (SNP) – an accurate but complicated model Model fit using iterative search Calculation of conditional expected value using conditional probability density

Sensor Control Each sensor connects to other sensors and perform measurements O(N 2 ) To reduce the time complexity, sensors organized in hierarchy called cliques To avoid collisions, tokens are used Adaptive control using adaptive token timeouts Adaptive time-out discovery and distributed leader election protocol