Job monitoring and accounting data visualization

Slides:



Advertisements
Similar presentations
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Advertisements

08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
Enabling Grids for E-sciencE Grid Monitoring Workshop Monterey Bay, California, 25 June 2007 Antonio Pierro INFN-BARI (Italy) Antonio.pierro.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Performance Improvements to BDII - Grid Information.
Certification and test activity IT ROC/CIC Deployment Team LCG WorkShop on Operations, CERN 2-4 Nov
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
INFSO-RI Enabling Grids for E-sciencE GridICE: Grid and Fabric Monitoring Integrated for gLite-based Sites Sergio Fantinel INFN.
HLRmon accounting portal DGAS (Distributed Grid Accounting System) sensors collect accounting information at site level. Site data are sent to site or.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
Certification and test activity ROC/CIC Deployment Team EGEE-SA1 Conference, CNAF – Bologna 05 Oct
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
EGEE is a project funded by the European Union under contract INFSO-RI Grid accounting with GridICE Sergio Fantinel, INFN LNL/PD LCG Workshop November.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
HLRmon accounting portal The accounting layout A. Cristofori 1, E. Fattibene 1, L. Gaido 2, P. Veronesi 1 INFN-CNAF Bologna (Italy) 1, INFN-Torino Torino.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
INFN GRID Production Infrastructure Status and operation organization Cristina Vistoli Cnaf GDB Bologna, 11/10/2005.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
INFSO-RI Enabling Grids for E-sciencE GILDA t-Infrastructure Antonio Fuentes Bermejo
First South Africa Grid Training June 2008, Catania (Italy) GILDA t-Infrastructure Valeria Ardizzone INFN Catania.
II EGEE conference Den Haag November, ROC-CIC status in Italy
IST E-infrastructure shared between Europe and Latin America The GILDA t-Infrastructure and the GENIUS portal Christian Grunfeld,
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
HLRmon Enrico Fattibene INFN-CNAF 1EGI-TF Lyon, France19-23 September 2011.
Using HLRmon for advanced visualization of resource usage Enrico Fattibene INFN - CNAF ISCG 2010 – Taipei March 11 th, 2010.
Enabling Grids for E-sciencE INFN Workshop – May 7-11 Rimini 1 Grid Accounting Status at INFN Riccardo Brunetti INFN-TORINO.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Monitoring and Diagnostic Tools: GridICE, GSTAT, SAM Giuseppe Misurelli INFN-CNAF giuseppe.misurelli cnaf.infn.it.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Enabling Grids for E-sciencE GridICE: overview and current status Guido Cuscela INFN – Bari Service Challenge Technical Meeting September.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Regional Operations Centres Core infrastructure Centres
gLite Information System
Classic Storage Element
GILDA t-Infrastructure
Installation and configuration of a top BDII
Practical: The Information Systems
POW MND section.
Brief overview on GridICE and Ticketing System
Monitoring: problems, solutions, experiences
Accounting at the T1/T2 Sites of the Italian Grid
Grid2Win: Porting of gLite middleware to Windows XP platform
Sergio Fantinel, INFN LNL/PD
GridICE monitoring for the EGEE infrastructure
Giuseppe Patania Nov, Martina Franca (Ta)‏
GSAF Grid Storage Access Framework
a VO-oriented perspective
Danilo Dongiovanni INFN-CNAF
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
HLRmon accounting portal
Site availability Dec. 19 th 2006
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

Job monitoring and accounting data visualization Enrico Fattibene INFN-CNAF enrico.fattibene<at>cnaf.infn.it Scuola per utenti INFN della Grid, Bologna, 28 Novembre 2007

Outline Grid Monitoring Grid Accounting GridICE HLRmon Overview and architecture Job activity analysis Grid Accounting HLRmon Personal Grid usage analysis

Grid resource awareness A large Grid system must provide its users with precise and reliable information about: Status and Usage of available resources The efficient distribution of this information enables VOs to: Optimize their utilization strategies How CPUs are distributed among sites What about the status of the Grid sites Complete the planned computations How long my jobs take for running in a site What about my jobs CPU/Wall time

What helps? Grid Monitoring tools help in the detection of: Faulty Situations Status of available resources VO activity Usage of the available resources

EUIndiaGrid BalticGrid INFNGrid EELA BeGrid GridICE: overview Distributed monitoring tool for Grid systems started in late 2002 (EU-DataTAG project) is evolving in the context of EU-EGEE and many other EU Grid projects 100% open source Fully integrated with the gLite Middleware Metering and publishing of data can be configured via gLite standard installation mechanisms Installed servers are monitoring Grid resources in the scope of: EGEE EGEE-SWE RDIG EGEE-SEE Grid.it GILDA CMS ATLAS EUMedGrid EUChinaGrid EUIndiaGrid BalticGrid INFNGrid EELA BeGrid

GridICE: overview Based on gLite Information System Periodic discovery of new GRISes (once a day) Periodic queries to the discovered GRISes (every 10-30 mins) Standard GRISes (CE, SE, Site BDII) Information published on Top BDII Extended GRIS (peculiar GridICE service) Hosts information (daemons monitoring) Job monitoring Summary info for computing resources from LRMS Information collected in a central DB on a server and shown in a Web interface Very useful help pages

GridICE: architecture

GridICE: added value Information provided include Grid summary info Computing/Storage resources VO activity Job submission Provided information are accessible from Web context drill-down navigation XML documents Data exchange with other applications

Monitoring for different users Users are required to have a valid CA certificate Users identification is done through the digital certificate installed in their browsers (DN retrieved) https secure protocol used on server side Two ways to access data: “Standard Users”: only the info of user’s own jobs are provided “High level Users” can ask to be registered to the GridICE web site with a specific role VO manager Site manager ROC (Regional Operation Centre) manager

Grid summary info Geographical composition Resources availability Geo view where sites are located with the actual job load Resources availability Site view to get downtime info Site view to spot possible problems on Grid Information Service Grid services running on host machines Resources inventory VO view where computing and storage resources are aggregated per-VO

End-user activity /1 Job section to track VO users activity in order to: Search among a huge number of jobs Inspect jobs resource consumption Personal jobs info (next release)

End-user activity /2

End-user activity /3

Ongoing and future work Data quality Recent data quality analysis (performed in production environment) confirmed the expected level of correctness Data access Cooperation with external applications Integration of GridICE data into Experiment dashboard Batch System data genereted by the job monitoring sensors Web redesign Next user experience will gain from Web 2.0 principles, guidelines and technologies

Accounting Need to know who used the resources and how many resource have been used ROC Managers: how the Grid resources are used and by whom? Site Managers: who used my resources? VO managers: how many resources my VO used? Users: how many resources I used? A good accounting system should provide answers to these questions taking care of all the security and privacy issues

HLRmon: overview Signed access Grid role based 4 different roles Authorization/authentication by user’s digital certificate Grid role based Proper information are provided conforming to the role scope 4 different roles ROC manager, VO manager, Site admin, VO user Local aggregation Daily aggregated activity is locally stored Graphical or textual Job CPU/WallTime usage per Site and VO

HLRmon: architecture

HLRmon: data presentation Report data aggregated per site/VO/day Charts created on user needs Possibility to enlarge graphs (next release) Interactive table with possibility to export excel format Information about Grid utilization An user can see jobs submitted by users in his visibility scope

4 different roles ROC Manager Site Manager VO Manager VO User Report on all sites and VO that used the Grid Information on resources usage by Grid users Site Manager Report on all VOs that used the site Information on all users jobs VO Manager Report on usage on all the sites accepting the VO Information on Grid usage by VO members VO User Report his own resources usage

VO-manager viewpoint Report on aggregated job activity in different formats Graphs JobsNum/Site CPUTime/Day WallTime/Day CPUTime/Site WallTime/Site Table End user jobs detailed info

End-user viewpoint Report on personal job activity in different formats and aggregation Graphs JobsNum/Site CPUTime/Site WallTime/Site CPUTime/Day WallTime/Day JobsNum?VO Table Jobs number, CPUTime and WallTime per Site and VO

Conclusions GridICE and HLRmon can show you the relevant info you are interested in With the authentication based on personal certificate, the data privacy is always guaranteed GridICE and HLRmon Web presentation can be accessed by VO end-users to obtain information related to Grid resources usage and availability

References GridICE dissemination Web Site http://grid.infn.it/gridice GridICE server for Italian Grid http://gridice4.cnaf.infn.it:50080/gridice HLRmon for Italian Grid https://dgas.cnaf.infn.it/hlrmon W3C Standards - evergreen hint http://www.w3.org/QA/2002/07/WebAgency-Requirements

Disclaimer This presentation is based on materials provided and authorized by the EGEE project and is freely available to download and use according to the terms of the following license: http://creativecommons.org/licenses/by-nc-sa/2.5/