EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC SEE By E. Atanassov,
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks VO-specific systems for the monitoring of.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
1 1 Service Composition for LHC Computing Grid Monitoring Beob Kyun Kim e-Science Division, KISTI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC UKI John Walsh.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
James Casey, CERN, IT-GT-TOM 1 st ROC LA Workshop, 6 th October 2010 Grid Infrastructure Monitoring.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Monitoring and enforcement of Service Level Agreements John Shade EGEE-II / EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE-III INFSO-RI Enabling Grids for E-sciencE Operations Automation Team KoM, May ROC VIEW (SWE)‏ Javier Lopez Cacheiro/
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
INFSO-RI Enabling Grids for E-sciencE GridICE: Grid and Fabric Monitoring Integrated for gLite-based Sites Sergio Fantinel INFN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks DSA1.4 – Objectives and Status Ioannis Liabotis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Dashboard Cyril L’Orphelin - CNRS/IN2P3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE GStat Work Plans for EGEE-III Joanna Huang, ASGC/OPS EGEE SA1 F2F Meetings, Abingdon.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Martín, A. Lorca (UCM) Introduction to.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Alistair.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Support for Distributed Computing Infrastructures.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
New solutions for large scale functional tests in the WLCG infrastructure with SAM/Nagios: The experiments experience ES IT Department CERN J. Andreeva.
Open Science Grid OSG Resource and Service Validation and WLCG SAM Interoperability Rob Quick With Content from Arvind Gopu, James Casey, Ian Neilson,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROC model assessment AP ROC ShuTing Liao.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI 2 nd level support training Marian Babik, David Collados, Wojciech Lapka,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
INFSO-RI Enabling Grids for E-sciencE GOCDB2 Matt Thorpe / Philippa Strange RAL, UK.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
Flexible Availability Computation Engine for WLCG Rajesh Kalmady, Phool Chand, Vaibhav Kumar, Digamber Sonvane, Pradyumna Joshi, Vibhuti Duggal, Kislay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks An insight into GOCDB for ROD Operators.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyEGEE David Horat (
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
NGI and Site Nagios Monitoring
POW MND section.
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference, September 2009, Barcelona Regional Grid Monitoring Introduction & database components

Enabling Grids for E-sciencE EGEE-III INFSO-RI Outline Introduction to the new Service Availability Monitoring System Description of the Database Components –Aggregated Topology Provider (ATP) –Metric Description Database (MDDB) –Metric Results Store (Metric Store) 2

Enabling Grids for E-sciencE EGEE-III INFSO-RI Outline Introduction to the new Service Availability Monitoring System Description of the Database Components –Aggregated Topology Provider (ATP) –Metric Description Database (MDDB) –Metric Results Store (Metric Store) 3

Enabling Grids for E-sciencE EGEE-III INFSO-RI SAM – existing architecture 4

Enabling Grids for E-sciencE EGEE-III INFSO-RI SAM - enhanced architecture 5

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 6

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 7

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 8

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 9

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 10

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 11

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 12

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 13

Enabling Grids for E-sciencE EGEE-III INFSO-RI Data Flow 14

Enabling Grids for E-sciencE EGEE-III INFSO-RI MyEGEE portal & iGoogle 15

Enabling Grids for E-sciencE EGEE-III INFSO-RI Outline Introduction to the new Service Availability Monitoring System Description of the Database Components –Aggregated Topology Provider (ATP) –Metric Description Database (MDDB) –Metric Results Store (Metric Store) 16

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases - ATP 17 What will be tested? ? ? How it will be tested? What to do with test results? ?

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases - ATP 18 What will be tested? ? ? How it will be tested? What to do with test results? Aggregated Topology Provider

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases - ATP What information is provided by the ATP? –Topology information containing:  Projects (WLCG) and grid infrastructures (EGEE, OSG, NDGF)  Sites, Services, VOs and their groupings  Downtimes  A history of the above Why do we need it? –For availability re-calculations, history of grid topology is needed –We couldn’t name groups of arbitrary grid resources (e.g. ATLAS clouds) –Single authoritative information source with topology information 19

Enabling Grids for E-sciencE EGEE-III INFSO-RI ATP - why do we need it? 20 Current flow of Grid topology data across various monitoring tools:

Enabling Grids for E-sciencE EGEE-III INFSO-RI ATP - why do we need it? 21 Streamlined grid topology data flow using the ATP:

Enabling Grids for E-sciencE EGEE-III INFSO-RI ATP – data sources 22 BDII OSG IM GOCDB CIC Portal ATP sync OSG topology & downtimes EGEE topology & downtimes Installed capacity VO cards Aggregated Topology Provider Gstat 2.0 VO / service mappings Alice Voboxes WLCG MOU Portal Project feeds VO feeds

Enabling Grids for E-sciencE EGEE-III INFSO-RI ATP – status What do we have today? –MySQL and Oracle version –Synchronizer –A programmatic interface to retrieve ATP information (XML/JSON): 23

Enabling Grids for E-sciencE EGEE-III INFSO-RI ATP – status What needs to be added? –History tables to record changes in topology information –Programmatic Interface - parameterised queries (similar to SAM PI) 24

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases 25 What will be tested? ? ? How it will be tested? What to do with test results? Aggregated Topology Provider

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases - MDDB 26 What will be tested? ? How it will be tested? What to do with test results? Aggregated Topology Provider Metric Description Database

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases - MDDB What information is provided the MDDB? –Metrics which are used to test Grid infrastructure –Profiles – combination of metrics for computation of different availabilities and configuration of Nagios installations Why do we need it? –More flexible availability calculations:  Example: CMS would like to test Tier-1 and Tier-2 sites differently –Maintain a history of which metrics and calculations were valid at each point in time 27

Enabling Grids for E-sciencE EGEE-III INFSO-RI MDDB - Architecture 28 CENTRAL MDDB Local Cache MDDB Sync

Enabling Grids for E-sciencE EGEE-III INFSO-RI MDDB - Status What do we have today? –MySQL and Oracle version –Integration with ATP –Web User Interface –A programmatic interface to retrieve MDDB information (JSON) What needs to be added? –Synchronizer between Central DB and local (ROC) caches –Interface for populating and querying profiles –Profiles: Mapping with grid resources 29

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases 30 What will be tested? ? How it will be tested? What to do with test results? Aggregated Topology Provider Metric Description Database

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases – Metric Store 31 What will be tested? How it will be tested? What to do with test results? Aggregated Topology Provider Metric Description Database Metric Results Store

Enabling Grids for E-sciencE EGEE-III INFSO-RI Databases – Metric Store What information is provided by the Metric Store? –Metric results for service end-points for the grid infrastructure –Status changes for service end-points in the infrastructure What do we have today? –MySQL and Oracle versions:  Integration with MDDB and ATP  Per-service status change calculation for Profiles  Data loader –Data from 11 ROCs is being loaded to Central Metric Store:  Some of the records rejected (Mainly due to service end-points not defined correctly in GOCDB) 32

Enabling Grids for E-sciencE EGEE-III INFSO-RI Metric Store – status What needs to be added: –MySQL – tuning of DB (e.g. table partitioning) –Programmatic Interface - parameterised queries –Purging mechanism –Alerting mechanism integrated with Nagios (e.g. when not enough metric results received in given period of time) 33

Enabling Grids for E-sciencE EGEE-III INFSO-RI Central Metric Store Population 34 Active & Passive Checks Results Metric & Profile Definition Service Definition

Enabling Grids for E-sciencE EGEE-III INFSO-RI Outline Introduction to the new Service Availability Monitoring System Description of the Database Components –Aggregated Topology Provider (ATP) –Metric Description Database (MDDB) –Metric Results Store (Metric Store) Publicity 35

Enabling Grids for E-sciencE EGEE-III INFSO-RI Publicity - Demo Watch our demo and vote for it: –Tuesday 16:30-17:00 –Wednesday lunch – (YouTube) – 36

Enabling Grids for E-sciencE EGEE-III INFSO-RI Acknowledgments Thanks to the following people for their contributions: –James Casey (CERN) –Emir Imamagic (SRCE) –Pradyumna Joshi (BARC) –Rajesh Kalmady (BARC) –Vaibhav Kumar (BARC) –Steve Traylen (CERN) SAM Team at CERN: –John Shade –David Collados –Karolis Eigelis –Judit Novak –Konstantin Skaburskas 37

Enabling Grids for E-sciencE EGEE-III INFSO-RI Summary New enhanced SAM system, based on Nagios - a very popular powerful open-source tool, will: –Simplify transition to the EGI era –Help site administrators with fabric monitoring ATP, acting as a single authoritative information aggregator, will simplify the job of assimilating grid resource information MDDB will allow flexible availability calculations Metric Results Store will help MyEGEE portal in displaying of the test results. Demo: 38

Enabling Grids for E-sciencE EGEE-III INFSO-RI Thank you! 39 Questions? egee3-operations-automation-