Www.see-grid.eu SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no. 031775.

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Wofgang Thöne, Institute For Scientific Computing – EGEE-Meeting August 2004 Welcome to the User.
SEE-GRID-SCI Antun Balaz SA1 Leader Institute of Physics Belgrade National, Regional and World-wide Grid eInfrastructures.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Pilot Test-bed Operations and Support Work.
Łukasz Skitał 2, Renata Słota 1, Maciej Janusz 1 and Jacek Kitowski 1,2 1 Institute of Computer Science AGH University of Science and Technology, Mickiewicza.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
SEE-GRID-SCI Regional Grid Infrastructure: Resource for e-Science Regional eInfrastructure development and results IT’10, Zabljak,
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
SEE-GRID-SCI SEE-GRID-SCI Operations Procedures and Tools Antun Balaz Institute of Physics Belgrade, Serbia The SEE-GRID-SCI.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid Monitoring Tools Alexandre Duarte CERN.
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
FP6_2004_Infrastructures_6-SSA [ Empowering e Science across the Mediterranean ] EUMEDGRID Infrastructure Kostas Koumantaros WP3.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America EELA Infrastructure (WP2) Roberto Barbera.
INFSO-RI Enabling Grids for E-sciencE Introduction to Grid Computing, EGEE and Bulgarian Grid Initiatives - Plovdiv,
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
FP6−2004−Infrastructures−6-SSA EUChinaGrid status report Giuseppe Andronico INFN Sez. Di Catania CERN – March 3° 2006.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
Julia Andreeva on behalf of the MND section MND review.
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Operations procedures: summary for round table Maite Barroso OCC, CERN
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
AEGIS Academic and Educational Grid Initiative of Serbia Antun Balaz (NGI_AEGIS Technical Manager) Dusan Vudragovic (NGI_AEGIS Deputy.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
INFSO-RI Enabling Grids for E-sciencE Introduction to Grid Computing, EGEE and Bulgarian Grid Initiatives, Sofia, South.
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
The GridPP DIRAC project DIRAC for non-LHC communities.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
II EGEE conference Den Haag November, ROC-CIC status in Italy
SEE-GRID-SCI Grid Operations Procedures Antun Balaz Institute of Physics Belgrade Serbia The SEE-GRID-SCI initiative.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Monitoring and Diagnostic Tools: GridICE, GSTAT, SAM Giuseppe Misurelli INFN-CNAF giuseppe.misurelli cnaf.infn.it.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
CESGA QR2 SA1-SWE Partner Coordination Meeting 2 CICA, Sevilla
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
Job monitoring and accounting data visualization
Regional Operations Centres Core infrastructure Centres
Use of Nagios in Central European ROC
Advancing South-East Europe into the eInfrastructure era
Grid Operations Procedures
Overview of IPB responsibilities in EGEE-III SA1
EGI Community Forum 2012 Munich, 29 March 2012
Short update on the latest gLite status
EGEE Operation Tools and Procedures
Site availability Dec. 19 th 2006
Presentation transcript:

SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no SEE-GRID operational tools and Grid services improvements Antun Balaz WP3 Leader Institute of Physics, Belgrade EGEE/WLCG Operations Workshop 2007, Stockholm, June 2007

EGEE/WLCG Operations Workshop 2007, Stockholm, June Overview SEE-GRID WP3 Infrastructure Operations Operational tools  HGSM, HGSM+SAM integration  WiatG  BBmSAM, BBmobileSAM WP3 ongoing work

EGEE/WLCG Operations Workshop 2007, Stockholm, June SEE-GRID WP3 Develop the next-generation SEE-GRID infrastructure  Next generation of EGEE middleware (gLite) and services Support in deployment and operations of the Resource Centres  Monitoring, helpdesk, overall upgrade of infrastructure Network resource provision and assurance  in close cooperation with the SEEREN2 project  Bandwidth-on-Demand requirements CA and RA guidelines and deployment  catch-all Certification Authority (CA)  per-country CA deployment and User portal deployment and operations  P-GRADE

EGEE/WLCG Operations Workshop 2007, Stockholm, June Infrastructure

EGEE/WLCG Operations Workshop 2007, Stockholm, June Infrastructure status (1) SEE-GRID Core services  Catch-all Certification Authority  enables regional sites to obtain user and host certificates  Virtual Organisation Management Service (VOMS),  authorization system for the SEE-GRID Virtual Organisation (VO),  supporting groups and roles  Workload management service (lcg-RB and glite-WMSLB) and Information Services (BDII)  deployed several instances for failover  MyProxy is operational  supports certificate renewal  FTS deployed  used in production

EGEE/WLCG Operations Workshop 2007, Stockholm, June Infrastructure status (2) SEE-GRID infrastructure contains currently the following resources:  31 sites in SEE-GRID production  5 sites in certification phase (AL + HR + 2 RO)  CPUs: ~950 total; Storage: TB gLite assessment done, results positive, upgrade done on all sites (GLITE-3_0_2)  glite-CE deployed at several sites, assessment results inconclusive, service probably not stable enough for production glite-WMSLB deployed at several sites, assessment results show that it is not so stable as lcg-RB, but has various new features and is therefore actively used WN deployment closely follows latest developments of gLite:  

EGEE/WLCG Operations Workshop 2007, Stockholm, June Operations

EGEE/WLCG Operations Workshop 2007, Stockholm, June Operational procedures Distributed operations Pilot SLA established Monitoring and Accounting Tools Helpdesk tickets procedures  Generic support group for users  TPM-like (monitoring open tickets created by users, trying to solve the simple ones, route the tickets, etc.).  Country level user support groups  Associate with country level mailboxes  GOOD shifts introduced, initial results positive  Tickets handling: response times need to be improved! SEEGRID Wiki with detailed information for site administrators 

EGEE/WLCG Operations Workshop 2007, Stockholm, June SLA Conformance Improvements seen after the first quarter of pilot SLA enforcement

EGEE/WLCG Operations Workshop 2007, Stockholm, June Operational & monitoring tools (1) Operational & monitoring tools deployment status  Hierarchical Grid Site Management (HGSM) – Turkey  Service Availability Monitoring (SAM) (+ porting to MySQL) – Bosnia and Herzegovina with CERN support  Helpdesk - Romania  BBmSAM - Bosnia and Herzegovina  GridICE – FYR of Macedonia  SEE-GRID GoogleEarth – Turkey + Gidoon Moont  SEE-GRID GoogleMaps - Turkey  Global Grid Information Monitoring System (GStat) – Min-Hong Tsai  Relational Grid Monitoring Architecture (R-GMA) – Bulgaria  Nagios - Bulgaria  Real Time Monitor (RTM) – Gidoon Moont and Turkey (HGSM)  MONitoring Agents using a Large Integrated Services Architecture (MonALISA) – Romania  What is at the Grid (WiatG) – CERN with support from Serbia

EGEE/WLCG Operations Workshop 2007, Stockholm, June Operational & monitoring tools map HGSM HELP-DESK BDII R-GMA SAM GSTAT (Taiwan) GSTAT (Taiwan) VOMS RTM (UK) RTM (UK) Google maps Google maps BBmSAM GridICE MonALISA NAGIOS WiatG

EGEE/WLCG Operations Workshop 2007, Stockholm, June Operational & monitoring tools (2) Integration status  HGSM+SAM, HGSM+BBmSAM  Automatic creation of list of sites to be tested  HGSM+BDII  Automatic creation of list of sites in the infrastructure  HGSM+GStat  Automatic creation of list of sites to be monitored  HGSM+RTM, HGSM+R-GMA  Automatic creation of list of sites monitoring and for accounting  VOMS+Helpdesk  Automatically create new user accounts when accessing helpdesk  Certificate based access to Helpdesk HGSM HELP-DESK BDII R-GMA SAM GSTAT VOMS RTM Google maps Google maps BBmSAM

EGEE/WLCG Operations Workshop 2007, Stockholm, June HGSM database SEE-GRID GOCDB  Introduced as a lightweight version of GOCDB  Allows us to easily change its format when necessary and to adapt it to regional needs  Allows us to provide custom exports on demand, depending on operational tools/application developers Contains statical information about all sites Developed and maintained by TUBITAK-ULAKBIM, Turkey  Used by EUMedGRID, other regional projects expressed interest

EGEE/WLCG Operations Workshop 2007, Stockholm, June HGSM+SAM integration has been done in collaboration between TUBITAK-ULAKBIM and U of Banjaluka Periodical export of HGSM data to XML file  XML if full dump of database and represents all relevant tables  Generated data is universal and can be used for other purposes Periodical import of HGSM data first to local MySQL DB then to Oracle XE SAM DB  Only SAM relevant data is imported into Oracle  Other data resides in local MySQL DB if needed for other use and not to burden Oracle DB HGSM+SAM Integration (1)

EGEE/WLCG Operations Workshop 2007, Stockholm, June HGSM+SAM Integration (2) HGSM (MySQL) XML (PHP) Local copy of HGSM (MySQL) SAM DB (Oracle) BBmSAM (PHP) SAM portal (Python) BBmSAM (PHP) SAM sync (PHP) c01.grid.etfbl.netc16.grid.etfbl.net hgsm.grid.org.tr

EGEE/WLCG Operations Workshop 2007, Stockholm, June HGSM – SAM – planned improvements  Currently SAM retrieves node/service from mix of different sources (the “official” way)  All the data is already present in HGSM  The intention is to communicate directly and only with HGSM as it is considered to be reference copy for data Having HGSM DB copy at the same place enables us to further develop (BBm)SAM portal  Checking whether someone is site administrator and allowing him/her to request out-of-order tests  Soft real-time tracking of test progress  Exporting data in any structured form – moving to XML and/or HGSM+SAM Integration (3)

EGEE/WLCG Operations Workshop 2007, Stockholm, June WiatG: New BDII operations tool Web application for visualization of BDII information  Highly responsive tool because it uses AJAX  Partial refresh (client receives part by part of the page)  Asynchronous (server processing in the background, so one may send several requests) Current version seeks for: CE, gCE, RB, gRB, SE, LFC, FTS and GridICE Used as an operational tool for site monitoring Documentation available:  Supports several regional projects: EUMedGRID, EUChinaGrid, EELA, and BalticGrid, as well as LHC VOs and OPS

EGEE/WLCG Operations Workshop 2007, Stockholm, June WiatG Architecture

EGEE/WLCG Operations Workshop 2007, Stockholm, June WiatG in action

EGEE/WLCG Operations Workshop 2007, Stockholm, June Further development of WiatG Addition of new services (MyProxy, localLFC, VO software tags, …) Correctness check of site-BDII data  Alarms dashboard  Automatic creation of tickets Development of the new tool “What should be at the Grid” (WsbatG)  Based on the site configuration exported from HGSM (SEE-GRID GOCDB)  Visually identical tool, providing the expected status of BDII in WiatG Comparison of WiatG and WsbatG data  Alarms dashboard  Automatic creation of tickets

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmSAM portal  Created for SLA monitoring  Generating site availability statistics according to several criteria  Overview (HTML) and full dump (CSV) of data possible  Extended into full SAM portal  Availability for last 24h period for all sites/services  Latest results per service  History for nodes/services  Currently being ported to MySQL  Developed by U of Banjaluka  BBmSAM

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmSAM as a SAM portal (1)

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmSAM as a SAM portal (2)

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmSAM and SLA (1)

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmSAM and SLA (2)

EGEE/WLCG Operations Workshop 2007, Stockholm, June BBmobileSAM Optimized for small-screen devices and low bandwidth Possible filtering of sites  For a single site (example: BA-01-ETFBL)  For all sites in a country (example: BA)  For all SEE-GRID sites Possible three levels of details  Basic level (critical test status for all nodes and services)  Single test level (all tests status for all nodes and services)  Single test level with timestamp Detail levels work independently of site filter, which means that will produce detailed results for all sites in SEE-GRID

EGEE/WLCG Operations Workshop 2007, Stockholm, June WP3 ongoing work Optimization of site/top-level BDIIs through indexing  SAM porting to MySQL WiatG/WsbatG HGSM improvements gLite-WMSLB performance and stability assessment Proxy renewal on RB/WMS with full VOMS capabilities