Www.egi.eu EGI-InSPIRE RI-261323 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 How to integrate portals with the EGI monitoring system Dusan Vudragovic.

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Africa & Arabia ROC tutorial The GSTAT2 Grid Monitoring tool Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
SEE-GRID-SCI Regional Grid Infrastructure: Resource for e-Science Regional eInfrastructure development and results IT’10, Zabljak,
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
G RID M IDDLEWARE AND S ECURITY Suchandra Thapa Computation Institute University of Chicago.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
1 1 Service Composition for LHC Computing Grid Monitoring Beob Kyun Kim e-Science Division, KISTI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
James Casey, CERN, IT-GT-TOM 1 st ROC LA Workshop, 6 th October 2010 Grid Infrastructure Monitoring.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Overview of Privilege Project at Fermilab (compilation of multiple talks and documents written by various authors) Tanya Levshina.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
Julia Andreeva on behalf of the MND section MND review.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI User-centric monitoring of the analysis and production activities within.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal Development Update on Requirements Cyril L'Orphelin IN2P3/CNRS.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Security Monitoring Daniel Kouřil EGI-TF 2011.
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 10 Grid Monitoring A conceptual introduction to GridICE Sergio Andreozzi
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI /05/2011 SA1 & JRA1 - EGI-InSPIRE Review
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI First Ops Tools Long Term Sustainability F2F David Collados 1First Ops Tools.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal OTAG September, 21th 2011 Cyril L’Orphelin – CCIN2P3/CNRS.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
NGI and Site Nagios Monitoring
Use of Nagios in Central European ROC
POW MND section.
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Lavoisier : a way to integrate heteregeneous monitoring systems.
Solutions for federated services management EGI
Monitoring of the infrastructure from the VO perspective
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
EGEE Operation Tools and Procedures
Presentation transcript:

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia 20/09/2012 EGI Technical Forum 2012, 20 Sep

EGI-InSPIRE RI Overview Introduction Overview and initial proposal for integration of Scientific Gateways (SG) into GOC DB GStat SAM Framework Real Time Monitoring GridView Google Earth GridMap Operations Portal

EGI-InSPIRE RI Introduction Scientific Gateways have become an essential tool for research Their operation and performance has to be monitored in order to ensure quality of service for end-users Such monitoring has to ensure an integrated overview of the global status of scientific gateways, but also detailed status of the individual scientific gateway layers and components In addition to this, the monitoring has to: enable sending of alerts to administrators when a particular problem is identified enable scheduling of downtimes during SG maintenance produce SG performance statistical reports

EGI-InSPIRE RI Overview of EGI monitoring Currently, several monitoring tools are used by EGI to detect and diagnose problems with sites GOC DB GStat (GIIS monitoring) SAM framework Real Time Monitoring GridView Google Earth GridMap On top of these, Operations Dashboard provides links and utilizes combined views to simplify monitoring tasks

EGI-InSPIRE RI GOC DB Central static information repository Stores information about NGIs, sites, nodes, services, users, etc. Used to declare maintenance for (un)scheduled events Consists of three parts: database, where all information is stored web portal, which interfaces with the database programmatic interface Exports initial configuration for the information system

EGI-InSPIRE RI SG integration into GOC DB Definition of a new object in GOC DB – Scientific Gateway Definition of the attributes of this object name of the portal portal URL type of the portal version of the portal contact persons (sysadmin, user support, security) available applications, etc. Definition of SG LDAP URL (in this way, SG can dynamically publish information to the Grid information system)

EGI-InSPIRE RI GStat Visualizes Grid infrastructures from an operational perspective, based on information found in the Grid information system Checks the health of Grid information system: detects faults in the information system verifies the validity of information This is done by directly querying site information systems and top-level information system It periodically takes a snapshot of the information system and maintains a cache of the main entities found in the infrastructure

EGI-InSPIRE RI SG integration into GStat GStat can provide statistical information on all properties available in the information system Number of jobs (total/running/waiting) Number of jobs per application Number of available job slots Number of users Available applications

EGI-InSPIRE RI SAM framework [1/2] Relies on existing technologies Nagios is used for scheduling and execution of the probes MSG messaging system (ActiveMQ) integrates other operational tools with Nagios instances SAM framework provides: Status and history of services and sites Visualization of services and sites’ availabilities Web services for data exports Nagios has a pluggable architecture that allows easy integration of SG probes

EGI-InSPIRE RI SAM framework [2/2] SAM uses three central databases: Aggregated Topology Provider (ATP) Metric Description Database (MDDB) Metric Results Store (MRS) Nagios Config Generator (NCG) enables automatic generation of Nagios configuration based on multiple information sources Nagios Probes Simple Probes (check of a service in a single run) Multitest Probes (single run performs multiple tests; mix of active and passive checks; file put > file get > file delete) Long-running Probes (submit > monitor > report state)

EGI-InSPIRE RI SG integration into SAM [1/2] SAM framework has to ensure monitoring of all SG layers and components Presentation Layer (scientific gateway portal) Presentation Layer (scientific gateway portal) Middle Layer (workflow engine, information system, application repository) Middle Layer (workflow engine, information system, application repository) Architecture Layer (job submission to different middlewares) Architecture Layer (job submission to different middlewares) SAM framework Generic SG architecture

EGI-InSPIRE RI SG integration into SAM [2/2] Probes for the Presentation Layer availability of portal and its components check of the authentication mechanism check of the input data management check of the workflow and data-flow tool application submission Probes for the Middle Layer application repository checks check of the workflow storage and interpreter check of the local file storage Probes for the Architecture Layer check of the submission to different DCIs (gLite, ARC, Unicore, Globus, LFS, PBS, BOINC, web service, local resource, Google App Engine, etc.)

EGI-InSPIRE RI SG integration into RTM Real time monitor overlays Grid activity onto a 3D globe Each Grid site is represented by a circle at the location of the resources (pulsing circle of magenta and green) Workload Management Systems (WMS) are represented as triangles Special symbol can be assigned for representation of SGs, while the number of jobs might be retrieved from the information system or through the RTM-dedicated service at SGs

EGI-InSPIRE RI SG integration into other tools GridView visualization tool provides a high-level view of various functional and performance aspects using GridFTP, WMS, FTS logs and SAM MSG Google Earth geographical location of sites GridMap gives graphical representation of site CPU power and its availability

EGI-InSPIRE RI Operations portal Entry point for all information and services related to EGI’s operations, where the community can manage, monitor, share and discuss information Architecture database – to store information related to the users and VOs web module – graphical user interface data aggregation and unification service

EGI-InSPIRE RI SG integration into Operations portal In addition to COD and VO dashboards, a new SG dashboard could be introduced dashboard with the overview of all detected problems related to SGs enabling operations staff to track problems using different results from various monitoring tools VO info feature could be also provided for SGs information on how to support (offer resources) a particular SG Broadcast feature contact several categories of stakeholders interested in notifications about identified problems, issues, downtimes announcement of a new SG version, or a new application