Pedro Andrade pedro.andrade@cern.ch ACE Status Update Pedro Andrade pedro.andrade@cern.ch.

Slides:



Advertisements
Similar presentations
Die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) Tools used for operations at GridKa Angela Poschlad, SCC.
Advertisements

ONE STOP THE TOTAL SERVICE SOLUTION FOR REMOTE DEVICE MANAGMENT.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EG recent developments T. Ferrari/EGI.eu ADC Weekly Meeting 15/05/
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
Automatic Report Generation for WLCG/EGEE D. D. Sonvane (Gridview Team) B.A.R.C.
James Casey, CERN, IT-GT-TOM 1 st ROC LA Workshop, 6 th October 2010 Grid Infrastructure Monitoring.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid Monitoring Tools Alexandre Duarte CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN IT Department CH-1211 Geneva 23 Switzerland t CCRC’08 Tools for measuring our progress CCRC’08 F2F 5 th February 2008 James Casey, IT-GS-MND.
HLRmon accounting portal DGAS (Distributed Grid Accounting System) sensors collect accounting information at site level. Site data are sent to site or.
Julia Andreeva on behalf of the MND section MND review.
Validation of SAM3 monitoring data (availability & reliability of services) Ivan Dzhunov, Pablo Saiz (CERN), Elena Tikhonenko (JINR, Dubna) April 11, 2014.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
HLRmon accounting portal The accounting layout A. Cristofori 1, E. Fattibene 1, L. Gaido 2, P. Veronesi 1 INFN-CNAF Bologna (Italy) 1, INFN-Torino Torino.
New solutions for large scale functional tests in the WLCG infrastructure with SAM/Nagios: The experiments experience ES IT Department CERN J. Andreeva.
Computation of Service Availability Metrics in Gridview Digamber Sonvane, Rajesh Kalmady, Phool Chand, Kislay Bhatt, Kumar Vaibhav Computer Division, BARC,
GridView - Presentation of Work done at CERN by D. D. Sonvane B.A.R.C.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI 2 nd level support training Marian Babik, David Collados, Wojciech Lapka,
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Present and Future Pedro Andrade (CERN IT) 31 st August.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI First Ops Tools Long Term Sustainability F2F David Collados 1First Ops Tools.
Flexible Availability Computation Engine for WLCG Rajesh Kalmady, Phool Chand, Vaibhav Kumar, Digamber Sonvane, Pradyumna Joshi, Vibhuti Duggal, Kislay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
CERN IT Department CH-1211 Geneva 23 Switzerland t LHCOPN Meeting Madrid, 11 th March 2008 James Casey WLCG Monitoring – An overview.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Site Activity (Program Metrics) Training
Daniele Bonacorsi Andrea Sciabà
GridPP37, Ambleside Adrian Coveney (STFC)
Monitoring Evolution and IPv6
Gridpp37 – 31/08/2016 George Ryall David Meredith
Job monitoring and accounting data visualization
PPS All sites Meeting: - CODs and PPS - Monitoring Tools
NGI and Site Nagios Monitoring
OVirt Data Warehouse 02/11/11 Yaniv Dary BI Software Engineer, Red Hat.
Key Activities. MND sections
EMI Project Processes and Tools
POW MND section.
Evolution of SAM in an enhanced model for monitoring the WLCG grid
FTS Monitoring Ricardo Rocha
Experiment Dashboard overviw of the applications
DWQ Web Transformation
Advancements in Availability and Reliability computation Introduction and current status of the Comp Reports mini project C. Kanellopoulos GRNET.
March Availability Report for EGEE Sites based on Nagios
Maite Barroso, SA1 activity leader CERN 27th January 2009
TS4.10 Comp Reports A new approach to Computing Availability/Reliability reports for EGI Progress Report C. Kanellopoulos GRNET 9/14/2018.
Solutions for federated services management EGI
Monitoring of the infrastructure from the VO perspective
Cloud Management Mechanisms
EGI operations - news T. Ferrari/EGI.eu 12/9/2018.
Operational Tools & Middleware Versions Monitoring
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
Site availability Dec. 19 th 2006
Presentation transcript:

Pedro Andrade pedro.andrade@cern.ch ACE Status Update Pedro Andrade pedro.andrade@cern.ch

Outline Brief History Overview Status Computation Availability Computation Operation Status ACE Status Update

Brief History The development of ACE is part of a co-operation agreement between CERN and the Department of Atomic Energy (DAE) of the Government of India for the WLCG project ACE is an external product designed and developed by the Bhabha Atomic Research Centre (BARC) based in Mumbai ACE is released by the SAM team and made available in the central grid-monitoring service ACE Status Update

Overview ACE main task is to compute availability and reliability statistics for grid services and sites: Status is the status of service instances, overall service flavours, and sites at a given point in time Availability is the fraction of time a service was up during the period the service was known Reliability is the fraction of time a service was up during the period the service was scheduled to be up ACE Status Update

Overview ACE Profiles are a collection of metrics/services defined for VOs (multiple profiles per VO). Each profile defines its computation algorithm. Service t1 tn’’ CREAM-CE s1 sn t1’ tn’ SRMv2 s1 sn t1’’ tn’’ sBDII s1 sn Flavour Profile 1 Profile 2 Metric ACE Status Update

Overview Metric Results Status Computation Availability Computation Retrieved from Messaging Broker and stored in DB Status Computation Calculated continuously Based on Service Instances, Service Flavours, Sites Availability Computation Calculated per Hour, Day, Week, and Month Visualization Monthly Reports Web Interface ACE Status Update

Status Computation ACE computes status for services, service flavours, and sites based on metric results ORing At least one service up  up ANDing All services up  up Metric Results Service Status Service Flavour Status Site Status ORing At least one flavours up  up ANDing All flavours up  up ANDing All metrics in OK state  up ACE Status Update

Availability Computation Based on the concepts of hourly up fraction, scheduled down fraction, and unknown fraction From the status and downtimes information ACE computes hourly availability and reliability for services, overall service flavours, and sites From the hourly availability and reliability information ACE computes daily, weekly, monthly availabilities and reliabilities for services, overall service flavours, sites, and federations ACE Status Update

Visualization Monthly Reports: Web Interface: http://gvdev.cern.ch/GRIDVIEW/downloads/Reports/ PDFs reporting monthly availability and reliability numbers Created once per month and distributed to WLCG/EGI Created using iReport, OpenReports, and JasperReports Web Interface: https://grid-monitoring.cern.ch/myegi/sa Online tool showing availability and reliability graphs Queries can be based on: profiles, groups, sites, dates ACE Status Update

Operation Status Management of OPS VO profile and algorithm: (OR CEs) AND (OR of SEs) AND sBDII Operation of the service availability web interface provided by the central grid-monitoring MyEGI portal Generation of monthly availability and reliability reports for OPS VO. Including the maintenance of templates used to generate the reports ACE Status Update

Thanks! For more information please visit the SAM Documentation website https://tomtools.cern.ch/confluence/display/SAMDOC/Home ACE Status Update

Visualization Monthly Reports Web Interface iReport OpenReports JasperReports ACE Status Update