1 VO User Team Alarm Total ALICE 1 2 ATLAS CMS 4 LHCb 20

Slides:



Advertisements
Similar presentations
GGUS summary (5 weeks) VOUserTeamAlarmTotal ALICE2002 ATLAS CMS6208 LHCb Totals
Advertisements

WLCG Service Report (for the SCOD team) ~~~ WLCG Management Board, 22 nd January 2013 Thanks to Maria Dimou, Mike Kenyon, David.
WLCG Service Report ~~~ WLCG Management Board, 27 th January 2009.
WLCG Service Report ~~~ WLCG Management Board, 27 th October
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE ATLAS CMS LHCb Totals
GGUS summary (7 weeks) VOUserTeamAlarmTotal ALICE ATLAS CMS LHCb Totals 1 To calculate the totals for this slide and copy/paste the usual graph please:
GGUS summary ( 4 weeks ) VOUserTeamAlarmTotal ALICE ATLAS CMS LHCb Totals 1.
WLCG Service Report ~~~ WLCG Management Board, 24 th November
WLCG Service Report ~~~ WLCG Management Board, 1 st September
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES GGUS Overview ROC_LA CERN
WLCG Service Report ~~~ WLCG Management Board, 9 th August
WLCG Service Report ~~~ WLCG Management Board, 16 th December 2008.
Handling ALARMs for Critical Services Maria Girone, IT-ES Maite Barroso IT-PES, Maria Dimou, IT-ES WLCG MB, 19 February 2013.
GGUS Slides for the 2012/07/24 MB Drills cover the period of 2012/06/18 (Monday) until 2012/07/12 given my holiday starting the following weekend. Remove.
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE1102 ATLAS CMS LHCb Totals
WLCG Service Report ~~~ WLCG Management Board, 7 th September 2010 Updated 8 th September
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE4015 ATLAS CMS LHCb Totals
4 March 2008CCRC'08 Feb run - preliminary WLCG report 1 CCRC’08 Feb Run Preliminary WLCG Report.
WLCG Service Report ~~~ WLCG Management Board, 16 th September 2008 Minutes from daily meetings.
WLCG Service Report ~~~ WLCG Management Board, 7 th June
WLCG Service Report ~~~ WLCG Management Board, 18 th September
WLCG Service Report ~~~ WLCG Management Board, 23 rd November
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE4004 ATLAS CMS LHCb Totals
WLCG Service Report ~~~ WLCG Management Board, 14 th February
WLCG Service Report Jean-Philippe Baud ~~~ WLCG Management Board, 24 th August
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
WLCG Service Report ~~~ WLCG Management Board, 17 th February 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Best Practices and Use cases David Bouvet,
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE7029 ATLAS CMS LHCb Totals
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GGUS Report Generator Günter Grein, KIT Helmut Dres, KIT Torsten Antoni,
GGUS summary (4 weeks) VOUserTeamAlarmTotal ALICE5016 ATLAS CMS6118 LHCb Totals
Scuola Grid - Martina Franca, Thursday 08 November Il Sistema di Supporto INFNGrid & GGUS ( Global Grid User.
Vendredi 27 avril 2007 Management of ATLAS CC-IN2P3 Specificities, issues and advice.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
EGEE is a project funded by the European Union under contract IST GGUS-ROCs Interface status update Marco Verlato INFN – Sezione di Padova.
GGUS summary ( 9 weeks ) VOUserTeamAlarmTotal ALICE2608 ATLAS CMS LHCb Totals
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
GGUS summary (2 weeks) VOUserTeamAlarmTotal ALICE2046 ATLAS CMS26210 LHCb Totals
WLCG Service Report ~~~ WLCG Management Board, 15 th December
Nordic NE ROC Face 2 Face Meeting
Servizi core INFN Grid presso il CNAF: setup attuale
WLCG IPv6 deployment strategy
Il Sistema di Supporto INFNGrid & GGUS (Global Grid User Support )
WLCG Workshop 2017 [Manchester] Operations Session Summary
1 VO User Team Alarm Total ALICE 12 ATLAS CMS
Il sistema di supporto di INFNGRID e GGUS
GGUS webportal – future plans
Key Activities. MND sections
PL-Grid – an example of NGI support structure Marcin Radecki
Flavia Donno CERN GSSD Storage Workshop 3 July 2007
1 VO User Team Alarm Total ALICE ATLAS CMS
1 VO User Team Alarm Total ALICE ATLAS CMS
Patricia Méndez Lorenzo ALICE Offline Week CERN, 13th July 2007
~~~ WLCG Management Board, 10th March 2009
CASTOR-SRM Status GridPP NeSC SRM workshop
LHCOPN Operations: Yearly review
WLCG Management Board, 16th July 2013
Castor services at the Tier-0
Jamie Shiers ~~~ WLCG MB, 19th February 2008
Olof Bärring LCG-LHCC Review, 22nd September 2008
1 VO User Team Alarm Total ALICE ATLAS CMS
ROD model assessment ROC FR
WLCG Service Report 5th – 18th July
1 VO User Team Alarm Total ALICE 2 ATLAS CMS LHCb 14
Take the summary from the table on
LCG Operations Workshop, e-IRG Workshop
Work flow changes after the end of EMI
Service Delivery and Support Program Update – April 25, 2018
WLCG Status – 1 Use remains consistently high
Dirk Duellmann ~~~ WLCG Management Board, 27th July 2010
Presentation transcript:

1 VO User Team Alarm Total ALICE 1 2 ATLAS 14 116 6 136 CMS 4 LHCb 20 GGUS summary (2 weeks) VO User Team Alarm Total ALICE 1 2 ATLAS 14 116 6 136 CMS 4 LHCb 20 22 Totals 137 9 166 1

Support-related events since last MB We need WLCG shifters, alarmers, management to give us meaningful values for the GGUS ‘Problem Type’ field, in order for periodic reporting to show better weak areas in support. There were 9 ALARM tickets since the last MB (2 weeks), 5 of which were real, all submitted by ATLAS. Details follow… 6/27/2018 WLCG MB Report WLCG Service Report

ATLAS ALARM->CERN-CNAF transFers What time UTC What happened 2010/10/05 9:13 GGUS ALARM ticket opened, automatic email notification to t1-alarms@cnaf.infn.it AND automatic assignment to ROC_Italy. 2010/10/05 10:23 Site acknowledges ticket and finds a StoRM backend problem. 2010/10/05 12:03 Service restored. Site puts the ticket to ‘solved’ and refers to GGUS:62745 for details. 2010/10/11 Submitter ‘verifies’ ticket GGUS:62745. Not sure how ‘symptomatic’ the solution was… https://gus.fzk.de/ws/ticket_info.php?ticket=62761 6/27/2018 WLCG MB Report WLCG Service Report

ATLAS ALARM->transFers to .fr cloud What time UTC What happened 2010/10/08 5:56 GGUS ALARM ticket opened, automatic email notification to lhc-alarm@cc.in2p3.fr AND automatic assignment to NGI_France. 2010/10/08 6:31 Site acknowledges ticket and finds a network problem preventing all DB server access. 2010/10/08 7:29 Service restored. 2010/10/08 10:41 Site puts ticket to status ‘solved’. https://gus.fzk.de/ws/ticket_info.php?ticket=62871 6/27/2018 WLCG MB Report WLCG Service Report

ATLAS ALARM-> CERN slow lsf What time UTC What happened 2010/09/27 15:34 GGUS ALARM ticket opened, automatic email notification to atlas-operator-alarm@cern.ch AND automatic assignment to ROC_CERN. 2010/09/27 16:01 Operator acknowledges ticket and contacts the expert. 2010/09/27 16:37 Expert’s 1st diagnosis. Too many queries. 2010/09/27 20:10 Service mgr kills a home-made robot by another experiment launching >> bjob queries and puts ticket to status ‘solved’. https://gus.fzk.de/ws/ticket_info.php?ticket=62467 6/27/2018 WLCG MB Report WLCG Service Report

ATLAS ALARM-> CERN slow AFS What time UTC What happened 2010/10/01 7:13 GGUS ALARM ticket opened, automatic email notification to atlas-operator-alarm@cern.ch AND automatic assignment to ROC_CERN. 2010/10/01 7:33 Operator acknowledges ticket and contacts the expert. 2010/10/01 9:37 IT Service manager re-classifies in CERN Remedy PRMS. 2010/10/11 15:33 Still ‘in progress’. Reminder sent during this drill. https://gus.fzk.de/ws/ticket_info.php?ticket=62662 6/27/2018 WLCG MB Report WLCG Service Report

ATLAS ALARM-> CERN CASTOR What time UTC What happened 2010/10/01 16:24 GGUS ALARM ticket opened, automatic email notification to atlas-operator-alarm@cern.ch AND automatic assignment to ROC_CERN. 2010/10/01 16:41 Operator acknowledges ticket and contacts the expert. 2010/10/01 16:42 Expert starts investigation. 2010/10/01 17:23 Solved. PutDONE in SRM not propagated to CASTOR. Done by hand. https://gus.fzk.de/ws/ticket_info.php?ticket=62688 6/27/2018 WLCG MB Report WLCG Service Report