EGEE/LCG Operation Workshop

Slides:



Advertisements
Similar presentations
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Advertisements

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Wofgang Thöne, Institute For Scientific Computing – EGEE-Meeting August 2004 Welcome to the User.
EGEE is a project funded by the European Union under contract IST ROCs Interface and TPM partecipation Marco Verlato INFN – Sezione di Padova.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
Operational Workshop, Abingdon Alistair Mills, CERN 27 September 2005 Global Grid User Support Alistair Mills Flavia Donno for the LCG/GGUS Executive Support.
INFSO-RI Enabling Grids for E-sciencE GLOBAL GRID USER SUPPORT THE MODEL AND EXPERIENCE IN LCG/EGEE Gilles Mathieu(1), Torsten Antoni(2),
EGEE is a project funded by the European Union under contract IST Plan for ROC verification Hélène Cordier - Alistair Mills IN2P3, CRNS, France.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE is a project funded by the European Union under contract IST User support in EGEE Alistair Mills Torsten Antoni EGEE-3 Conference 20 April.
INFSO-RI Enabling Grids for E-sciencE User Support in EGEE Torsten Antoni, FZK
EGEE is a project funded by the European Union under contract IST Support Operation Challenge – 1 SOC-1 Alistair Mills Torsten Antoni ARM-4,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE is a project funded by the European Union under contract IST Network Resources Provision Jean-Paul Gautier SA2 manager Cork meeting,
EGEE is a project funded by the European Union under contract IST Support in EGEE Ron Trompert SARA NEROC Meeting, 28 October
INFSO-RI Enabling Grids for E-sciencE An overview of EGEE operations & support procedures Jules Wolfrat SARA.
LCG GDB LCG User Support 8 February 2005 – n o 1 LCG/EGEE User Support Flavia Donno LCG/INFN-Pisa
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Alistair.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
INFSO-RI Enabling Grids for E-sciencE Database Services and Grid User Support Flavia Donno on behalf of GGUS/ESC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EGEE is a project funded by the European Union under contract IST Service Activity 1 M.Cristina Vistoli ROC Coordinator All activity meeting,
LCG Workshop User Support Working Group 2-4 November 2004 – n o 1 Some thoughts on planning and organization of User Support in LCG/EGEE Flavia Donno LCG.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number GGUS Service Provider GGUS –
II EGEE conference Den Haag November, ROC-CIC status in Italy
1/3/2006 Grid operations: structure and organization Cristina Vistoli INFN CNAF – Bologna - Italy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGEE is a project funded by the European Union under contract IST ROC-IT User Support in the EGEE infrastructure Riccardo Brunetti INFN-Torino.
Scuola Grid - Martina Franca, Thursday 08 November Il Sistema di Supporto INFNGrid & GGUS ( Global Grid User.
CERN LCG Grid Deployment Board Flavia Donno, INFN 8 September 2005 Global Grid User Support Flavia Donno for the LCG/GGUS Executive Support Committee (ESC)
INFSO-RI Enabling Grids for E-sciencE Support Model for SC4 Pilot WLCG Service Flavia Donno CERN.
INFN-Grid WS, Bari, 2004/10/15 Andrea Caltroni, INFN-Padova Marco Verlato, INFN-Padova Andrea Ferraro, INFN-CNAF Bologna EGEE User Support Report.
EGEE is a project funded by the European Union under contract IST GGUS-ROCs Interface status update Marco Verlato INFN – Sezione di Padova.
Enabling Grids for E-sciencE EGEE-II INFSO-RI ROC managers meeting at EGEE 2007 conference, Budapest, October 1, 2007 Admin Matters Vera Hanser.
CERN WLCG Grid Storage Systems Deployment Flavia Donno, CERN 6 November 2007 Organization of Storage Support through GGUS Flavia Donno CERN/IT-GD CERN.
Bob Jones EGEE Technical Director
Il Sistema di Supporto INFNGrid & GGUS (Global Grid User Support )
Grid.It Grid Managers Tutorial
Regional Operations Centres Core infrastructure Centres
Il sistema di supporto di INFNGRID e GGUS
Operations Status Report
The Italian Regional Helpdesk System
EGEE is a project funded by the European Union
Support Operation Challenge – 1 SOC-1 Alistair Mills Torsten Antoni
EGEE Middleware Activities Overview
GGUS webportal – future plans
SA1 Execution Plan Status and Issues
User Support Workflow in EGEE
Ian Bird GDB Meeting CERN 9 September 2003
EGEE/LCG Operation Workshop
Brief overview on GridICE and Ticketing System
ATLAS support in LCG.
Report from ESC / GGUS / TPM
VOCE Peter Kaczuk, Dan Kouril, Miroslav Ruda, Jan Svec,
INFN – GRID status and activities
Infrastructure Support
The GILDA testbed Roberto Barbera University of Catania and INFN
The CCIN2P3 and its role in EGEE/LCG
Nordic ROC Organization
R-COD model readiness in FR
GGUS Partnership between FZK and ASCC
NE-ROC Nordics Operations
LCG Operations Workshop, e-IRG Workshop
Leigh Grundhoefer Indiana University
Presentation transcript:

EGEE/LCG Operation Workshop 24th-26th May 2005 A report on operation support, open issues and statistics Marco Verlato INFN – Sezione di Padova www.eu-egee.org EGEE is a project funded by the European Union under contract IST-2003-508833

Outline History since last Operation Workshop EGEE User Support Overview Grid.it helpdesk usage report ROC Integration Open issues EGEE/LCG Operation Workshop – May 24-26, 2005 - 2

History Nov. 04: Outcome from User Support Task Force, Grid.it support infrastructure and pilot interface to/from GGUS presented at 1st EGEE/LCG Operation Workshop Nov. 04: Pilot GGUS-Grid.it helpdesk interface live demo at EGEE-2 Conference Dec. 04: Grid.it helpdesk code and interface documentation made available to other ROCs Jan. 05: E(gee, or xecutive) Support Commettee kick off at FZK, WP definition and mandate Mar. 05: Support on Duty start, GGUS/ Grid.it / Cic-on-duty helpdesks fully interfaced May 05: GGUS enhanced, SE , SW helpdesks interfaced EGEE/LCG Operation Workshop – May 24-26, 2005 - 3

EGEE User Support: requirements Support requests range from: Grid Services and Sites faults Problems with installation/configuration “How do I …?” Problems with applications Bugs Requirements for extra features Users may be Site Admins, VO application users, VO managers, … they all prefer a single point of contact for Grid problems User Support / Operation Support / VO Support are different but with a lot of overlap Different sets of experts and levels of support EGEE/LCG Operation Workshop – May 24-26, 2005 - 4

EGEE User Support: infrastructure The ROCs, VOs and the other project wide groups such as the Core Infrastructure Center (CIC), middleware groups (JRA), and network groups (NA), will be connected via a central integration platform provided by GGUS. This central helpdesk keeps track of all service requests and assigns them to the appropriate support groups. In this way, formal communication between all support groups is possible. To enable this, each group has to build only one interface between its internal support structure and the central GGUS application. EGEE/LCG Operation Workshop – May 24-26, 2005 - 5

EGEE User Support: interfaces Using the local Helpdesk Systems in conjunction with a central integration platform at GGUS Resource Center 1(RC) ... Resource Center N(RC) Local User Support Application Regional Operations Center (ROC) Third level support: Generic deployment Grid Middleware Report Problem The User Interface VO support Use the Webview Report Problem Central GGUS Application Interface CIC EGEE/LCG Operation Workshop – May 24-26, 2005 - 6

EGEE User Support: Responsible Units First Level Support GGUS team SOD (ROC experts rotation) Second Level Support CIC-on-duty ROC_Asia/Pacific ROC_CE ROC_CERN ROC_France ROC_GER/CH ROC_Italy ROC_North ROC_Russia ROC_SE ROC_SW ROC_UK/Ireland VOSupport (atlas,magic,biomed,compass,babar,cdf,alice,lhcb,cms,d0) Third Level Support (filled with experts provided by ROCs) Grid Deployment Castor Generic Deployment Manual Installation Pre-production system VO management/VOMS Grid Middleware d-Cache Data Management GLUE GridICE Information System/GIP/BDII R-GMA Security Management Workload Management ROC Helpdesks EGEE/LCG Operation Workshop – May 24-26, 2005 - 7

The Grid-It portal 31 RCs ~1400 CPUs ~120 TB 21 VOs +DAG+MPI +DGAS http://grid-it.cnaf.infn.it 31 RCs ~1400 CPUs ~120 TB 21 VOs +DAG+MPI +DGAS EGEE/LCG Operation Workshop – May 24-26, 2005 - 8

Deployment Status EGEE/LCG Operation Workshop – May 24-26, 2005 - 9

Services and Sites Monitoring EGEE/LCG Operation Workshop – May 24-26, 2005 - 10

Grid-it Helpdesk EGEE/LCG Operation Workshop – May 24-26, 2005 - 11

Trouble Ticketing System The trouble ticketing system is based on OneOrZero Helpdesk tool (www.oneorzero.com), coded in PHP, using MySQL, customizable, free Replaced with Xoops / xHelp tool soon Access allowed to registered members approved by administrators: End-users: they create the tickets describing problems or suggestions Supporters: fix the problems, or redirect somewhere else Site Managers: act as supporters for a given RC, and exchange tickets with Operatives for operational issues Operatives: people of ROC/CIC Central Management Team, Release & Deployment Team and Ticketing System Team itself, exchange tickets with Site Managers and Supporters EGEE/LCG Operation Workshop – May 24-26, 2005 - 12

ROC Support Units ~ 40 people + site managers EGEE/LCG Operation Workshop – May 24-26, 2005 - 13

Weekly shifts 4 people a day weekly rotating 8.30-19.30 working hours 11x5 coverage Mainly busy with Operations EGEE/LCG Operation Workshop – May 24-26, 2005 - 14

Usage Report Statistics for last 6 months of operations ~25 tickets a week on average EGEE/LCG Operation Workshop – May 24-26, 2005 - 15

Usage Report Grid services Operative teams Grid sites VO applications EGEE/LCG Operation Workshop – May 24-26, 2005 - 16

Interface to GGUS http://infnforge.cnaf.infn.it/eticketimp/ First Interface between Grid.it Helpdesk and GGUS ready since November 04 and in ‘production’ since March 05 Based on Web Services at GGUS side, several advantages: sample code available for PHP / Perl and other computing languages very fast: 600-1000 service requests/sec on the GGUS Servers easy to adapt Based on e-mail at Grid.it side (importing tool) XML exchange format http://infnforge.cnaf.infn.it/eticketimp/ EGEE/LCG Operation Workshop – May 24-26, 2005 - 17

Interface to GGUS EGEE/LCG Operation Workshop – May 24-26, 2005 - 18

GGUSROC Basic Workflow GGUS System ROC Helpdesk XML Mail GGUS/SOD Web Portal SUPP Unit CMT Ticket assignment CIC-on-duty SUPP Unit X CIC Interface Ticket solved notification SUPP Unit Y Web services EGEE/LCG Operation Workshop – May 24-26, 2005 - 19

ROC Integration All ROCs were asked to create/enable their Support Structure to be integrated with GGUS: providing a contact to their helpdesk system providing a well defined structure behind their helpdesk system providing a list of experts committed to VO support and 3th level support filling the corresponding GGUS Responsible Units Some ROCs set up an helpdesk system interfaced to GGUS following the Grid.it example using OneOrZero SE: ready in production since April 25th SW: ready in production since May RU: work started in April, plan to be in production by end of July EGEE/LCG Operation Workshop – May 24-26, 2005 - 20

ROC Integration: SE EGEE/LCG Operation Workshop – May 24-26, 2005 - 21

ROC Integration: SE EGEE/LCG Operation Workshop – May 24-26, 2005 - 22

ROC Integration: SW Links to : Home FAQ Ticket Documents Repositories Training EGEE/LCG Operation Workshop – May 24-26, 2005 - 23

ROC Integration: SW username/ password needed EGEE/LCG Operation Workshop – May 24-26, 2005 - 24

ROC Integration: Russia EGEE/LCG Operation Workshop – May 24-26, 2005 - 25

ROC Integration: Russia EGEE/LCG Operation Workshop – May 24-26, 2005 - 26

ROC Integration Some ROCs had different helpdesks inside their federation: CE & NE: helpdesk based on RT open to local users since April, plan to be interfaced to GGUS by end of May, support structure and responsibilities defined within their ROC, tickets expected to be answered in a reasonable time FR: home developed helpdesk, interface to GGUS by end of May D-CH: helpdesk based on Remedy, interface to GGUS ready by June 8th UK-I: helpdesk based on Footprint, plan to be interfaced to GGUS by end of July All ROCs will have their Support System ready and interfaced to GGUS by end of July EGEE/LCG Operation Workshop – May 24-26, 2005 - 27

ROC Integration: some numbers Even if most ROC helpdesks not yet interfaced to GGUS, ROC supports units are reached with mailing lists: ROC # tickets # open oldest CE 29 2 1 day France 3 1 month GER-CH 33 Italy 54 5 days NE 10 5 Russia 13 2 months SE 36 4 SW 31 12 UK-I 58 15 TOTAL 309 60 Statistic available since half March More than 90% coming from CIC-on-duty CIC-on-duty rate: ~ # 50/week 1st Level rates: GGUS ~ # 20/week SOD ~ # 4/week EGEE/LCG Operation Workshop – May 24-26, 2005 - 28

Open issues Distributed EGEE User/Operation Support Infrastructure is progressing, but: tickets must be solved within an acceptable timeframe, otherwise we’ll not attract users real responsive people has to be behind the Support Units urgent to define and control the Service Level Agreements (SLAs) escalate problems if they are not solved within the defined SLAs measuring and reporting regarding the SLAs document Processes and Workflows within the whole infrastructure enhance 1st Level Support / SOD: most of people at ROCs involved in deploying / troubleshooting the Grid can more easily solve tickets without addressing them to other RUs, shortening time … Integration effort useless if at the end we are not able to provide a reliable service EGEE/LCG Operation Workshop – May 24-26, 2005 - 29