EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)

Slides:



Advertisements
Similar presentations
The importance of the service catalogue to the service desk
Advertisements

Page 2 Agenda Page 3 History –Blue Print, 2000 –GIS Process 1.2, 2001 (training only) –GIS Process 2.0, (ITIL based - not implemented) –Supply/Demand.
Chapter 7: Key Process Areas for Level 2: Repeatable - Arvind Kabir Yateesh.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Services and Operations in Polish NGI M. Radecki,
By Saurabh Sardesai October 2014.
The Key Process Areas for Level 2: Repeatable Ralph Covington David Wang.
UK NGI Operations John Gordon 10 th January 2012.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Unified Middleware Distribution (UMD): SW provisioning to EGI Mario David.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Future support of EGI services Tiziana Ferrari/EGI.eu Future support of EGI.
Service Management Processes
Roles and Responsibilities
ITIL Process Management An Overview of Service Management Processes Thanks to Jerree Catlin, Sue Silkey & Thelma Simons University of Kansas.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EG recent developments T. Ferrari/EGI.eu ADC Weekly Meeting 15/05/
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Service Transition & Planning Service Validation & Testing
The EGI Blueprint: Grid Operations and Security Migration to the next grid operations era Tiziana Ferrari (Istituto Nazionale di Fisica Nucleare)
Georgia Institute of Technology CS 4320 Fall 2003.
UK NGI Operations John Gordon 15 th May Helpdesk Ticket Workflow NGI Core Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
RI EGI-InSPIRE RI EGI Future activities Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
Your university or experiment logo here The European Landscape John Gordon GridPP24 RHUL 15 th April 2010.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD June 2009 COD-20 Parallel sessions Hélène Cordier COD-20, CNRS-IN2P3,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI (Present and) Future of the EGI Services for WLCG Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Operational Architecture of PL-Grid project M.Radecki,
State of Georgia Release Management Training
EGI-InSPIRE RI EGI EGI-InSPIRE RI Service Operations Security Policy the new generalised site operations security policy.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
The Service Monitoring and Control Toolkit 1 Protect your business with an effective alert management system and high service availability.
Introduction to ITIL and ITIS. CONFIDENTIAL Agenda ITIL Introduction  What is ITIL?  ITIL History  ITIL Phases  ITIL Certification Introduction to.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Grid Oversight in Service Level Agreement environment Małgorzata Krakowian,
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Requirements Status EGI.eu UCB
Why a Commercial Provider should Join the Academic Cloud Federation David Blundell Managing Director 100 Percent IT Ltd Simple, Flexible, Reliable.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
ICS Area Managers Training 2010 ITIL V3 Overview April 1, 2010.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Resource allocation Małgorzata Krakowian 1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regionalisation summary Prague 1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Federated Service Management Sy Holsinger Senior Strategy and Policy Officer,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
Service Design.
Business models for federated infrastructures Owen Appleton Emergence Tech Limited e-FISCAL & FedSM projects.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI.eu Service Portfolio - EGI CF’13 - Apr 2013 EGI.eu Service Portfolio.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI COD activity in EGI-InSPIRE Marcin Radecki CYFRONET, Poland & COD Team 9/29/2016.
George Fergadis, Dimitris Zilaskos, AUTH/GRNET
Integrated Management System and Certification
Maite Barroso, SA1 activity leader CERN 27th January 2009
Solutions for federated services management EGI
Interaction with resource providers: selection, SLA, support
Operations Management Board April 30
EGI operations - news T. Ferrari/EGI.eu 12/9/2018.
Operations Management Board January 29
OLA/SLA framework update
FitSM Self-Study Workbook
Sergio Andreozzi, Sy Holsinger, Malgorzata Krakowian, Matthew Viljoen
Core Activities re-assessment
EOSC Service management system
Presentation transcript:

EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)

EGI Core Services Technical tools that support EGI daily operations Customers: NGIs and end-users Providers: NGIs on behalf of EGI.eu – Accounting Portal and APEL central Repository – GOCDB – Incident Management Tool (EGI Helpdesk - GGUS) – Message Brokers – Metrics Portal – Service Availability Monitoring – Central SAM (operation tool monitoring) and Nagios instances (security monitoring, middleware versions, etc.) – Operations Portal – Security Monitoring Tools – Catch all Grid Services, operations support tools (DTEAM, OPS), site certification EGI Process Assessment and Improvement Plan (Core Services) 2

(I) Service Level Management of Core Services 3EGI Process Assessment and Improvement Plan (Core Services)

What is it about? Negotiation of the EGI.eu Operational Level Agreement – Definition of measurable targets per service – Documentation of EGI.eu OLA in Document DB, release notes – Periodic review of EGI.eu OLA – Periodic review of achieved service level targets – Improvement plan in case of underperformance No Service Level Agreements currently in place EGI.eu Operational Level Agreement currently including a subset of core services – – 4 EGI Process Assessment and Improvement Plan (Core Services)

Who is responsible? Service Level Manager – EGI Chief Operations Officer Customers – NGIs represented by the NGI Operations Managers – OMB – End-user communities represented by Virtual Research Communities – UCB, where applicable Grid Oversight team – Monthly performance analysis (to be established) 5 EGI Process Assessment and Improvement Plan (Core Services)

What are the requirements (according to FitSM)? Services to be delivered shall be agreed with customers. SLAs shall include agreed service targets. A service catalogue shall be maintained. Services and SLAs shall be reviewed at planned intervals. Service performance shall be monitored against service targets. For supporting services or service components provided by Federation members, OLAs shall be agreed. 6 EGI Process Assessment and Improvement Plan (Core Services)

What is the target capability level? Level 5Level 4Level 3Level 2Level 1Level 0 7 EGI Process Assessment and Improvement Plan (Core Services) Defined (standardized procedures to deliver defined outputs): Clearly defined and documented procedures, roles and responsibilities, but process effectiveness and efficiency not measured and reported

What is the target capability level? There is a defined and documented procedure for negotiating SLAs, which in particular specifies the required scope of the negotiated SLA, the output format in which the SLA must be stored, and how the SLA and service targets are defined. There is a defined and documented procedure for maintaining a service catalogue clearly specifying differentiated service offerings. There is a defined and documented procedure for reviewing services and SLAs at planned intervals. There is a defined and documented procedure for service performance monitoring and reporting the results to relevant parties. There is a defined and documented procedure for establishing OLAs, well-aligned to specific services offered to customers. 8 EGI Process Assessment and Improvement Plan (Core Services)

What is the baseline (current) capability level? Level 5Level 4Level 3Level 2Level 1Level 0 9 EGI Process Assessment and Improvement Plan (Core Services) Repeatable (intuitive activities to deliver results): Repeatable results/outputs most of the time, solid, "intuitive" understanding of activities to be performed, but poor and inconsistent documentation and recording

What is the baseline (current) capability level? There is a defined and documented procedure for negotiating SLAs, which in particular specifies the required scope of the negotiated SLA, the output format in which the SLA must be stored, and how the SLA and service targets are defined. There is a defined and documented procedure for maintaining a service catalogue clearly specifying differentiated service offerings. There is a defined and documented procedure for reviewing services and SLAs at planned intervals  only OLA There is a defined and documented procedure for service performance monitoring and reporting the results to relevant parties  only OLA There is a defined and documented procedure for establishing OLAs, well-aligned to specific services offered to customers. 10 EGI Process Assessment and Improvement Plan (Core Services)

Major gaps to be addressed Lack of a service catalogue  PQ12 PQ13end of Aug 2013 – Assign responsibility of performance review (need reports first) PQ14, end of Oct 2013 – OLA vs SLA  do we need SLAs? undefined relation between SLAs and OLAs, include all core services PQ15, end of Jan 2013 – Many quality procedures tied to EGI-Inspire quality assurance  define new procedures, start periodic review process + review targets 11 EGI Process Assessment and Improvement Plan (Core Services)

(II) Service Reporting 12EGI Process Assessment and Improvement Plan (Core Services)

Service What is it about? Establish infrastructure for gathering of measurements – SAM. SAM for core operations tools in place as of Oct 2012 (not covering all core services right now) Establish infrastructure for report generation – Raw data currently available per tool from MyEGI (central instance) – Operations portal developments due in May 2013 (PQ13) 13 EGI Process Assessment and Improvement Plan (Core Services)

Who is responsible? Service Reporting Manager – To be established  AUTH when reports will be available from the operations portal (PQ13, June 2013) Customers – NGIs represented by the NGI Operations Managers – OMB – End-user communities represented by Virtual Research Communities – UCB, for end-user facing tools Grid Oversight team – Monthly performance analysis, to be established (PQ13, June 2013) 14 EGI Process Assessment and Improvement Plan (Core Services)

SR: Requirements  Service reports shall be specified and agreed with interested parties.  The description of each service report shall include its identity, purpose, audience, frequency and content.  Service reports shall be produced. Service reporting shall include performance against service targets, relevant information about significant events, workload characteristics and detected nonconformities. EGI Process Assessment and Improvement Plan (Core Services) 15

Major gaps to be addressed [Average capability 1.33] PQ13, June 2013 – Customer view of report (need to combine some of the services into one report)? OLA report? SLA report? – Establish process for reporting and performance analysis on a monthly basis – Possibly automate the generation of tickets as already done with sites 16 EGI Process Assessment and Improvement Plan (Core Services)

(III) Incident and Service Request Management 17EGI Process Assessment and Improvement Plan (Core Services)

Incident/service Req. What is it about? Process for registering incidents affecting the tools Manage customer requests (e.g. access to tools, configuration changes, requirements) Manage major incidents which bypass some normal procedures Establish escalation procedures in case of incidents/service requests 18 EGI Process Assessment and Improvement Plan (Core Services)

Who is responsible? Incident and Service Request Manager – Incident Manager: R. Trompert (SA1.7) – Service Request: Chief EGI Operations Officer Incident and Service Request Case Manager – Specific team responsible of supporting and operating the service 19 EGI Process Assessment and Improvement Plan (Core Services)

ISRM Requirements All incidents shall be recorded according to a defined procedure Personnel involved in the incident and service request management process shall have access to relevant information including configuration and release information. Customers shall be kept informed of the progress of their reported incidents and service requests (GGUS and RT)  All incidents shall be classified according to a defined procedure.  All incidents shall be prioritized according to a defined procedure, taking into account impact and urgency of the incidents  Closure of incidents shall follow a defined procedure.  Escalation of incidents shall follow a defined procedure  no operations escalation, no alarms in the operations dashboard  There shall be a definition of a major incident. Major incidents shall be classified and managed according to a documented procedure. EGI Process Assessment and Improvement Plan (Core Services) 20

Major gaps to be addressed [Average capability 2.50 ??] PQ13 – Define escalation (turn tests into operations) – Make sure that all core services are supported in GGUS PQ14 – Lack of procedures for incident classification, prioritization and closure – No definition of major incident – No procedure to handle major incidents  GGUS notification mechanism being discussed for critical services (?) Unclear availability of configuration information 21 EGI Process Assessment and Improvement Plan (Core Services)