Maite Barroso, SA1 activity leader CERN 27th January 2009

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PoW for the second year Transition to EGI.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
The EGI Blueprint: Grid Operations and Security Migration to the next grid operations era Tiziana Ferrari (Istituto Nazionale di Fisica Nucleare)
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
WLCG Laura Perini1 EGI Operation Scenarios Introduction to panel discussion.
Julia Andreeva on behalf of the MND section MND review.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Grid Oversight in Service Level Agreement environment Małgorzata Krakowian,
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
Components Selection Validation Integration Deployment What it could mean inside EGI
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
NGI_TR Emrah Akkoyun TR-Grid Operational Center EGI-InSPIRE – SA1 Kickoff Meeting1.
EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
TSA1.4 Infrastructure for Grid Management Tiziana Ferrari, EGI.eu EGI-InSPIRE – SA1 Kickoff Meeting1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal OTAG September, 21th 2011 Cyril L’Orphelin – CCIN2P3/CNRS.
Polish NGI: PL-Grid Marcin Radecki EGI-InSPIRE – SA1 Kickoff Meeting 1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI.eu Service Portfolio - EGI CF’13 - Apr 2013 EGI.eu Service Portfolio.
May pre GDB WLCG services post EGEE Josva Kleist Michael Grønager Software Coord NDGF Director CERN, May 12 th 2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
Bob Jones EGEE Technical Director
JRA2: Quality Assurance
Regional Operations Centres Core infrastructure Centres
EGEE is a project funded by the European Union
SA1 Status Report EGEE Grid Operations & Management
Ian Bird GDB Meeting CERN 9 September 2003
POW MND section.
Introduction to OAT presentations
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Security Monitoring in a Nagios world
Report on SLA progress Ioannis Liabotis <ilaboti at grnet.gr>
EGI Community Forum 2012 Munich, 29 March 2012
Networking support (SA2) tasks for EGI
Advancements in Availability and Reliability computation Introduction and current status of the Comp Reports mini project C. Kanellopoulos GRNET.
Cyril L’Orphelin (CC-IN2P3) COD-19, Bologna, March 30th 2009
The CCIN2P3 and its role in EGEE/LCG
Nordic ROC Organization
Monitoring in EGEE Automatisierung & Regionalisierung im Hinblick auf EGI Torsten Antoni (KIT), James Casey (CERN), Sabine Reißer (KIT)
Solutions for federated services management EGI
Leigh Grundhoefer Indiana University
EGI operations - news T. Ferrari/EGI.eu 12/9/2018.
Core Activities re-assessment
Presentation transcript:

Maite Barroso, SA1 activity leader CERN 27th January 2009 SA1 tasks in EGI Maite Barroso, SA1 activity leader CERN 27th January 2009

Introduction 5 tasks defined in the Blueprint, each with several subtasks, and associated manpower at the EGI.org level: Operation of tools (and services) User support Security Other international tasks Development and tool maintenance Three types of tasks considered in the analysis: EGI.org task NGI.org task (list not complete) Central task: task undertaken by an NGI on behalf of EGI.org To change: View -> Header and Footer

Operation of tools Sub-task Blueprint? EGI.org Central NGI Operations portal Yes X GOCDB Monitoring central repository Grid accounting repository and portal Messaging system No Test and availability description DB Regional/NGI grid monitoring Regional operations dashboard To change: View -> Header and Footer

Operation of tools FACTS: CONSEQUENCES: Most are home-developed by individual institutes without enforcement of common coding practises, repositories, packaging standards, etc Many only ever deployed at the institute developing them Many developed to conform to our custom operational model, no industry standard model or procedures (e.g. ITIL) CONSEQUENCES: Difficult to re-resign Costly to maintain (even to new people from the same institute) Basic documentation, testing, etc Difficult to split development/maintenance/service management between different partners Not easily deployable at EGI.org level if the current partners do not continue to run them To change: View -> Header and Footer

EGEE SA1 Architecture of the regional solution Use Nagios to probe sites from ROC/NGI Have a self-contained set of components inside the region/NGI for: Storing topology of regional grid Storing metrics results from probes Raising alarms Raising tickets Viewing metric history and details for debugging Central data stores and components for project-level systems Project level metric store Availability calculation and metrics DB GOCDB Use messaging to exchange information between site/region/project To change: View -> Header and Footer

Operation of tools: summary Two EGI.org services/tools missing in the blueprint: Messaging system Test and availability description DB Operations model at the end of EGEE III will move responsibility to the regions; this does not mean that ALL tools will have independent regional instances Resources estimated to run these tools are underestimated (~ 3 FTE more) For the tools to be run regionally (some) and centrally in a given location (others) we need packaging, testing, documentation… The alternative for the central ones is to keep them where they are (e.g. GOCDB, SAM, CIC portal, GGUS) For the regional ones this is a MUST To change: View -> Header and Footer

User support Sub-task Blueprint? EGI.org Central NGI SUMMARY: Helpdesk (central and regional) Yes X Triage, monitoring and escalation of user support tickets (TPM role) SUMMARY: Tasks are the same Description of the same model we target in SA1 GGUS will define a common interface and it is up to the NGIs to adapt their regional helpdesks to it (being prototyped) To change: View -> Header and Footer

Security Sub-task Blueprint? EGI.org Central NGI SUMMARY: Development and maintenance of security policies and procedures Yes X Coordination of all aspects of operational security, including responding to security incidents SUMMARY: looks fine, both resources and tasks The scope of the role and authority of the "EGI Security Officer" is currently unknown To change: View -> Header and Footer

International tasks Sub-task Blueprint? EGI.org Central NGI Coordination of mw deployment and support (roll out of releases) Yes X Operation of pilot/pre-production services (?) No Resource allocation and brokering support for VOs from NGIs Interoperations between NGIs and with other grids Definition of best practices, operations procedures, operations requirements Operation of catch-all services Escalation of regional/NGI operation issues Monitoring and enforcement of SLAs Operations coordination with international VOs Coordination of operations tool development and maintenance To change: View -> Header and Footer

International tasks: summary Four EGI.org tasks missing in the blueprint: PPS/pilot services, needed? Escalation of regional/NGI operation issues Monitoring and enforcement of SLAs Operations coordination with international VOs Number of FTEs depending on the structure/responsibilities of other EGI.org areas: mw little experience with rolling out mw not produced/certified/supported inside the project, and with diff mw stacks Work to do in the SLA area Automation of SLA monitoring, not existing today Little experience, only based on site service monitoring and site suspension by the grid operator on duty No candidates to work on this To change: View -> Header and Footer

Summary and conclusions Operations tasks listed in the blueprint are a subset of what we have today Some are missing (and the associated FTEs) More detail needed for the benefit of the migration Central, regional, EGI.org? Details, responsibilities SA1 priorities for next year: Keep the production infrastructure running Operations model at the end of EGEE III will move responsibility to the regions Implement and deploy the EGEE SA1 Architecture of the regional solution Make the operations tools easy to deploy and maintain To change: View -> Header and Footer