Ian Bird GDB Meeting CERN 9 September 2003

Slides:



Advertisements
Similar presentations
Last update 01/06/ :23 LCG 1Maria Dimou- cern-it-gd Maria Dimou IT/GD Site Registration policy & procedures
Advertisements

LCSC October The EGEE project: building a grid infrastructure for Europe Bob Jones EGEE Technical Director 4 th Annual Workshop on Linux.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EGEE is a project funded by the European Union under contract IST SA1 and NA3 Alistair Mills Grid Deployment Group +41.
INFSO-RI Enabling Grids for E-sciencE Incident Response Policies and Procedures Carlos Fuentes
EGEE is a project funded by the European Union under contract IST The way ahead Alistair Mills Grid Deployment Group
EGEE-II INFSO-RI Enabling Grids for E-sciencE AP ROC Min-Hong Tsai ASGC SA1 Transition Meeting May 8 th, 2008
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
EGEE ARM-2 – 5 Oct LCG Security Coordination Ian Neilson LCG Security Officer Grid Deployment Group CERN.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
GGF12 – 20 Sept LCG Incident Response Ian Neilson LCG Security Officer Grid Deployment Group CERN.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
INFSO-RI Enabling Grids for E-sciencE Plan until the end of the project and beyond, sustainability plans Dieter Kranzlmüller Deputy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks David Kelsey RAL/STFC,
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
SA1/SA2 meeting 28 November The status of EGEE project and next steps Bob Jones EGEE Technical Director EGEE is proposed as.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America EELA Infrastructure (WP2) Roberto Barbera.
15-Dec-04D.P.Kelsey, LCG-GDB-Security1 LCG/GDB Security Update (Report from the Joint Security Policy Group) CERN 15 December 2004 David Kelsey CCLRC/RAL,
EGEE is a project funded by the European Union under contract IST EGEE Services Ian Bird SA1 Manager Cork Meeting, April
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE MiddlewareLCG Internal review18 November EGEE Middleware Activities Overview Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as.
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE is a project funded by the European Union under contract IST Network Resources Provision Jean-Paul Gautier SA2 manager Cork meeting,
EGEE is a project funded by the European Union under contract IST Support in EGEE Ron Trompert SARA NEROC Meeting, 28 October
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
INFSO-RI Enabling Grids for E-sciencE An overview of EGEE operations & support procedures Jules Wolfrat SARA.
Guy Wormser IN2P3/CNRS, EGEE Applications Manager September 2003 EGEE is proposed as a project funded by the European Union under contract IST
CERN LCG Deployment Overview Ian Bird CERN IT/GD LCG Internal Review November 2003.
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
SA2 : Network Resource Provision All Activity Meeting – 17 March SA2 Execution Plan for the first year Jean-Paul Gautier SA2 Manager CNRS/UREC.
EGEE ARM-2 – 5 Oct LCG/EGEE Security Coordination Ian Neilson Grid Deployment Group CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGEE is a project funded by the European Union under contract IST New VO Integration Fabio Hernandez ROC Managers Workshop,
EGEE is a project funded by the European Union under contract IST Service Activity 1 M.Cristina Vistoli ROC Coordinator All activity meeting,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security aspects (based on Romain Wartel’s.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
Grid Deployment Technical Working Groups: Middleware selection AAA,security Resource scheduling Operations User Support GDB Grid Deployment Resource planning,
INFSO-RI Enabling Grids for E-sciencE EGEE general project update Fotis Karayannis EGEE South East Europe Project Management Board.
2007/07/04 Organisation and tasks of ROC France Pierre Girard Visit of Japanese grid site managers.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
JRA1 Middleware re-engineering
Bob Jones EGEE Technical Director
JRA2: Quality Assurance
Status of Task Forces Ian Bird GDB 8 May 2003.
Regional Operations Centres Core infrastructure Centres
EGI – Round table discussion
EGEE is a project funded by the European Union
David Kelsey CCLRC/RAL, UK
JRA3 Introduction Åke Edlund EGEE Security Head
SA1 Execution Plan Status and Issues
Integrated Management System and Certification
LCG Security Status and Issues
Long-term Grid Sustainability
Bob Jones EGEE Technical Director
Nordic ROC Organization
LCG Operations Centres
LCG Operations Workshop, e-IRG Workshop
Description of Revision
Connecting the European Grid Infrastructure to Research Communities
Leigh Grundhoefer Indiana University
Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002
Building Statistical Capacity UNSD perspective
Portfolio, Programme and Project
{Project Name} Organizational Chart, Roles and Responsibilities
Presentation transcript:

Ian Bird GDB Meeting CERN 9 September 2003 EGEE Operations Ian Bird GDB Meeting CERN 9 September 2003

EGEE Operations – key objectives Core Infrastructure services: operate essential grid services Grid monitoring and control: proactively monitor the operational state and performance, initiate corrective action Middleware deployment and resource induction: to validate and deploy middleware releases Set up operational procedures for new resources Resource provider and user support: coordinate the resolution of problems with Grid operations from both Resource Centres and users; filter and aggregate problems, providing or obtain solutions Grid management: Coordinate Regional Operations Centres (ROC) and Core Infrastructure Centres (CIC), manage the relationships with resource providers, via negotiation of service-level agreements. International collaboration: drive collaboration with peer organisations in the U.S. and in Asia-Pacific ensure the interoperability of grid infrastructures and services for cross-domain VO’s Participate in liaison and standards bodies in wider grid community

Operations Structure Implement the objectives to provide: Access to resources Operation of EGEE as a reliable service Deploy new middleware and resources Support resource providers and users With a clear layered structure: Operations Management Centre (1) Core Infrastructure Centres (4 + 1 in Russia) Regional Operations Centres (10) Resource Centres

Operations infrastructure

Operations Management Centre - OMC Manager + deputy Coordinator for CICs (at CERN) Coordinator for ROCs (Italy) Team to oversee operations – problems resolved, performance targets, etc. OAG (like GDB) to advise on policy issues, etc. Responsibilities include: resource management delivery of the operational service and for its improvement and development; Enable cooperation and access agreements with user communities, virtual organisations and existing national and regional Grid infrastructures; Approve the service level agreements negotiated between the Resource Centres and the ROCs. Approve connection of new Resource Centres once they have correctly installed the necessary middleware and operational tools; Promote the development of cross-trust agreements between the various existing Certification Authorities (CAs) operating within the EGEE Grid community and encourage the establishment of new CAs where necessary; Liaise with user communities and virtual organisations to monitor their developing requirements; Interface to international grid efforts: Standards, interoperability, collaborative projects

Core Infrastructure Centres - CIC Originally 4 (5 with Russia) Operate core grid services Function as a single distributed entity Each may have specialist expertise Day-to-day operation – implement operational policies defined by OMC Monitor state – initiate corrective actions Eventual 24x7 operation of grid infrastructure Does not imply that RCs must be 24x7 – specify in SLAs with ROCs Provide resource and usage accounting Provide security incident response coordination Ensure recovery procedures Operations management and performance tuning tools – build or commission

Regional Operations Centres – ROC Provide front-line support to users and resource centres Support new resource centres joining EGEE in the regions Support deployment to the resource centres Responsibilities include: Middleware validation: User and administrator Support: Operate call centres Refer operational problems to the layer II Core Infrastructure Centres; Refer middleware problems to the middleware activity; Distributed problem tracking db Provide Grid Operations training for staff at Resource Centres; Middleware and service deployment Develop deployment procedures and documentation Distribute approved middleware releases to Resource Centres Assist Resource Centres to deploy Grid middleware and to develop the technical and operational procedures to become part of the Grid.; Distribute operational monitoring and authorisation and accounting tools to Resource Centres; General: Collaborate in producing release notes for the services and middleware Collaborate in producing the cook-books to be used by new participants in EGEE (resource centres, new ROCs, new VOs) as part of a strategy of building a long-lasting infrastructure Work with CICs and Operations Management to make recommendations for improvement of the Grid infrastructure.

User Support Initial filtering by VO support experts Essential – VO specific knowledge, diverse applications and grid usage Report problems to ROC May escalate to CIC CIC coordinates reporting to external sources Middleware developers, other projects, other grid operators, network operators OMC together with CIC, ROC, VOs Develop procedures and policies including response targets, etc Support coordinator (oversees problem resolution) from CICs

Implementation plans Initial service will be based on the LCG-1 infrastructure This will be the production service, most resources allocated here In parallel must deploy as soon as possible a development service Based on EGEE m/w – even a basic framework This is where functionality is validated before going to production, apps do β-testing, etc. Must be treated as an operational service Needs enough resources – runs at sub-set of production sites, additional resources for scaling tests on request Also would need a testbed system Parallel to production system to debug and resolve problems, Requires sufficient support and resources

Roles and staffing Federation Services provided FTE Requested Unfunded Financing CERN OMC, CIC, Resource Centre 10 2000 UK+Ireland CIC, 2 ROCs, 5 Resource Centres 10.5 2100 France CIC, ROC, 3 Resource Centres 9.55 11 1850 Italy CIC, ROC, ROC Coordinator, 4 Resource Centres Northern Europe 2 ROCs, 7 Resource Centres 6 7 1200 Germany + Switzerland ROC, Support centres, 4 Resource Centres 4.5 7.5 South East Europe distributed ROC, 5 Resource Centres Central Europe South West Europe 8.85 Russia CIC, distributed ROC, 8 Resource Centres 7.15 22.75 560 Totals 79.05 100.1 14610 k€

Management structure OAG Includes: VOs, RC’s

LCG and EGEE Operations The core infrastructure of the LCG and EGEE grids will be operated as a single service, will grow out of LCG service LCG includes US and Asia, EGEE includes other sciences Substantial part of infrastructure common to both The ROCs provide local support for Resource Centres and applications. Similar to LCG primary sites Some ROCs and LCG primary sites will be merged LCG Deployment Manager will be the EGEE Operations Manager Will be member of PEBs of both ROCs will be coordinated outside of CERN (which has no ROC)

Milestones MSA1.1 M6 Initial pilot Grid infrastructure operational. MSA1.2 M12 First review MSA1.3 M14 Full production Grid infrastructure (20 Resource Centres) operational. MSA1.4 M24 Second review and expanded production Grid infrastructure (50 Resource Centres) operational.

Deliverables DSA1.1 M3 Detailed execution plan for first 14 months of infrastructure operation. DSA1.2 M6 Release notes corresponding to MSA1.1 DSA1.3 M9 Accounting and reporting web site publicly available DSA1.4 M12 Assessment of initial infrastructure operation and plan for next 12 months. DSA1.5 M14 First release of EGEE Infrastructure Planning Guide (“cook-book”), and release notes corresponding to MSA1.3 DSA1.6 M24 Assessment of production infrastructure operation and outline of how sustained operation of EGEE might be addressed. Updated EGEE Infrastructure Planning Guide and release notes corresponding to MSA1.4 DSA1.1 – execution plan – this must be started now, based on use-cases, scenarios, etc. The CIC, ROC managers must contribute to this.

Summary EGEE Operations 14.6 M€ for ~80 FTE funded and ~100 unfunded Many issues to understand – need to start work on a detailed implementation plan now Initial service will be based on LCG-1 infrastructure and experience