Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n CookBook:

Slides:



Advertisements
Similar presentations
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Services and Operations in Polish NGI M. Radecki,
Advertisements

EGI-Engage Recent Experiences in Operational Security: Incident prevention and incident handling in the EGI and WLCG infrastructure.
Dr. Ognjen Prnjat European and Regional eInfrastructure management Greek Research and Technology Network Shared Computing Infrastructures:
EGI: A European Distributed Computing Infrastructure Steven Newhouse Interim EGI.eu Director.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Co-funded.
UK NGI Operations John Gordon 15 th May NGS continuation NGI Security Monitoring VOMS Helpdesk I am reacting to some issues highlighted by Jeremy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks David Kelsey RAL/STFC,
RI EGI-InSPIRE RI EGI Future activities Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
Your university or experiment logo here The European Landscape John Gordon GridPP24 RHUL 15 th April 2010.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
WLCG Laura Perini1 EGI Operation Scenarios Introduction to panel discussion.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
AEGIS Academic and Educational Grid Initiative of Serbia Antun Balaz (NGI_AEGIS Technical Manager) Dusan Vudragovic (NGI_AEGIS Deputy.
Dr. Isabel Campos Plasencia (IFCA-CSIC) Spanish NGI Coordinator ES-GRID The Spanish National Grid Initiative.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Why a Commercial Provider should Join the Academic Cloud Federation David Blundell Managing Director 100 Percent IT Ltd Simple, Flexible, Reliable.
3rd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Carmela ASERO, EGI.eu 17 September 2013, Madrid
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n e-Infrastructures.
WP2: Consolidation of existing state of the art Dr. Ognjen Prnjat, GRNET.
EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
EGI-Engage EGI-Engage WP3 e-Infrastructure Commons Diego Scardaci EGI.eu/INFN 6/18/2016 EGI-Engage – First.
Co-ordination & Harmonisation of Advanced e-Infrastructures Research Infrastructures – Grant Agreement n The CHAIN Project Federico Ruggieri, INFN.
Co-ordination & Harmonisation of Advanced e-Infrastructures Research Infrastructures – Grant Agreement n CHAIN sustainability guidelines Dr. Ognjen.
Co-ordination & Harmonisation of Advanced e-Infrastructures Research Infrastructures – Grant Agreement n CHAIN sustainability guidelines Dr. Ognjen.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Role and Challenges of the Resource Centre in the EGI Ecosystem Tiziana Ferrari,
Piotr Bała, Marcin Radecki, Krzysztof Benedyczak
GGUS New features and roadmap
Bob Jones EGEE Technical Director
EMI and GISELA Collaboration
Regional Operations Centres Core infrastructure Centres
Steve Brewer Stichting European Grid Initiaitive
Integrated Management System and Certification
Ian Bird GDB Meeting CERN 9 September 2003
EMI Interoperability Activities
Service Level Agreement/Description between CE ROC and Sites
Report on SLA progress Ioannis Liabotis <ilaboti at grnet.gr>
EGI Community Forum 2012 Munich, 29 March 2012
NA3: User Community Support Team
Maite Barroso, SA1 activity leader CERN 27th January 2009
Nordic ROC Organization
Javier Magnin Brazilian Center for Research in Physics & ROC-LA
Status report of the LToS platform
Connecting the European Grid Infrastructure to Research Communities
Solutions for federated services management EGI
Input on Sustainability
Leigh Grundhoefer Indiana University
Interaction with resource providers: selection, SLA, support
Klopotek is transitioning to a Global Organization
Operations Management Board April 30
EGI operations - news T. Ferrari/EGI.eu 12/9/2018.
The EGI.eu Organisation
CHAIN KoM – Rome, 13 December 2010
Introdicution to EGI.eu
User Support in EGI Reactive and proactive services
Presentation transcript:

Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n CookBook: Guidelines on how to kick-start and ROC/NGI Kostas Koumantaros, GRNET

Chain-Reds Project. Objectives  Support the stability of existing and emerging Regional Operation Centres (ROCs) so as to ensure interoperability of DCIs, with focus on Grids.  Maintain a set of guidelines for standard grid interfacing, customised for the type of region.  Investigate the emerging cloud solutions and propose the relevant interoperation approaches.  Analyse the existing HPC interoperation approaches and propose potential solutions.

What is an Operations Centre  “Operations Centre (OC): A centre offering operations services on behalf of the Resource Infrastructure Provider.”  “Resource Infrastructure Provider (RP): The legal organisation responsible for any matter that concerns the respective Resource Infrastructure. It provides, manages and operates (directly or indirectly) all the operational services required to an agreed level of quality as required by the Resource Centres and their user community. It holds the responsibility of integrating these operational services into EGI in order to enable uniform resource access and sharing for the benefit of their Users. The Resource infrastructure Provider liaises locally with the Resource Centre Operations Managers, and represents the Resource Centres at an international level. Examples of a Resource infrastructure Provider are the European Intergovernmental Research Organisations(EIRO) and the National Grid Initiatives (NGIs)”.

Operations Architecture Copyright T. Ferrari EGI.EU Chief Operations officer 4

EGI Infrastructure providers  Integrated infrastructure providers  sharing policies, procedures, tools, QoS agreements and part of the same operations structure  Members of the EGI collaboration (EGI Council/EGI-InSPIRE)  External providers  Latin America, AfricaROC, ChinaROC  Peer providers  own operations tools and procedures, compatible policies, loose operations collaboration with EGI  CNCGRID (Under the wing of ChinaROC), Garuda GRID Copyright T. Ferrari EGI.EU Chief Operations officer 5

Steps to become an Integrated Resource Infrastructure Provider 1. Sign an MoU & SLA with EGI.eu 2. Set-up your Operations Center that provides  Accounting/Monitoring Systems  a Helpdesk System Integrated with GGUS  Core Services as needed 3. Register your Sites to GOCDB 4. Addhere to EGIs Best Practices and Policies 1. Respond to tickets 2. Maintain you site availability and reliability high 3. Always run the recommended versions of middleware and OS.

Sign an MoU & SLA with EGI.eu  MoU Memorandum of understanding  defines what each party offers to this collaboration  Defines what are the obligations of each participant  2 nd level Signed by each member of a federation (e.g institutes that offer sites) and and one Legal entity/representative of the federation  1 st level signed between EGI.EU and the Legal Representative for the federation  SLA/OLA: Service/Operation Level Agreements  Defines the minimum level of Availability and Reliability of each service/site/roc  1st level of SLAs are signed between  2 nd level of SLAs are signed between the RP and each Site

Organise your Operations Center  adhere to the Grid Security and Operational Policies and Procedures  Setup a Heldesk service integrated with a dedicated GGUS Support Unit  Organise teams for 1 st and 2 nd level of support  Setup Accounting and Monitoring services compatible with the EGI services.(e.g SAM/APEL)

Register your Sites to GOCDB  GOCDB is the central contact service of EGI.EU and is used to:  Collect Resource providers/Operations center management contacts  Collect Site contact points  Register Services offered by each site (visible or not to EGI)  Declare downtimes

Site Lifecycle  Candidate  1 st step when a site is being set-up  Uncertified  When ready the site is switched to uncertified and starts to be monitored if stable enough it is declared as certified  Certified  This status signifies that a site is part of the production infrastructure  Suspended  Scheduled Downtime  Site is still being monitored but no alarms are raised if something fails. This is used for updates/upgrades and other technical actions that affect a sites availability.  Visible to EGI (On/OFF) Switch  ON to be part of the EGI infrastructure  OFF to be part of a regional infrastructure only.

ROC Africa&Arabia Contacts ✔ Bruce Becker Status ✖ 7 Sites (to be updated with all regional sites). The ROC is operational. Not registered in GOCDB. Helpdesk ✔ This is an XGUS instance Accounting ✖ Accounting records are not published. Monitoring ✖ Monitoring information is not published. The ROC runs SAM-NAGIOS but there is no data in it. Website ✔ Action Points AP-AAROC-1: Sign MoU with EGI.eu as an Integrated Resource Infrastructure Provider AP-AAROC-2: Provide IGTF Accredited Certificate Services that will cover the whole AA ROC AP-AAROC-3: Create a new Operations Center in the EGi.eu GOCDB and register Resource Centers AP-AAROC-4: Setup and Operate a Grid Monitoring Service AP-AAROC-5: Publish accounting records to the EGI.eu Accounting System from all certified Resource Centers AP-AAROC-6: Adopt and employ Operational Policies and Procedures AP-AAROC-7: Set up dedicated Support Unit in GGUS

SEAsia ROC Contacts ✔ Eric Yen Status ✔ 65 Sites. The ROC is operational Helpdesk ✔ Accounting ✔ Accounting records are published. Monitoring ✔ Monitoring information is published. Website ✔ Action Points None defined.

LA ROC Contacts ✔ Andres Holguin & Renato Santana & Luciano Diaz Status ✔ 4 Sites. The ROC is operational (a lot more currently uncertified) Helpdesk ✔ Accounting ✔ Accounting records are published Monitoring ✔ Monitoring information is published Website ✖ (?) Site appears to be down Action Points AP-LAROC-1: Bring up / create the roc-la.org web site AP-LAROC-2: Write and publish an AUP AP-LAROC-3: Certify the rest of their sites.

China ROC Contacts ✔ Shi, Jingyan & Yan, Xiaofei Status ✖ 1 Site. The ROC is not operational and the 1 site belongs to ROC Canada Helpdesk ✔ The helpdesk is an XGUS instance. A person has the task to monitor the helpdesk for incoming tickets and provide responses. The help desk is not used with the ROC Accounting ✔ Monitoring ✔ Nagios & Ganglia are used internally. Beijing-LCG2 is being monitored by ROC_Canada which published the results Website ✖ The site publishes information about ROC Africa Action Points AP-CHINAROC-1: Sign MoU with EGI.eu as an Integrated Resource Infrastructure Provider AP-CHINAROC-2: Update the information on the CHINA ROC website AP-CHINAROC-3: Setup and Operate a Grid Monitoring Service AP-CHINAROC-4: Create a new Operations Center in the EGi.eu GOCDB and transfer Resource Centers from ROC Canada to the newly established Operations Center AP-CHINAROC-5: Adopt and employ Operational Policies and Procedures AP-CHINAROC-6: Set up dedicated Support Unit in GGUS

CNGRID Contacts ✔ 1.Prof. Chi Xuebin Status ✔ 14 Sites running GOS. ROC is operational. Sites are registered internally within GOS Helpdesk ✖ No helpdesk Is operational (Needs clarification) Accounting ✖ Does not publish accounting information. CNGridEye is used for accounting. ( Monitoring ✖ Does not publish monitoring information. CNGridEye is use for monitoring ( Website ✔ Action Points AP-CNGRID-1:Investigate the compatibility with the EGI policies AP-CNGRID-2:Register with EGI.eu as a Peer Resource Infrastructure Provider AP-CNGRID-3:Setup a specific Support Unit in the CHINA ROC Heldpesk AP-CNGRID-4:Set Up a dedicated Science Gateway through which European users can run jobs on the CNGrid infrastructure and CNGrid Users can run jobs on the EGI infrastructure AP-CNGRID-5Investigate integration with the EGI Accounting System AP-CNGRID-6Investigate the publishing of Service Information using Glue 1.3 or Glue 2.0 AP-CNGRID-7Investigate integration with the EGI Monitoring Framework

Garuda ROC Contacts ✔ 1.Ms. M. Divya Status ✔ 8 Sites running Globus Toolkit 4.0.7/ ROC is operational. Helpdesk ✖ The helpdesk is an RT instance and it is not integrated with GGUS Accounting ✖ Does not publish accounting information. Using and in-house developed tool. Monitoring ✖ Does not publish monitoring information. Nagios is used internally for monitoring. Website ✔ Action Points AP-GARUDA-1: Investigate the compatibility with the EGI policies AP-GARUDA-2: Register with EGI.eu as a Peer Resource Infrastructure Provider AP-GARUDA-3: Create dedicated Support Unit in GGUS and Integrate GARUDA Request Tracker with GGUS AP-GARUDA-4: Set Up a dedicated Science Gateway through which European users can run jobs on the GARUDA infrastructure and GARUDA Users can run jobs on the EGI infrastructure AP-GARUDA-5 Investigate integration with the EGI Accounting System AP-GARUDA-6 Investigate the publishing of Service Information using Glue 1.3 or Glue 2.0 AP-GARUDA-7 Investigate integration with the EGI Monitoring Framework

References  EGI Resource Providers: providers/index.htmlhttps:// providers/index.html  EGI Procedures:  ISGC Presentation on EGI Procedures and Best Practices: 75&contribId=270&confId= &contribId=270&confId=370  Chain-Reds website:  The cookbook attached to the agenda Id=20&resId=0&materialId=paper&confId=1222