Collaboration Board topics Ian Bird CB Meeting; New York 19 th May 2012.

Slides:



Advertisements
Similar presentations
ESWW4, 5-9 th November 2007 Draft Proposal: Space Weather as part of an Optional Space Situational Awareness Programme A.Glover, E. Daly, R. Marsden, A.
Advertisements

GEOSS Data Sharing Principles. GEOSS 10-Year Implementation Plan 5.4 Data Sharing The societal benefits of Earth observations cannot be achieved without.
Research Infrastructures WP 2012 Call 10 e-Infrastructures part Topics: Construction of new infrastructures (or major upgrades) – implementation.
The Role of Environmental Monitoring in the Green Economy Strategy K Nathan Hill March 2010.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Digital public services and innovation
Pre-Commercial Procurement proposal - HNSciCloud
1 Ideas About the Future of HPC in Europe “The views expressed in this presentation are those of the author and do not necessarily reflect the views of.
Research and Innovation Research and Innovation Research and Innovation Research and Innovation Research Infrastructures and Horizon 2020 The EU Framework.
Interest for the Economy: Reaching Supersites sustainability through the creation of a science - commercial ecosystem This document produced by Members.
Ian Bird WLCG Management Board CERN, 17 th February 2015.
Assessment of Core Services provided to USLHC by OSG.
GridPP Steve Lloyd, Chair of the GridPP Collaboration Board.
Ian Bird LHCC Referees’ meeting; CERN, 11 th June 2013 March 6, 2013
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Procurement Innovation for Cloud Services in Europe CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium.
European Grid Initiative Technical Forum 21 September 2011, Lyon The Digital Agenda for Europe What about the Cloud? Carl-Christian Buhr European Commission.
Advanced Computing Services for Research Organisations Bob Jones Head of openlab IT dept CERN This document produced by Members of the Helix Nebula consortium.
Helix Nebula The Science Cloud CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a.
1 ETP4HPC Background and Future ‘Building a Globally Competitive HPC Technology Industry in Europe’
Notur: - Grant f.o.m is 16.5 Mkr (was 21.7 Mkr) - No guarantees that funding will increase in Same level of operations maintained.
Scientific Cloud Computing Infrastructure for Europe – Strategic Plan Bob Jones, IT department, CERN.
Workshop summary Ian Bird, CERN WLCG Workshop; DESY, 13 th July 2011 Accelerating Science and Innovation Accelerating Science and Innovation.
WP8– Governance Models Jurry de la Mar T-Systems – 26 June 2014.
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
Cloud Services for Research CERN – 26 June 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a Creative.
This document produced by Members of the Helix Nebula Partners and Consortium is licensed under a Creative Commons Attribution 3.0 Unported License. Permissions.
The 2012 European Cloud Computing Conference Brussels, 21 March 2012 Building the European Cloud Computing Strategy Carl-Christian Buhr (All expressed.
Ian Bird LHC Computing Grid Project Leader LHC Grid Fest 3 rd October 2008 A worldwide collaboration.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Ian Bird, WLCG MB; 27 th October 2015 October 27, 2015
Helix Nebula The Science Cloud Workshop on Best Practices for Data Management & Sharing Ispra – 15 April 2014 Bob Jones (CERN)
Helix Nebula The Science Cloud CERN – 13 June 2014 Alberto Di MEGLIO on behalf of Bob Jones (CERN) This document produced by Members of the Helix Nebula.
Ian Bird GDB CERN, 9 th September Sept 2015
Procedure to follow for proposed new Tier 1 sites Ian Bird CERN, 27 th March 2012.
Procedure for proposed new Tier 1 sites Ian Bird WLCG Overview Board CERN, 9 th March 2012.
LHC Computing, CERN, & Federated Identities
Work Plan for the Second Period Bob Jones, CERN First Helix Nebula Review 03 July 2013 This document produced by Members of the Helix Nebula consortium.
Public health, innovation and intellectual property 1 |1 | The Global Strategy on Public Health, Innovation and Intellectual Property Technical Briefing.
A European Open Science Cloud
The Helix Nebula Initiative EMBL – 20 January 2016 Maryline Lengert (ESA) This document produced by Members of the Helix Nebula consortium is licensed.
Possibilities for joint procurement of commercial cloud services for WLCG WLCG Overview Board Bob Jones (CERN) 28 November 2014.
WP8– Governance Models Bernd Schirpke / Jurry de la Mar T-Systems - 03 July 2013 Bernd Schirpke - T-Systems 03/07/20131.
Ian Bird Overview Board; CERN, 8 th March 2013 March 6, 2013
WP5 – Flagship Deployment Period 1 Review Phil Evans – Logica/CGI 1.
Interoperability and Integration of EGI with Helix Nebula - Workshop Sergio Andreozzi Strategy and Policy Manager (EGI.eu) 11/04/2013 EGI Community.
3rd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Carmela ASERO, EGI.eu 17 September 2013, Madrid
3 nd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Sergio Andreozzi Strategy and Policy Manager, EGI.eu EGI Technical.
WLCG Status Report Ian Bird Austrian Tier 2 Workshop 22 nd June, 2010.
Economical opportunities stemming from data and computing e- infrastructures Stakeholders consultation on computing and data for the WP Brussels,
Helix Nebula Workshop On Interoperability among Public And Community Clouds Session 2: Networking Connectivity Convener: Carmela ASERO, EGI.eu19 September.
URBACT IMPLEMENTATION NETWORKS. URBACT in a nutshell  European Territorial Cooperation programme (ETC) co- financed by ERDF  All 28 Member States as.
Ian Bird, CERN 1 st February Dec 2015
Building European Scientific Cloud Computing Infrastructure An overview by Marc-Elian Bégin, SixSq 1.
Summary and next steps for the future Bob Jones, CERN Second Helix Nebula Review 26 June 2014 This document produced by Members of the Helix Nebula consortium.
WP9– Evaluation, roadmap & development plan Rupert Lueck EMBL – 26 June
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No Advanced European.
WP5 – Flagship Deployment Period 1 Review Phil Evans – Logica/CGI 1.
WP9– Evaluation, roadmap & development plan This document produced by Members of the Helix Nebula consortium is licensed under a Creative Commons Attribution.
The Helix Nebula marketplace 13 May 2015 Bob Jones, CERN.
WP6 Inter-operability with e-infrastructures Sergio Andreozzi Strategy and Policy Manager, EGI.eu 1 Helix Nebula – The Science Cloud Final review, 26 June.
WP6 – Inter-operability with e-Infrastructures Sergio Andreozzi - WP6 Task Leader Strategy and Policy Manager, EGI.eu Helix Nebula - 1st Year Review 1.
André Hoddevik, Project Director Enlargement of the PEPPOL-consortium 2009.
Work Plan for the Second Period Bob Jones, CERN First Helix Nebula Review 03 July This document produced by Members of the Helix Nebula consortium.
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
Input on Sustainability
EGI Webinar - Introduction -
Overview of working draft v. 29 January 2018
Juan Gonzalez eGovernment & CIP operations
Presentation transcript:

Collaboration Board topics Ian Bird CB Meeting; New York 19 th May 2012

New Tier 1s – follow up from last CB Helix Nebula (not strictly WLCG) WLCG and wider collaboration Overview

New Tier 1s

In 2011 the first suggestions of potential new Tier 1 sites have been made The procedure was initially discussed in the WLCG Collaboration Board in July 2011 The process is now documented (WLCG-OB ) – And the OB approved this process on 9 th March 2012 Background

Pre-requisite is that any such proposal must be supported by the experiment(s) Balance between encouragement of new sites/resources and reaching high standards of existing Tier 1 services Process:  Prepare with MB a detailed plan that shows how the site will demonstrate required functionality, performance, reliability; timeline and milestones  Present plan to OB: OB recommends acceptance (or not)  Site can sign MoU as an Associate Tier 1  MB monitors progress on ramp up, reports to OB  When milestones achieved as agreed by MB, final report to OB to recommend full Tier 1 status  This should normally take ~1 year Process

Most elements are described in the MoU addenda Candidate site must achieve MoU requirements in terms of: – Level and performance of resources – see next – Quality and reliability of services: Set of services agreed with the experiments Provide agreed levels of support – as in MoU. Typically on-call support year round Availability and reliability: install agreed sensors, publish to WLCG monthly (as all other sites) Interface to WLCG accounting, provide accounting data to be published monthly Support for Tier 2s – in agreement with experiments. Data source and technical support for Tier 2s Requirements

Networking: – Eventually 10 Gb/s (+ alternate) as part of OPN for T0-T1 or T1-T1 – Proposal should describe how connectivity provided during prototyping, and the plan to achieve connecting to the OPN – Tier 2 connectivity via academic networks. Tier 1 normally expected to have good connectivity. Practical needs to be agreed with experiment(s) – as part of the usage model. Plan should describe this. Resources - 1

Must provide tape archive service: – Capacity needed for share of raw + other data – Must guarantee archive for life of the experiment – Specified in MoU Must show capability of accepting agreed share of raw data and writing it to tape at agreed rate – Plan should detail what this is Disk & CPU of significant fraction of experiment requirement – Typically ~10% (minimum 5%) of requirements expressed to RRB – Must be balanced with adequate internal networking to support expected workloads Resources - 2

At the March OB; KISTI (S. Korea) presented an initial proposal as a Tier 1 for ALICE; the OB accepted KISTI as the first “Associate Tier1” – A full plan is now being prepared Also anticipated: – Russia has proposed providing Tier 1 for all 4 experiments – Discussions with Mexico for ALICE; and India for ALICE and CMS – All t.b.c. New Tier 1s

HELIX NEBULA HEPiX, Prague,

Terminology: EIROForum EIROForum – Partnership between 8 of Europe’s largest inter- governmental scientific research organisations that are responsible for infrastructures and laboratories: CERN, EFDA-JET, EMBL, ESA, ESO, ESRF, European XFEL, and ILL – Mission to combine resources, facilities, & expertise of member organisations to support European Science – Represents a very large number of European Scientists, and has significant influence on (e.g.) European-level policy and strategy – Has a number of collaborative activities/common fields of expertise of which Information Technology is one –

Helix Nebula Project Context

Problem? For scientific data processing: – HEP data is not restricted in where it can be sent; but other scientific data is (IP, etc.) Concern for ESA, EMBL, etc. For other cloud services: – Data privacy is a major concern: should not use US companies because of Patriot Act and other concerns about exposure of personnel, financial, and other private data Procurement concerns: CERN & others should procure from European (or Member State) sources preferentially Thus, cannot just trivially use Amazon, Microsoft, Gmail, DropBox, etc. – (the fact that individuals do is already a problem!) 13

Origin of the initiative Conceived by ESA as a prospective for providing cloud services to space sector in Europe Presented to the IT working group of the EIROforum where other members (CERN, EMBL) joined Two workshops held during 2011 – June: hosted by ESA in Frascati – October: hosted by EMBL in Heidelberg 14

The initiative: A strategic plan for a scientific cloud computing infrastructure for Europe – Establish a sustainable multi-tenant cloud computing infrastructure in Europe – Initially based on the needs for the European Research Area & Space Agencies – Based on commercial services from multiple IT industry providers – Adhere to internationally recognised policies and quality standards – Governance structure involving all stakeholders 15

Objectives of the initiative Set up a cloud computing infrastructure for European Research Area Identify and adopt policies for trust, security and privacy on a European-level Create a light-weight governance structure involving all stakeholders Define a short and medium term funding scheme 16

Timeline Set-up (2011) Pilot phase ( ) Full-scale cloud service market (2014 … ) Select flagships use cases, identify service providers, define governance model 17 Deploy flagships, Analysis of functionality, performance & financial model More applications, More services, More users, More service providers

Pilot Phase Through the pilot phase we expect to explore/push a series of perceived barriers to Cloud adoption: – Security: Unknown or low compliance and security standards – Reliability: Availability of service for business critical tasks – Data privacy: Moving sensitive data to the Cloud – Scalability/Elasticity: Will the Cloud scale-up to our needs – Network performance: Data transfer bottleneck; QoS – Integration: Hybrid systems with in-house/legacy systems – Vendor lock-in: Dependency on vendors once data & applications have been transferred to the Cloud – Legal concerns: Such as who has legal liability – Transparency: Clarity of conditions, terms and pricing 18

Data Protection Concerns related to access to data by third parties – Case study: How the USA PATRIOT Act can be used to access EU data, Zack Whittacker, Zdnet – “ These subsidiary companies and their U.S.-parent corporations cannot provide the assurances that data is safe in the UK or the EEA, because the USA PATRIOT Act not only affects the U.S.-based corporations but also their worldwide wholly-owned subsidiary companies based within and outside the European Union. ” Foreseen Helix Nebula use cases include data that is personal, of economic value or of national importance HEPiX, Prague, 27/04/

Service Procurement Assuming pilot phase proves successful, the provision of commercial Cloud services would need to be integrated into the ICT procurement process of the demand-side organisations For the initial flagships this implies: – Inter-governmental organisations Jurisdiction (governing laws & arbitration), tax-free status, etc. Return on Investment: preference for procurement from each organisation’s member-states – Pool of commercial service providers that can respond to calls for tender – Cannot integrate procurement processes of all demand-side organisations but can converge: Technical specifications & standards Terms and conditions EC published Guide for the procurement of standards based ICT Elements of Good Practice (21 Dec 2011) HEPiX, Prague, 27/04/

Flagship use cases Proposed by demand-side user organisations addressing scientific challenges with societal impact – High-profile applications that catch the public imagination and encourage others to use the services – Show need for significant scale of resources, federation/aggregation of data sets, long-term archiving and on-demand processing Sponsored by user organisations with a long-term objective of outsourcing a portion of their computing requirements – Must be prepared to contribute their own resources during the pilot phase to port application and contribute to the cost of procuring required services from the supply-side – Must participate in a costing exercise where the total cost of deploying and operating the flagship application in-house can be compared to the cost of procuring the services via the cloud computing initiative HEPiX, Prague, 27/04/

Initial flagships use cases Call for proposals – Proposals received in format following template agreed by demand and supply side Eligibility review of collected proposals (user-side) resulted in 3 recommended flagships – CERN: ATLAS High Energy Physics Cloud Use – EMBL: Genomic Assembly in the Cloud – ESA / CNES / DLR: SuperSites Exploitation Platform Flagships sent to supply-side for analysis 22

Flagship deployments Proof of Concept stage within the Pilot Phase started January 2012 Each flagship will be deployed with a series of providers independently Sequence: – CERN-ATLAS – EMBL – ESA Expect to have completed initial proof of concept by summer

Flagship use cases Participating Suppliers 24

Helix Nebula EC project proposal Coordination action submitted to INFRA in November 2011 – Requested total EC funding €2M HEPiX, Prague, 27/04/ no.Organisation nameShort nameCountry 1 (coord)European Organization for Nuclear ResearchCERNCH 2STICHTING EUROPEAN GRID INITIATIVEEGI.euNE 3European Molecular Biology LaboratoryEMBLDE 4ATOS ORIGIN NEDERLANDAtosNE 5T-Systems International GMBHT-SystemsDE 6CLOUDSIGMA AGCloudSigmaCH 7SAP AGSAPDE 8Logica Deutschland GmbH & Co KGLogicaDE 9CONSIGLIO NAZIONALE DELLE RICERCHECNRIT 10Cloud Security Alliance EMEACSAUK

Role of Helix Nebula: The Science Cloud Vision of a unified cloud‐based infrastructure for the ERA based on Public/Private Partnership, 4 goals building on the collective experience of all involved. Goal One : Establish HELIX NEBULA – the Science Cloud – as a cloud computing infrastructure addressing the needs of the ERA and capable of serving as a platform for innovation and evolution of the overall e‐infrastructure. Goal Two: Identify and adopt suitable policies for trust, security and privacy on a European‐level Goal Three: Create a light‐weight governance structure that involves all the stakeholders ‐ and which can evolve over time as the infrastructure, services and user‐base grows. Goal Four: Define a funding scheme involving all the stake‐holder groups (service suppliers, users, EC and national funding agencies) for PPP to implement a Cloud Computing Infrastructure that delivers a sustainable and profitable business environment adhering to European‐ level policies. 26

Specific outcomes Develop strategies for extremely large or highly distributed and heterogeneous scientific data (including service architectures, applications and standardisation) in order to manage the upcoming data deluge Analyse and promote trust building towards open scientific data e‐Infrastructures covering organisational, operational, legal and technological aspects, including authentication, authorisation and accounting (AAA) Develop strategies and establish structures aiming at co‐ordination between e‐infrastructure operators Create frameworks, including business models for supporting Open Science and cloud infrastructures based on PPP, useful for procurement of computing services suitable for e‐ Science 27

Scientific Flagships CERN LHC (ATLAS): – High Throughput Computing and large scale data movement EMBL: – Novel de novo genomic assembly techniques ESA: – Integrated access to data held in existing Earth Observation “Super Sites” Each flagship brings out very different features and requirements and exercises different aspects of a cloud offering 28

ATLAS use case Simulations (~no input) with stage out to: – Traditional grid storage vs – Long term cloud storage Data processing (== “Tier 1”) – This implies large scale data import and export to/from the cloud resource Distributed analysis (== “Tier 2”) – Data accessed remotely (located at grid sites), or – Data located at the cloud resource (or another?) Bursting for urgent tasks – Centrally managed: urgent processing – Regionally managed: urgent local analysis needs All experiences immediately transferable to other LHC (& HEP) experiments 29

Immediate and longer term goals for CERN Determine costs of commercial cloud resources from various sources – Compute resources – Network transfers into and out of cloud – Short and long term data storage in the cloud Develop understanding of appropriate SLA’s – How can they be broadly applicable to LHC or HEP Understand policy and legal constraints; e.g. in moving scientific data to commercial resources Performance and reliability – compared to WLCG baseline Use of standards (interfaces, etc.) & interoperability between providers Can CERN transparently offload work to a cloud resource – Which type of work makes sense? Long term: Can we use commercial services as a significant fraction of overall resources available to CERN experiments? – At which point is it economic/practical to rely on 3 rd party providers? 30

Summary The objective of this initiative is to establish a sustainable cloud computing infrastructure for the European Research Area based on commercially provided services It is a collaborative initiative bringing together all the stakeholders to establish a public-private partnership Interoperability with existing e-infrastructures is a goal of the initiative It has commitments from the IT industry and user organisations – flagship deployment started Jan initial flagship use cases identified Framework collaboration EC project to start summer

“WLCG” (and OSG, EGI, NDGF, etc.) used already by other HEP experiments and others Suggestions by SuperBelle, and others that they could “join” WLCG and benefit from not just the infrastructure and technology but also support, operations, etc. Other initiatives looking at broader use of e- infrastructures How should WLCG collaboration position itself? – We are only mandated for LHC today – How is it different from EGI, OSG, etc? It is worldwide, collaborates across infrastructures WLCG & HEP HEPiX, Prague, 27/04/2012