Data Infrastructure foundation of the European Cloud Initiative 20 January 2016 Carlos Morais Pires European Commission - DG CNECT
Digital Economy and Society Digital Single Market Strategy: Digital Economy and Society Andrus Ansip Digital Single Market Günther Oettinger Digital Economy and Society A Juncker Commission priority Adopted 6 May 2015 Element of DSM: building a data economy, including through open science
Digital Single Market extracting value from data The exponential growth of data will drive societal challenges, scientific advances and productivity gains across the economy. Digital technologies support more efficient production processes. Europe has to invest on its capacity to be global player extracting value from data (science, industry, public sector).
responding to the challenges Lack of interoperability of infrastructures Fragmentation of deployment Cost-efficiency European businesses, research communities and public bodies are yet to tap into the full potential of data and its potentially transformative effect in economy. Lack of interoperability of infrastructures. Many datasets are unavailable to industry, governments and scientists in the same area and rely upon incompatible processing infrastructures. Fragmentation of deployment. Digital infrastructures are now split, by scientific and economic domains (e.g. physics, health, agriculture, space, climate) and by countries. Europe risk of lagging behind world-class High Performance Computing infrastructure to process data. Europe needs deployment of new architectures (exascale) and research (post-exascale/quantum). Cost-efficiency - scientific data producers and users must be able to re-use scientific data and to make the most of advanced data analytics techniques. Europe risks lagging behind world-class HPC infrastructure to process data!
European Data Infrastructure will combine: processing data (HPC infrastructure) storing data (Data infrastructure) moving data (Network infrastructure) In the same way Europe needs an efficient and sustainable transport infrastructure for mobility of people and goods so it needs a data infrastructure for the digital economy. Data-driven economy needs an infrastructure made of a combination of world-class supercomputing (HPC) capability, high-speed connectivity, leading-edge data storage capacity. Investments should not be driven only by technology but by a "User Driven and Service Oriented approach" which matches the Cloud Paradigm. User-driven and Service Oriented approach matching the Cloud Paradigm
European exascale capacity High Performance Computing European capacity on components prototype EU HPC pre-exascale node followed by fully operational HPC exascale machine connect/upgrade HPC Tier 0 nodes and link them with the data infrastructure support centres of excellence for software and platforms adaptation to exascale Networking rooted European platform for innovative big data-driven services extend geographic coverage upgrade network capacity from backbone to the campus and researcher desk HPC: - develop European capacity on components (processors, memory and interconnect) - prototype EU HPC pre-exascale Tier 0 (T0) node followed by installation of fully operational HPC exascale machine - connect and upgrade HPC Tier 0 nodes and link them with the data infrastructure - support centres of excellence (CoE) for software and platforms adaptation to exascale European ambition that world-class HPC technology will be based on new generation of EU technology (low-power high performance computing chips). The Pan-European High Performance Computing infrastructure and services (PRACE) will be involved in this process to complement the contractual Public-Private Partnership on HPC. Data and software: - connect/upgrade Data and Software Centres of Excellence for data interoperation (CoE) - install the EU Data/CoE exascale storage node for Earth Observation data (e.g. Copernincus) that can be repurposed for other areas - long term storage (mixture of Tier-0 and Tier-1, Long Tail… seamlessly connected) support core services across domains and disciplines (store, preserve, access) The European Data Infrastructure will work in combination with the national and regional scientific and public data centres (OpenAIRE, EGI, EUDAT and others). Will develop and implement best practices based on certification schemes, common European and global standards and specifications (RDA) to tackle the current lack of interoperability between national and disciplinary data centres. Networking: - Rooted European platform for innovative big data-driven services - extend geographic coverage - upgrade network capacity from backbone to the campus and researcher desk Connectivity is essential to make sure that HPC can be accessible everywhere in the EU. The trans-European high speech network (GÉANT) and National Research and Education Networks (NREN) already connect 50 million researchers and students. These infrastructures will be upgraded to match the increase of expected exponential growth of data volumes to be transferred and the extension of the user base. Data and software connect/upgrade Data and Software Centres of Excellence (CoE) install the EU Data/CoE exascale storage node long term storage (Tier-0/Tier-1 and long-tail seamlessly connected) support core services across domains (store, preserve, access)
a seamlessly connected data infrastructure HPC-Tier 0 Big Data Processing data is co-located with T0-HPC centres HPC-Tier 0 HPC CoE HPC-T0 Accessed by SMEs, Industry Universities
Widening Access and Building Trust Data Infrastructure HPC Data and Software Networks Science Users Big Science Research Facilities Labs and Universities Widening the User Base Hospitals Smart Municipalities Government SMEs Industry Data Infrastructure service delivery empowered to serve all European users: scientists but also for other users from industry (including SMEs) and the public sector. Data infrastructure will allow unlocking the value of Big Data and digital by default.
Earth Obs … life … catalogue of services cloud and Data-Infrastructure transparency Earth Obs … life catalogue of services dynamic, adaptive service interfaces … network infrastructure, GÉANT HPC/distributed computing/software infrastructure scientific data infrastructure …
Call for coordinated European action Important Project of Common European Interest (IPCEI) on HPC and Big Data enabled applications Objective to support the development of new usages of HPC by the industry and to guarantee access to world-class HPC facilities for public and private research. Blog post - by Günther H. Oettinger - 8 January 2016 http://bit.ly/1ZLRPJ5 A first promising step in this direction: A group of Member States presented plans to launch an Important Project of Common European Interest (IPCEI) on HPC and Big Data enabled applications. Objective to support the development of new usages of HPC by the industry and to guarantee access to world-class HPC facilities for public and private research. The proposed IPCEI offers the possibility to underpin large-scale pilots (e.g. Smart Energy, Smart Cities, Smart Mobility,…) by progressively opening the European Data Infrastructure to users from industry and the public sector in view to tackling the full European dimension. No one can go alone in a connected economy. It has to be an European Ambition! No one can go alone in a connected economy. It has to be an European Ambition!
Carlos Morais Pires carlos.morais-pires (at) ec.europa.eu Thank You!