EUDAT Towards a European Collaborative Data Infrastructure Damien Lecarpentier – CSC, IT Center for Science, Finland ISC’11, Hamburg, 20 June 2011.

Slides:



Advertisements
Similar presentations
EUDAT Towards a European Collaborative Data Infrastructure Alison Kennedy and Rob Baxter Jan 2012.
Advertisements

Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Introduction to Research Data Management Services, January 2013 Research Data Management Infrastructure The Current Context.
Interest for the Economy: Reaching Supersites sustainability through the creation of a science - commercial ecosystem This document produced by Members.
WP5 Strategy Domenico Giardini SED ETHZ. WP5 Objectives Harmonize national implementation Integrate the European scientific community Establish Centres.
European Life Sciences Infrastructure for Biological Information ELIXIR
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
A complementary view from the DIGOIDUNA study Paolo Bouquet, University of Trento, Italy SMART 2010/0054.
Helix Nebula The Science Cloud CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a.
EPOS a long term integration plan of research infrastructures for solid Earth Science in Europe Preparatory Phase Project
DASISH Final Conference Common Solutions to Common Problems.
Results of the HPC in Europe Taskforce (HET) e-IRG Workshop Kimmo Koski CSC – The Finnish IT Center for Science April 19 th, 2007.
Фондация ГИС Трансфер Център г. Sofia Лектор: Kostadin Kostadiov Проект: EURESP+, ENT/CIP/10/D/
Internationalisation of Finnish Public Research Organisations Dr. Antti Pelkonen Senior Scientist, VTT Technical Research Centre of Finland
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
Report /03/ Athens Information Day on “Research Infrastructures in FP7” Robert Jan Smits European Commission 21 March 2007 Greece European.
Geneva, Switzerland, April 2012 Introduction to session 7 - “Advancing e-health standards: Roles and responsibilities of stakeholders” ​ Marco Carugi.
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
This document produced by Members of the Helix Nebula Partners and Consortium is licensed under a Creative Commons Attribution 3.0 Unported License. Permissions.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
1 Direction scientifique Networks of Excellence objectives  Reinforce or strengthen scientific and technological excellence on a given research topic.
1 e-Infrastructures e-Infrastructures Taking stock and looking ahead an European perspective Bernhard Fabianek European Commission - DG INFSO GÉANT & e-Infrastructure.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
EU Projects – FP7 Workshop 6: EU Funding –What’s Next? Carolina Fernandes Innovation & Funding Manager GLE Group.
Towards an European Network of Earth Observation Networks (ENEON): Addressing Challenges and Facilitating Collaboration for non-space based Earth Observations.
Helix Nebula The Science Cloud CERN – 13 June 2014 Alberto Di MEGLIO on behalf of Bob Jones (CERN) This document produced by Members of the Helix Nebula.
Overview and status of the project EuroVO-AIDA – Final review – 5 October 2010 Françoise Genova, Project Coordinator, 1 Overview and status of the project.
EUDAT: Data sharing and management in a collaborative data infrastructure Rob Baxter, EPCC, University of Edinburgh.
SEE-GRID-2 The SEE-GRID-2 initiative is co-funded by the European Commission under the FP6 Research Infrastructures contract no
1 Framework Programme 7 Overview. 2 The Programmes within FP7 IDEAS European Research Counsel ERC PEOPLE Marie Curie Measures Initial Training Life-long.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT- Towards.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
The 7th Framework Programme for Research: Strategy of international cooperation activities Robert Burmanjer Head of Unit, “International Scientific Cooperation.
Helix Nebula Workshop On Interoperability among Public And Community Clouds Session 2: Networking Connectivity Convener: Carmela ASERO, EGI.eu19 September.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
WP9– Evaluation, roadmap & development plan Rupert Lueck EMBL – 26 June
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No West-Life.
European Perspective on Distributed Computing Luis C. Busquets Pérez European Commission - DG CONNECT eInfrastructures 17 September 2013.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
WP6 – Inter-operability with e-Infrastructures Sergio Andreozzi - WP6 Task Leader Strategy and Policy Manager, EGI.eu Helix Nebula - 1st Year Review 1.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
Work Plan for the Second Period Bob Jones, CERN First Helix Nebula Review 03 July This document produced by Members of the Helix Nebula consortium.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No TURBASE-DNS: A.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No LTER- Europe &
Bob Jones EGEE Technical Director
PIDs in EUDAT Webinar, 15 Februari 2013
Towards a pan-European Collaborative Data Infrastructure
EUDAT Towards a European Collaborative Data Infrastructure
Steven Newhouse EGI-InSPIRE Project Director, EGI.eu
Carlos Morais Pires European Commission Information Society and Media
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Antonella Fresa Technical Coordinator
DATA SPHINX & EUDAT Collaboration
EOSC Governance Development Forum
EGI Webinar - Introduction -
Overview of working draft v. 29 January 2018
ESciDoc Introduction M. Dreyer.
An EUDAT-based FAIR Data Approach for Data Interoperability
Common Solutions to Common Problems
European Research Data Services, Expertise & Technology Solutions
Brian Matthews STFC EOSCpilot Brian Matthews STFC
DATATURB Direct simulation data of turbulent flows
Presentation transcript:

EUDAT Towards a European Collaborative Data Infrastructure Damien Lecarpentier – CSC, IT Center for Science, Finland ISC’11, Hamburg, 20 June 2011

Outline of the talk  EUDAT concept  EUDAT consortium  EUDAT service approach  Expected benefits and challenges of a CDI

Initiative funded through FP7 e-Infrastructure Call 9 (WP11): INFRA : Data infrastructure for e-Science (november 2010)  Call 9 Objective: ”Establish a peristent and robust service infrastructure for scientific data in Europe that responds to the need of data-intensive Science of 2020”  Budget 43M€ EUDAT selected for funding (three-year project)  Official starting date: 1st October 2011  Biggest budget of the call: 9,3 M€ EC Grant  Total Budget: 16,3 M€ Consortium  23 partners representing 13 countries  15 user communities from a wide range of disciplines (Biomed, Earth Science, Climate, SSH, etc.) Targets  EUDAT objective: “To deliver a Collaborative Data Infrastructure (CDI) with the capacity and capability for meeting researchers’ needs in a flexible and sustainable way, across geographical and disciplinary boundaries.” EUDAT Key facts and objectives  The infrastructure must be Collaborative  The infrastructure must be driven by researchers’ needs  The infrastructure must be sustainable yet flexible  The infrastructure must be pan-European  The infrastructure must be multi-disciplinary

The current data infrastructure landscape: challenges and opportunities  Long history of data management in Europe: several existing data infrastructures dealing with established and growing user communities (e.g., ESO, ESA, EBI, CERN)  New Research Infrastructures are emerging and are also trying to build data infrastructure solutions to meet their needs (CLARIN, EPOS, ELIXIR, ESS, etc.)  A large number of projects providing excellent data services (EURO-VO, GENESI-DR, Geo-Seas, HELIO, IMPACT, METAFOR, PESI, SEALS, etc.)  However, most of these infrastructures and initiatives address primarily the needs of a specific discipline and user community Challenges  Compatibility, interoperability, and cross-disciplinary research  Data growth in volume and complexity (the so-called “data tsunami”)  strong impact on costs threatening the sustainability of the infrastructure Opportunities  Potential synergies do exist: although disciplines have different ambitions, they have common basic needs and requirements that could be matched with generic pan-European services supporting multiple communities and ensuring greater interoperability.  Strategy needed at pan-European level

Towards a Collaborative Data Infrastructure Source: HLEG report, p. 31  EUDAT will focus on building this generic data infrastructure layer and offer a trusted domain for long term data preservation accompanied with related services to store, identify, authenticate and mine these data.  This need be done in close collaboration with the Communities  Core services must match the requirements of the communities  Community services can also be incorporated into the common data service infrastructure when they are of use to other communities.

The EUDAT Consortium

The EUDAT Communities

The EUDAT Communities (by field) EUDAT targets all scientific disciplines (discipline neutral):  To enable the capture and identify cross-discipline requirements  To involving the scientists of all the communities in the shaping of the infrastructure and its services Biological and Medical ScienceVPH, ELIXIR, BBRMI, ECRIN Environmental ScienceENES, EPOS, Lifewatch, EMSO, IAGOS-ERI, ICOS Social Sciences and HumanitiesCLARIN Physical Sciences and EngineeringWLCG, ISIS Material ScienceESS… EnergyEUFORIA…

EUDAT Services Activities – Iterative Design EUDAT’s Services activity is concerned with identification of the types of data services needed by the European research communities, delivering them through a federated data infrastructure and supporting their users 1. Capturing Communities Requirements (WP4)  Services to be deployed must be based on user communities needs  Strong engagement and collaboration with user communities (EUDAT communities and beyond) to capture requirements 2. Building the services (WP5)  User requirements must be matched with available technologies  Need to identify:  available technologies and tools to develop the required services (technology appraisal)  gaps and market failures that should be addressed by EUDAT research activities  Services must be designed, built and tested in a pre-production test bed environment and made available to WP4 for evaluation by their users 3. Deploying the services and operating the federated infrastructure (WP6)  Services must be deployed on the EUDAT infrastructure and made available to users, with interfaces for cross-site, cross-community operation  Reliability, 24h/7d availability and accessibility of the shared services, with operational security, data integrity and compliance with stakeholder requirements and policies.

Core services are building blocks of EUDAT‘s Common Data Infrastructure mainly included on bottom layer of data services Fundamental Core Services Long-term preservation Persistent identifier service Data access and upload Workspaces Web execution and workflow services Single Sign On (federated AAI) Monitoring and accounting services Network services Extended Core Services (community-supported) Joint meta data service Joint data mining service EUDAT core services No need to match the needs of all at the same time, addressing a group of communities can be very valuable, too

Service Model Approach and Generic Collaboration Generic Service Model Fundamental Core Services meet strongly overlapping service requirements Extended Core Services are mainly community-supported, community requirements are typically overlapping between some disciplines Collaboration between Teams Fundamental Core Services are operated and supported by an Operations Team which collaborates across the participating centres. Extended Core Services and other joint multi-disciplinary service must be community-supported, the requirements are overlapping between a specific subset of disciplines

EUDAT Kick-Off Service deployment SERVICE DESIGN USER REQUIREMENTS SERVICE DEPLOYMENT st User Forum4th User Forum2nd User Forum3rd User Forum First Services available Cross- Community Services Full core Services deployed Sustainability Plan EUDAT Timeline

Expected benefits of a Collaborative Data Infrastructure  Enabling multi-disciplinary data intensive research and collaboration  Development of common services supporting research communities  Support to existing scientific communities’ infrastructures  Support to smaller communities through access to sophisticated services  Inter-disciplinary collaboration and exploitation of synergies between communities  Communities from different disciplines working together to build services  Data sharing between disciplines  Collaboration with other large-scale infrastructure  European e-Infrastructures: Géant, PRACE,EGI, etc.  Global initiatives in the US, Japan, Australia, etc.  Ensuring wide access to and preservation of data in a sustainable way  A robust generic infrastructure capable of handling the scale and complexity of data that will be generated over the next years  Greater access to existing data and better management of data for the future  Increased security by managing multiple copies in geographically distant locations  Put Europe in a competitive position for important data repositories of world-wide relevance  Economies of scale and cost-efficiency  Shared resources and work are less costly

Challenges and Opportunities  Delivering high level multi-disciplinary data services  Achieving a high level of interoperability in the context of diversity of data, research disciplines and practices  Need to strongly involve the different communities in the design and evaluation of services  EUDAT as a platform to discuss interoperability issues (along with other initiatives: e.g DAITF)  Building trust among stakeholders  Trust between service providers and users but also between the researchers and disciplines themselves  Trust in the EUDAT infrastructure, the data deposited and collected, data integrity  Ensuring the sustainaibility of the infrastructure  Providing a framework and a plan to ensure the continuity of services beyond the immediate funding window, through the setting up of a sustainable entity  Funding and business models  Parnerships (new communities, industry, etc.) and governance models

“Do the difficult things while they are easy and do the great things while they are small. A journey of a thousand miles must begin with a single step.” Lao Tzu The beginning of a long journey…

How to get in touch with EUDAT? Kimmo Koski, CSC - IT Center for Science EUDAT Project Coordinator Peter Wittenburg, Max Planck Institute for Psycholinguistics at Nijmegen (MPI-PL) EUDAT Scientific Coordinator Damien Lecarpentier, CSC - IT Center for Science EUDAT Project Manager  BoF session on “e-Infrastructure for science in Europe”, on Tuesday 21 June, 14:30-15:15, Hall B  Partners’ booths at ISC:  CSC #146  BSC # 114  DKRZ # 140  EPCC # 152 THANK YOU!