Data Sharing and Interoperability Francoise Genova RDA TAB and RDA/Europe member WDS Scientific Committee.

Slides:



Advertisements
Similar presentations
The Five Working Groups Faculty Development Scaling-Up Post-Graduate programmes and 1.Research & Development 2.Innovation 3.Industry - Institute Interaction.
Advertisements

F. Genova, Strategy, CDS SC meeting, 2011/06/29-30 Strategy.
© Cambridge International Examinations 2013 Component/Paper 1.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Welcome to the Conference !! Juan Bicarregui Chair, APA Executive.
DARE: building a networked academic repository in the Netherlands ICOLC October 25 Ronald Dekker Delft University of Technology Library.
Data-intensive Research Policy In Ireland A brief overview By J.-C. Desplat.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
THE ODIN PROJECT Sergio Ruiz – DataCite Laura Paglione – ORCID ORCID and DataCite Interoperability Network: Connecting Identifiers This project has received.
Standards and Guidelines for Quality Assurance in the European
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
QS 702 Phase II: Encouraging the Integration of Technology Into Higher Education.
London February, 28 European Commission Information Society and Media Annual Meeting Annual Meeting GaLA Game and Learning Alliance The European Network.
Dr. Jūratė Kuprienė Director for innovations and infrastructure development Workshop: Information services for research process , Rīga Research.
Franklin Consulting Programme X The Innovation Base The e-Framework: What do they mean for programme management? Tom Franklin Franklin Consulting Richard.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
ICSTI Workshop, 2012/03/05, F. Genova The astronomical Virtual Observatory.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Research Quality Assessment following the RAE David Sweeney Director, Research, Innovation, Skills.
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
THE ROAD TO OPEN ACCESS A guide to the implementation of the Berlin Declaration Frederick J. Friend OSI Open Access Advocate JISC Consultant Honorary Director.
THOMSON SCIENTIFIC Patricia Brennan Thomson Scientific January 10, 2008.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
F. Genova, Challenges, CDS SC meeting, 2011/06/29-30 Challenges.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
Scholarly communications Discussion group Linked Data Workshop May 2010.
HIT Policy Committee NHIN Workgroup Recommendations Phase 2 David Lansky, Chair Pacific Business Group on Health Danny Weitzner, Co-Chair Department of.
E-Science for the SKA WF4Ever: Supporting Reuse and Reproducibility in Experimental Science Lourdes Verdes-Montenegro* AMIGA and Wf4Ever teams Instituto.
‘intelligent openness’ The common objective of an RCUK data policy Gregor McDonagh
The Research Excellence Framework Expert Advisory Groups round 1 meetings February 2009 Paul Hubbard Head of Research Policy.
Suggestions for the GEOSS Workplan C. Waldmann and R. Huber MARUM, Germany.
Summary of RDA Outputs so far dr. Ir. Herman Stehouwer 22 September 2015.
Data at Work: Supporting Sharing in Science and Engineering (Birnholtz & Bietz, 2003) Adam Worrall LIS 6269 Seminar in Information Science 3/30/2010.
Date, location Open Access policy guidelines for research funders Name Logo area.
A GEO Label: Scope of Review Hans-Peter Plag IEEE University of Nevada, Reno, Nevada, USA;
WP2 – Strategy, co-operation and dissemination EuroVO-AIDA – First periodic review – 24 April 2009 Françoise Genova, Project Coordinator WP2 – NA2 Strategy,
Sharing scientific data: astronomy as a case study for a change in paradigm Présenté par Françoise Genova.
Overview and status of the project EuroVO-AIDA – Final review – 5 October 2010 Françoise Genova, Project Coordinator, 1 Overview and status of the project.
WP2: Strategy EuroVO-DCA Review, 10 January 2007 Françoise Genova, CNRS WP2 Definition of European Data Centre Alliance strategy.
Date, location Open Access policy guidelines for research institutions Name Logo area.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
WP2 – Definition of European DCA strategy EuroVO-DCA Final Review, 29 April 2009 Françoise Genova, Coordinator, WP2 lead WP2 Definition of European Data.
EGI-InSPIRE RI EGI-InSPIRE Open Science Open Data Open Access Sergio Andreozzi Strategy & Policy Manager, EGI.eu
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
CERTIFICATE IN ASSESSING VOCATIONAL ACHIEVEMENT (CAVA) Unit 1: Understanding the principles and practices of assessment.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
Agency Briefing - Overview
COST Action and European GBIF Nodes Anne-Sophie Archambeau.
The European Transport Research Alliance - ETRA Prof. G. A. Giannopoulos Chairman, ETRA.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Paola Gargiulo, Ilaria Fava CASPUR – Italy OpenAIRE National Open Access Desk Open Access to research data and publications: OpenAIREplus.
LIVING LAB OF GLOBAL CHANGE RESEARCH
Overview of WGs, IGs and BoFs
Priorities for International Development of e-Infrastructure and Data Management in Global Change Research Presentation by Robert Gurney, University of.
Current and Upcoming RDA Recommendations Dr. ir. Herman Stehouwer
EOSC MODEL Pasquale Pagano CNR - ISTI
Donatella Castelli CNR-ISTI
Summit 2017 Breakout Group 2: Data Management (DM)
Working with your archive organization Broadening your user community
EOSCpilot Skills Landscape & Framework
EOSC Governance Development Forum
Scientific Data as Research Infrastructure
Archives and Records Professionals for Research Data IG
Three Uses for a Technology Roadmap
Bird of Feather Session
IG Data Curation & Preservation
FAIR Across – Implementation of FAIR into research practice
Presentation transcript:

Data Sharing and Interoperability Francoise Genova RDA TAB and RDA/Europe member WDS Scientific Committee

The general context in which DataCite works is a complex system with many elements A researcher’s point of view on requirements DataCite 2014, F. Genova, Keynote2

How do astronomers do science? DataCite 2014, F. Genova, Keynote3

And also … DataCite 2014, F. Genova, Keynote4

We use research infrastructures « Research infrastructures are facilities, resources and services that are used by the research communities to conduct research and foster innovation in their fields. Where relevant, they may be used beyond research, e.g. for education or public services. » Definition from HORIZON 2020 Work programme 2014 – 2015 Data is, or should be, among the research infrastructures. From the 2030 Vision of the « Riding the Wave » report to the European Commission (2010): All stakeholders, from scientists to national authorities to the general public, are aware of the critical importance of conserving and sharing reliable data produced during the scientific process. Impact if achieved Data form an infrastructure, and are an asset for future science and the economy. DataCite 2014, F. Genova, Keynote5

This is true for all sciences, Social Sciences and Humanities as well as for sciences which use « physical » infrastructure Sharing of result/project data becomes a requirement on projects from many funding agencies « Open science » is currently a buzz word, even for the G8 Ministers of Science DataCite 2014, F. Genova, Keynote6

« Open Science » in the G8 Ministers’ statement DataCite 2014, F. Genova, Keynote7

What we learnt in astronomy It is worth doing the job! In astronomy, the change of paradigm is done and astronomers use remote data in their everyday work – they probably could not live without it any more. – Key for main scientific subjects: multi-wavelength data to understand the physical phenomena at work in the objects, variable phenomena, etc… – Many more papers from data retrieved from observatory archives than from the original observations (HST, …) – Also huge usage of the value-added services, eg ~ queries/day on the CDS services which are only one of the on-line data resource It is worth but it requires lots of work and takes a fraction of the community resources. It is not enough to set rules about data sharing, scientists have to be convinced to participate themselves and to give time and resources. How to build this willingness to extend data sharing to more/all disciplines? DataCite 2014, F. Genova, Keynote8

Convince Data Producers/Users Data producers include – « Physical » Research Infrastructures Although it is more and more often mandatory for Research Infrastructures to share their data, it is not always done and some have a hard work convincing their users who sometimes prefer to keep their data for themselves indefinitely or for a long time to avoid competition with others in the data usage. Fairness on the provider and user sides: – allow a ‘short’ proprietary period e.g. when the data is obtained from a highly competitive process (one year in general in astronomy), also to continue to feed the system with ‘the best’ data – make sure that the data is not kept hidden forever or for too long They are also a target in addition to research tems/researchers (cf. ARGO talk) – Individual scientists/research teams Keywords: trust, usability of the data sharing process, usefulness of the data infrastructure, incentive and rewards, support to the data sharing process DataCite 2014, F. Genova, Keynote9

The elements of trust The data system fulfils the user needs – a user-centered system, not a system driven by technology or by data preservation. Data must be usable: it must come with the metadata and in a format allowing reuse. Metadata, data formats, include elements which depend on the discipline. ─Disciplines have to care about their data to set up the disciplinary components of data sharing (metadata et al.) Data providers must trust the data repository and more generally the data framework. – The data will be preserved on the long term > sustainability – Other people will be able to find and use the data – They are confident that data will be used, i.e. the data repository and the data framework more generally have the capacity to fulfill the research needs – Usefulness of certification Certification criteria are a good topic for discussion to include the different possible points of view (cf the RDA-WDS WG)! Not only technical aspects, all the components of trust have to be identified. – E.g., do users want the original data or data corrected from errors? Preservation vs usage DataCite 2014, F. Genova, Keynote10

Incentives and rewards Make it easy: support researchers in the process of sharing their data – Data repositories work at facilitating the deposit process – Support also needed to prepare metadata – Librarians in institutional repositories – Disciplinary data repositories (librarians, researchers, s/w engineers) gather the expertise and can play a major role Make it worth – When a ‘good’ data system is set up, a virtuous loop: when researchers use data from the system for their own research they are more eager to share their own one (although not always, in which case rules can be very useful) – Increase trust in one’s own results by providing access to data allowing to check them – Increase one’s research impact by allowing data reuse by others – Include data sharing among researchers’ evaluation criteria DataCite 2014, F. Genova, Keynote11

Evaluation… The fact that researchers’ evaluation criteria have to evolve appears prominently every time « open data » is discussed, at all levels, e.g. – One of the first comments from the audience in any talk on data sharing – « To ensure successful adoption by scientific communities, open scientific research data principles will need to be underpinned by an appropriate policy environment, including recognition of researchers fulfilling these principles » G8 Science Ministers Statement, 2013 Unfortunately nothing changed until now in the « bibliometry dictatorship » although most people understand the problems (take into account all types of research activities is only one of them) Recognize data sharing can also include measure data value « If we are to encourage broader use, and re-use, of scientific data we need more, better ways to measure its impact and quality » “Riding the Wave” report DataCite 2014, F. Genova, Keynote12

Data citation, data usage Of course here the capacity to cite data plays a major role – Ensure fairness by referring to the data used in a paper (like one expects researchers to cite the articles they use) – Enable data citation metrics to include in the evaluation criteria (but…) – Are they other criteria to measure data impact? This is also an important incentive for the Research Infrastructures to share their data – One aim is to increase their impact by allowing data reuse by others – It is necessary then to be able to identify publications using their data even if they are not from the original infrastructure user (author’s name not known a priori) DataCite 2014, F. Genova, Keynote13

To get the full benefit of data sharing Discoverable, accessible, assessable, intelligible, useable and wherever possible interoperable data (G8 Science Minister Statement) Interoperability among disciplinary resources VizieR usage through the VO protocols Increases the impact of data sharing, eg J/ApJ/594/1 – Paper cited 1357 times since 2003, ~100 times in 2013 (ADS) – 1850 queries in VizieR in 2013, 93% through VO protocol from a VO tool DataCite 2014, F. Genova, Keynote14

Interoperability Cross-discipline interoperability essential to tackle the grand challenges Generic building blocks can help many For each building block of the data infrastructure, there will likely be more than one solution, e.g. data citation Some disciplines already have well established systems and cannot be required to break something which works Interoperability is also the key here DataCite 2014, F. Genova, Keynote15

The Research Data Alliance Breaking barriers to data sharing – also many scientific disciplines represented Bottom-up work to provide building blocks, best practices, etc. Working Groups such as – Data Citation WG Making dynamic data citeable – Data Type Registry WG – RDA/WDS Publishing Data: Bibliometrics – RDA/WDS Publishing Data: Services – RDA/WDS Publishing Data: Workflows Interest Groups such as – Education and training on handling of research data IG – Long Tail of Research Data IG – PID IG – RDA/WDS Publishing Data IG – Plenary 4 September in Amsterdam, co-located ODIN ORCID and DataCite event DataCite 2014, F. Genova, Keynote16