Nikhef RDM Policy – first experiences

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

Configuration management
OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
DATA PRESERVATION IN ALICE FEDERICO CARMINATI. MOTIVATION ALICE is a 150 M CHF investment by a large scientific community The ALICE data is unique and.
Research Data Management Philip Tarrant Global Institute of Sustainability.
This chapter is extracted from Sommerville’s slides. Text book chapter
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
 Have some paper and pen or a mind mapping tool available for brainstorming.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
Creation of the Archiving Component of a Memorandum of Understanding (MOU) Template for International Missions IPDA MOU project members.
School on Grid & Cloud Computing International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics.
Service on Common Ground: Creating Effective Partnerships/Collaboratives Linda Frizzell, Ph.D.
Data Publication (in H2020)
Jeff Moon Data Librarian &
Stuart J. Chalk, Department of Chemistry University of North Florida
Data Sharing entails shared responsibilities
2nd DPHEP Collaboration Workshop
Frameworks for Sensitive Data in the Research Lifecycle
European Union Law Law 326.
DSA and FAIR: a perfect couple
ELIXIR Core Data Resources and Deposition Databases
Objectives of WHO's collaboration with NGOs
Chapter 4 – Requirements Engineering
Auditing & Investigations II
FAIR Metadata RDA 10 Luiz Olavo Bonino – - September 21, 2017.
Libraries as Data-Centers for the Arts and Humanities
IMPLEMENTATION OF MEAs IN NATIONAL LAW
FAIR Sample and Data Access
Donatella Castelli CNR-ISTI
Ways to upgrade the FAIRness of your data repository.
Summit 2017 Breakout Group 2: Data Management (DM)
FAIR Metrics RDA 10 Luiz Bonino – - September 21, 2017.
Fitness for use: Users of the U. S
Active Data Management in Space 20m DG
General Data Protection Regulation
FAIR Data Management, Trustworthy Digital Repositories and Business Continuity / Disaster Preparedness
General Finnish DMP Guidance
Interpretation of policy language
Identifiers Answer Questions
CMMI – Staged Representation
Making Annotations FAIR
Reading Research Papers-A Basic Guide to Critical Analysis
FAIR Sample and Data Access
OPEN DATA – F.A.I.R. PRINCIPLES
EOSCpilot Skills Landscape & Framework
G.D.P.R General Data Protection Regulations
Introduction to Research Data Management
Metadata for research outputs management Part 2
Metadata for research outputs management
METHOD VALIDATION: AN ESSENTIAL COMPONENT OF THE MEASUREMENT PROCESS
General Data Protection Regulation
Topic Principles and Theories in Curriculum Development
Research Data Management
What does DPHEP do? DPHEP has become a Collaboration with signatures from the main HEP laboratories and some funding agencies worldwide. It has established.
Interoperability – GO FAIR - RDA
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
How to Implement the FAIR Data Principles? Elly Dijk
The new RDA: resource description in libraries and beyond
Automatic evaluation of fairness
Introduction to the CESSDA Data Management Expert Guide
Dept. of Computation, UMIST
eScience - FAIR Science
Ethics & Data Management
Introduction to SOA Part II: SOA in the enterprise
Research data lifecycle²
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Supporting Open Research
Mary Miller Director of Collection Management & Preservation
Presentation transcript:

Nikhef RDM Policy – first experiences Contactgroep DM NWO-I

Nikhef RDM Development Process A long journey – and for long at a sufficiently abstract level prominently present in H2020/ERC (and in NWO now) participated in the FOM DM pilot for Projectruimte proposals (2016) schemes were adaptable to domain-specific needs August 2016 “NWO Institute Data Management Policy Framework” became binding development of institute policy now must fit within this framework flexibility needs to be found within those constraints (keeping cost in mind) Summer 2017 decide on a hierarchical approach, separating policy and practice statements with a sensible default for ‘small’ activities (but that is still a challenge)

DPHEP (2008 – ICFA-DPHEP Study Group, 2013+ DPHEP MoU) RDM is not new last big e+e- machine closed years ago, and e.g. LEP, or HERA, data is not re-measurable today … DPHEP (2008 – ICFA-DPHEP Study Group, 2013+ DPHEP MoU) preserve experimental data and its software environment study group, repository development (Zenodo), software curation http://dphep.org/

opendata.cern.ch

We have a diverse tradition … DPHEP HEPForge (and many sub-repositories) HepData CERN OC3 archive INSPIRE-HEP (‘SPIRES’) and we ‘spread the word’ through general-purpose services Zenodo https://www.nikhef.nl/grid/nikhef/dmp/

Re-use as much as possible from partners – we re-used STFC/RAL  Nikhef RDMP ToR must implement the NWO-I DM Framework (and NWO Protocol) leaves us freedom of implementation, but we signed it, and it does require specific elements like the Replication Package not undermine any past or future data management customs already in use leverage as much as possible international efforts (such as DPHEP) minimal impact on PhD students and their time limit impact also for staff members as much as possible not incur unnecessary costs or liabilities Re-use as much as possible from partners – we re-used STFC/RAL 

The Document ... Uses RFC2119 terminology MUST This word, or the terms "REQUIRED" or "SHALL", mean that the definition is an absolute requirement of the specification. SHOULD This word, or the adjective "RECOMMENDED", mean that there may exist valid reasons in particular circumstances to ignore a particular item, but the full implications must be understood and carefully weighed before choosing a different course.

Scope defines Research Activities and re-usable data non-reusable data (e.g. collected when building tools/components) need not be subject to the policy but most data usually is

RDM Policy Principles Encompasses all kinds of Data: raw, derived, published, logs & settings Implement NWO Protocol & NWO-I DM Framework Be legal, yet use of released data not our concern (beware of law & NWO) Each Activity SHOULD have a Data Management Plan (DMP) (already required for NWO, H2020, ERC) or at least implement the ‘default’ guidance in the Policy Supply outline of DMP in proposal phase - already now for NWO/H2020 DMP SHOULD follow current best practice ‘for our domain’

but there are exclusions specific agreements & treaties take precedence (e.g. CERN) we exclude contract work – the responsibility in that case is with the third party software as a product in itself (like control software) physical detectors & components (but archive the design drawing) personal “GDPR” data – you really ought to think why you need it @Nikhef administrativa you can obviously still be following the policy, but it will not be as absolutely mandatory

Standard DMPs Many global experiments have a DM policy already in place: LHC experiments, Auger, LVC, … To answer the NWO DM paragraph, we developed standard templates https://www.nikhef.nl/grid/nikhef/dmp/

Otherwise, leverage the community The aim for all non-standard DMP is to leverage existing community services and initiatives (DPHEP and more) just your own DOI assignment costs €8500/yr for DataCite membership – or €4.50 per DOI @mEDRA

Repositories

Early feed-back from the staff Standard DMP templates are much appreciated Need to take balanced care about RDM in MoU negotiations ‘seek to ensure’ – not require Wider diversity in practices for ‘smaller’ collaborations many concerns about feasibility and cost/benefit analysis ‘depth’ of the replication package is a big concern the “I” premise of FAIR does not quite work in our domain Policy implementation will be a phased approach Training is an essential component both on RI and log keeping (e.g. Jupyter notebooks) and on proper use of existing repository facilities (storage QoS classes)

FAIR The “FAIR” Principles pick a name for the effort that is ‘hard to disagree with’ … FAIR Findable - data & metadata are easy to find by humans & computers Accessible – standard download method & ‘protocols’ for access explicit Interoperable - they can be automatically combined with other data Reusable - sufficiently well described to be replicated or combined but this is easier said than done, and implementation domain specific Wilkinson et al. The FAIR Guiding Principles for scientific data management and stewardship doi:10.1038/sdata.2016.18

FAIR – a ‘slight’ LS bias To be Findable: F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource To be Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol A1.1 the protocol is open, free, and universally implementable A1.2 the protocol allows for an authentication and authorization procedure, where necessary A2. metadata are accessible, even when the data are no longer available

FAIR – more microstatements To be Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles I3. (meta)data include qualified references to other (meta)data To be Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes R1.1. (meta)data are released with a clear and accessible data usage license R1.2. (meta)data are associated with detailed provenance R1.3. (meta)data meet domain-relevant community standards ‘let’s really think that you can replace researchers with automated systems’