DSA and FAIR: a perfect couple

Slides:

Advertisements

Similar presentations

DSA and the Certification Framework Ingrid Dillo Data Archiving and Networked Services DSA Conference, Florence 10 December 2012.

Advertisements

Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.

Data Seal of Approval Overview Lightning Talk RDA Plenary 5 – San Diego March 11, 2015 Mary Vardigan University of Michigan Inter-university Consortium.

Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.

Evolving Roles in Scholarly Communications Susan Reilly, APA, Frascati, 7th Nov, 2012.

Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.

Data Archiving and Networked Services DANS is an institute of KNAW en NWO and the Peter Doorn Data Archiving and Networked Services EUDAT Conference Trust.

Repository Audit and Certification DSA–WDS Partnership WG RDA Working Groups Meeting at NIST November 13-14, 2014.

Data Archiving and Networked Services DANS is an institute of KNAW en NWO Data Archiving and Networked Services Introduction to Data Management Planning.

Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.

4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.

Data Seal of Approval (DSA) SEEDS Kick-off meeting May 5, Lausanne Renate Kunz.

Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.

GEO Data Management Principles Implementation : World Data System–Data Seal of Approval (WDS-DSA) Core Certification of Digital Repositories Dr Mustapha.

SciDataCon 2014, WDS Forum, Dehli WDS Certification Objective: building trust in the usage of data & data services Michael Diepenbroek Rorie Edmunds Mustapha.

DSA & WDS WG Certification RDA Outputs: Munich 20 February 2015.

Information Structures: Standards Week 7 Lecture notes INF 380E: Perspectives on Information 1.

PhD-course Research Data Management (RDM) Expert Centre Research Data.

1 Using DLESE: Finding Resources to Enhance Teaching Shelley Olds Holly Devaul 11 July 2004.

Core Certification for Trustworthy Data Repositories

WP3: Common policies and implementation strategies

CESSDA SaW Training on Trust, Identifying Demand & Networking

FAIR Data in Trustworthy Data Repositories:

Towards a FAIR Assessment Tool for Datasets

2nd DPHEP Collaboration Workshop

Digital Repository Certification Schema A Pathway for Implementing the GEO Data Sharing and Data Management Principles Robert R. Downs, PhD Sr. Digital.

Does it make sense to apply the FAIR Data Principles to Software?

Auditing of Trustworthy Data Repositories – Speakers

ELIXIR Core Data Resources and Deposition Databases

M25 Group Open Library Data A British Library Perspective

Developing Criteria to Establish Trusted Digital Repositories

Certification of Trusted Repositories

RDA/WDS IG Certification of Digital Repositories The new 'Core Trustworthy Data Repository Requirements' hands-on RDA Plenary 9, Barcelona,

D33.1B PEER REVIEW OF DIGITAL REPOSITORIES

DANS Certification Efforts Use Case

FAIR Metadata RDA 10 Luiz Olavo Bonino – - September 21, 2017.

From the old to the new… Towards better resource discoverability

Trustworthiness of Preservation Systems

Libraries as Data-Centers for the Arts and Humanities

FAIR Sample and Data Access

Ways to upgrade the FAIRness of your data repository.

FAIR Metrics RDA 10 Luiz Bonino – - September 21, 2017.

GFBio – Education module

knowledge organization for a food secure world

FAIR Data Management, Trustworthy Digital Repositories and Business Continuity / Disaster Preparedness

General Finnish DMP Guidance

Identifiers Answer Questions

Sophia Lafferty-hess | research data manager

OPEN DATA – F.A.I.R. PRINCIPLES

Experiences of the Digital Repository of Ireland

Metadata for research outputs management Part 2

OpenML Workshop Eindhoven TU/e,

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Creating a Culture of Open Data in Academia

Statistical Organization Giovanni Savio and Majed Skaini, SD, UN-ESCWA

An Open Archival Repository System for UT Austin

How to Implement the FAIR Data Principles? Elly Dijk

The WDS/RDA Assessment of Data Fitness for Use Working Group

From FAIRy tale to FAIR enough

Jisc Research Data Shared Service (RDSS)

Bird of Feather Session

Automatic evaluation of fairness

eScience - FAIR Science

A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case

Research data lifecycle²

Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek

One Step Forward, Two Steps Back:

One Step Forward, Two Steps Back:

Australian and New Zealand Metadata Working Group

Cultivating Semantics for Data in Agriculture and Nutrition

Presentation transcript:

DSA and FAIR: a perfect couple Rob Hooft (DTL, ELIXIR NL), Peter Doorn and Ingrid Dillo (DANS) SciDataCon, session Data Fitness for Reuse Denver, 12 September 2016

DANS and the Data Seal of Approval 2005: DANS to promote and provide permanent access to digital research information Formulate quality guidelines for digital repositories including DANS (TRAC, Nestor) 2006: 5 basic principles as basis for 16 DSA guidelines 2009: international DSA Board Today: over 62 seals acquired and many in progress

DSA principles The DSA is intended to ensure that: The data can be found on the internet The data are accessible (clear rights and licenses) The data are in a usable format The data are reliable The data are identified in a unique and persistent way so that they can be referred to (guidelines: metadata, ltp, integrity and authenticity, persistency, technical infrastrcuture, workflows and processes..)

FAIR principles Leiden 2014: minimal set of community agreed guiding principles to make data more easily discoverable, accessible, appropriately integrated and re-usable, and adequately citable. FAIR Principles: Findable Accessible Interoperable Re-usable (all four both for machines and for people)

Resemblance DSA Principles FAIR Principles data can be found on the internet findable data are accessible accessible data are in a usable format interoperable data are reliable reusable data can be referred to (citable) usable format (DSA) is just an aspect of interoperability (FAIR) reliability (DSA) is a condition for reuse (FAIR) FAIR explicitly addresses machine readability citability is in FAIR an aspect of findability

From the FAIR perspective: Findable DSA does Well known location on the internet (meta)Data gets persistent identifier FAIR also requires Well described what it is (not only what was done with it) Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Accessible DSA does Define protocol, require reliability FAIR also requires Standardized authentication where needed Metadata are kept even if data are deleted Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Interoperable DSA does Require usable format FAIR also requires Standardized vocabulary (mapped?) FAIR vocabularies Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Reusable DSA does Require license FAIR also requires Rich metadata Rich provenance Adherence to community standards Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

Data must be stored in a DSA certified repository in order to become FAIR * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the DSA perspective: Combine and operationalize Growing demand for quality criteria for research datasets Combine the ideas of DSA and FAIR Focus of the principles as quality criteria: DSA – digital repositories FAIR – research data(sets) Operationalize the principles to make them easily implementable in any trustworthy digital repository

From the DSA perspective: How could it work? Each principle a separate dimension of data quality Score data on each dimension, e.g. for Findable - defined by metadata, documentation (and identifier for citation): 0 = No URI or PID and no documentation 1 = PID without or with insufficient metadata 2 = PID with limited metadata present to understand the data 3 = PID with extensive metadata and rich additional documentation available Total score of FAIRness as an indicator of data quality Scoring by humans and machines Scoring: scoring at ingest by data archivists of TDR after reuse by data users (community review)

DSA and FAIR: a perfect couple To sum up: DSA and FAIR: a perfect couple Data must be stored in a DSA certified repository in order to become FAIR DSA and FAIR together offer great possibilities for quality assessment of research data

Thank you for listening! rob.hooft@dtls.nl peter.doorn@dans.knaw.nl ingrid.dillo@dans.knaw.nl http://www.datasealofapproval.org/en/ https://rd-alliance.org/group/repository-audit-and-certification-dsa–wds- partnership-wg/outcomes/dsa-wds-partership http://www.nature.com/articles/sdata201618