DSA and FAIR: a perfect couple

Slides:



Advertisements
Similar presentations
DSA and the Certification Framework Ingrid Dillo Data Archiving and Networked Services DSA Conference, Florence 10 December 2012.
Advertisements

Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.
Data Seal of Approval Overview Lightning Talk RDA Plenary 5 – San Diego March 11, 2015 Mary Vardigan University of Michigan Inter-university Consortium.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Evolving Roles in Scholarly Communications Susan Reilly, APA, Frascati, 7th Nov, 2012.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO and the Peter Doorn Data Archiving and Networked Services EUDAT Conference Trust.
Repository Audit and Certification DSA–WDS Partnership WG RDA Working Groups Meeting at NIST November 13-14, 2014.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Data Archiving and Networked Services Introduction to Data Management Planning.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Data Seal of Approval (DSA) SEEDS Kick-off meeting May 5, Lausanne Renate Kunz.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
GEO Data Management Principles Implementation : World Data System–Data Seal of Approval (WDS-DSA) Core Certification of Digital Repositories Dr Mustapha.
SciDataCon 2014, WDS Forum, Dehli WDS Certification Objective: building trust in the usage of data & data services Michael Diepenbroek Rorie Edmunds Mustapha.
DSA & WDS WG Certification RDA Outputs: Munich 20 February 2015.
Information Structures: Standards Week 7 Lecture notes INF 380E: Perspectives on Information 1.
PhD-course Research Data Management (RDM) Expert Centre Research Data.
1 Using DLESE: Finding Resources to Enhance Teaching Shelley Olds Holly Devaul 11 July 2004.
Core Certification for Trustworthy Data Repositories
WP3: Common policies and implementation strategies
CESSDA SaW Training on Trust, Identifying Demand & Networking
FAIR Data in Trustworthy Data Repositories:
Towards a FAIR Assessment Tool for Datasets
2nd DPHEP Collaboration Workshop
Digital Repository Certification Schema A Pathway for Implementing the GEO Data Sharing and Data Management Principles Robert R. Downs, PhD Sr. Digital.
Does it make sense to apply the FAIR Data Principles to Software?
Auditing of Trustworthy Data Repositories – Speakers
ELIXIR Core Data Resources and Deposition Databases
M25 Group Open Library Data A British Library Perspective
Developing Criteria to Establish Trusted Digital Repositories
Certification of Trusted Repositories
RDA/WDS IG Certification of Digital Repositories The new 'Core Trustworthy Data Repository Requirements' hands-on RDA Plenary 9, Barcelona,
D33.1B PEER REVIEW OF DIGITAL REPOSITORIES
DANS Certification Efforts Use Case
FAIR Metadata RDA 10 Luiz Olavo Bonino – - September 21, 2017.
From the old to the new… Towards better resource discoverability
Trustworthiness of Preservation Systems
Libraries as Data-Centers for the Arts and Humanities
FAIR Sample and Data Access
Ways to upgrade the FAIRness of your data repository.
FAIR Metrics RDA 10 Luiz Bonino – - September 21, 2017.
GFBio – Education module
knowledge organization for a food secure world
FAIR Data Management, Trustworthy Digital Repositories and Business Continuity / Disaster Preparedness
General Finnish DMP Guidance
Identifiers Answer Questions
Sophia Lafferty-hess | research data manager
OPEN DATA – F.A.I.R. PRINCIPLES
Experiences of the Digital Repository of Ireland
Metadata for research outputs management Part 2
OpenML Workshop Eindhoven TU/e,
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Creating a Culture of Open Data in Academia
Statistical Organization Giovanni Savio and Majed Skaini, SD, UN-ESCWA
An Open Archival Repository System for UT Austin
How to Implement the FAIR Data Principles? Elly Dijk
The WDS/RDA Assessment of Data Fitness for Use Working Group
From FAIRy tale to FAIR enough
Jisc Research Data Shared Service (RDSS)
Bird of Feather Session
Automatic evaluation of fairness
eScience - FAIR Science
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Research data lifecycle²
Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek
One Step Forward, Two Steps Back:
One Step Forward, Two Steps Back:
Australian and New Zealand Metadata Working Group
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

DSA and FAIR: a perfect couple Rob Hooft (DTL, ELIXIR NL), Peter Doorn and Ingrid Dillo (DANS) SciDataCon, session Data Fitness for Reuse Denver, 12 September 2016

DANS and the Data Seal of Approval 2005: DANS to promote and provide permanent access to digital research information Formulate quality guidelines for digital repositories including DANS (TRAC, Nestor) 2006: 5 basic principles as basis for 16 DSA guidelines 2009: international DSA Board Today: over 62 seals acquired and many in progress

DSA principles The DSA is intended to ensure that: The data can be found on the internet The data are accessible (clear rights and licenses) The data are in a usable format The data are reliable The data are identified in a unique and persistent way so that they can be referred to (guidelines: metadata, ltp, integrity and authenticity, persistency, technical infrastrcuture, workflows and processes..)

FAIR principles Leiden 2014: minimal set of community agreed guiding principles to make data more easily discoverable, accessible, appropriately integrated and re-usable, and adequately citable. FAIR Principles: Findable Accessible Interoperable Re-usable (all four both for machines and for people)

Resemblance DSA Principles FAIR Principles data can be found on the internet findable data are accessible accessible data are in a usable format interoperable data are reliable reusable data can be referred to (citable) usable format (DSA) is just an aspect of interoperability (FAIR) reliability (DSA) is a condition for reuse (FAIR) FAIR explicitly addresses machine readability citability is in FAIR an aspect of findability

From the FAIR perspective: Findable DSA does Well known location on the internet (meta)Data gets persistent identifier FAIR also requires Well described what it is (not only what was done with it) Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Accessible DSA does Define protocol, require reliability FAIR also requires Standardized authentication where needed Metadata are kept even if data are deleted Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Interoperable DSA does Require usable format FAIR also requires Standardized vocabulary (mapped?) FAIR vocabularies Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the FAIR perspective: Reusable DSA does Require license FAIR also requires Rich metadata Rich provenance Adherence to community standards Also computer readable * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

Data must be stored in a DSA certified repository in order to become FAIR * Especially institutional or special: catalog it! * Keywords for re-use. What is in there, not why you collected * Keywords for subsets? * Librarians/archivists can help! Go look for them! * Talk to the catalog people early to find out what they need. ==== Especially for data that is deposited to institutional repositories, it is very important that they will also be listed in a catalog: some place where researchers like yourself would be looking for existing data sets. Keywords under which your data can be found should especially be triggered by possibilities for re-use: they should indicate what is in the data and what could be done with it, it is not enough to mention why you collected the data. Subsets of your data may be usable for researchers of different fields, and it is a good idea to think of keywords and descriptions that make this kind of subsets findable by themselves. Librarians and archival specialists can often help you to find suitable keywords and make good descriptions. If cataloging is applicable to your data, you should select the catalogs that you want to list your data, and check out as early as possible what you need to do to end up in there.

From the DSA perspective: Combine and operationalize Growing demand for quality criteria for research datasets Combine the ideas of DSA and FAIR Focus of the principles as quality criteria: DSA – digital repositories FAIR – research data(sets) Operationalize the principles to make them easily implementable in any trustworthy digital repository

From the DSA perspective: How could it work? Each principle a separate dimension of data quality Score data on each dimension, e.g. for Findable - defined by metadata, documentation (and identifier for citation): 0 = No URI or PID and no documentation 1 = PID without or with insufficient metadata 2 = PID with limited metadata present to understand the data 3 = PID with extensive metadata and rich additional documentation available Total score of FAIRness as an indicator of data quality Scoring by humans and machines Scoring: scoring at ingest by data archivists of TDR after reuse by data users (community review)

DSA and FAIR: a perfect couple To sum up: DSA and FAIR: a perfect couple Data must be stored in a DSA certified repository in order to become FAIR DSA and FAIR together offer great possibilities for quality assessment of research data

Thank you for listening! rob.hooft@dtls.nl peter.doorn@dans.knaw.nl ingrid.dillo@dans.knaw.nl http://www.datasealofapproval.org/en/ https://rd-alliance.org/group/repository-audit-and-certification-dsa–wds- partnership-wg/outcomes/dsa-wds-partership http://www.nature.com/articles/sdata201618