Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek

Slides:



Advertisements
Similar presentations
Michael Maune Carl von Ossietzky University, Oldenburg and Institute for Science Networking Oldenburg Distributed Open Access Reference Citations Service.
Advertisements

Product Quality and Documentation – Recent Developments H. K. Ramapriyan Assistant Project Manager ESDIS Project, Code 423, NASA GFSC
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
Vivien Bonazzi Ph.D. Program Director: Computational Biology (NHGRI) Co Chair Software Methods & Systems (BD2K) Biomedical Big Data Initiative (BD2K)
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.
Publishing Data: Scientific Data as Integral Part of Scholarly Publishing Theodora Blum (BMJ) Adrian Burton (ANDS) Sarah Callaghan (BADC) Sünje Dallmeier-Thiessen.
Repository Audit and Certification DSA–WDS Partnership WG RDA Working Groups Meeting at NIST November 13-14, 2014.
1 Data Description Registry Interoperability (DDRI) Working Group Dimitris Gavrilis, Amir Aryani.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
Session on Disasters Management: Overview Karen Moe NASA Earth Science Technology Office WGISS-37 Meeting April 14-18, 2014.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
RDA, 5th Plenary, San Diego WDS Certification Objective: building trust in the usage of data & data services Michael Diepenbroek Rorie Edmunds Mustapha.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
GEO Data Management Principles Implementation : World Data System–Data Seal of Approval (WDS-DSA) Core Certification of Digital Repositories Dr Mustapha.
SciDataCon 2014, WDS Forum, Dehli WDS Certification Objective: building trust in the usage of data & data services Michael Diepenbroek Rorie Edmunds Mustapha.
DSA & WDS WG Certification RDA Outputs: Munich 20 February 2015.
WP3: Common policies and implementation strategies
CESSDA SaW Training on Trust, Identifying Demand & Networking
FAIR Data in Trustworthy Data Repositories:
2nd DPHEP Collaboration Workshop
Legacy and future of the World Data System (WDS) certification of data services and networks Dr Mustapha Mokrane, Executive Director, WDS International.
Digital Repository Certification Schema A Pathway for Implementing the GEO Data Sharing and Data Management Principles Robert R. Downs, PhD Sr. Digital.
Auditing of Trustworthy Data Repositories – Speakers
User Characterization in Search Personalization
Preparing a Trustworthy Domain Repository for ISO Certification
DSA and FAIR: a perfect couple
ELIXIR Core Data Resources and Deposition Databases
Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University.
Metadata Catalogue and Knowledge Network
Toward Best Practice for Language Resource Conversion
Paolo Budroni, University of Vienna
Certification of Trusted Repositories
DSA–WDS Partnership: Streamlining the landscape of data repository certification Lesley Rickards, Mary Vardigan, Ingrid Dillo, Françoise Genova, Hervé.
Presentation Practices
An Overview of Data-PASS Shared Catalog
Donatella Castelli CNR-ISTI
FAIR Metrics RDA 10 Luiz Bonino – - September 21, 2017.
The Challenge.
Fitness for use: Users of the U. S
W. Christopher Lenhardt
Introduction Helena Cousijn, Claire Austin & Michael Diepenbroek
EOSCpilot Skills Landscape & Framework
OpenML Workshop Eindhoven TU/e,
EOSCpilot All Hands Meeting 9 March 2018, Pisa
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
From Observational Data to Information (OD2I IG )
Criteria for Data Fitness
eContentplus Programme (2005 – 2008)
Introduction to the MIABIS SOP Working Group
WDS/RDA Assessment of Data Fitness for Use Claire Austin, Helena Cousijn, Michael Diepenbroek, Jon Petters.
EOSCpilot All Hands Meeting 9 March 2018, Pisa
The WDS/RDA Assessment of Data Fitness for Use Working Group
From FAIRy tale to FAIR enough
Bird of Feather Session
Automatic evaluation of fairness
eScience - FAIR Science
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Assessing FAIRness within the Enabling FAIR Data project WG WDS/RDA Assessment of Data Fitness for Use RDA 11th Plenary 22 March 2018 Shelley Stall,
One Step Forward, Two Steps Back:
Supporting Open Research
One Step Forward, Two Steps Back:
Cultivating Semantics for Data in Agriculture and Nutrition
Presentation transcript:

Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek WDS/RDA Publishing Data IG WDS/RDA Certification of Digital Repositories IG Assessment of Data Fitness for Use Introduction Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek

Initiatives F A I R 5 ★ Open Data (Tim Berners Lee) FAIR principles A design framework & exemplar metrics for FAIRness GEO label facets ESIP Information Quality Cluster Enabling FAIR Data Across the Earth & Space Sciences Certification of data centers/repositories F A I R 2 User Reviews 1 Archivist Assessment 24 Downloads

Criteria I Inherent properties Non-inherent properties objectively verifiable or even measurable e.g. validity of used methodologies, completeness of metadata Non-inherent properties ~subjective descriptions assigned to data e.g. social tagging, downloads as indicator to data quality

Criteria II properties directly related to data objects E.g. PIDs, citation, precision of data values properties related to data findability & accessibility E.g. quality of services for data discovery and interoperability properties characterizing data management processes E.g. curational workflows, tools, human resources! Not transparent to users

Metrics Dimensions should be independant Evaluation and ranking should be practical Automatic versus manual evaluation Direct versus indirect evaluation (proxies)

Agenda Data Fitness for Use as part of the CoreTrustSeal: Mustapha Mokrane (ICSU WDS) – Chair of the CoreTrustSeal Board Assessing FAIRness within the Enabling FAIR Data project: Shelley Stall - Director of the AGU Data Program A design framework and exemplar metrics for FAIRness: Peter Doorn – Director of Data Archiving and Networked Services (DANS) Proposed criteria Data Fitness for Use WG: Michael Diepenbroek (PANGAEA) – Co-Chair of the Data Fitness for Use WG Discussion on governance (30 minutes)

Dimensions & evidence required Completeness & Quality of Content Evidence: Metadata & data Findability Evidence: Services exposed, metadata & data Accessibility Evidence: Services exposed, metadata, documentation of system & services Interoperability Evidence: Services exposed, metadata Curation Evidence: Services exposed, metadata & data, documentation of system & services

Completeness & Quality of Content Evidence: metadata Metadata completeness Citation (authorship, year, comprehensive title, PID) Content description (listing of measurement & obvervation types incl. used methods) Coverage (spatial, temporal) Provenance authorship (PIs, institutions, labs) data collection/generation (sampling events, processing steps, experimental setup) references to related work (literature) Terms of usage: licenses, other conditions, protection (ethical issues) Persistent identifier (for the data set, others for literature, authors, projects, terms etc.) Metadata adequate to science domain (domain expertise needed)

Completeness & Quality of Content Evidence: metadata & data Data completeness difficult to evaluate. Minimum: content description should match data content (for data matrices comparison of column headers with content description)

Completeness & Quality of Content Evidence: metadata & data Metadata & data correctness Content description matches data content Validity of used methods (needs domain expertise)

Completeness & Quality of Content Evidence: metadata & data Machine readibility of data & metadata data & metadata consistently structured (consistent, standard formatting) data & metadata harmonized (consistent use of metadata elements (possibly needs complementary information from other requirements, e.g. usage of RDBMS & standard terminologies (ontologies), type of curation)

Completeness & Quality of Content Evidence: metadata & data Machine readibility of data & metadata data & metadata consistently structured (consistent, standard formatting) data & metadata harmonized (consistent use of metadata elements (possibly needs complementary information from other requirements, e.g. usage of RDBMS & standard terminologies (ontologies), type of curation)

All other dimensions Do supplied services match results from certification? Findability: Sufficient discovery metadata - usually metadata are enriched with related terms, e.g. using terminologies/ontologies. Such terms might not be visible in the metadata or data Curation: Curation level claimed by the repository matches the completeness, correctness, structuring, and harmonization of metadata & data

Service