4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.

Slides:



Advertisements
Similar presentations
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
Advertisements

IPP Notification and Notification Services White Paper Hugo Parra; Novell, Inc. October 6, 1999 The intent of this paper is to supplement the discussions.
Identifiers and trust: lessons for data publishers Valued Resources: Roles and Responsibilities of Digital Curators and Publishers FOURTH BLOOMSBURY.
Configuration management
Configuration Management
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
Data Publishing Workflows: Strategies and Standards
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
The NSDL Registry Jon Phipps Stuart Sutton Diane Hillmann Ryan Laundry Cornell U. U. of Washington.
This chapter is extracted from Sommerville’s slides. Text book chapter
Presented by DOI Create: TERN as a use-case Siddeswara Guru
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA Plenary San Diego, March 9, 2015 Gary Berg-Cross, Raphael Ritz, Co-Chairs DFT.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
EZID long-term identifiers made easy Greg Janée University of California Curation Center California Digital Library July 31, 2012.
METADATA QUALITY IN EUROPEANA , Den Haag.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
Librarians as a Resource for African Journals Partnership Project (AJPP) Journals Christine Wamunyima Kanyengo
Recommended Practices for Journal Article Supplemental Material Highlights of the Sub-Session Background Basic Principles Definitions Status of Recommendations.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Roadmap Activity 2a: A GEOSS citation standard : Hans-Peter Plag IEEE University of Nevada, Reno, Nevada, USA;
Citation and Recognition of contributions using Semantic Provenance Knowledge Captured in the OPeNDAP Software Framework Patrick West 1
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
TOWARDS A DATA CITATION STANDARD FOR GEOSS I. McCallum, H.-P. Plag & S. Fritz.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
Data Citation Implementation Pilot Workshop
Data Citation Dataverse Mercè Crosas Chief Data Science and Technology Officer, IQSS, Harvard Workshop: Data Citation.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Identifiers and Data Management Joan Starr California Digital Library.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Approaches to Making Data Citeable Recommendations of the RDA Working Group Andreas Rauber, Ari Asmi, Dieter van Uytvanck Stefan Pröll.
Identifiers and Citation
RDA WG on Dynamic Data Citation
DSA and FAIR: a perfect couple
EPSRC Research Data Policy Awareness
Donatella Castelli CNR-ISTI
Research software best practices: Transparency, credit, and citation
Wheat Data Interoperability Esther DZALE YEUMO KABORE Richard FULSS
Active Data Management in Space 20m DG
Identifiers and Citation
FORCE11 Data Citation Synthesis Group
Maggie, Carlo, Peter, Rebecca (GEDE discussions)
Metadata for research outputs management Part 2
OpenML Workshop Eindhoven TU/e,
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
2. An overview of SDMX (What is SDMX? Part I)
Tech introduction.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Presentation transcript:

4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group

DataCite: DataCite was founded in 2009 to “increase acceptance of data as legitimate, citable contributions to the scientific record,” among other objectives. A DC: 1 and C-I are similar as to intent, but C-I goes a bit further, and emphasizes the need to elevate the status of data citations to that already accorded citations to other types of objects that comprise the scholarly record Recommendation: General agreement; modify to capture sentiments of all.

F-2 and C-III both mention persistence, but differ in that F-2 specifies the mechanism of “public” repositories (which is partly about access), while C-III is agnostic about the nature of the repository and mode of access (open or not). There is a legitimate debate to be had over the value of open access, but citation practices need to be applicable to data stored in repositories that are either open or subscription, public or privately owned (and even to data not stored in a repository.) DataCite: Research data for which identifiers will be assigned must be located in data centres or repositories committed to persistence and maintenance.” Recommendation: Do not get into the specifics of persistence of data and what should be kept for how long; do make it clear that all data may not be fully public; do recommend that data be stored in places committed to maintenance and persistence rather than on an individual’s machine. Make it clear that the citation may outlive the data

We can infer that the reasons for using the mechanism specified in F-are those referred to in C-I, C-II, C-V, and perhaps C-VI. DataCite Metadata Schema supports the ability to describe a rich set of relationships between the article and data, including IsCitedBy IsSupplementTo IsReferencedBy IsDocumentedBy suggesting that a registered dataset could “provide” metadata to assist with its placement in relationship to the article. A DCC 4b: In particular, there need to be services that use the citations in metrics to support the academic reward system, and services that can generate complete citations

DataCite recommends a bibliographic citation style, but makes no statement about its location in the publication. A F-4 is about the means rather than the purpose. We can infer that this is a means to enforce the purpose mentioned in C-I. The degree to which a data citation should resemble a bibliographic citation is debatable. It might be better to specify the purposes and functions that the citation should fulfill or facilitate, and leave the details of implementation to the communities who will need to implement them. DCC 4a: It must be usable not only by humans but also by software tools, so that additional services may be built using these citations.

DataCite: a persistent approach to access, identification, sharing, and re-use of datasets” is offered by DataCite, which uses DOIs to achieve this aim. A DCC 1: The citation itself must be able to identify uniquely the object cited, though different citations might use different methods or schemes to do so F-5 goes beyond C-IX in recommending a DOI as the specific type of persistent identifier, which is a means rather than a purpose. The purpose of using registries of persistent identifiers such DOIs, ARKs, or other handles is to provide persistence of findability if the location of a digital object changes. (Purpose stated in C-III.) Many communities of practice have already developed systems of persistent identifiers, some of which pre-date the existence of DOIs. Some people argue that proper use of URIs (and the redirect mechanism that is already a part of HTTP) could accomplish persistence without the intermediate step of a PID registry. While use of widely-accepted metadata standards (See C-IX) helps 5 to ensure interoperability, it seems presumptuous to specify DOIs over other PID systems already in use.

F-6 specifies a both a purpose (actionability by both humans and machines) and a particular means (a landing page). C-IV specifies only the purpose. The best means for accomplishing this purpose may evolve over time. DCC 3 a: it must provide the reader with enough information to access the dataset; b. indeed, when expressed digitally it should provide a mechanism for accessing the dataset through the Web infrastructure Data Cite: “Clients will ensure that the URL assigned to the identifier provides users with the necessary information for making meaningful use of the data. Often this will be in the form of a landing page…It is best practice to have a landing page for all registered data…” B

F-7 refers generally to the need to identify the specific version of the data being referenced. C-VI, C-VII, and C-VIII refer to distinct aspects of version: Provenance, Granularity, and Verifiability. The distinction among these aspects is useful. DataCite supports content negotiation: dex.html DCC 1: The citation itself must be able to identify uniquely the object cited, though different citations might use different methods or schemes to do so. 2. It must be able to identify subsets of the data as well as the whole dataset.

F-8 and C-II both address the function of attribution. C-II draws the subtle distinction between legal attribution and scholarly norm of giving credit to others for work they have performed. These are similar concepts, but there are some important differences between them, and what is necessary to accomplish them. The DataCite Metadata Schema supports the ability to supply metadata for unlimited contributors, including name, role (or type) and identifier information. A DCC 4B: In particular, there need to be services that use the citations in metrics to support the academic reward system, and services that can generate complete citations.