Identifiers and Citation

Slides:

Advertisements

Similar presentations

Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.

Advertisements

UKOLN is supported by: Using the RSLP schema Ann Chapman Collection Description Focus A centre of expertise in digital information management

Chapter 10: Designing Databases

Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.

VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.

Metadata Standards for Data Joan Starr California Digital Library June, Second thoughts about.

Research Data Service at the IT Pro Forum HEIDI IMKER, DIRECTOR.

Data Publishing Workflows: Strategies and Standards

DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)

EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.

THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.

Presented by DOI Create: TERN as a use-case Siddeswara Guru

The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]

Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS

Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.

U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3.2: Data Citation CC image by adesigna on Flickr,

Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.

Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov

EZID Easy Identifiers UC Curation Center California Digital Library.

Metadata Standards for Data Joan Starr California Digital Library January, Thoughts about.

CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.

Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &

Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.

1 24 September BREAKOUT :30 1)Review of Metadata Standards Directory (DCC version and GitHub) 2)Introduction of Metadata Standards Catalog.

NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.

4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.

Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team

4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.

4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.

Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.

Data Citation Implementation Pilot Workshop

Data Citation Dataverse Mercè Crosas Chief Data Science and Technology Officer, IQSS, Harvard Workshop: Data Citation.

Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.

An Introduction to EZID University of California Curation Center Team California Digital Library August, 2011 UC3 Summer Webinar Series.

1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.

The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Section: The Case for Data Stewardship.

Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.

Identifiers and Data Management Joan Starr California Digital Library.

Approaches to Making Data Citeable Recommendations of the RDA Working Group Andreas Rauber, Ari Asmi, Dieter van Uytvanck Stefan Pröll.

Identifiers and Citation

RDA WG on Dynamic Data Citation

Current and Upcoming RDA Recommendations Dr. ir. Herman Stehouwer

Copyright 2013 Matthew Mayernik.

Making Sense of the Alphabet Soup of Standards

Progress Collaborations FUTURE

DOI Overview to Support its Use in GSICS

ACS 2016 Moving research forward with persistent identifiers

AGU Paper Number: IN43B-1697 Evolving a NASA Digital Object Identifiers System with Community Engagement Lalit Wanchoo1 and Nathan.

Active Data Management in Space 20m DG

Institutional role in supporting open access, open science, open data

FORCE11 Data Citation Synthesis Group

Getting Started with Data Management

A step-by-step guide to DOI registration

Data Management: Documentation & Metadata

Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.

Metadata for research outputs management

New input for CEOS Persistent Identifier Best Practices

Tech introduction.

Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.

DOI Overview and its Usage for EUMETSAT GSICS Objects

Module P4 Identify Data Products and Views So Their Requirements and Attributes Can Be Controlled Learning Objectives: Understand the value of data. Understand.

Introduction to the MIABIS SOP Working Group

A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.

Bird of Feather Session

Research data lifecycle²

MGT/465 SMALL BUSINESS AND ENTREPRENEURIAL PLANNING The Latest Version A+ Study Guide // uopcourse.com

MGT 465 UopStudy Guide // uopstudy.com

MGT/465 SMALL BUSINESS AND ENTREPRENEURIAL PLANNING The Latest Version // uopcourse.com

MGT 465 MGT465 mgt 465 mgt465 Entire Course // uopstudy.com

Presentation transcript:

Identifiers and Citation an introduction Joan Starr California Digital Library

Agenda Introducing identifiers Citation practices and issues Q&A ARKs and DOIs Using EZID to mint and assign them Using DataCite to track citations Citation practices and issues Granularity Versioning Dynamic data Q&A

Why identify data? get credit ensure transparency measure impact track use (even over a long project) aid reproducibility

Which identifier? machine (& human) actionable globally unique widely used by a community commitment to persistence your requirements here Credit: Jt. Declaration of Data Citation Principles

ezid.cdlib.org Introducing one option for creating and managing long-term globally unique identifiers is EZID, from California Digital Library. At the present time, EZID offers 2 identifier types that meet the above requirements, ARKs and DOIs. There are other identifiers that meet these requirements as well, but I am not qualified to speak at length about all of them. Today I’m going to talk to you about these two.

DOIs ARKs Strict metadata requirements Flexible metadata guidelines From the scholarly communication community From the archives and museums community Established “brand name” Option-rich, open source Use case: Data Citation Use case: Data Documentation

ARKs and DOIs in data management plans Image credit: http://www.flickr.com/photos/52041471@N04/6053042920

ARKS: Data Documentation What it looks like: At top-level directory/folder: Project Title Unique Identifier Date (yyyy or yyyy.mm.dd) At sub-directories: optional identifiers at granular levels Sample plan language: [Team] follows the recommended best practice for good data management by assigning unique identifiers (ARKs) to the data as part of the data documentation. Why it’s important: Researcher benefits: Data documentation helps you keep track of (and remember) aspects of your data throughout the research project. Reference for “documentation and metadata”: http://libraries.mit.edu/guides/subjects/data-management/metadata.html

DOIs: Data Citation What it looks like: Sample plan language: Aguilée R, Lambert A, Claessen D (2011) Data from: Ecological speciation in dynamic landscapes. Journal of Evolutionary Biology doi:10.5061/dryad.74024 Publication of data shall occur during the project, if appropriate, or at the end of the project, consistent with normal scientific practices. [Team] follows a standardized data product citation including DOI, that indicates the version and how to obtain a copy of that product. Why it’s important: OSTP mandate to: identify and provide “appropriate attribution to scientific data sets” Researcher benefits: Credit, increased citations, increased productivity OSTP Public Access Memo: https://web.archive.org/web/20161218061244/https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_public_access_memo_2013.pdf Increased citations: Piwowar’s 2007 study: http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0000308

Working with DOIs in EZID Image credit: http://www.flickr.com/photos/52041471@N04/6053042920

IDENTIFY image ©2014 University of California 1. Assign a DOI.

ezid.cdlib.org

ezid.cdlib.org

CITE image ©2014 University of California 2. Use the DOI.

TRACK image ©2014 University of California 3. Track the DOI’s use.

profiles.datacite.org orcid.org search.datacite.org EZID is a founding member of DataCite, so all EZID DOIs are DataCite DOIs and EZID clients are able to use all DataCite tools and services. Auto-update

AUTOMATE 4. Make assigning DOIs automatic. image ©2014 University of California 4. Make assigning DOIs automatic.

To try it: username: apitest pword: apitest

Agenda Introducing identifiers Citation practices and issues Q&A ARKs and DOIs Using EZID to mint and assign them Using DataCite to track citations Citation practices and issues Granularity Versioning Dynamic data Q&A

GRANULARITY Image: https://www.flickr.com/photos/39495096@N04/4656925337 by velmc Granularity describes the degree of aggregation of the object to be registered. Different levels of granularity can be useful depending on discipline or resource. The identification of an object can be executed to any desired level of granularity (element of a file, a file, a file collection, etc.) as the purpose dictates. When deciding which level of granularity is used to register an object, THERE ARE SEVERAL FACTORS TO KEEP IN MIND.

Granularity considerations Citation: What is likely to be cited? Use cases: How will funders/publishers/administrators/etc. use the data? Complexity: What type of resource/structure is being registered? Complex objects may require a more granular identifier structure than a document or image file. Sustainability: PID owners must be able to maintain target URLs and citation metadata for PIDs. Relationships: The DataCite Metadata Schema includes a flexible mechanism to specify relationships between objects. DOIs have strict contractual requirements for maintenance.

VERSIONING Image: https://www.flickr.com/photos/sweetascandybooth/13104215864 by sweet as candy photo I’ll base my versioning practice suggestions on 2 sources: DataCite Metadata Schema documentation and the Earth Science Information Partners (ESIP)

Versioning considerations ESIP: track major_version.minor_version For ESIP, see: http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines#Note_on_Versioning_and_Locators Datacite: Register a new DOI for a major version change. Individual stewards need to determine major vs. minor. DataCite: Use AlternateIdentifier and RelatedIdentifier to indicate various information updates and Description to indicate the nature and file/record range of version. For DataCite, see: http://schema.datacite.org/

DYNAMIC DATA https://www.flickr.com/photos/adstream/2723201167 by Jon Anderson Citation of dynamic data is an evolving topic. There is no single best practice, because the best approach depends on the characteristics of the data, as well as the capabilities of the repository storing the data. I’m going to present the DataCite recommendations and then point you to some sources for more information and developments.

Citing dynamic data Cite a specific slice (the set of updates to the dataset made during a particular period of time or to a particular area of the dataset); Cite a specific snap-shot (a copy of the entire dataset made at a specific time); Cite the continuously updated dataset, and add an Access Date and Time to the citation; Cite a query time-stamped for re-execution against versioned DB (Research Data Alliance proposal) For datasets that are continuously and rapidly updated, there are special challenges both in citation and preservation. 4 approaches are possible. Note that a “slice” and “snap-shot” are versions of the dataset and require unique identifiers. The third and fourth options are controversial. The 3rd, because it necessarily means that following the citation does not result in observation of the resource as cited. The 4th, because it shifts a significant burden onto repositories to store database versions for all the queries.

More about dynamic data Ball, A. & Duke, M. (2015). ‘How to Cite Datasets and Link to Publications’. DCC How-to Guides. Edinburgh: Digital Curation Centre. Available online: http://www.dcc.ac.uk/resources/how-guides/cite-datasets#sec:versions For more information about the RDA approach, see this post on DataCite’s blog: https://blog.labs.datacite.org/dynamic-data-citation-webinar/ Another place to watch is Force11: https://www.force11.org/group/dcip

Agenda Introducing identifiers Citation practices and issues Q&A ARKs and DOIs Using EZID to mint and assign them Using DataCite to track citations Citation practices and issues Granularity Versioning Dynamic data Q&A

CONTACT TWEET VISIT ezid@ucop.edu @ezidCDL ezid.cdlib.org