U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3.2: Data Citation CC image by adesigna on Flickr,

Slides:



Advertisements
Similar presentations
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Advertisements

VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
Good practice in Research Data Management Module 5: Deposit and long-term preservation.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
USGS Data Release ESIP 2015 Winter Meeting Viv Hutchison US Geological Survey U.S. Department of the Interior U.S. Geological Survey.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
EPSRC expectations on research data: What researchers need to know 12/03/2015 Masud Khokhar and Hardy Schwamm.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
Agenda: DMWG SM policy status ESIP meeting recap Reminder - DM Webinar Series New and updated web pages on DM website Metadata Training Sessions CDI meeting.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
The Digital Curation Lifecycle Model Joy Davidson and Sarah Jones
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Responsible Data Use (or what should you do if you find yourself re-using someone else’s data) Ruth Duerr National Snow and Ice Data Center.
1 Why should “WE” CARE about data?. International initiatives OECD principles and guidelines for access to research data from public funding 2007 “Access.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Agency Requirements: NSF Data Management Plans Ruth Duerr National Snow and Ice Data Center Version 1.0 October 2012 Section: The Case for Data Stewardship.
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Copyright 2012 Matthew Mayernik. Version 1.0 October 2012 Section:
Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.
Because good research needs good data The DCC lifecycle model, Exeter Uni, May 2011 Funded by: The Digital Curation Lifecycle Model Joy Davidson.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Choosing Between Data Sharing Repositories for Engineering Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Responsible Data Use: Copyright and Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
TOWARDS A DATA CITATION STANDARD FOR GEOSS I. McCallum, H.-P. Plag & S. Fritz.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Funded by GFBio – Education module Publish Lesson in Data Publishing.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Identifiers and Citation
NRF Open Access Statement
Tutorials on Data Management
Copyright 2013 Matthew Mayernik.
EPSRC research data expectations and research software management
ACS 2016 Moving research forward with persistent identifiers
Identifiers and Citation
Institutional role in supporting open access, open science, open data
Linking persistent identifiers at the British Library
CNI Spring 2010 Membership Meeting
A step-by-step guide to DOI registration
Metadata for research outputs management
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Citation.
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
OPEN ACCESS POLICY Larshan Naicker Rhodes University Library
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3.2: Data Citation CC image by adesigna on Flickr,

Data Citation Provided by DataONE  Data Citation in the Data Lifecycle  Definitions: What is Data Citation?  Benefits of Data Citation  Collaborating to Support Data Citation  How to Cite Data  How to Obtain a Persistent Identifier for a Data Set  Best Practices to Support Data Citation Lesson Topics CC image by cybrarian77 on Flickr,

Data Citation Provided by DataONE  After completing this lesson, the participant will be able to:  Define data citation  Describe benefits of data citation  Identify roles of data authors/managers, data publishers, and journal publishers in supporting data citation  Recognize metadata elements useful for data citation  Recognize common persistent data locators and describe the process for obtaining one  Summarize best practices for supporting data citation Learning Objectives

Data Citation Provided by DataONE The Data Lifecycle

Data Citation Provided by DataONE  Data citation  “The practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to printed resources” 1  “A key practice underpinning the recognition of data as a primary research output rather than as a by-product of research” 1  Data author  “Individual involved in research, education, or other activities that generate digital data that are subsequently deposited in a data collection” 2  Persistent identifier  A unique web-compatible alphanumeric code that points to a resource (e.g., data set) that will be preserved for the long term (i.e., over several hardware and software generations)  Should direct to latest available version of resource or metadata which enable acquisition of desired version or format Definitions

Data Citation Provided by DataONE  Short term  Facilitates discovery of relationships between data and publications, therefore making it easier to validate and build upon previous work  Ensures that proper credit can be given when others use your work  For researchers searching for data, link to publication provides information about related methodology and allows data to be put into context Benefits of Data Citation CC image by futureatlas.com on Flickr,as “Citation Needed”

Data Citation Provided by DataONE  Short term  Allows an ability to estimate the impact of a particular data set based on number of publications that cite the data  Links to other publications that cite a particular data set helps researchers re- using data to find other ways the data have been used and how it is regarded. Benefits of Data Citation CC image by futureatlas.com on Flickr,as “Citation Needed”

Data Citation Provided by DataONE  Long term  Adoption of practices that rely on publishing infrastructure ensure data will be available in the future  Less danger of ‘stealing’ because citable data must be cited or treated as plagiarism  Easier to discover existing data relevant to a particular question  Can be used as the basis for measuring impact of data sets or data contributors Benefits of Data Citation CC image by gruntzooki on Flickr

Data Citation Provided by DataONE  Long term  Enables recognition of scholarly effort in disciplines and organizations that want to recognize such effort  Increase in the quantity and quality of data published  Increased transparency of scientific research  Increased rate of scientific research Benefits of Data Citation CC image by gruntzooki on Flickr

Data Citation Provided by DataONE  Requires participation of a variety of individuals and institutions  Journal publishers  Data publishers/repositories  Data authors  Data managers  Data users  Professional organizations Collaborative effort CC image via Wikimedia Commons (Masur, derivative of Al Maghi)

Data Citation Provided by DataONE  Data authors and managers should  Choose appropriate repository for data publication  Use common standards for data and metadata  Work data up to publication standard  Obtain citation information from data publisher and include it in associated paper in accordance with journal’s citation standards  Include citations for any prior data sets used in research  Notify data publisher (e.g., data center or repository) about associated paper(s) Collaborating with publishers

Data Citation Provided by DataONE  Data centers or distributors should  Ensure that data and metadata remain static over time  Digital preservation policy  Versioning strategy  Assign persistent and unique identifiers so that data are discoverable  Ensure published data remain accessible  Restricted access in the case of sensitive, commercial, or obsolete data  Provide citation information that can be included with papers associated with the data  Include a link to related publications as part of metadata Collaborating with publishers Image courtesy of

Data Citation Provided by DataONE  Journal publishers should  Provide clear guidance on how and where data sets should be cited  Be sensitive to wishes of funding agencies when suggesting or mandating specific data repositories  Alert data publisher when publishing a paper that cites a data set Collaborating with publishers CC image by the.Firebottle on Flickr

Data Citation Provided by DataONE  Similar to citing a published article or book  Provide information necessary to identify and locate the work cited  Broadly-applicable data citation standards have not yet been established; use standards adopted by relevant academic journal, data repository, or professional organization How to Cite Data CC image by walknboston on Flickr

Data Citation Provided by DataONE  Author/Principal Investigator/Data Creator  Release Date/Year of Publication – year of release, for a completed data set  Title of Data Source – formal title of the data set  Version/Edition Number – the version of the data set used in the study  Format of the Data – physical format of the data  3 rd Party Data Producer – refers to data accessed from a 3 rd party repository  Archive and/or Distributor – the location that holds the data set Examples of information needed in a citation

Data Citation Provided by DataONE  Locator or Identifier – includes Digital Object Identifiers (DOI), Handles, Archival Resource Key (ARK), etc.  Access Date and Time – when data was accessed online  Subset of Data Used – description based on organization of the larger data set  Editor or Contributor – reference to a person who compiled data, or performed value-added functions  Publication Place – city and state and country of the distributor of the data  Data within a Larger Work – refers to the use of data in a compilation or a data supplement (such as published in a peer-reviewed paper) Examples of information needed in a citation, con’t

Data Citation Provided by DataONE Examples of Data Citation Formats  DataCite: Creator (Publication Year): Title. Publisher. Identifier  Dryad: Author (Date of Article Publication) Data from: Article name. Dryad Digital Repository. doi: DOI number CC image by Paxsimius on Flickr

Data Citation Provided by DataONE  Earth Science Information Partners (ESIP):  Required citation elements: Author. Release date. Title. Version. Archive/Distributor. Locator/Identifier. Access date and time.  Optional citation elements: Subset Used; Editor, Compiler, or other important role; Distributor, Associate Archive, or other Institutional Role; Data Within a Larger Work  Example citation:  Zwally, H.J., R. Schutz, C. Bentley, J. Bufton, T. Herring, J. Minster, J. Spinhirne, and R. Thomas GLAS/ICESat L1A Global Altimetry Data V018, 15 October to 18 November National Snow and Ice Data Center. Data set accessed at doi: /NSIDC/gla01. Examples of Data Citation Formats, con’t

Data Citation Provided by DataONE  A persistent identifier should be included in the citation:  DOI (Digital Object Identifier)  Globally unique, alphanumeric string assigned by a registration agency to identify content and provide a persistent link to its location.  May be assigned to any item of intellectual property that is defined by structured metadata  Examples: /NP5678, /ISBN ; / ISO-DOI  ARK (Archival Resource Key)  URL designed to support long-term access to information objects  Can refer to digital, physical, or intangible objects or living beings and groups  Example: What are examples of Persistent Identifiers?

Data Citation Provided by DataONE More persistent identifiers:  UUID (Universally Unique Identifier) o ‘practically unique’ identifiers that can be generated by distributed systems but later combined into a single database without needing to resolve identifier (ID) conflicts o 32 hexadecimal digits, displayed in five groups separated by hyphens, in the form for a total of 36 characters o Example: 550e8400-e29b-41d4-a  Researcher identifier: ORCID (Open Researcher & Contributor ID) o Central registry of unique identifiers for individual researchers to address author name ambiguity o Transparent linking mechanism between ORCID and other author ID schemes What are examples of Persistent Identifiers?

Data Citation Provided by DataONE  Contact organization/institution that can create DOIs  Organization hierarchy  20+ services registered through DataCite include  California Digital Library’s EZID  ANDS Cite My Data Service  DataCite Canada DOI Foundation DataCitexx California Digital Library (EZID) UCSB LibrariesxxNCEASxXXx How to obtain a DOI for a data set

Data Citation Provided by DataONE  Provide data and necessary citation profile (metadata)  DataCite: creator, title, publisher, publication year  Dublin Core: creator, title, publisher, date  ERC: who, what, when  Additional information: owner, owner group, co- owners  Receive DOI to be used for citation purposes How to obtain a DOI for a data set con’t

Data Citation Provided by DataONE  To support access to your data:  Use application that supports metadata creation for environmental data sets  Morpho  Metavist (USDA Forest Service)  Mermaid (NOAA)  Use standardized keywords to describe your data  Biocomplexity Thesaurus (USGS)  Global Change Master Directory (NASA)  Use a persistent identifier such as DOI or ARK Best Practices to Support Data Citation

Data Citation Provided by DataONE  Work with journal publishers and data repositories to archive data during the publication process  Allows information about how to access your data set to be published with your article  Note: Data repositories typically offer to embargo archived data for a pre-determined time period after publication  Allows article citation/DOI to be included with archived data set  Encourage other data authors to cite data and to make their own data available for reuse:  Provide full citation information for data whenever you publish work that makes use of another author’s data  Archive your own data in a repository that supports data discovery and reuse  Update your archived data sets when newer versions are available Best Practices to Support Data Citation, con’t

Data Citation Provided by DataONE 1. Australian National Data Service (ANDS), Data Citation Awareness Guide. Accessed May 10, 2012 at awareness.html. awareness.html 2. National Science Board (2005). Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century. Accessed May 10, 2012 at 3. Hakala, J. Persistent identifiers – an overview. Accessed May 10, 2012 at overview/. overview/ 4. Ball, A. & Duke, M. (2011). How to Cite Datasets and Link to Publications. DCC How-to Guides. Edinburgh: Digital Curation Centre. Available online: guides. guides 5. Costello, M. J. (2009). Motivating Online Publication of Data. BioScience, 59(5): Accessed May 10, 2012 at ESIP Federation Interagency Data Stewardship/Citations/Provider Guidelines. Accessed May 10, 2012 at tions/provider_guidelines. tions/provider_guidelines References