Data Citation Proposal Based on work by: Mark A. Parsons and the ESIP Preservation and Stewardship Cluster, esp. Ruth Duerr, Curt Tilmes, and Bruce Barkstrom.

Slides:



Advertisements
Similar presentations
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Advertisements

Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
J.B. Minster on behalf of ….  Mark Parsons, Ruth Duerr  Michael Diepenbroek, Michael Zgurovsky  Kari Raivio, Brian McMahon  AGU Data Policy Panel.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
Agency Requirements: NASA Data Management Plans Ronald Weaver National Snow and Ice Data Center W. Christopher Lenhardt Renaissance Computing Institute.
ORNL DAAC Experience With Digital Object Identifiers (DOIs) Bruce Wilson, ORNL DAAC Manager for NASA Data Center Managers telecon 22 Feb 2010.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
Data Publishing Workflows: Strategies and Standards
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Elements of a Data Management Plan
Tracking and Managing Citations: Data Centers and Best Practices W. Christopher Lenhardt CIESIN – Columbia University 25 October 2006 – CODATA 2006 W.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
Data Management Plans Bill Michener University Libraries and Biology Dept. University of New Mexico.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Providing Access to Your Data: Rights Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science.
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
Providing Access to Your Data: Tracking Data Usage Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Why Create a Data Management Plan? Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date Section: Data Management Plans.
Elements of a Data Management Plan: Identifying the materials to be created Ruth Duerr National Snow and Ice Data Center Version Review Date Section:
Citing Data Sets in the Literature: ORNL DAAC Practices Robert Cook, Suresh SanthanaVannan, and Daine Wright Environmental Sciences Division Oak Ridge.
References: [1] [2] [3] Acknowledgments:
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
Responsible Data Use (or what should you do if you find yourself re-using someone else’s data) Ruth Duerr National Snow and Ice Data Center.
THOMSON SCIENTIFIC Patricia Brennan Thomson Scientific January 10, 2008.
CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 1.
Where are the rewards? University of Melbourne 28 January
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Where are the rewards? Building a culture of data citation workshop Edith Cowan University, Perth March
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
Agency Requirements: NSF Data Management Plans Ruth Duerr National Snow and Ice Data Center Version 1.0 October 2012 Section: The Case for Data Stewardship.
Creating Documentation and Metadata: Introduction to Metadata and Metadata Standards Lynn Yarmey National Snow and Ice Data Center Version 1.0 February.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Data Matthew Mayernik National Center for Atmospheric Research Version.
1. 2 Rewards are real … but few (yet) 3 The citation benefit intensified over time... ...with publications from 2004 and 2005 cited 30 per cent more.
Making Data Accessible Yolanda Gil USC/ISI February 20, 2015 "To deposit or not to deposit, that is the question - journal.pbio g001"
Advertising your data: Agency requirements for submitting metadata Nancy J. Hoebelheinrich Version 1.0 September 2012 Section: Local Data Management Copyright.
Providing Access to Your Data: Rights Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International Earth Science.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
Why Create a Data Management Plan? Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
GigaScience ( is an online, open-access journal that includes, as part of its publishing activities, the database GigaDB.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Elements of a Data Management Plan Ruth Duerr National Snow and Ice Data Center Version 1.0 February 2013 Data Management Plans Copyright 2013 Ruth Duerr.
8 January 2016 ESIP Winter Meeting
The Case for Data Stewardship: Enhancing Your Reputation Matthew Mayernik National Center for Atmospheric Research Version 1.0 September 2012 Section:
Creating Documentation and Metadata: Creating a Citation for Your Data Robert Cook Oak Ridge National Laboratory Section: Local Data Management Copyright.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Advertising your data Alecia Aleman 1, Ruth Duerr 2 1 National Aeronautics and Space Administration (NASA) 2 National Snow and Ice Data Center, University.
Data Citation Implementation Pilot Workshop
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Webinar on increasing openness and reproducibility April Clyburne-Sherin Reproducible Research Evangelist
Updating image To update the background image: Go to ‘View’ Select ‘Slide Master’ Select the page with the image Right click on the image and select ‘Change.
Development and Management of e-Repositories April 2013 IODE Project Office Oostende, Belgium Future Repository Trends: Repositories and Published.
Data Citation and You: The new AGU guidelines for data citation
Copyright 2013 Matthew Mayernik.
Persistent Identifiers Implementation in EOSDIS
Copyright 2012 Lola Olsen & Tyler Stevens.
Presented April 7, 2005 at the 2005 AAG meeting, Denver, CO
AGU Paper Number: IN43B-1697 Evolving a NASA Digital Object Identifiers System with Community Engagement Lalit Wanchoo1 and Nathan.
Publishing software and data
W. Christopher Lenhardt
OpenML Workshop Eindhoven TU/e,
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Citation.
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Presentation transcript:

Data Citation Proposal Based on work by: Mark A. Parsons and the ESIP Preservation and Stewardship Cluster, esp. Ruth Duerr, Curt Tilmes, and Bruce Barkstrom.

2 Purpose of Data Citation Credit for data creators and stewards Allow data creators to see how researchers are using their data Track impact of data set Provides accountability for creators and stewards Aids reproducibility through unambiguous connection to the precise data used From Parsons, modified by Lynnes

3 How “data citation” is currently done 1.Not mentioned, just used, e.g., in tables or figures 2.Reference to name or source of data in text 3.URL in text (with variable degrees of specificity) 4.Citation of related paper (e.g. CRU Temp. records recommend citing two old journal articles which do not contain the actual data or full description of methods) 5.Citation of actual data set typically using recommended citation given by data center 6.Citation of data set including a persistent identifier/locator, typically a DOI From Parsons, et al.

4 Current GES DISC Policy CITING OUR DATA GES DISC Data Use Acknowledgment Distribution of GES DISC data sets is funded by NASA's Science Mission Directorate (SMD). The data are not copyrighted and are open to all for both commercial and non-commercial uses. If you used GES DISC data for a publication (research or otherwise), or for any other purpose, we request that you include the following acknowledgment: "The data used in this effort were acquired as part of the activities of NASA's Science Mission Directorate, and are archived and distributed by the Goddard Earth Sciences (GES) Data and Information Services Center (DISC)." We would appreciate receiving a copy of your publication, which can be be forwarded to...

5 Basic data citation form and content Author(s). Year. Title, [version]. [editor(s)]. Publisher. Location. [date accessed]. [subset used]. From: Parsons, Mark A., Ruth Duerr, and Jean-Bernard Minster Data citation and peer-review. Eos, Trans. AGU 91 (34): doi: /2010EO

An Example Citation Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at Authors: intellectual effort going into the dataset: i.e., algorithm developers Year: year data were produced Title: Data Set Long Name Editor(s): People that have added significant value to the dataset City and Publisher: Greenbelt, MD: Goddard Earth Sciences Data and Information Services Center Data access date and location From Parsons, et al.

Implementation Store information in GCMD entry, under “Data Set Citation” Requested “Dataset Editor” field from GCMD Generate stable, toplevel locations for each dataset, e.g., Generate individualized citations for each dataset, e.g.,: Chung-Lin Shie, Long Chiu, Robert Adler, I-I Lin, Eric J. Nelkin, and Joe Ardizzone, Surface Turbulent Fluxes, 1x1 deg Monthly Grid, Set1 and Set2. Edited by A. Savtchenko. Greenbelt, MD: Goddard Earth Sciences Data and Information Services Center, Accessed at Add to READMEs at the top OR add a special file to URL set for download Present within Mirador at Checkout stage 7

Backup Slides 8

9 “We found that few policies recommend robust data citation practices: in our preliminary evaluation, only one-third of repositories (n=26), 6% of journals (n=307), and 1 of 53 funders suggested a best practice for data citation. We manually reviewed 500 papers published between 2000 and 2010 across six journals; of the 198 papers that reused datasets, only 14% reported a unique dataset identifier in their dataset attribution, and a partially-overlapping 12% mentioned the author name and repository name. Few citations to datasets themselves were made in the article references section.” “Data Citation in the Wild” Valerie Enriquez, Sarah Walker Judson, Nicholas M. Weber, Suzie Allard, Robert B. Cook, Heather A. Piwowar, Robert J. Sandusky, Todd J. Vision, Bruce Wilson From Parsons, et al.

11 Tracking citation “Tracking Dataset Citations Using Common Citation Tracking Tools Doesn’t Work” —Heather Pinowar, DataONE Traditional fields such as author and date too imprecise Web of Science, Scopus, and other tools don’t handle identifiers From Parsons, et al.

12 Accountability A new standard of accountability in a post-climategate world Data “publication” needs to be tied to promotion, tenure, etc. Implies peer review— See AGU Position Statement on Data What is peer-review? An assertion of accuracy or validity? An audit of complete documentation and sound practice? Related to but different than QA. How does it overlap with curation and stewardship? Earth System Science Data one approach, but not universally applicable. Open or informal review or usage comments within the metadata Versioning and transparency are essential From Parsons, et al.

Author Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Year Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Title Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Editor Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Publisher Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Location Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at From Parsons, et al.

Location Gary King; Langche Zeng, 2006, "Replication Data Set for 'When Can History be Our Guide? The Pitfalls of Counterfactual Inference'" hdl:1902.1/DXRXCFAWPK UNF:3:DaYlT6QSX9r0D50ye+tXpA== Murray Research Archive [distributor] From Parsons, et al.

Location König-Langlo, Gert and Hatwig Gernandt Compilation of radiosonde data from the Antarctic Georg- Forster station of the German Democratic Republic from 1985 to Bremerhaven, Germany: Alfred Wegener Institute for Polar and Marine Research Data set accessed doi: /PANGAEA From Parsons, et al.

21 Doing it as best we can... Hall, Dorothy K., George A. Riggs, and Vincent V. Salomonson. 2007, updated daily. MODIS/Aqua Snow Cover Daily L3 Global 500m Grid V005.3, Oct Sep. 2008, 84°N, 75°W; 44°N, 10°W. Boulder, Colorado USA: National Snow and Ice Data Center. Data set accessed at doi: /xxx. Hall, Dorothy K., George A. Riggs, and Vincent V. Salomonson. 2007, updated daily. MODIS/Aqua Snow Cover Daily L3 Global 500m Grid V005.3, Oct Sep. 2008, Tiles (15,2; 16,0;16,1;16,2;17,0;17,1). Boulder, Colorado USA: National Snow and Ice Data Center. Data set accessed at doi: /xxx. Cline, D., R. Armstrong, R. Davis, K. Elder, and G. Liston. 2002, Updated CLPX-Ground: ISA snow depth transects and related measurements, Version 2.0, shapefiles. Edited by M. Parsons and M. J. Brodzik. Boulder, CO: National Snow and Ice Data Center. Data set accessed at doi: /xxx. From Parsons, et al.

Thank You Much of this talk comes from: Parsons, Mark A., Ruth Duerr, and Jean-Bernard Minster Data citation and peer-review. Eos, Trans. AGU 91 (34): doi: /2010EO Duerr, Ruth E., Robert R. Downs, Curt Tilmes, Bruce Barkstrom, W. Christopher Lenhardt, Joe Glassy, Luis E. Bermudez, and Peter Slaughter (submitted). On the utility of identification schemes for digital Earth science data: An assessment and recommendations. Earth Science Informatics. A lot of discussion at: photo courtesy NOAA From Parsons, et al.