Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum 26 - 30 September 2011, Oostende, Belgium Citation.

Slides:



Advertisements
Similar presentations
The Corporation for National Research Initiatives The Handle System Persistent, Secure, Reliable Identifier Resolution.
Advertisements

doi> Digital Object Identifier: overview
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS
ORNL DAAC Experience With Digital Object Identifiers (DOIs) Bruce Wilson, ORNL DAAC Manager for NASA Data Center Managers telecon 22 Feb 2010.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
U.S. Department of the Interior U.S. Geological Survey Tutorials on Data Management Lesson 3.2: Data Citation CC image by adesigna on Flickr,
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
Responsible Data Use (or what should you do if you find yourself re-using someone else’s data) Ruth Duerr National Snow and Ice Data Center.
The DOI Standard Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards and Best Practices March.
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
1 Not So Strange Bedfellows: Information Standards For Librarians AND Publishers November 6, 2015.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Breakout Session 2.2: A sustainable GEO Information System of Systems Chair: Lorenzo Bigagli Rapporteur: Greg Yetman.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Networked Information Resources Federated search, link server, e-books.
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Development and Management of e-Repositories April 2013 IODE Project Office Oostende, Belgium Future Repository Trends: Repositories and Published.
Acknowledgments Funding provided by the Jewett Foundation Introduction Data collected in ocean sciences, whether generated from research or operational.
Identifiers and Citation
Data Citation and You: The new AGU guidelines for data citation
NRF Open Access Statement
Advertising your data: Using data portals and metadata registries
Copyright 2013 Matthew Mayernik.
Making Sense of the Alphabet Soup of Standards
Chapter Eight Interoperability How to Build a Digital Library
ACS 2016 Moving research forward with persistent identifiers
Identifiers and Citation
Linking persistent identifiers at the British Library
CNI Spring 2010 Membership Meeting
Access  Discovery  Compliance  Identification  Preservation
Persistent identifiers in VI-SEEM
A step-by-step guide to DOI registration
Choosing the Discovery Model Martin Forsberg
Data Management: Documentation & Metadata
Open Access to your Research Papers and Data
Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum September 2011, Oostende, Belgium Concepts.
Metadata for research outputs management
Standards For Collection Management ALCTS Webinar – October 9, 2014
Name authority control in an evolving landscape
2. An overview of SDMX (What is SDMX? Part I)
Tech introduction.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
Research Infrastructures: Ensuring trust and quality of data
Introduction to the MIABIS SOP Working Group
Bird of Feather Session
Dataverse for citing and sharing research data
Presentation transcript:

Training Course on Data Management for Information Professionals and In-Depth Digitization Practicum 26 - 30 September 2011, Oostende, Belgium Citation Linking and other Access Models Lisa Raymond

Data Citation Data citation is an evolving practice. It is still rarely done in publications, but researchers are starting to see the benefit of being able to get credit for their data. Additionally as more data is being made available online and re-used, it needs to be cited.

Purpose To provide fair credit for data creators or authors, data stewards, and other critical people in the data production and curation process.

Purpose To ensure scientific transparency and reasonable accountability for authors and stewards.

Purpose To aid in tracking the impact of data set through reference in scientific literature.

Purpose To help data authors verify how their data are being used. Download Stuff Now This is very important to scientists. An obstacle to data deposit in open access repositories has been concern over misuse of data. Proper citation and attribution will help researchers verify other work being done with their data.

Core Required Elements Author(s)--the people or organizations responsible for the intellectual work to develop the data set. The data creators. Release Date--when the particular version of the data set was first made available for use (and potential citation) by others. Title--the formal title of the data set Version--the precise version of the data used. Careful version tracking is critical to accurate citation. Archive and/or Distributor--the organization distributing or caring for the data, ideally over the long term. Locator/Identifier--this could be a URL but ideally it should be a persistant service that resolves to the current location of the data in question. The Digital Object Identifier is currently the most broadly adopted service for persistently identifying and locating whole data collections (as opposed to individual files or granules), although other identifier/locator services, such as ARKs and Handles, could be used. Time, date accessed--because data can be dynamic and changeable in ways that are not always reflected in release dates and versions, it is important to indicate when on-line data were accessed.

Example Additional fields can be added as necessary to credit other people and institutions, etc. Additionally, it is important to provide a scheme for users to indicate the precise subset of data that were used. This could be the temporal and spatial range of the data, the types of files used, a specific query id, or other ways of describing how the data were subsetted. An example citation: Zwally, H.J., R. Schutz, C. Bentley, J. Bufton, T. Herring, J. Minster, J. Spinhirne, and R. Thomas. 2003. GLAS/ICESat L1A Global Altimetry Data V018, 15 October to 18 November 2003. Data set accessed 2011-07-21 at doi:10.3334/NSIDC/gla01.

Identifiers vs. Locators URL is an identifier Digital Object Identifier (DOI) is a locator Consider a human example. A name such as “Lisa Raymond" (Associate Library Director) is an identifier. An address such as “260 Woods Hole Road, Woods Hole, MA, USA" is a locator. The locator might work as an identifier, because you might find Lisa in her office, but she may also have retired and there is a new Associate Director who plays the same role but is not the same person. Similarly, you may be able to locate Lisa based on her name and title, but what happens if she is telecommuting this week and is in Oostende not Massachusetts? It is similar with digital objects. One might be able to identify a data set by its URL, for example, but there is no guarantee that what is at that URL today is the same as what was there yesterday.

DOI, Handle, URI/URL DOI - The DOI System provides a framework for persistent identification, managing intellectual content, managing metadata, linking customers with content suppliers, facilitating electronic commerce, and enabling automated management of media. DOI names can be used for any form of management of any data, whether commercial or non-commercial. The DOI System is an ISO International Standard. The system is managed by the International DOI Foundation, an open membership consortium including both commercial and non-commercial partners. Over 50 million DOI names have been assigned by DOI System Registration Agencies in the US, Australasia, and Europe. http://www.doi.org/ A DOI … is a handle system manged by the International DOI Foundation through registry agents such as Cross Ref and DataCite. At this time Woods Hole is paying 6 cents per dataset for DOIs, we pay $1.00 for current technical reports, articles, etc. We decided to assign DOIs in addition to handles because scientists were familiar with the term, the cost is minimal, and it is a widely used international standard.

DOI, Handle, URI/URL Handle - The Handle System is a technology specification for assigning, managing, and resolving persistent identifiers for digital objects and other resources on the Internet. The protocols specified enable a distributed computer system to store identifiers (names, or handles), of digital resources and resolve those handles into the information necessary to locate, access, and otherwise make use of the resources. That information can be changed as needed to reflect the current state and/or location of the identified resource without changing the handle. http://en.wikipedia.org/wiki/Handle_System A handle is a technology that allows the item to be moved, but the system allows the handle to stay the same and resolve to the correct place. DSpace uses the Handle system – as in Ocean Docs and Published Ocean Data. DSpace generates the handle and there is no cost.

DOI, Handle, URI/URL URI/URL - In computing, a Uniform Resource Locator or Universal Resource Locator (URL) is a character string that specifies where a known resource is available on the Internet and the mechanism for retrieving it. A URL is technically a type of Uniform Resource Identifier (URI) but in many technical documents and verbal discussions URL is often used as a synonym for URI.[1] In computing, a Uniform Resource Identifier (URI) is a string of characters used to identify a name or a resource on the Internet. Such identification enables interaction with representations of the resource over a network (typically the World Wide Web) using specific protocols. Schemes specifying a concrete syntax and associated protocols define each URI. One can classify URIs as locators (URLs), or as names (URNs), or as both. A Uniform Resource Name (URN) functions like a person's name, while a Uniform Resource Locator (URL) resembles that person's street address. In other words: the URN defines an item's identity, while the URL provides a method for finding it. Wikipedia URI and URL specifies where a known resource is available on the internet. Think of it as a naming convention, not a locator.

DOI, Handle, URI/URL Examples URI – ftp://example.org/resource.txt URL - http://en.wikipedia.org/wiki/Uniform_Resource_Identifier DOI - The combination of a unique prefix element (assigned to a particular DOI registrant) and a unique suffix element (provided by that registrant) is unique, and so allows the Decentralized allocation of DOI numbers. The 4199 is the item number in our WH system. Handle – lacks the suffix, has the handle resolver information and then the 1912 is for WHOAS and 4199 the item number URI / URL as you can see names the items and points to it on the internet

We must rely on location information combined with other information such as author, title, and version to uniquely identify data. The key to making registered locators, such as DOIs or Handles, work to identify and locate data sets is through careful tracking and documentation of versions.

Location alone is not enough It is important to remember when creating citations that location alone is not enough. DOIs are now being assigned to datasets that are still being added to – such as daily temperature data and also to datasets that are updated because of corrections. Note, it is agreed that major version changes to datasets should get a new DOI, but minor may not. So author, version and date accessed are very important in the citation.

More Examples Doe, J. and R. Roe. 2001. The FOO Data Set. The FOO Data Center. doi:10.xxxx/notfoo.547983. Accessed 1 May 2011. Doe, J. and R. Roe. 2001, updated 2005. The FOO Occasionally Updated Data Set. The FOO Data Center. doi:10.xxxx/notfoo.547983. Accessed 1 May 2011. Doe, J. and R. Roe. 2001, updated daily. The FOO Time Series Data Set. The FOO Data Center. doi:10.xxxx/notfoo.547983. Accessed 1 May 2011. Doe, J. and R. Roe. 2001. The FOO Data Set. Version 2.3. The FOO Data Center. doi:10.xxxx/notfoo.547983. Accessed 1 May 2011.

Data linked to articles Deposit Data … Where? Some Existing Models

Data linked to published articles There are several projects that are working in this area. SCOR/IODE and the MBLWHOI Library have been collaborating on data publications and looking at two use cases – data from a data center and data associated with published articles.

Citation for WHOAS Data Seigel, David A., 2006. VERTIGO project Niskin bottle sample data from KM0414 and RR_K2 cruises. Bottle KM0414.csv. doi: 10.1575/1912/4199. Accessed 3 August 2011.

Links to Linked Data Sources http://www.pangaea.de/about/   http://thedata.org/ http://datadryad.org/ http://www.mblwhoilibrary.org/services/whoas-repository-services http://publishedoceandata.net/

Acknowledgments Mark Parson (NSIDC) and the ESIP Federation Resources Interagency Data Stewardship/Citations/provider guidelines http://wiki.esipfed.org/index.php/Interagency_Data_Stewardship/Citations/provider_guidelines Data Citation presentation. GeoData ,2011 Broomfield, CO, 3 March 2011 http://tw.rpi.edu/media/latest/ParsonsDataCitation.pdf A Proposed Standard for the Scholarly Citation of Quantitative Data http://www.dlib.org/dlib/march07/altman/03altman.html