DataCite – Persistent links to scientific data Jan Brase, DataCite – TIB 1st PRELIDA workshop PISA, June 26th.

Slides:



Advertisements
Similar presentations
The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising.
Advertisements

DataCite Jan Brase, DataCite 5 minute madness Nordbib 2012 Copenhagen.
DataCiteMaking Datasets Citable Jan Brase DataCite.
The German National Library of Science and Technology as a DOI RA 2007.
Access to non-textual information 2008 Jan Brase IDF Open Meeting: Resource Access for a Digital World June 17th, 2008, Brussels.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
DataCite Metadata. Science Paradigms Thousand years ago: science was empirical describing natural phenomena Last few hundred years: theoretical branch.
Technical Highlights 25th August 2011 Sebastian Peters German National Library of Science and Technology.
Preservation, access and re-use of research data A Publishers perspective……and how we can help Joep Verheggen, Elsevier PARSE.insight workshop, Darmstadt,
Creating Institutional Repositories Stephen Pinfield.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Frauke Ziedorn IATUL Workshop 2013 Research Data Management: Finding our Role 6. December 2013 PIDs and DOI Registration with DataCite.
Selecting a Data Sharing Repository. 2 Why Share Data? Enabling others to replicate and verify results as part of the scientific process Allows researchers.
1 Quality Control in Scholarly Publishing. What are the Alternatives to Peer Review? William Y. Arms Cornell University.
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
Data Publishing Workflows: Strategies and Standards
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional research repository for the University of Pretoria.
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
Future of Library and Information services Jan Brase, DataCite - TIB ICSTI-ITOC meeting March 17th Hannover.
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
ICPSR’s Approach to Data Citation and Persistent Identifiers Mary Vardigan Assistant Director, ICPSR Workshop on Persistent Identifiers in the Social Sciences.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
DataCite and the CODATA task group on data citation Jan Brase, DataCite ICSTI workshop “Delivering data in science” March 5 th 2012 Paris.
ORCID and me: DataCite ORCID Outreach Meeting Jan Brase, Managing agent DataCite September 17th, 2011 CERN.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
DOI and DataCite Establishing information infrastructures Dr. Irina Sens 14. Conference „Consortia Library Systems: Technologies and Innovation“ 23. Juni.
World Data Center for Marine Environmental Sciences.
The DOI Standard Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards and Best Practices March.
DataCite CODATA Symposium Jan Brase, Managing agent DataCite August 22nd, 2011 Berkeley.
DOI uses cases for data Jan Brase DOI outreach meeting November 21 st Milano.
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Data Citation & Digital Object Identifiers DOIs. 2 DOIs for articles mints DOIs for Journal articles and some datasets.
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
ODIN – ORCID and DATACITE Interoperability Network Presentation to S&C Open House January 2013 John Kaye – British Library Funded by The European Union.
DataCite – Bridging the gap and helping to find, access and reuse data Herbert Gruttemeier INIST-CNRS Paris, IPSL, 11/7/2013.
Libraries and data – the DataCite consortium Jan Brase, DataCite February 2nd, 2011 Workshop: Persistent Identifiers for the Social Sciences Bonn, Germany.
The Many Facets of Metadata Exchange Between Publishers and the Research Community: The Role that A&I Services and DOIs Play in Providing Access to Electronic.
Data Citation & Digital Object Identifiers DOIs. 2 Digital Object Identifiers 101 Persistent identifier Identifies intellectual property in the digital.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
DataCite Adam Farquhar DataCite President ODIN Conference, CERN,
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
British Library Datasets Programme JISC RSP Winter School February 2011 Max Wilkinson.
Paradigm shifts in Information Access - beyond classical scholarly publication Jan Brase, DataCite - TIB GL14 Conference November 29th Rome.
NRF Open Access Statement
First Light for DOIs at ESO
ACS 2016 Moving research forward with persistent identifiers
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Non-profit DOI registration agency for Scientific primary data
OpenML Workshop Eindhoven TU/e,
DataCite - A global registration agency for research data
Tech introduction.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research data in library catalogues and the joint initiative of European technical libraries for data registration Jan Brase Workshop Primary data for.
Persistent identifiers for instruments (PIDINST) working group
Jez Cope, Data Services Lead, The British Library
Presentation transcript:

DataCite – Persistent links to scientific data Jan Brase, DataCite – TIB 1st PRELIDA workshop PISA, June 26th

High visability of the content Easy re-use and verification. Scientific reputation for the collection and documentation of content (Citation Index) Encouraging the Brussels declaration on STM publishing Avoiding duplications Motivation for new research What if any kind of scientific content would be citable?

Digital Object Identifiers (DOI names) offer a solution Mostly widely used identifier for scientific articles Researchers, authors, publishers know how to use them Put datasets on the same playing field as articles Dataset Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. doi: /PANGAEA URLs are not persistent (e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study. Bioinformatics. 2008, Jun 1;24(11):1381-5).   DOI names for citations

How to achieve this? Science is global it needs global standards Global workflows Cooperation of global players Science is carried out locally By local scientist Beeing part of local infrastrucures Having local funders

Global consortium carried by local institutions focused on improving the scholarly infrastructure around datasets and other non-textual information focused on working with data centres and organisations that hold content Providing standards, workflows and best-practice Initially, but not exclusivly based on the DOI system Founded December 1st 2009 in London DataCite

1.Technische Informationsbibliothek (TIB) 2.Canada Institute for Scientific and Technical Information (CISTI), 3.California Digital Library, USA 4.Purdue University, USA 5.Office of Scientific and Technical Information (OSTI), USA 6.Library of TU Delft, The Netherlands 7.Technical Information Center of Denmark 8.The British Library 9.ZB Med, Germany 10.ZBW, Germany 11.Gesis, Germany 12.Library of ETH Zürich 13.L’Institut de l’Information Scientifique et Technique (INIST), France 14.Swedish National Data Service (SND) 15.Australian National Data Service (ANDS) 16.Conferenza dei Rettori delle Università Italiane (CRUI) 17.National Research Council of Thailand (NRCT) DataCite members Affiliated members: 1. Digital Curation Center (UK) 2. Microsoft Research 3. Interuniversity Consortium for Political and Social Research (ICPSR) 4. Korea Institute of Science and Technology Information (KISTI) 5. Bejiing Genomic Institute (BGI) 6. IEEE 7. Harvard University Library

Earth quake events => doi: /GFZ.GEOFON.gfz2009kciu doi: /GFZ.GEOFON.gfz2009kciu Climate models => doi: /WDCC/dphase_mpepsdoi: /WDCC/dphase_mpeps Sea bed photos => doi: /PANGAEA doi: /PANGAEA Distributes samples => doi: /PANGAEA.51749doi: /PANGAEA Medical case studies => doi: /eaacinet2007/CR/ doi: /eaacinet2007/CR/ Computational model => doi: /02/4E9F69C011BC8doi: /02/4E9F69C011BC8 Audio record => doi: /PANGAEA doi: /PANGAEA Grey Literature => doi: /GBV: doi: /GBV: Videos => doi: / doi: / What type of data are we talking about? Anything that is the foundation of further reserach is research data Data is evidence

Over 1,700,000 DOI names registered so far DataCite Metadata schema published (in cooperation with all members) DataCite MetadataStore DataCite in 2013

DataCite search Searchterm: * Searchterm: uploaded:[NOW-7DAY TO NOW] Searchterm: relatedIdentifier:* Searchterm: relatedIdentifier:issupplementto\: * Searchterm:relatedIdentifier:*\: *

OAI and Statistics OAI Harvester DataCite statistics (resolution and registration)

DataCite Content Service Service for displaying DataCite metadata Different formats (BibTeX, RIS, RDF, etc.) Content Negotation (through MIME-Typ) Access through DOI proxy ( First implemented by CNRI and CrossRef: Documentation:

Content negotiation Optimized for m2m communication using the accept header of the http protocol curl -L -H "Accept: MIME_TYPE" Try a shortcut out in any webbrowser:

Resolving to the citation datacite+text/ / Li, j; Zhang, G; Lambert, D; Wang, J (2011): Genomic data from Emperor penguin. GigaScience.

Resolving to the RDF metadata / Li, J Zhang, G Wang, J doi: / info:doi/ / GigaScience Lambert, D 2011 Genomic data from the Emperor penguin (Aptenodytes forsteri)

Example of use This allows persistent identification of RDF statements! Implemented for all over 45 million CrossRef and DataCite DOI names Example of use: DOI Citation Formatter

2012: STM, CrossRef and DataCite Joint Statement 1.To improve the availability and findability of research data, the signers encourage authors of research papers to deposit researcher validated data in trustworthy and reliable Data Archives. 2.The Signers encourage Data Archives to enable bi- directional linking between datasets and publications by using established and community endorsed unique persistent identifiers such as database accession codes and DOI's. 3. The Signers encourage publishers and data archives to make visible or increase visibility of these links from publications to datasets and vice versa 23

Example The dataset: Storz, D et al. (2009): Planktic foraminiferal flux and faunal composition of sediment trap L1_K276 in the northeastern Atlantic. Is supplement to the article: Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull, Detlef; Kucera, Michal (2009): Seasonal and interannual variability of the planktic foraminiferal flux in the vicinity of the Azores Current. Deep-Sea Research Part I-Oceanographic Research Papers, 56(1), ,

Thank you! See you September 2013 in Washington DC