DataCite – Bridging the gap and helping to find, access and reuse data Herbert Gruttemeier INIST-CNRS Paris, IPSL, 11/7/2013.

Slides:



Advertisements
Similar presentations
DataCite Jan Brase, DataCite 5 minute madness Nordbib 2012 Copenhagen.
Advertisements

DataCiteMaking Datasets Citable Jan Brase DataCite.
1 IDF Annual Members Meeting June 23, 2004 IDF – Annual Members Meeting Implementation Update.
The German National Library of Science and Technology as a DOI RA 2007.
Access to non-textual information 2008 Jan Brase IDF Open Meeting: Resource Access for a Digital World June 17th, 2008, Brussels.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
Technical Highlights 25th August 2011 Sebastian Peters German National Library of Science and Technology.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Frauke Ziedorn IATUL Workshop 2013 Research Data Management: Finding our Role 6. December 2013 PIDs and DOI Registration with DataCite.
Digital Object Identifiers for EOSDIS data ESDSWG TIWG November 2, 2011 John Moses, ESDIS
THE ODIN PROJECT Sergio Ruiz – DataCite Laura Paglione – ORCID ORCID and DataCite Interoperability Network: Connecting Identifiers This project has received.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
Data Publishing Workflows: Strategies and Standards
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Future of Library and Information services Jan Brase, DataCite - TIB ICSTI-ITOC meeting March 17th Hannover.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
Digital Object Identifiers for EOSDIS data ESIP Winter Meeting Jan 6, 2011 John Moses, ESDIS
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
ICPSR’s Approach to Data Citation and Persistent Identifiers Mary Vardigan Assistant Director, ICPSR Workshop on Persistent Identifiers in the Social Sciences.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
ODIN – ORCID and DataCite Interoperability Network ODIN Event October 2013 Jude England– British Library Funded by The European Union Seventh Framework.
DataCite and the CODATA task group on data citation Jan Brase, DataCite ICSTI workshop “Delivering data in science” March 5 th 2012 Paris.
ORCID and me: DataCite ORCID Outreach Meeting Jan Brase, Managing agent DataCite September 17th, 2011 CERN.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
DataCite – Persistent links to scientific data Jan Brase, DataCite – TIB 1st PRELIDA workshop PISA, June 26th.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
DOI and DataCite Establishing information infrastructures Dr. Irina Sens 14. Conference „Consortia Library Systems: Technologies and Innovation“ 23. Juni.
World Data Center for Marine Environmental Sciences.
The DOI Standard Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards and Best Practices March.
DataCite CODATA Symposium Jan Brase, Managing agent DataCite August 22nd, 2011 Berkeley.
DOI uses cases for data Jan Brase DOI outreach meeting November 21 st Milano.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Data Citation & Digital Object Identifiers DOIs. 2 DOIs for articles mints DOIs for Journal articles and some datasets.
Dataset Metadata Joan Starr California Digital Library January, Tools and Approaches for Access and Preservation.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
ODIN – ORCID and DATACITE Interoperability Network Presentation to S&C Open House January 2013 John Kaye – British Library Funded by The European Union.
Libraries and data – the DataCite consortium Jan Brase, DataCite February 2nd, 2011 Workshop: Persistent Identifiers for the Social Sciences Bonn, Germany.
The Many Facets of Metadata Exchange Between Publishers and the Research Community: The Role that A&I Services and DOIs Play in Providing Access to Electronic.
Data Citation & Digital Object Identifiers DOIs. 2 Digital Object Identifiers 101 Persistent identifier Identifies intellectual property in the digital.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
|| Barbara Hirschmann1 Establishing a DOI service for Switzerland’s university and research sector.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
1 Introducing the Australian National Data Service (ANDS) Research data as a scholarly output Options for data publishing and data discovery Make your.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
DataCite Adam Farquhar DataCite President ODIN Conference, CERN,
Data Citation Implementation Pilot Workshop
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Networked Information Resources Federated search, link server, e-books.
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
Paradigm shifts in Information Access - beyond classical scholarly publication Jan Brase, DataCite - TIB GL14 Conference November 29th Rome.
NRF Open Access Statement
Norman Paskin International DOI Foundation
ACS 2016 Moving research forward with persistent identifiers
A step-by-step guide to DOI registration
ORCID y la comunidad global
DataCite - A global registration agency for research data
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research data in library catalogues and the joint initiative of European technical libraries for data registration Jan Brase Workshop Primary data for.
Bird of Feather Session
Persistent identifiers for instruments (PIDINST) working group
Jez Cope, Data Services Lead, The British Library
Presentation transcript:

DataCite – Bridging the gap and helping to find, access and reuse data Herbert Gruttemeier INIST-CNRS Paris, IPSL, 11/7/2013

Digital Object Identifiers (DOI names) offer a solution Mostly widely used identifier for scientific articles Researchers, authors, publishers know how to use them Put datasets on the same playing field as articles Dataset Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. doi: /PANGAEA URLs are not persistent (e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study. Bioinformatics. 2008, Jun 1;24(11):1381-5).   DOI names for citations

Publishers’ data policies

H. GRUTTEMEIER Publishers’ data policies extract from Nature Publishing Group, Editorial Policies, Availability of data and materials

H. GRUTTEMEIER9 Data journals

At the infrastructure level, DOI names are handles.

From KE workshop presentation, The Hague, June 2011 (L. Lannom)

From KE workshop presentation, The Hague, June 2011 (N. Paskin)

plutôt: identifiant numérique d’objet « The objects identified by DOI names may be of any form - digital, physical, or abstract - as all these forms may be necessary parts of a content management system. The DOI system is an abstract framework which does not specify a particular context of its application, but is designed with the aim of working over the Internet. » Norman Paskin, « Digital Object Identifier (DOI®) System »

DataCite Global consortium carried by local institutions Focused on improving the scholarly infrastructure around datasets and other non-textual information Focused on working with data centres and organisations that hold data Providing standards, workflows and best-practice Initially, but not exclusively based on the DOI system Memorandum of Understanding, Paris, February 2009 Officially founded December 1st 2009 in London

DataCite Members Technische Informationsbibliothek (TIB), Germany Canada Institute for Scientific and Technical Information (CISTI) California Digital Library, USA Purdue University, USA Office of Scientific and Technical Information (OSTI), USA The British Library Technical Information Center of Denmark (DTU) Library of TU Delft, The Netherlands ZBMed, Germany ZBW, Germany GESIS, Germany Library of ETH Zürich, Switzerland Institut de l’Information Scientifiqueet Technique (INIST-CNRS), France Swedish National Data Service (SND) Australian National Data Service (ANDS) Conferenza dei Rettori delle Università Italiane (CRUI) National Research Council of Thailand (NRCT) Affiliated members: Digital Curation Center, UK Microsoft Research Interuniversity Consortium for Political and Social Research (ICPSR), USA Institute of Electrical and Electronics Engineers (IEEE), USA Korea Institute of Science and Technology Information (KISTI) Bejiing Genomic Institute (BGI) Harvard University Library, USA

DataCite The DataCite registration agency –Maintains the resolution infrastructure –Maintains a searchable database of metadata –Manages the identifiers over the long term –Establishes and shares best practice Publishing agents (data centres, research institutes, data publishers) are responsible for –Quality assurance –Content storage and access –Creating the identifiers –Creating and updating metadata

Earth quake events => doi: /GFZ.GEOFON.gfz2009kciu doi: /GFZ.GEOFON.gfz2009kciu Climate models => doi: /WDCC/dphase_mpepsdoi: /WDCC/dphase_mpeps Sea bed photos => doi: /PANGAEA doi: /PANGAEA Distributes samples => doi: /PANGAEA.51749doi: /PANGAEA Medical case studies => doi: /eaacinet2007/CR/ doi: /eaacinet2007/CR/ Computational model => doi: /02/4E9F69C011BC8doi: /02/4E9F69C011BC8 Audio record => doi: /PANGAEA doi: /PANGAEA Grey Literature => doi: /GBV: doi: /GBV: Videos => doi: / doi: / What type of data are we talking about? Anything that is the foundation of further research is research data Data is evidence

DataCite Structure Carries International DOI Foundation DataCite Member Institution Data Centre Member Institution Data Centre … Works with Managing Agent (TIB) Member Associate Stakeholder

Bridging the gap PublishersData centres DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly publishers. But CrossRef DOIs are not the only DOIs available in the scholarly community. DOIs for datasets associated with scholarly research are being registered by institutions in the DataCite network. DataCite and CrossRef have committed to the interoperability of their DOIs. Ideally, scholarly content like journals will cite related data by the appropriate DataCite DOI, and in return, the data record will cite the relevant article’s CrossRef DOI. (from CrossRef Quarterly, January 2012)

Bridging the gap

Connecting article and underlying data via DOI: The dataset: Storz, D et al. (2009): Planktic foraminiferal flux and faunal composition of sediment trap L1_K276 in the northeastern Atlantic. Is supplement to the article: Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull, Detlef; Kucera, Michal (2009): Seasonal and interannual variability of the planktic foraminiferal flux in the vicinity of the Azores Current. Deep-Sea Research Part I-Oceanographic Research Papers, 56(1), , Data citation

Bridging the gap DataCite supports researchers by enabling them to locate, identify, and cite research datasets with confidence DataCite supports data centres by providing workflows and standards for data publication DataCite supports publishers by enabling linking from articles to the underlying data

Working Groups Business Practices Criteria for Data Centers Identifier Syntax Metadata Services Special Datasets Technical Infrastructure

MDS: Central portal allowing access to the metadata from all registered objects (OAI)

Service for displaying DataCite metadata Different formats (BibTeX, RIS, RDF, etc.) Content Negotation (through MIME-Typ) –Access through DOI proxy ( –First implemented by CNRI and CrossRef: Documentation: Service for displaying DataCite metadata in different formats (BibTeX, RIS, RDF, etc.) A particular representation of the metadata can be requested via content negotiation Documentation:

Resolution - Current Status Persistent Identifier (DOI, URN, …) Resolver (DataCite, …) Mapping Table PID - URL Landing Page with catalog metadata (human-readable) Data Client (Web-Browser) requesting PID Details on Data (Rich Metadata) (human-readable) Details on Data (Rich Structured Metadata) (machine- actionable) Problem Not machine- actionable

Content Negotiation - Based on the Solution of CrossRef/DataCite Persistent Identifier (DOI, URN, …) Resolver (DataCite, …) Mapping Table PID - URL Web Page on Data with catalog metadata (human-readable) Data Client requesting PID Details on Data (Rich Metadata) (human-readable) Details on Data (Rich Structured Metadata) (machine- actionable) Different Accept Headers in addition to URL requesting different representations of PID

List of repositories for research data

Some recent related developments Thomson-Reuters Data Citation Index ORCID official launch ODIN European project CODATA/ICSTI Working Group on Data Citation Creation of the Research Data Alliance

ORCID and DataCite Interoperability Network « ODIN will build on the ORCID and DataCite initiatives to uniquely identify scientists and data sets and connect this information across multiple services and infrastructures for scholarly communication. It will address some of the critical open questions in the area: Referencing a data object; Tracking of use and re- use; Links between a data object, subsets, articles, rights statements and every person involved in its life-cycle. »

Thank you