Research data in library catalogues and the joint initiative of European technical libraries for data registration Jan Brase Workshop Primary data for libraries ICSTI winter meeting, march 2nd 2009 2008
The TIB as a DOI Registration agency
Problem: The research trajectory Data … is lost! are analysed synthesised interpreted become Information … is traceable is published Knowledge, as published through scientific literature, is the last step in a process originating from primary scientific data. (Click) These data are analysed, synthesised, interpreted (Click) and we gain information. (Click) The outcome of this process is published (Click) as a scientific article, which is the classical shape of knowledge.(Click) We can access the article in an easy way (Click) We can trace the information in the article (Click) But in today’s practice data are primarily stored in private files, not in secure institutional repositories, and effectively are lost. becomes Knowledge … is accessible Publication
Approach The German Research Foundation (DFG) has started the project Publication and Citation of Scientific Primary Data to increase the accessibility of scientific research data, starting with the field of earth science. TIB is now established as a “non-commercial” DOI-registration agency for scientific research data as a member of the International DOI Foundation (IDF).
Data and article The DOI system offers an easy way to connect the article with the underlying data: The dataset: G.Yancheva, . R Nowaczyk et al (2007) Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores of the Lake Huguang Maar, Southeast China, PANGAEA doi:10.1594/PANGAEA.587840 Is supplement to the article: G. Yancheva, N. R. Nowaczyk et al (2007) Influence of the intertropical convergence zone on the East Asian monsoon Nature 445, 74-77 doi:10.1038/nature05431
Results Citability of research data High visability of the data Easy re-use and verification of the data sets. Scientific reputation for the collection and documentation of data (Citation Index) Encouraging the Brussels declaration on STM publishing Avoiding duplications Motivation for new research
Scope The TIB registers research data worldwide from a scientific, technical or medical background. So far over 600,000 information objects have received a DOI name. The storage, maintaining and evaluation of the contents lies with the content providers (institutions, data centers, …) the data centers
Content providers, Data centers, who are responsible for: Quality assurance Storage of the content and accessibility Creation of metadata The TIB stores the metadata and keeps it searchable.
Research data in library catalogues
Examples Catalogue: http://tws.gbv.de/CHARSET=ISO-8859-1/DB=2.63/LNG=DU/SID=445afc80-6b/CMD?ACT=SRCHA&IKT=1016&SRT=YOP&TRM=exk+primaerdaten Dataset1: 10.1594/GFZ.SDDB.1128 Dataset2: 10.1594/PANGAEA.605212 Dataset3: 10.1594/PANGAEA.667386
Building an infrastructure
An ideal cycle of research Data analysed synthesised interpreted Accumulation Publication (DOI) Research Catalogue Publication(DOI) Data archive Publishers Information Experiment Publication(DOI) linking Peer-Review Inspiration Publication
3 layered Data Infrastructure Libraries Search results, DOI-Registriation Costs: 0,01 – 1 Euro per dataset Data centers Storage, Quality assurance , Metadata Costs: 50 – 500 Euro per dataset (usually around 1% of data creation costs) Scientists Data harvesting, Data production Costs: 5.000 – 5.000.000 Euro per dataset, (unlimited)
? Problems Missing Infrastructure in many disciplines Missing funding possibilities Missing policies for data contribution ! ? ?
Cooperations In 2008 TIB has started a pilot project with Thieme Chemistry. Chemical research data as supplement to articels from chemical journalss SYNLETT and SYNTHESIS will be registered with DOI names from TIB. The data sets will be stored at Fachinformationszentrum Karlsruhe. Access will be free of charge.
Joint initiative of European technical libraries for data registration
Re-organisation So far the TIB is a DOI registration agency for primary data (and other non-commercial scientific information) with other local institutions as its customers. In 1999 the publishers funded their independent DOI agency CrossRef In 2008/2009 TIB will transit the DOI registration to a new worldwide agency, carried by local information institutes and libraries.
Opportunities Easy access to the DOI system for non-commercial information institutes and libraries worldwide. Access to Research Data is part of the national responsibilities. As science is international we aim at creating a global cooperation with national representatives.
Partners The core group of partners are: ETH Zürich Library, Switzerland L’Institut de l’Information Scientifique et Technique (INIST), France TU Delft library, The Netherlands Technical Information Center of Denmark British Library Always open for other institutions