Pilot Implementation: Publication and Citation of Scientific Primary Data Result of CODATA WG, supported by DFG Jan Brase Learning Lab Lower Saxony, Uni.

Slides:



Advertisements
Similar presentations
The Benefits of Cross- Linking The International Continental Scientific Drilling Program (ICDP) Jens Klump et al. Knowledge by Networking - Digitising.
Advertisements

1 IDF Annual Members Meeting June 23, 2004 IDF – Annual Members Meeting Implementation Update.
doi> Digital Object Identifier: overview
Pilot Implementation: Publication and Citation of Scientific Primary Data Result of CODATA WG, supported by DFG Jan Brase Learning Lab Lower Saxony, Uni.
Access to non-textual information 2008 Jan Brase IDF Open Meeting: Resource Access for a Digital World June 17th, 2008, Brussels.
Std-doi Publication of Climate Data at WDCC DataCite Summer Meeting 7./8. June 2010 Publication of climate data Heinke Höck World Data Center for Climate.
Introduction to DataCite Adam Farquhar PhD Head of Digital Library Technology, The British Library President, DataCite June 2010.
Introduction to DataCite Adam Farquhar, PhD Head of Digital Library Technology, The British Library President, DataCite June, 2010.
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
DOIs for Tracking and Citing Scientific Data J. Klump, J. Wächter and M. Lautenschlager CODATA Conference 2006 Beijing, PR China.
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Data.
M.Lautenschlager (WDCC/MPI-M) / / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO.
M.Lautenschlager (WDCC, Hamburg) / / 1 Conception of Citing Scientific Primary Data (Result of CODATA WG, supported by DFG) Michael Lautenschlager.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
M. Stockhause et al. Martina Stockhause, Michael Lautenschlager, Frank Toussaint Deutsches Klimarechenzentrum (DKRZ) World Data Centre for Climate (WDCC)
German Cluster of WDCs for Earth System Research - Entwurf - Michael Lautenschlager 1, Michael Diepenbroek 2, Hannes Grobe 2, Michael Bittner 3, Jens Klump.
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin World Data Center Cluster.
Review on 5 Years DataCite and 10 Years DOI Registration for Data DataCite Annual Conference 2014 Nancy, August 25th – 26th Michael Lautenschlager (DKRZ.
M.Lautenschlager (WDCC / MPI-M) / / 1 GO-ESSP at LLNL Livermore, June 19th – 21st, 2006 World Data Center Climate: Status and Portal Integration.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
M.Lautenschlager (WDCC / MPI-M) / / 1 AGU Fall Meeting, San Francisco, December 2005 Michael Lautenschlager - WDC Climate (Max-Planck-Institut.
M. Lautenschlager (M&D/MPIM)1 The CERA Database Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie Workshop "Definition.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie Dr. Liu, Runda 5 March 2012,
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
CrossRef, DOIs and Data: A Perfect Combination Ed Pentz, Executive Director, CrossRef CODATA ’06 Session K4 October 25, 2006.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
World Data Center for Marine Environmental Sciences.
The DOI Standard Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards and Best Practices March.
ICSTI Annual Members’ Meeting & Workshop Dr. Stefan Winkler-Nees; Paris, 5. March 2012 The Alliance of German Science Organisations - Recommendations on.
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
Publication and Citation of Scientific Primary Data at WDC Climate (WDCC ) Michael Lautenschlager (WDCC) Heinke Höck (WDCC) Jan Brase (TIB) Susanne Waszkewitz.
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Wolfgang.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
Semantic linking of data and journal publications in the STD-DOI project Jens Klump and STD-DOI Team European GeoInformatics Workshop Edinburgh, 7 March.
International Data Exchange Workshop, Kiel, PANGAEA Publishing Network for Geoscientific & Environmental Data.
The Many Facets of Metadata Exchange Between Publishers and the Research Community: The Role that A&I Services and DOIs Play in Providing Access to Electronic.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
The Repository of the World Data Centre for Climate Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie Repositories in Research.
Every bit counts Data management and data publication in the earth sciences Jens Klump et al. International Data Exchange Workshop Kiel, 10 May 2007.
Lautenschlager + Thiemann (M&D/MPI-M) / / 1 Introduction Course 2006 Services and Facilities of DKRZ and M&D Integrating Model and Data Infrastructure.
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Copyright and Data Matthew Mayernik National Center for Atmospheric Research Section: Responsible Data Use Version 1.0 October 2012 Copyright 2012 Matthew.
CNR – National Research Council, Rome (IT) Central Library ‘G. Marconi’ National Centre for Grey Literature and National ISSN Centre CNR – National Centre.
TOWARDS A DATA CITATION STANDARD FOR GEOSS I. McCallum, H.-P. Plag & S. Fritz.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Hannes Thiemann Michael Lautenschlager Deutsches Klimarechenzentrum GmbH, Germany EGU 2010.
Data Citation Implementation Pilot Workshop
M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut.
Open Access data at VLIZ Experience in retrieving data from EMODnet “Data ingestion, archiving, citation and DOI” June 26, 2014.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Click to edit Master title style Click to edit Master text styles Second level Third level Fourth level Fifth level 1 SI O S Svalbard Integrated Arctic.
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Russian Academy of Sciences
A step-by-step guide to DOI registration
Non-profit DOI registration agency for Scientific primary data
Tech introduction.
Research data in library catalogues and the joint initiative of European technical libraries for data registration Jan Brase Workshop Primary data for.
Presentation transcript:

Pilot Implementation: Publication and Citation of Scientific Primary Data Result of CODATA WG, supported by DFG Jan Brase Learning Lab Lower Saxony, Uni. Hannover Michael Lautenschlager WDC for Climate Model and Data / Max-Planck-Institute for Meteorology ERPANET WS, Cork, Ireland, IDF Member's Meeting, London,

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 2 Roots CODATA 1) National Committee initiated WG, grant-aided by DFG Working Period September 2001 to May 2002 Result Final Report "Konzept zur Zitierfähigkeit wissenschaftlicher Primärdaten" or "Conception of Citing Scientifc Primary Data", Hannover, Continuation Two year project for pilot implementation funded by DFG starting in October 2003 ( 1) CODATA - Committee on Data for Science and Technology)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 3 Northern Hemisphere temperature response for scenario IS92a NH mean temperature anomaly relative to 1961 – 1990 mean of the IPCC DDC greenhouse gas only experiments ECHAM4 / 1 :  T = 0.7°C ECHAM4 / 2 :  T = 2.5°C ECHAM4 / 3 :  T = 4.3°C Each curve is connected with appr. 1TB data (numbers)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 4 ECHAM4 / 1: Temperature °C to -12°C ECHAM4/OPYC greenhouse gas only according to IS92a Corresponding to point 1 in NH temperature anomaly CO2 = 370 ppmv

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 5 ECHAM4 / 2: Temperature 2050 ECHAM4/OPYC greenhouse gas only according to IS92a Corresponding to point 2 in NH temperature anomaly: CO2 = 500 ppmv -4°C to -8°C

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 6 ECHAM4 / 3: temperature anomaly 2099 ECHAM4/OPYC greenhouse gas only according to IS92a Corresponding to point 3 in NH temperature anomaly: CO2 = 690 ppmv 0°C to -4°C

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 7 Problem and Solution Shortcomings in data provision and interdisciplinary use Rules of good scientific practise are not taken into account in all cases. Data sources are widely unknown. Data are achived without context. Data cannot be cited as independent entities Method of solution: publication of primary data as independent entities Persitent Identifier with global resolving mechanism for data archive and context referencing (scientifc datamodel at archive level) Integration into library catalogues in order to find data together with articles STD-DOI application profile: meta data kernel + items for electronic publication (interface between scientific data archives and libraries)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 8 Credits in Science "Citation Index": Scientific efficiency is "measured" by publications. Extra work for data publication is currently not acknowledged. Data processing, context documentation, quality assurance. Recommendation: Data publications should be included in the standard scientific "Citation Index". Motivation of the individual scientist. Connection between person and primary dataset. Citable Data publications support the rules of good scientific practise. encourage inter-disciplinary data utilisation. Make data searchable in library catalogues together with articles Closes the gap between scientifc literature and related data sources

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 9 Metadata for primary data 1 AttributeExample 1. DOI /WDCC/IPCC_EH4_OPYC_SRES_B2_MM 2. identifierURN:TIB: /WDCC/IPCC_EH4_OPYC_SRES_B 2_MM 3. creatorMonika Esch (Author) 4. publisherWDCC, World Data Center for Climate 5. titleClimate Projection for the next Century calculated by the Global Climate Model ECHAM4- OPYC using the SRES B2 IPCC Scenario 6. languageen 7. StructuralTypeDigital 8. modeAbstract 9. resourceTypeDataset

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 10 Metadata for primary data 2 AttributeExample registration information (RA) / 1 (issue no.) / (issue date) 13. creationDate publicationDate descriptionThese data represent results from the ECHAM4/OPYC climate model running the SRES- B2 sceanrio. The data base tables contain monthly mean time series of …… 16. publicationPlaceHamburg 17. size Bytes 18. formatGRIB 19. edition1 20. relatedDOIs(none)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 11 Criteria for Persistent Identifier Allocation Critical points are securing of data quality and stable connection between identifier and data entity Allocation is restricted to syntax control and completeness, i.e. expert data description and long-term archiving Scientific quality assurance is expected by the author and will be reviewed during the allocation process. Published primary data cannot be changed like published articles. Stable connection between identifier reference and data entity as well as long-term availability of the primary data are essential and must be ensured (e.g. ICSU WDC's)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 12 DOI and URN DOI (Digital Object Identifier)URN (Uniform Ressource Name) Non profit, but membership feePresently cost free Extended metadata supportBasic technical metadata System of registration agencies infrastructure Anybody can register URN namespaces Global resolving mechanismResolving at community level

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 13 GFZ Geophysics International DOI Foundation TIB Hannover Registr.Agency M&D/MPIM Climate Models Marum/AWI Observations Data Storage Long-term Archiving In WDC Data Storage Long-term Archiving In WDC Data Storage Long-term Archiving Global Handle System DDB URN-Knot DFG Project "Publication and Citation of Scientific Primary Data" TIB-ORDER Library Catalogue

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 14

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 15 More Details of Pilot Implementation Application Example

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 16 Primary data publication During her research for the World Data Center Climate (WDCC) the scientist Mrs. Weather gains primary data about the weather in Hannover in the year As usual the primary data is tested, evaluated, stored and administrated at the WDCC. In addition Mrs. Weather registers the primary data at the TIB (Primary data publication by STD-DOI/URN assignment)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 17 Registration of primary data After quality assurance WDCC transmits to the TIB the URL where the data can be accessed, together with a XML-file containing all relevant metadata (generated from scientific data model) Including all information obligatory for the citing of electronic media (ISO 690-2) language publisher publishing date publishing place author title size edition

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 18 Identifier The TIB is saving this information about the primary data and awards the primary data with a unique identifier for registration:a DOI DOI (Digital Object Identifier) is a system for persistent and actionable identification and interoperable exchange of intellectual property on digital networks Coordinated by the International DOI foundation (IDF)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 19 Citing primary data In her publications, Mrs. Weather is now citing this primary data with its unique DOI, maintaned from the TIB: doi: /WDCC/W_Han_2003_MMB_ (Prefix) stands for the TIB as the registration agency. WDCCstands for the respective research institute. W_Han_2003_MMB_2is the internal name of the Data

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 20 Resolving the DOI These DOI can be resolved (and the data can be cited) in every browser worldwide in three ways: Or by Doi:// /WDCC/W_Han_2003_MMB_2 (after installing a browser plugin)

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 21

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 22

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 23

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 24 Usage scenario 1 Mr. Storm is reading publications from Mrs. Weather in a journal and would like to analyse her data under different aspects. In his publication ”Comparison of the weather from Hannover and Miami” Mr. Storm cites Mrs. Weathers data using its DOI, refering to the uniqueness and own identity of the original data. Citation example: Weather, 2003: Weather in Hannover for [doi: /WDCC/W_Han_2003_MMB_2]

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 25 Usage scenario 2 Mr. Nice is writing a paper about the sales figures of ice cream in Hannover in 2003, but he has no information about the weather. He uses the TIB as the central registration agency to start a metadata search over the registered primary data. The result is doi: /WDCC/W_Han_2003_MMB_2 He resolves the DOI to find the data sufficient. The metadata refers him to the WDCC as publisher and data archive. In his paper he cites the data again using their DOI.

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 26 URN In cooperation with the German Library (DDB) in Frankfurt, every dataset is also registered with an unique URN, having the same structure as the DOI: DOI-Structure: /WDCC/W_Han_2003_MMB_2 URN-Structure: Urn:TIB: /WDCC/W_Han_2003_MMB_2

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 27 Current situation In cooperation with  World Data Center Climate (WDCC), Max Plank Institut für Meteorologie, Hamburg Geoforschungszentrum Potsdam World Data Center MARE, Uni. Bremen and Alfred Wegener Institute Bremerhaven Learning Lab Lower Saxony, Uni. Hannover the TIB Hannover now is the world‘s first registration agency for scientific and technical data (STD-DOI).

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 28 Technical A Handle server is installed at the TIB Hannover, so TIB is able to register and resolve DOIs. The TIB officially received a DOI Prefix ( ) The first data sets have been stored at the TIB by hand. The automatic registration process is under development.

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 29 Technical realization Cocoon-Webserver XML-basiert XSL-Transformierung Handle Server International DOI Foundation DDB Central Library database Göttingen GFZ WDCs Metadata storage URN registration DOI registration Data URL with XML-file

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 30 Outlook 2004 We expect abaout datasets until the end of the year The system shall be widened for other science fields 2006 The TIB Hannover shall become the central registration agency for scientific primary data

J.Brase (L3S) + M.Lautenschlager (WDCC) / / 31 Further information Project webpage: TIB Handle Server: DOI Foundation: URN registration of the DDB: