Download presentation
Presentation is loading. Please wait.
Published byOctavia Douglas Modified over 9 years ago
1
M. Diepenbroek (MARUM), M. Lautenschlager (MPI-M), E. Paliouras (DLR), H. Grobe (AWI) CODATA General Assembly, Berlin 10.11.2004 World Data Center Cluster „Earth System Research“ - an approach for a common data infrastructure in geosciences WDC-MARE WDC-RSAT WDC-TERRA WDC-Climate (Candidate)
2
? Global increase in publications in empirical sciences
3
The poor availability of scientific data hampers complex and large scale approaches in research Scientific results cannot be verified without the data underlying publications Reproduction is often more expensive than archiving and recycling of data Main problems
4
founded during the International Geophysical Year (IGY) 1957-58 longterm funding and maintainance by their host countries on behalf of the international science community status of WDC is peer reviewed by international research institutes and programmes and funding organisations accept data from national and international scientific or monitoring programs as resources permit. all data held in WDCs are generally available to science scope of data collected: solar, geophysical, environmental, and human dimensions data, especially for monitoring changes in the geosphere and biosphere at present 52 Centers in 12 countries The World Data Center (WDC) System of the International Council for Science (ICSU)
5
Founded in April 2003 in Oberpfaffenhofen Members: WDC-MARE - World Data Center for Marine Environmental Sciences (AWI, MARUM) WDC-C - World Data Center for Climate (MPI, Hamburg) WDC-RSAT – World Data Center for Remote Sensing (DLR, Oberpfaffenhofen) WDC-TERRA – World Data Center of the Lithosphere (candidate) – GeoForschungsZentrum (Potsdam) The WDC cluster „Earth System Research“
6
WDC cluster „Earth System Research“ Data and thematic coverage atmosphere land models ocean
7
Activities & characteristics of the WDC cluster Longterm archiving facilities Clear commission as data libraries Data management infrastructure, expertise, and manpower Longterm commitment and funding Peer review for scientific data Completeness of data set descriptions (metadata) Validity of methods used data values (precision, sequence, and ranges) Data publication based on citable data entities having persistent identifiers (DOI) Userfriendly and reliable systems for data retrieval and distribution General nonrestricted online access Offline products (e.g. data collections, DVD) Fostering common standards and protocols Clear commitment to the rules for good scientific practice!
8
WDC infrastructure Metadataprofile (ISO 19115, subset compatible with Dublin Core and ISO 690) Metadata catalogues based on common protocols (ISO, W3C, OGC) Common internet portal (search engine) Cost models to support longterm archiving at universities and in scientific projects Data publication Migration of metadata into library catalogues and direct access of WDC archives Common search of scientific data and literature Peer review for scientific data Acceptance as citable publication through ISI WDC cluster - milestones
9
Data Publication: Problem and Solution Shortcomings in data provision and interdisciplinary use –Rules of good scientific practise are not taken into account in all cases. –Data sources are widely unknown. –Data are achived without context. –Data cannot be cited as independent entities Method of solution: publication of primary data as independent entities –Persitent Identifier with global resolving mechanism for data archive and context referencing (scientifc datamodel at archive level) –Integration into library catalogues in order to find data together with articles –STD-DOI application profile: meta data kernel + items for electronic publication (interface between scientific data archives and libraries)
10
Data Publication: Credits in Science "Citation Index": Scientific efficiency is "measured" by publications. Extra work for data publication is currently not acknowledged. –Data processing, context documentation, quality assurance. Recommendation: Data publications should be included in the standard scientific "Citation Index". –Motivation of the individual scientist. –Connection between person and primary dataset. Citable Data publications –support the rules of good scientific practise. –encourage inter-disciplinary data utilisation. –Make data searchable in library catalogues together with articles –Closes the gap between scientifc literature and related data sources
11
Data Publication: Metadata for primary data 1 AttributeExample 1. DOI10.1594/WDCC/IPCC_EH4_OPYC_SRES_ B2_MM 2. identifierURN:TIB:10.1594/WDCC/IPCC_EH4_OPY C_SRES_B2_MM 3. creatorMonika Esch (Author) 4. publisherWDCC, World Data Center for Climate 5. titleClimate Projection for the next Century calculated by the Global Climate Model ECHAM4-OPYC using the SRES B2 IPCC Scenario 6. languageen 7. StructuralTypeDigital 8. modeAbstract 9. resourceTypeDataset
12
AttributeExample 10.-12. registration information10.1594 (RA) / 1 (issue no.) / 2004-07-18 (issue date) 13. creationDate2001-12-31 14. publicationDate2004-07-18 15. descriptionThese data represent results from the ECHAM4/OPYC climate model running the SRES-B2 sceanrio. The data base tables contain monthly mean time series of …… 16. publicationPlaceHamburg 17. size614190228 Bytes 18. formatGRIB 19. edition1 20. relatedDOIs(none) Data Publication: Metadata for primary data 2
13
Data Publication: Criteria for Persistent Identifier Allocation Critical points are securing of data quality and stable connection between identifier and data entity –Allocation is restricted to syntax control and completeness, i.e. expert data description and long-term archiving –Scientific quality assurance is expected by the author and will be reviewed during the allocation process. –Published primary data cannot be changed like published articles. –Stable connection between identifier reference and data entity as well as long-term availability of the primary data are essential and must be ensured (e.g. ICSU WDC's)
14
GFZ Geophysics International DOI Foundation TIB Hannover Registr.Agency M&D/MPIM Climate Models Marum/AWI Observations Data Storage Long-term Archiving In WDC Data Storage Long-term Archiving In WDC Data Storage Long-term Archiving Global Handle System DDB URN-Knot DFG Project "Publication and Citation of Scientific Primary Data" TIB-ORDER Library Catalogue Data Publication:
15
Further information Project webpage: http://www.std-doi.de TIB Handle Server: http://doi.tib-hannover.de:8000 DOI Foundation: http://www.doi.org URN registration of the DDB: http://www.persistent-identifier.de
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.