PSI Meta Data meeting, Toulouse - 15 November 2005 - 1 The CERA C limate and E nvironment data R etrieval and A rchiving system at MPI-Met / M&D S. Legutke,

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

Forest Markup / Metadata Language FML
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, M. Lautenschlager, H. Luthardt,
1 CEOS/WGISS20 – Kyiv – September 13, 2005 Paul Kopp SIPAD New Generation: Dominique Heulet CNES 18, Avenue E.Belin Toulouse Cedex 9 France
Meta Dater Metadata Management and Production System for surveys in Empirical Socio-economic Research A Project funded by EU under the 5 th Framework Programme.
M.Lautenschlager (WDCC/MPI-M) / / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO.
CERA / WDCC Hannes Thiemann Max-Planck-Institut für Meteorologie Modelle und Daten zmaw.de NCAR, October 27th – 29th, 2008.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Information systems and databases Database information systems Read the textbook: Chapter 2: Information systems and databases FOR MORE INFO...
M.Lautenschlager (WDCC / MPI-M) / / 1 WS Spatiotemporal Databases for Geosciences, Biomedical sciences and Physical sciences Edinburgh, November.
M.Lautenschlager (WDCC / MPI-M) / / 1 GO-ESSP at LLNL Livermore, June 19th – 21st, 2006 World Data Center Climate: Status and Portal Integration.
ISO Standards: Status, Tools, Implementations, and Training Standards/David Danko.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
M. Lautenschlager (M&D/MPIM)1 The CERA Database Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie Workshop "Definition.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
1 Copyright © 2004, Oracle. All rights reserved. Introduction to Oracle Forms Developer and Oracle Forms Services.
CAA/CFA Review | Andrea Laruelo | ESTEC | May CFA Development Status CAA/CFA Review ESTEC, May 19 th 2011 European Space AgencyAndrea Laruelo.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute for Meteorology German Climate Computing Centre (DKRZ)
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
2Object-Oriented Analysis and Design with the Unified Process Objectives  Describe the differences and similarities between relational and object-oriented.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
The european ITM Task Force data structure F. Imbeaux.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
M.Lautenschlager (WDCC, Hamburg) / / 1 ICSU World Data Center For Climate Semantic Data Management for Organising Terabyte Data Archives Michael.
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
TPAC Tasmanian Partnership for Advanced Computing Partner in APAC (Australian Partnership for Advanced Computing) Expertise centre for Earth Systems Science.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
M. Lautenschlager (M&D) / / 1 ENES: The European Earth System GRID ENES – Alcatel WS , ANTWERPEN Michael Lautenschlager Model and.
Michael Lautenschlager, Hannes Thiemann, Frank Toussaint WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Joachim Biercamp, Ulf Garternicht,
H. Thiemann (M&D) / / 1 Hannes Thiemann M&D Statusseminar, 22. April 2004.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
The Repository of the World Data Centre for Climate Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie Repositories in Research.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
H. Widmann (M&D) Data Discovery and Processing within C3Grid GO-ESSP/LLNL / June, 19 th 2006 / 1 Data Discovery and Basic Processing within the German.
Lautenschlager + Thiemann (M&D/MPI-M) / / 1 Introduction Course 2006 Services and Facilities of DKRZ and M&D Integrating Model and Data Infrastructure.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Merging Metadata Standards: FGDC CSDGM and ISO and Sharon Shin Federal Geographic Data Committee Metadata Coordinator
Create XML from a template Browse available records WDCC Metadata Generation with GeoNetwork Hans Ramthun, Michael Lautenschlager, Hans-Hermann Winter.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
British Atmospheric Data Centre ( Searching: Whither NDG? Bryan Lawrence.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Hannes Thiemann Michael Lautenschlager Deutsches Klimarechenzentrum GmbH, Germany EGU 2010.
M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
ODP V2 Data Provider overview. 22 Scope Data Provider provides access to data and metadata of the local data systems. Data Provider is a wrapper, installed.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
AP7/AP8: Long-Term Archival of CMIP6 Data
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
An Overview of Data-PASS Shared Catalog
Flanders Marine Institute (VLIZ)
Data Management: Documentation & Metadata
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Presentation transcript:

PSI Meta Data meeting, Toulouse - 15 November The CERA C limate and E nvironment data R etrieval and A rchiving system at MPI-Met / M&D S. Legutke, F. Toussaint, M. Lautenschlager

PSI Meta Data meeting, Toulouse - 15 November Content History, Architecture, Usage of the CERA DB WDCC, IPCC/DDC, CEOP : data archives hosted by CERA Core and Extensions of the CERA meta data model Relations with other meta data standards

PSI Meta Data meeting, Toulouse - 15 November CERA compliant with DIF (DirectoryInterchangeFormat), NASA Hierachic 2-layer structure: Experiments => Datasets Shortcomings: - static 2-layer horizontal structure of climate model data - restructuring needed History Architecture Usage

PSI Meta Data meeting, Toulouse - 15 November CERA-21997, compliant in addition with FGDC meta data standard 1-layer structure: RDBMS with tree-like / hierachical / network relations between entities Requirements: - geographically distributed archives - common meta data model for all archives => simple but extendible - one GUI for all archives History Architecture Usage Unchanged since 7 years

PSI Meta Data meeting, Toulouse - 15 November History Architecture Usage User Application Server DBMS (Oracle): 12 TB in 10/2002 Metadata, Blob-Data, Processing Fileserver (Unitree) Processed + Raw Data Mass Storage Archive ( 0.5 PB in 10/2002) FTP Data Migration SQL*Net IIOP CORBA-Client RMI/IIOP http, jdbc, iiop Direct file access 177 TB in 11/ PB in 11/2005

PSI Meta Data meeting, Toulouse - 15 November Mass Storage capacity/load " tape archive: STK Tape Silo > 3.4 PB " disks: 177 TB in Oracle RDBMS (web accessible; applet or servlet) " Bandwidth compute - data server 450 MB / sec " 1 TB/day automated filling at model run time (IPCC) " 3.4 PB data in files (no.=67263) " No. of experiments: 570 " > 1000 requests per day History Architecture Usage

PSI Meta Data meeting, Toulouse - 15 November WDCC IPCC/DDC CEOP Other CERA is hosting the data of World Data Centre of Climate Maintained by M&D in cooperation with DKRZ and MPI-Met Collection and dissemination of data related to climate change (focus on georeferenced data) Access: WWW or FTP (on request)

PSI Meta Data meeting, Toulouse - 15 November WDCC IPCC/DDC CEOP Other M&D and its CERA DB is acknowledged as Data Distribution Centre for IPCC model data Hosting (and distributing) a subset of IPCC data all monthly mean model data of AR4, TAR, SAR

PSI Meta Data meeting, Toulouse - 15 November WDCC IPCC/DDC CEOP Other

PSI Meta Data meeting, Toulouse - 15 November CERA-2 holds the CEOP data archive (Coordinated Enhanced Observing Period) " " Strong cooperation with GEWEX, CLIVAR, CLiC, IGOS-P, CEOS " web based access to xml meta data and data files WDCC IPCC/DDC CEOP Other

PSI Meta Data meeting, Toulouse - 15 November The Winter TopTen Program identifies the world’s largest and most heavily used databases. reached in September, 13 th : ….. Congratulations on achieving Grand Prize award winner status (1) in Database Size, Other, All and TopTen Winner status Database Size, Other, Linux;Workload, Other, Linux in Winter Corp.'s 2005 TopTen Program! (1) Grand prizes are awarded for first place winners in the All Environments categories only. WDCC's CERA DB has been identified as the largest Linux DB.

PSI Meta Data meeting, Toulouse - 15 November Collaborations within Climate Community Data Archive Initiative " DFD/DLR " IPA/DLR " DOD " DWD " GFZ " PANGAEA/AWI " xDAT/PIK " CERA-2/PIK " ECMWF " CERA-2/DKRZ " BADC Distributed Archive

PSI Meta Data meeting, Toulouse - 15 November CERA-2 Metat data model Core scheme: - valid for all entries Extensions: - community defined Module (e.g. PIK, DKRZ, PRISM to be defined?) - user defined local extension Structural flexibility: - definable fields, tables, entry types & various other - flexible lists of valid values (LOV): extensible but controlled Simple structure: - blockwise table groups - all CERA-2 blocks have a similar structure - more complex structures go into CERA Modules Core and Extensions

PSI Meta Data meeting, Toulouse - 15 November The CERA Core meta data: " only data common to most data in geophysics " compliant with 1 st level of FGDC standard " sufficient to answer: " What data are stored? " How to get assistance? " How to get the data? Little information is requireable, in order to make the model applicable for as many institutions/data as possible ! Schema and example at The core meta data system is extendible but not changeable (e.g. the CERA Core table structure may not be changed) Core and Extensions

PSI Meta Data meeting, Toulouse - 15 November Parameter Block describes data topic, variable and unit Metadata Entry This is the central CERA Block, providing information on the entry's title type and relation to other entries the project the data belong to a summary of the entry a list of general keywords related to data creation and review dates of the metadata Coverage Information on the volume of space-time covered by the data Reference Any publication related to the data together with the publication form Status Status information like data quality, processing steps, etc. Distribution Distribution information including access restrictions, data format and fees if necessary Contact Data related to contact persons and institutes like distributor, investigator, and owner of copyright Spatial Reference Information on the coordinate system used Core and Extension FGDC level 1 Extension needed for Grid description

PSI Meta Data meeting, Toulouse - 15 November The Core structure

PSI Meta Data meeting, Toulouse - 15 November Parameter Block describes data topic, variable and unit Metadata Entry This is the central CERA Block, providing information on the entry's title type and relation to other entries the project the data belong to a summary of the entry a list of general keywords related to data creation and review dates of the metadata Additionally: Modules / Local Extensions Module DATA_ORGANIZATION (grid structure) Module DATA_ACCESS (physical storage) Local extension for specific information on (e.g.) data usage data access and data administration Coverage Information on the volume of space-time covered by the data Reference Any publication related to the data together with the publication form Status Status information like data quality, processing steps, etc. Distribution Distribution information including access restrictions, data format and fees if necessary Contact Data related to contact persons and institutes like distributor, investigator, and owner of copyright Spatial Reference Information on the coordinate system used Core and Extension

PSI Meta Data meeting, Toulouse - 15 November Core and Extensions ENTRY entry_id. PARAMETER entry_id. data_org_id data_access_id.. DATA_ORG data_org_id data_org_descr space_id time_id DATA_ACCESS data_access_id access_structure_id storage1_id storage2_id storage3_id storage4_id rec_structure_id modification_date CORE

PSI Meta Data meeting, Toulouse - 15 November CERA: Module Example

PSI Meta Data meeting, Toulouse - 15 November Core and Extensions DATA_ORG module data_org_descr/name/acronym space_id: key of table with space information gridded or point data (station data, buoys, ships, …) gridded data only if lat/lon coordinates time_id : key of table with time information (grid) => any data value locatable in space / time

PSI Meta Data meeting, Toulouse - 15 November Meta data not in the CERA core can be defined in new modules. Presently: " DATA_ORG module " DATA_ACCESS module Presently there is little information on model code (= NMM code base) or on configurations of models (=NMM models) in CERA => define model meta data module A minimum of specifications should be required (allowing to exactly reproduce a model run) Most specifications should be optional Core and Extensions

PSI Meta Data meeting, Toulouse - 15 November A minimum of specifications should be required (allowing to exactly reproduce a model run) " Components involved " Code repository for each component " Code release numbers for each component " Compile scripts " Namelists " Initial data files " Forcing data files Core and Extensions

PSI Meta Data meeting, Toulouse - 15 November Most specifications should be optional: " All the required from above can be split into small pieces of informations and included to the right place of the meta data / tables Core and Extensions

PSI Meta Data meeting, Toulouse - 15 November CF standard CF standard compliancy: Any data file with any file format can be an entry of CERA CERA is primarily containing GRIB single variable data files Support for NetCDF/CF file format is being implemented: - adding meta data elements for the NetCDF/CF attributes if needed - e.g. additional CF_UNIT table - optional retrieval of data time windows of fine granularity - search along NetCDF-CF attributes

PSI Meta Data meeting, Toulouse - 15 November Other standards xsl scripts exists to transfer the CERA meta data into other standards/formats: xhtml DIF (NASA) - xml CSDGM (FGDC) - xml ISO/TC211 (19115/19139) - xml Dublin Core – xml

PSI Meta Data meeting, Toulouse - 15 November The End