M.Lautenschlager (WDCC/MPI-M) / 24.02.05 / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO.

Slides:



Advertisements
Similar presentations
Std-doi Publication of Climate Data at WDCC DataCite Summer Meeting 7./8. June 2010 Publication of climate data Heinke Höck World Data Center for Climate.
Advertisements

World Data Centre for Climate - WDCC -
New Resources in the Research Data Archive Doug Schuster.
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, M. Lautenschlager, H. Luthardt,
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Data.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
CERA / WDCC Hannes Thiemann Max-Planck-Institut für Meteorologie Modelle und Daten zmaw.de NCAR, October 27th – 29th, 2008.
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
M.Lautenschlager (WDCC / MPI-M) / / 1 WS Spatiotemporal Databases for Geosciences, Biomedical sciences and Physical sciences Edinburgh, November.
German Cluster of WDCs for Earth System Research - Entwurf - Michael Lautenschlager 1, Michael Diepenbroek 2, Hannes Grobe 2, Michael Bittner 3, Jens Klump.
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, S. Kindermann, M. Lautenschlager,
M.Lautenschlager (WDCC / MPI-M) / / 1 GO-ESSP at LLNL Livermore, June 19th – 21st, 2006 World Data Center Climate: Status and Portal Integration.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
M.Lautenschlager (WDCC / MPI-M) / / 1 AGU Fall Meeting, San Francisco, December 2005 Michael Lautenschlager - WDC Climate (Max-Planck-Institut.
M. Lautenschlager (M&D/MPIM)1 The CERA Database Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie Workshop "Definition.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
CEOS/WGISS 20, Kyev, September 12-16, WTF-CEOP Implementation Plan #1 Status (WTF-CEOP first prototype, by JAXA) September 12, 2005 Osamu Ochiai.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Simple Database.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
1 The NERC DataGrid DataGrid The NERC DataGrid DataGrid AHM 2003 – 2 Sept, 2003 e-Science Centre Metadata of the NERC DataGrid Kevin O’Neill CCLRC e-Science.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
CS4273: Distributed System Technologies and Programming Lecture 13: Review.
Scientific Investigations; Support from Research Data Archives for Joint Office for Science Support 26 February, 2002 Steven Worley SCD/DSS.
Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute for Meteorology German Climate Computing Centre (DKRZ)
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
Architecture Renovation Yoshiyuki Kudo (JAXA) WGISS-37.
Long-term Archiving of Climate Model Data at WDC Climate and DKRZ Michael Lautenschlager WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Wolfgang.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
M.Lautenschlager (WDCC, Hamburg) / / 1 ICSU World Data Center For Climate Semantic Data Management for Organising Terabyte Data Archives Michael.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
TPAC Tasmanian Partnership for Advanced Computing Partner in APAC (Australian Partnership for Advanced Computing) Expertise centre for Earth Systems Science.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
M. Lautenschlager (M&D) / / 1 ENES: The European Earth System GRID ENES – Alcatel WS , ANTWERPEN Michael Lautenschlager Model and.
Michael Lautenschlager, Hannes Thiemann, Frank Toussaint WDC Climate / Max-Planck-Institute for Meteorology, Hamburg Joachim Biercamp, Ulf Garternicht,
H. Thiemann (M&D) / / 1 Hannes Thiemann M&D Statusseminar, 22. April 2004.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
The Repository of the World Data Centre for Climate Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie Repositories in Research.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth System Science S. Kindermann, DKRZ, Germany.
PSI Meta Data meeting, Toulouse - 15 November The CERA C limate and E nvironment data R etrieval and A rchiving system at MPI-Met / M&D S. Legutke,
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
Dataset registration process Sergey Sukhonosov, Dr. Sergey Belov National Oceanographic Data Centre, Russia Training course on establishment of the ODP.
Research & Development Building a science foundation for sound environmental decisions Remote Sensing Information Gateway (RSIG)
H. Widmann (M&D) Data Discovery and Processing within C3Grid GO-ESSP/LLNL / June, 19 th 2006 / 1 Data Discovery and Basic Processing within the German.
Lautenschlager + Thiemann (M&D/MPI-M) / / 1 Introduction Course 2006 Services and Facilities of DKRZ and M&D Integrating Model and Data Infrastructure.
MarLIN: a research data metadatabase for CSIRO Marine Research Tony Rees Divisional Data Centre CSIRO Marine Research, Hobart contact:
Create XML from a template Browse available records WDCC Metadata Generation with GeoNetwork Hans Ramthun, Michael Lautenschlager, Hans-Hermann Winter.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
TIGGE Archive Access at NCAR Steven Worley Doug Schuster Dave Stepaniak Hannah Wilcox.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
Hannes Thiemann Michael Lautenschlager Deutsches Klimarechenzentrum GmbH, Germany EGU 2010.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
1. Gridded Data Sub-setting Services through the RDA at NCAR Doug Schuster, Steve Worley, Bob Dattore, Dave Stepaniak.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
World Conference on Climate Change October 24-26, 2016 Valencia, Spain
Flanders Marine Institute (VLIZ)
TIGGE Data Archive and Access System at NCAR
Information Technology Ms. Abeer Helwa
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Presentation transcript:

M.Lautenschlager (WDCC/MPI-M) / / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO Joint Meeting Feb. 28th – Mar. 4th, 2005, Tokyo Michael Lautenschlager Frank Toussaint, Hans Luthardt World Data Center for Climate Max-Planck-Institut für Meteorologie / Modelle und Daten, Hamburg

M.Lautenschlager (WDCC/MPI-M) / / 2 CEOP Data Triangle UCAR WDCCUni. Tokyo

M.Lautenschlager (WDCC/MPI-M) / / 3 Start: Approved in January 2003 Maintenance: Model and Data (M&D/MPI-M) and German Climate Computing Centre (DKRZ) Mission: Data for climate research are collected, stored and disseminated ICSU Policy: long-term archiving and unrestricted data access for scientists Restriction: Only climate data products in CERA DB, no raw data storage. Content: Emphasis is spent on climate modelling and related data products. Co-operation: with thematically corresponding data centres like WDC- MARE (Bremen) and WDC-RSAT (Oberpfaffenhofen) URL: The ICSU World Data Center for Climate (WDCC)

M.Lautenschlager (WDCC/MPI-M) / / 4 (I) Data catalogue and Unix files (pointer or BLOB-table- entry)  Enable search and identification of data  Allow for data access as they are (II) Application-oriented data storage  Time series of individual variables are stored as BLOB entries in DB Tables Allow for fast and selective data access  Storage in standard file-format (GRIB, NetCDF) Allow for application of standard data processing routines (PINGOs) CERA 1) Concept: Semantic Data Management 1) Climate and Environmental data Retrieval and Archiving

M.Lautenschlager (WDCC/MPI-M) / / 5 Climate Model Raw Data Application-oriented Data Storage (Interface level 2) Primary Data Processing

M.Lautenschlager (WDCC/MPI-M) / / 6 Entry ReferenceStatus Distribution ContactCoverageParameter Spatial Reference Local Adm. Data Access Data Org CERA-2 Data Model Blocks

M.Lautenschlager (WDCC/MPI-M) / / 7 The CERA2 data model … allows for data search according to discipline, keyword, variable, project, author, geographical region and time interval and for data retrieval. allows for specification of data processing (aggregation and selection) without attaching the primary data. is flexible with respect to local adaptations, to storage of different types of geo-referenced data, and to definition of data topologies (hierarchical, network, ….). is open for cooperation and interchange with other database systems (e.g. FGDC metadata standard and ISO included). But: is not the simplest data model for each single application. Data Model Functions

M.Lautenschlager (WDCC/MPI-M) / / 8 Level 1 - Interface: Metadata entries (XML, ASCII) + Data Files Level 2 – Interf.: Separate files containing BLOB table data in application adapted structure (time series of single variables) Experiment Description Unix-Files Table / Pointer Dataset 1 Description Dataset n Description BLOB Data Table BLOB Data Table WDCC Data Topology BLOB DB Table corresponds to scalable, virtual file at the operating system level.

M.Lautenschlager (WDCC/MPI-M) / / 9 WDCC Data Content  Climate model data IPCC (Hamburg model runs)  IPCC DDC  CEOP  Observational Data ERA15/40 (ECMWF), NCEP 40Y WOCE  Project support HOAPS CARIBIC BALTEX SFB512 Different model appplications  Size of CERA: 140 Tbyte  Number of experiments: 382  Number of datasets:  No. of blobs:4 billion  Downloads last 12 months >

M.Lautenschlager (WDCC/MPI-M) / / 10 WDCC

M.Lautenschlager (WDCC/MPI-M) / / 11 Availability of CEOP-Model data in the WDCC Total amount of CEOP data within WDCC: 1.6 TByte

M.Lautenschlager (WDCC/MPI-M) / / 12 WDCC DB Access Web Interface Binary Large Objects (BLOBS) Global coverage of 1 (more) parameter at 1 (more) level for 1 moment TAPE DB storage cache

M.Lautenschlager (WDCC/MPI-M) / / 13 WDCC Data Access Granularity Finest granularity for DB data access (SQL) is on BLOB level Sub-BLOB data access requires data processing outside the WDCC DB system BLOB sizes within WDCC Fine granularity: global field of 1 variable at 1 vertical level for 1 moment 20 – 100 KB / BLOB depending on model resolution Coarse granularity: global fields of all variables at all vertical levels for 1 moment around 25 MB / BLOB for CEOP model output Model output file sizes: global fields of of all variables at all vertical levels for many moments 500 MB – 1 GB / File for CEOP model output

M.Lautenschlager (WDCC/MPI-M) / / 14 CEOP DB Storage Storage of global coverages per file or BLOB : all levels, all parameters arbitrary time intervals all levels, all parameters 1 moment (6 by 6 hours) 1 level, 1 parameter 1 moment parameters levels days /4 parameters levels time how we get the grid data: Files from prediction model postprocessing step 1: homogenizing time MB per blob postprocessing step 2: isolation of levels & parameters

M.Lautenschlager (WDCC/MPI-M) / / 15 Web Access to WDCC METADATA:DATA: GUI:display in applet (1)JDBC (2) jblob-script:Search for DS names (2) JDBC (2) jblob –f … html-display (3) - xml-download (ISO, DC, …) (4) download http (5) URL:

M.Lautenschlager (WDCC/MPI-M) / / 16 Selection via CERA meta data:  selection of the experiment (=model run)  display of meta data: experiment, quality, datasets  selection of the dataset  display of dataset information  add datasets to “process list”  select time span and download from tape archive to data server and to the client User Interface 1: The Present Applet

M.Lautenschlager (WDCC/MPI-M) / / 17 User Interface 2: jblob Command for data download Retrieval by script : Java based – for Unix / Linux and Windows JDBC connection examples : jblob – datasetname -options jblob – datasetid -options jblob – showdatasets “ ” (% as wildcard)

M.Lautenschlager (WDCC/MPI-M) / / 18 User Interface 3: html metadata access Servlet and JSP Retrieval specification dataset selection from CERA catalogue three steps of specification: selected region in space and time download by http

M.Lautenschlager (WDCC/MPI-M) / / 19 User Interface 4: http Metadata Access Retrieval by browser (or Unix command wget): wget … … XML/CERA2WINIq.xsql?&id=50 XML ISO format http or: Browser display as html file

M.Lautenschlager (WDCC/MPI-M) / / 20 User Interface 5: http data download Java servlet URL includes: - specification of dataset (parameter) - specification of records (time span) - encrypted user / password in preparation: control by certificate

M.Lautenschlager (WDCC/MPI-M) / / 21 WDCC Networking DLR: WDCC catalogue links to external satellite data, data download by servlet CEOP: mutual data access for in-situ, model and satellite data BADC: distributed data holding (ERA40 and IPCC), certificates for authentication

M.Lautenschlager (WDCC/MPI-M) / / 22 Future activities may be summarized as improving the data granularity, harmonizing the metadata and completing the CEOP data sets. Problem: Model data provision does not fit the data inclusion interface of the WDCC in all details. How to organise the extra work? WDCC - CEOP