2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)

Slides:



Advertisements
Similar presentations
ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
Advertisements

Lecture plan Information retrieval (from week 11)
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, M. Lautenschlager, H. Luthardt,
M.Lautenschlager (WDCC/MPI-M) / / 1 The CEOP Model Data Archive at the World Data Center for Climate as part of the CEOP Data Network CEOP / IGWCO.
CERA / WDCC Hannes Thiemann Max-Planck-Institut für Meteorologie Modelle und Daten zmaw.de NCAR, October 27th – 29th, 2008.
Center for Environmental Studies Arizona State University Digital Research Records at Center for Environmental Studies Peter McCartney.
Implementing ISO Aleta Vienneau and David Danko ESRI.
The Earth System Grid Discovery and Semantic Web Technologies Line Pouchard Oak Ridge National Laboratory Luca Cinquini, Gary Strand National Center for.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
M.Lautenschlager (WDCC / MPI-M) / / 1 WS Spatiotemporal Databases for Geosciences, Biomedical sciences and Physical sciences Edinburgh, November.
Preservation and Long Term Access of Data at the World Data Centre for Climate Frank Toussaint N.P. Drakenberg, H. Höck, S. Kindermann, M. Lautenschlager,
M.Lautenschlager (WDCC / MPI-M) / / 1 GO-ESSP at LLNL Livermore, June 19th – 21st, 2006 World Data Center Climate: Status and Portal Integration.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
M. Lautenschlager (M&D/MPIM)1 The CERA Database Michael Lautenschlager Modelle und Daten Max-Planck-Institut für Meteorologie Workshop "Definition.
Chapter 4: Core Web Technologies
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Sep , 2006 v FME Worldwide User Conference - Vancouver Capturing SpatialDirect Usage Statistics to Support Intelligent Business Decisions Jason Close,
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Last News of and
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
F. Toussaint (WDCC, Hamburg) / / 1 CERA : Data Structure and User Interface Frank Toussaint Michael Lautenschlager World Data Center for Climate.
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Semantic Data Management for Organising Terabyte Data Archives Michael Lautenschlager World Data Center.
M.Lautenschlager (WDCC, Hamburg) / / 1 Training-Workshop Facilities and Sevices for Earth System Modelling Integrated Model and Data Infrastructure.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
M.Lautenschlager (WDCC, Hamburg) / / 1 ICSU World Data Center For Climate Semantic Data Management for Organising Terabyte Data Archives Michael.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
H. Thiemann (M&D) / / 1 Hannes Thiemann M&D Statusseminar, 22. April 2004.
IPCC TGICA and IPCC DDC for AR5 Data GO-ESSP Meeting, Seattle, Michael Lautenschlager World Data Center Climate Model and Data / Max-Planck-Institute.
INFSO-RI Enabling Grids for E-sciencE A service oriented framework to create, manage and update metadata for earth system science.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
The Repository of the World Data Centre for Climate Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie Repositories in Research.
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth System Science S. Kindermann, DKRZ, Germany.
PSI Meta Data meeting, Toulouse - 15 November The CERA C limate and E nvironment data R etrieval and A rchiving system at MPI-Met / M&D S. Legutke,
H. Widmann (M&D) Data Discovery and Processing within C3Grid GO-ESSP/LLNL / June, 19 th 2006 / 1 Data Discovery and Basic Processing within the German.
Lautenschlager + Thiemann (M&D/MPI-M) / / 1 Introduction Course 2006 Services and Facilities of DKRZ and M&D Integrating Model and Data Infrastructure.
WEB SERVER SOFTWARE FEATURE SETS
DSpace System Architecture 11 July 2002 DSpace System Architecture.
4 m 9K Copyright 2002 Forum 9000, LLC Slide 1 Forum 9000 Quality Systems for Quality Care.
Create XML from a template Browse available records WDCC Metadata Generation with GeoNetwork Hans Ramthun, Michael Lautenschlager, Hans-Hermann Winter.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
IPCC WG II + III Requirements for AR5 Data Management GO-ESSP Meeting, Paris, Michael Lautenschlager, Hans Luthardt World Data Center Climate.
M. Lautenschlager (M&D/MPIM)1 WDC on Climate as Part of the CERA 1 Database System Michael Lautenschlager Modelle und Daten Max-Planck-Institut.
Ideas on Opening Up GEOSS Architecture and Extending AIP-5 Wim Hugo SAEON.
JAFER Toolkit Project Oxford University 1 JAFER Java-based high level Z39.50 toolkit Matthew Dovey; Colin Tatham; Antony Corfield; Richard Mawby Oxford.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
ODP V2 Data Provider overview. 22 Scope Data Provider provides access to data and metadata of the local data systems. Data Provider is a wrapper, installed.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
AIRS Meeting GSFC, February 1, 2002 ECS Data Pool Gregory Leptoukh.
INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,
AP7/AP8: Long-Term Archival of CMIP6 Data
An Overview of Data-PASS Shared Catalog
Flanders Marine Institute (VLIZ)
EVLA Archive The EVLA Archive is the E2E Archive
Introduction to J2EE Architecture
IP Publishing From IP Data Base to IP list to IP catalog
Intermountain West Data Warehouse
Enabling direct data access to social science research data
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
CREE: HEIRPORT lite Welcome screen:
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
Presentation transcript:

2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)

2005 – 06 – - ESSP2 Contents  The World Data Center for Climate (WDC-C) at Max-Planck-Institute  Input to the DB  Output of Metadata  Output of Data

2005 – 06 – - ESSP3 WDCC + M&D: Structure and: IPCC-DDC etc Administration Finance Facilities

2005 – 06 – - ESSP4 Hardware and DB The hardware  > 2 PB data tape cartridges  90 TB disk cache TAPE DB storage cache The database  170 TB web accessible via Oracle DB

2005 – 06 – - ESSP5 Input to the DB The metadata input: x m l  xml interface to Oracle DB for: - xml metadata from data providers - “hand knit” xml metadata The data input  cutting into single time steps one step = 1 blob (binary large object)  Java filling tool / JDBC

2005 – 06 – - ESSP6 Output from the DB What is desired – and what is desirable ?  browse, search, retrieve  easy access from everywhere  authentication and authorization for some data  no problems with the damnable firewalls TAPE DB storage cache http

2005 – 06 – - ESSP7 Metadata Output: JSP + Servlets Browsing  online access to all CERA projects and experiments  links to all datasets of any experiment

2005 – 06 – - ESSP8 Metadata Output: JSP + Servlets Search  search on different CERA tables  technique: JSP + html form  in preparation

2005 – 06 – - ESSP9 http – Access to Metadata DB xml Web access as :  xhtml table for quick view of raw data  any type of xml as: - ISO / ISO Dublin Core  project dependent xhtml design  raw DB data xsl mapping from/to xml and JSP / Servletts

2005 – 06 – - ESSP10 Data Retrieval Specification: JSP+Servlets html forms  dataset selection from CERA catalogue  three steps of specification: bounding box and period  GRIB & NetCDF  download region in space and time

2005 – 06 – - ESSP11 Data Download http / ftp data download: hyperlink to ftp request integrated into an html page docking to DB by JDBC …but :  piping ? easy for GRIB - difficult for NetCDF  postprocessing ? presently: cuts in time of 1-parameter-1-level global data  regional cuts: prototype ready

2005 – 06 – - ESSP12 Data Download (http/ftp) Authentication and authorization : html embedded hyperlink: DB login required by servlet DB login allows for “shopping cart”  future: certificated access

2005 – 06 – - ESSP13 DODS – an Alternative ? http data download: e.g., a DODS Server  multidimensional data selection and step width in an http request but (mainly open NetCDF files) :  does not work on piped GRIB - positioning  does not work directly on tape archives - positioning  does not work indirectly on tape archives – performance  has presently no access control at maximum: security by obscurity  possibly use on DB tables

2005 – 06 – - ESSP14 Finally :

2005 – 06 – - ESSP15 ? h  a  p

2005 – 06 – - ESSP16 Metadata: CERA-2 and ISO CERA meta data and data is web- enabled

2005 – 06 – - ESSP17 WDCC: CERA-2 DB The Database " 100 TB in Oracle RDBMS web accessible > 2 PB data in files " distributed system (5 DB servers) CERA-2 holds metadata & data " CERA-2 meta data model: PIK & DKRZ / M&D Access by web applet " data download by applet or servlet

2005 – 06 – - ESSP18 WDCC: CERA-2 Features What are the typical features of CERA-2 ? integration of catalogue data and mass data web access to catalogue data and mass data adequate for all types of data capacity: >1000 daily requests statistics by data volume and country cutting of data in spacetime formats: grib, ASCII, netCDF, IEEE integrated netCDF visualization under development

2005 – 06 – - ESSP19 WDCC: CERA-2 Structure What are the typical structures of CERA-2 ? " structural flexibility: - definable fields, tables, entry types, & various other - flexible lists of values (LOV): extensible but controlled - layouted for automated data access " simple structure: - blockwise table groups - CERA-2 Blocks have similar structure - more difficult structures go into CERA Modules

2005 – 06 – - ESSP20 WDCC: CERA-2 Structure Metadata Entry This is the central CERA Block, providing information on the entry's title type and relation to other entries the project the data belong to a summary of the entry a list of general keywords related to data creation and review dates of the metadata Additionally: Modules + Local Extensions Module DATA_ORGANIZATION (grid structure) Module DATA_ACCESS (physical storage) Local extension for specific information on (e.g.) data usage data access and data administration Coverage Information on the volume of space-time covered by the data Reference Any publication related to the data togehter with the publication form Status Status information like data quality, processing steps, etc. Distribution Distribution information including access restrictions, data format and fees if necessary Contact Data related to contact persons and institutes like distributor, investigator, and owner of copyright Parameter Block describes data topic, variable and unit Spatial Reference Information on the coordinate system used

2005 – 06 – - ESSP21 WDCC: CERA-2 Structure

2005 – 06 – - ESSP22 WDCC: CERA-2 Module

2005 – 06 – - ESSP23 CERA-2 Data Streams Access Client realised as web-based Java Applet. Middleware layer provides applet and DB connection DB-Server for catalogue operations and climate data retrieval.

2005 – 06 – - ESSP24 CERA-2: Collaboration as WebService Client applet receives foreign data URI from CERA-2 DB Foreign server provides DB data by http: German Aerospace Centre (DLR) CEOP in situ data ??

2005 – 06 – - ESSP25 CERA-2: Certificate Based Collaboration Certification of users Access for users from trusted authorities Exchange of authorisation information WebBrowser Gatekeeper at WDCC web portal for certified users Gatekeeper at a trusted site Partner Centre (BADC)

2005 – 06 – - ESSP26 CERA-2: Applet wdcc.dkrz.de

2005 – 06 – - ESSP27 CERA-2: User Interface Selection via CERA meta data:  selection of the experiment (=model run)  display of meta data: experiment, quality, datasets  selection of the dataset  display of dataset information  add datasets to “process list”  download from tape archive to data server and to the client

2005 – 06 – - ESSP28 o ????????