INFSO-RI Enabling Grids for E-sciencE A service oriented framework to create, manage and update metadata for earth system science K. Ronneberger, DKRZ, Germany S. Kindermann, DKRZ, Germany
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Structure Background: – What is Earthsystem science about – Motivation of this framework Design of the metadata framework –Requirements –Concept –Implementation Potential impact and vision
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Distributed Climate Data Process Result Dataset Earthsystem Sciences Goal: learn about the past, the present, and possible futures of the earth system Method: Modelling, analysing, comparing and processing data Input: data from observations and/or other modelling studies Community: internationally and interdisciplinary distributed but strongly interconnected Model Data Observation Data Scenario data Typical workflow Analysis Dataset Find & Retireve Share Result Dataset Compare/ reuse
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Data grid for Earthsystem Science A grid to –Share data –Find data –Process data in modular, independent steps –Compare/reuse data …needs Metadata to describe Content (unique variable description, temporal & spatial bound) Discovery (where to find, how to access) Use (format, size, etc.) Provenance (origin of data, performed processing steps etc.)
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand ISO-Metadata for earthsystem science Complex enough to reference geographic data Offers a hierarchical description XML implementation ISO available Used for ESS data by academic and business: –ANZLIC – ESRI – con terra GmbH C3Grid -> adapted ISO format for grid world ISO Content Discovery Use Provenance Tools are needed to create, manage and update metadata
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Share Data Find & retrieve Process Data Metadata in the grid EGEE ISO Metadata C3Grid data provider C3Grid data provider ISO Metadata C3Grid data provider C3Grid data provider ISO Metadata C3Grid data provider C3Grid data provider ISO Metadata C3Grid data provider C3Grid data provider ISO Metadata EGEE ISO Metadata Central Metadata catalogue Web Portal C3 Compare/ reuse Data ISO MD Data‘ ISO MD’ Create ISO MD for existing data resource Automatically update ISO MD during processing
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Metadata workflows Create ISO MD for existing data resource Automatically update ISO MD during processing ISO Content Discovery Use Provenance (1) Understand ISO- MD implementation ? tempBB geogrBB vertBB extent timeBegin timeEnd timeInt (2) Interactively create default file this data.. … DKRZ... (3) Produce property file Default A Ref_xml B,C update timeEnd with t1 add vertBB from A update cfvar = a with b delete keyword = c
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand System design Step(1)(2)(3) Function view structure, content and organization of the metadata Parsed creation/change of metadata via GUI Automatic, property file driven update of MD Requirements graphical depiction Schema template & specification file Example MD files GUI Secure login Parsing functionality Schema template & specification file Example MD files API property file language A&A Update & parsing functionality Schema template, specification & default file Original MD files System layout Interface Specification layer XML Databases Interface Security layer Manipulation layer Specification layer XML Databases Interface Abstraction layer Security layer Manipulation layer Specification layer XML Databases
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Implementation Metadata basis Metadata basisISO XMLs Specification layer Specification layer ISO XML template, Specification file, default XMLs Manipulation layer Manipulation layer XSLT & XQuery Secure login HTML + css Orbeon ExistExist Browser OGSA-DAI Property file interpreter API Program interface
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Summary Modular Differentiated access Common basis Easy maintainable Layered system Easy extendable/reusable Standardized technology Easy adaptable Metadata basisISO XMLs Specification layer ISO XML template, Specification file, default XMLs Manipulation layer XSLT & XQuery Secure login HTML + css Orbeon Exist Exist Browser OGSA-DAI Property file interpreter API Program interface
Enabling Grids for E-sciencE INFSO-RI EGEE Userforum 2008 – Clermont-Ferrand Impact & vision Tools are used in C3-EGEE framework Ease integration of further data providers/processors ISO is increasingly used as metadata format for ES data Foster the collaboration on ES data The schema related XMLs are limited and easy to exchange Adapt tools for further schemas/communities