Embedding Live Access Server into GFDL Data Portal Infrastructure K.O’Brien (PMEL), S.Nikonov (GFDL), R.Schweitzer (PMEL), S.Hankin (PMEL), V.Balaji (GFDL)

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
Implementation of Web Service Technologies in GFDL's FMS Runtime Environment Y. Malysheva, S. Nikonov, V. Balaji GFDL The 7 th GO-ESSP Workshop September.
Prototyping. Horizontal Prototyping Description of Horizontal Prototyping A Horizontal, or User Interface, Prototype is a model of the outer shell of.
Two main requirements: 1. Implementation Inspection policies (scheduling algorithms) that will extand the current AutoSched software : Taking to account.
Requirements Specification
Astrophysics, Biology, Climate, Combustion, Fusion, Nanoscience Working Group on Simulation-Driven Applications 10 CS, 10 Sim, 1 VR.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Classroom User Training June 29, 2005 Presented by:
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
OracleAS Reports Services. Problem Statement To simplify the process of managing, creating and execution of Oracle Reports.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
The european ITM Task Force data structure F. Imbeaux.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
“curator” DB design Curator meeting, GFDL, Sep 20.
ALICE, ATLAS, CMS & LHCb joint workshop on
CERN-PH-SFT-SPI August Ernesto Rivera Contents Context Automation Results To Do…
A Data Access Framework for ESMF Model Outputs Roland Schweitzer Steve Hankin Jonathan Callahan Kevin O’Brien Ansley Manke.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
Product-Generation in ESG: some explorations of the user experience Steve Hankin – March, 2007.
FRErator – the Bridge between FRE and Curator DB.
1 The EDIT System, Overview European Commission – Eurostat.
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
D R A T D R A T ABSTRACT Every semester each department at Iowa State University has to assign its faculty members and teaching assistants (TAs) to the.
1 GFDL Data Portal Current Status, Achievements and Future Development NOAATECH-2006 K.Dixon, V.Balaji, S.Nikonov GFDL, Princeton.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Welcome to the PRECIS training workshop
Curator: Gap Analysis (from a schema perspective) Rocky Dunlap Spencer Rugaber Georgia Tech.
Application of RDF-OWL in the ESG Ontology Sylvia Murphy: Julien Chastang: Luca Cinquini:
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
IBM Express Runtime Quick Start Workshop © 2007 IBM Corporation Deploying a Solution.
Metadata Development in the Earth System Curator Spanning the Gap Between Models and Datasets Rocky Dunlap, Georgia Tech 5 th GO-ESSP Community Meeting.
GFDL Data Portal Update: Curator DB Approach S.Nikonov, V.Balaji, K.Dixon GFDL The 5 th GO-ESSP Workshop June , LLNL.
Ideas on Opening Up GEOSS Architecture and Extending AIP-5 Wim Hugo SAEON.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Advanced Higher Computing Science The Project. Introduction Worth 60% of the total marks for the course Must include: An appropriate interface using input.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Metadata Support for Model Intercomparison Projects Sylvia Murphy: Cecelia DeLuca: Julien.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Embedded Systems. What is Embedded Systems?  Embedded reflects the facts that they are an integral.
Advanced Higher Computing Science
MIKADO – Generation of ISO – SeaDataNet metadata files
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
5th Shire River Basin Conference 22 February 2017 Shire River Basin Management Project Shire Basin Planning Tool Sub-Component A1 Development of a.
Database System Concepts and Architecture
Self Healing and Dynamic Construction Framework:
Integration of the UC Davis Biological Collections Data via a Web Portal [A Pilot Project] Project Goals To develop a Web Portal allowing better & more.
Lecture 8 Database Implementation
System Design.
Exploring Microsoft Office Access
Design and Implementation
Data collection methodology and NM paradigms
Database System Concepts and Architecture
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Service-centric Software Engineering
Design and Implementation
Dtk-tools Benoit Raybaud, Research Software Manager.
Metadata Development in the Earth System Curator
Overview of Workflows: Why Use Them?
Data Warehousing Concepts
Chapter 3 Database Management
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Exploring Microsoft Office Access
What's New in eCognition 9
Presentation transcript:

Embedding Live Access Server into GFDL Data Portal Infrastructure K.O’Brien (PMEL), S.Nikonov (GFDL), R.Schweitzer (PMEL), S.Hankin (PMEL), V.Balaji (GFDL) The 6 th GO-ESSP Workshop June Jussieu, Paris

Outlines  Curator metadata DB, part of GFDL FMS Runtime Environment (FRE) is a centralized metadata storage for entire modeling process.  Live Access Server (LAS) is important component of GFDL Data Portal.  FRE –> Curator DB –> LAS metadata stream.  Benefits of symbiosis. The 6 th GO-ESSP Workshop June Jussieu, Paris

Live Access Server + Curator = Live Access Server + Curator = LASurator (not LACerater !!!) The 6 th GO-ESSP Workshop June Jussieu, Paris   Symbiotic self configuring system combining LAS and Curator metadata DB.   Part of FRE   Essential part of GFDL Data Portal FRE Data Portal LASurator

 Technical part of modeling process consists of three important parts: assembling model, configuring it and running simulation.  Two ways of conducting these stages: 1.write once and forever sophisticated universal scripts with a lot of input parameters needed for running them; 2.write tools which generate disposable scripts with configurations’ parameters needed for simulation hardcoded in these scripts.  FRE was design using the 2 nd way avoiding scientists the long road of configuring model experiment every time. 1 st version offers XML metadata file as user interface for configuring model and experiment.  The 2 nd version will give convenience and automation to scientists in controlling the process of model building through more user friendly interface then XML file based on centralized storage – Curator DB. Curator DB is important part of FRE The 6 th GO-ESSP Workshop June Jussieu, Paris

Modes of working Curator DB within FRE  Research mode (component oriented) - modeler introduces new physical processes / parameterization / algorithmizations / components from newly developed modules. New entities have to be described in database.  Production mode (simulation oriented) – experimenter composes coupled model from available components described in database, builds scenario, postprocessing plan and runs experiment. All this activity is recorded in database. Thoroughly elaborated very friendly GUI is the critical thing for these modes otherwise users will avoid the database load stage => DB will be empty. Thoroughly elaborated very friendly GUI is the critical thing for these modes otherwise users will avoid the database load stage => DB will be empty.  Automatic mode – applications write metadata into database harvesting it from model output data files or queries it from DB during data searching.  Publishing mode – metadata is extracted from DB by publishing tools for describing public data. The 6 th GO-ESSP Workshop June Jussieu, Paris

Curator DB on different stages of modeling process Metadata Curator DB Component Building Model Assembling Experiment Preparation Postprocessing Data Portal Service The 6 th GO-ESSP Workshop June Jussieu, Paris FMS Runtime Environment (currently developed version) in development currently

Curator DB Design Compartments  Process Domains & Physical Process: decomposition of physical reality into homogeneous domains with descriptions of theoretical approaches for processes there considered for modeling.  Algorithmization: describes program modules of elementary physical processes  Composition: components, couplers; drivers; technical environment needed for assembling model as computer application  Simulation: metadata on simulations and model output data  Publishing: all metadata on data available for public; includes descriptions needed for Data Portal software (OPeNDAP, LAS). The 6 th GO-ESSP Workshop June Jussieu, Paris

Curator database   Contains a lot of metadata information needed for entire modeling process   Well ordered “information architecture” Live Access Server   Configurable metadata access   Configurable data browse/access The 6 th GO-ESSP Workshop June Jussieu, Paris

The GFDL IPCC hierarchy, for example: IPCC (Project) GFDL (Institution) CM2.0 (Model number) Climate of the 20 th Century (Scenario) Realization 1 (Which realization) Run 1 (Which run) 3 hourly data (Temporal domain) Jan 1991 – Dec 2000 (Dates of on-line data) The 6 th GO-ESSP Workshop June Jussieu, Paris Atmosphere (Domain)

GFDL CM2.0 CM2.1 Climate of the 20 th Century 1%/year CO2 increase scenario (to quadrupling) Climate of the 20 th Century Run 2 Run 1 Run 2 Run 1 AtmosOceanIce AtmosOceanIce AtmosOceanIce AtmosOceanIce AtmosOceanIce annual monthlydaily 3hrmonthlydaily 3hr annual Jan 2046 – Dec 2050 Jan 2096 – Dec 2100 Jan 2196 – Dec 2200 Jan 2146 – Dec 2150 Precipitation Surface Air Temperature Surface Latent Heat Flux Sea Level Pressure The 6 th GO-ESSP Workshop June Jussieu, Paris Metadata Tree

Challenge:   How to configure LAS to allow efficient user interaction and reflecting all variety of available data   How to minimize work for LAS installer Solution:   Use Curator DB to pull together pertinent metadata on experiments   Modify LAS utility addXML to read metadata from mySQL database rather than individual or aggregated files   Create generic velocity templates to present in friendly HTML for users The 6 th GO-ESSP Workshop June Jussieu, Paris

Simple but effective architecture GFDL Data Storage LAS THREDDS server Aggregations’ URLs metadata addXML Categories The 6 th GO-ESSP Workshop June Jussieu, Paris metadata Extractor, XMLGenerator metadata Extractor, XMLGenerator metadata Extractor/ XMLGenerator metadata Extractor/ XMLGenerator metadata

Behind the scenes The 6 th GO-ESSP Workshop June Jussieu, Paris Data Curator LAS XML Aggregation XMLs OPeNDAP access to data LAS access to products metadata

Tools populating Curator  Populating Curator DB is automatic.  One piece of software scans data storage following the list of public experiments from Curator DB. Traversing data files it extracts metadata and write it in Curator. In next version of FRE metadata will populate DB automatically in course of all stages from assembling model through experiment running.  Another tool analyses metadata from Curator and creates aggregations records in accordance to the aggregation criterion. Currently, Experiment is used as a sorting criterion for upper level category.  Last stage is generating THREDDS configuration XML files based on prepared metadata in Curator. The 6 th GO-ESSP Workshop June Jussieu, Paris

Interaction FRE ↔ Curator → LAS (final goal) The 6 th GO-ESSP Workshop June Jussieu, Paris Model Configuration XML metadata FMS Config data metadata FRE LAS XML LAS Aggregation XMLs in development currently GUI

LAS oriented Curator DB design features Created specific tables for Inventorying metadata about time spans and variables of available data; Inventorying metadata about time spans and variables of available data; Describing projections in details; Describing projections in details; Storing THREDDS aggregations descriptions including such fields as type of averaging, domain of variable (atmos, land,…), time limits, URL, aggregation criterion. Storing THREDDS aggregations descriptions including such fields as type of averaging, domain of variable (atmos, land,…), time limits, URL, aggregation criterion. Dynamic hierarchy Implemented flexibility of criteria for categories hierarchy giving to user freedom in choosing his own interface layout preferences. The 6 th GO-ESSP Workshop June Jussieu, Paris

Curator DB table samples The 6 th GO-ESSP Workshop June Jussieu, Paris

Imagine a search interface with Ajax talking to Curator DB or machine oriented auxiliary DB automatically designed based on adopted ontology and populated by RDF triples from main metadata DB - Curator (like it’s proposed and explored in ESG project) …. The 6 th GO-ESSP Workshop June Jussieu, Paris Future development –Discovery/Navigation Interface As the user constrains the search, output and menu lists adjust immediately.

Imagine a metadata summary of this experiment here … Experiment 2 The 6 th GO-ESSP Workshop June Jussieu, Paris

The 6 th GO-ESSP Workshop June Jussieu, Paris

Desktop:Matlab, IDL, IDV, Ferret, GrADS, … Information Products Files:netCDF,binary,spreadsheet, GIS layer, …

Benefits  Curator DB contains extensive metadata (aggregations, gridspec descriptions, time spans of data availability).  LAS configuration can be pulled directly from Curator DB. Simplifies the configuration of the complicated information hierarchy for the LAS installer. (using LAS “categories”) Simplifies the configuration of the complicated information hierarchy for the LAS installer. (using LAS “categories”) Simplifies the complex mosaic of datasets which are presented to the LAS user. Simplifies the complex mosaic of datasets which are presented to the LAS user.  LAS addXML tool works faster. The 5 th GO-ESSP Workshop June , LLNL

Thanks!Questions? The 6 th GO-ESSP Workshop June Jussieu, Paris

Process Domain Process Algorithmization Assembling Simulation Domains Atmosphere Ocean Ice Land Surf_Boundary Layer Rivers Lakes Compartment Structure of Curator Database Dynamics Radiation IceProc BiotaProc Hydrology CloudProc Chemistry Convection ProcCodeBase NumArtificies GridSpecs BoundCond TracerModels NameLists DataSets Components CmpPMIOD CmpDrivers Services Versioning Compiling PlatformEnv Others...Projects Experiments Scenarios PostProc Variables Fields/Files Aggregations Realizations DomConstituents The 5 th GO-ESSP Workshop June , LLNLInitCond CouplModels