A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.

Slides:



Advertisements
Similar presentations
James Gallagher OPeNDAP 1/10/14
Advertisements

Database System Concepts and Architecture
AHRT: The Automated Human Resources Tool BY Roi Ceren Muthukumaran Chandrasekaran.
LaTiS Doug Lindholm Laboratory for Atmospheric and Space Physics University of Colorado Boulder ESIP – July 8, 2014.
A Unified Data Model and Programming Interface for Working with Scientific Data Doug Lindholm Laboratory for Atmospheric and Space Physics University of.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
DLESE and NSDL The role of the Digital Library for Earth System Education* (DLESE) in the National SMETE Digital Library Presented by Dave Fulker Director.
7 +/- 2 Maybe Good Ideas John Caron June (1) NetCDF-Java (aka CDM) has lots of functionality, but only available in Java – NcML Aggregation – Access.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Introduction Downloading and sifting through large volumes of data stored in differing formats can be a time-consuming and sometimes frustrating process.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
Developing a NetCDF-4 Interface to HDF5 Data
Oracle8 JDBC Drivers Section 2. Common Features of Oracle JDBC Drivers The server-side and client-side Oracle JDBC drivers provide the same basic functionality.
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
E-Infrastructure & Data Services Features and implementations 1.
Integrating netCDF and OPeNDAP (The DrNO Project) Dr. Dennis Heimbigner Unidata Go-ESSP Workshop Seattle, WA, Sept
DAP4 James Gallagher & Ethan Davis OPeNDAP and Unidata.
PROGRESS: ICCS'2003 GRID SERVICE PROVIDER: How to improve flexibility of grid user interfaces? Michał Kosiedowski.
Unidata TDS Workshop THREDDS Data Server Overview
Accessing Remote Datasets using the DAP protocol through the netCDF interface. Dr. Dennis Heimbigner Unidata netCDF Workshop August 3-4, 2009.
Convert generic gUSE Portal into a science gateway Akos Balasko 02/07/
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
Remote Data Access with OPeNDAP Dr. Dennis Heimbigner Unidata netCDF Workshop October 25, 2012.
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
LAS and THREDDS: Partners for Education Roland Schweitzer Steve Hankin Jonathan Callahan Joe Mclean Kevin O’Brien Ansley Manke Yonghua Wei.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
CF 2.0 Coming Soon? (Climate and Forecast Conventions for netCDF) Ethan Davis ESO Developing Standards - ESIP Summer Mtg 14 July 2015.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
1 Earth System Grid Center for Enabling Technologies OPeNDAP Services for ESG March 9, 2016 Peter Fox, Patrick West, Stephan Zednik RPI Performance Measures.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Update on Unidata Technologies for Data Access Russ Rew
TSDS (HPDE DAP). Objectives (1) develop a standard API for time series-like data, (2) develop a software package, TSDS (Time Series Data Server), that.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.
DAP+NETCDF Using the netCDF-4 Data Model
Efficiently serving HDF5 via OPeNDAP
Data sharing and visualisation
Remote Data Access Update
Accessing Remote Datasets through the netCDF interface.
Dispatch Layer and the NetCDF Architecture
ExPLORE Complex Oceanographic Data
Future Development Plans
OPeNDAP/Hyrax Interfaces
Adapting an existing web server to S3
Presentation transcript:

A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric and Space Physics University of Colorado, Boulder

The Problem ● Diverse, disparate data formats and conventions abound in scientific datasets. ● Not going to get everyone to agree on storing data in a common format. ● A common format is not enough. Need higher level semantics. e.g. time series ● Data access, not discovery, not storage ● Long time series, but not HPC (yet?)

Telemetr y Storage Data Processing Science Product Storage Legacy Science Products File Server Web Server Databas e Server UARS SORC E Glory SDO Telemetr y Storage Data Processing Science Product Storage Data Processing Stove Pipes

LASP Time Series Server (LaTiS) Telemetr y Storage Data Processing Science Product Storage Legacy Science Products File Server Web Server Databas e Server UARS SORC E Glory SDO Telemetr y Storage Data Processing Science Product Storage Data Processing Stove Pipes Interoperability via a Common Service

files database remote service s TSM L Common Data Model ASCII File Reader Service Reader CSV Writer Binary Writer OPeNDA P Writer Web Applicat ion (LISIRD ) Exc el IDL/Ma tlab Progra m... Analysi s Tools Interoperability via a Common Data Model Databas e Reader Binary File Reader... JSON LASP Time Series Server Data Source Dataset Descripto r Data Application

Unidata Common Data Model ● Merge NetCDF Classic, HDF5, OpeNDAP data models ● As implemented by NetCDF-Java ● NetCDF Markup Language (NcML) + IOServiceProvider (IOSP) ●

NetCDF Class Data Model

OPeNDAP Data Model

HDF5 Data Model

Unidata Common Data Model

Unidata CDM limitations (for our needs) ● Different intent, design goals – Unidata: enhance existing dataset – LASP: describe, reshape existing data ● Time Series: Sequence, not mature ● Aggregation limited ● NetCDF-Java API largely influenced by netCDF as a file format. ● Specialized scientific feature types (e.g. forecast models) are tightly coupled to the implementation. ● Unneeded complexity.

LaTiS Data Model ● Inspired by the Unidata CDM ● Largely consistent with CDM but different semantics ● Object Oriented over Array based ● Functional relationships ● Dimensions have shape, not each Variable ● Structure plays the role of Group, Compound type, or even Dataset. Just a collection of variables. ● Data storage agnostic, beyond file and type abstraction ● Virtual: subset, filter before reading data ● Implementation independent API ● Extensible with custom variable types as plugins

LaTiS Data Model

Example: Time Series of Spectra NetCDF Classic (CDL): dimensions: time = UNLIMITED; wavelength = 100; variables: double time(time); double wavelength(wavelength); double a(time,wavelength);

Example: Time Series of Spectra Unidata CDM (NcML):

Example: Time Series of Spectra LaTiS Data Model (TSML):

LASP Time Series Server (LaTiS) ● RESTful web service built around the reference implementation of the data model API ● Open Source, Java Servlet, portable, easy to install ● Independent implementation of OPeNDAP (DAP2) specification, and more ● Time Series Markup Language (TSML) as dataset descriptor. Inspired by NcML. ● Adapters (like IOSPs) to read various data sources via common data model interface (note: does not specify data representation), can use the TSML (unlike IOSPs) ● Writers to output various formats ● Filters to do server side processing ● Modular architecture. Plugin functionality.

LaTiS Data Access Interface Web Service URL (REST): host: Name (and port) of the computer running the server dataset: Name of a dataset that the server is configured to serve suffix: The requested type/format of the output constraint_expression: A collection of request parameters such as time range and filters to limit the results &format_time(yyyy-DDD)&time> Demos...

LaTiS Roadmap ● HDF Adapter and Writer modules ● Other formats ● More Filters ● December 2010 release (AGU) ● Go beyond the time series abstraction ● Run with distributed data in the cloud.

Bonus slides

● See Time Series Data Server poster (AGU 2009):