NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004.

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

Reading HDF family of formats via NetCDF-Java / CDM
Recent Work in Progress
The Model Output Interoperability Experiment in the Gulf of Maine: A Success Story Made Possible By CF, NcML, NetCDF-Java and THREDDS Rich Signell (USGS,
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
THREDDS, CDM, OPeNDAP, netCDF and Related Conventions John Caron Unidata/UCAR Sep 2007.
7 +/- 2 Maybe Good Ideas John Caron June (1) NetCDF-Java (aka CDM) has lots of functionality, but only available in Java – NcML Aggregation – Access.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
NextGen Network-Enabled Weather (NNEW) Concepts Aaron Braeckel.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
John Caron Unidata October 2012
Developing a NetCDF-4 Interface to HDF5 Data
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
1 Russ Rew, Ed Hartnett, John Caron UCAR Unidata Program Center Mike Folk, Robert McGrath, Quincey Kozial NCSA and The HDF Group, Inc. Final Project Review,
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
NetCDF for Developers and Data Providers Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 14 April 2011.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
HDF5 A new file format & software for high performance scientific data management.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
NetCDF-Java Overview John Caron Oct 29, Contents Data Models / Shared Dimensions Coordinate Systems Feature Types NetCDF Markup Language (NcML)
NcML Aggregation vs Feature Collections. NcML functionality 1.Modify the objects found in CDM files – Especially Attributes – Don’t have to rewrite the.
Unidata’s TDS Workshop TDS Overview – Part II Unidata July 2011.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
THREDDS Data Server Unidata’s Common Data Model Background / Summary John Caron Unidata/UCAR Mar 2007.
Unidata TDS Workshop THREDDS Data Server Overview
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
Recent developments with the THREDDS Data Server (TDS) and related Tools: covering TDS, NCML, WCS, forecast aggregation and not including stuff covered.
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
Unidata’s TDS Workshop TDS Overview – Part I July 2011.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Unidata's Involvement in Developing and Supporting Climate Science Infrastructure Russ Rew UCAR Unidata April 2010.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
Weathertop Consulting, LLC Server-side OPeNDAP Analysis – Concrete steps toward a generalized framework via a reference implementation using F-TDS Roland.
The NetCDF/OPeNDAP (NO) Project or “What part of NO do you not understand?”
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
Rich Signell Roland Viger Curtis Price USGS Community for Data Integration Feb 15, 2012.
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Update on Unidata Technologies for Data Access Russ Rew
THREDDS Data Server (TDS) and Data Discovery John Caron Unidata/UCAR May 15, 2006.
TSDS (HPDE DAP). Objectives (1) develop a standard API for time series-like data, (2) develop a software package, TSDS (Time Series Data Server), that.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
Data Are from Mars, Tools Are from Venus
James Gallagher OPeNDAP
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
What is NetCDF ? And what are its plans for world domination?
Recent Work in Progress
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
OPeNDAP/Hyrax Interfaces
Presentation transcript:

NetCDF-Java version 2.2 Common Data Model John Caron Unidata/UCAR Dec 10, 2004

Outline 1.Data Models 2.NetCDF-4 and NetCDF-Java NcML & THREDDS

Acknowledgements NetCDF-4: Russ Rew, Ed Hartnett THREDDS: Ethan Davis, Ben Domenico, Yuan Ho, Robb Kambic IDV: Don Murray, Jeff McWhirter, Doug Lindholm NcML: Luca Cinquini, Ethan Davis, Stefano Nativi, Russ Rew, Bob Drach HDF5: Mike Folk, Quincey Kiozol, Robert McGrath OpenDAP: James Gallagher

Creating a Common Data Model from NetCDF, HDF5, OPeNDAP Data Models

NetCDF Machine and OS independent file format for “self-describing” scientific data C library (Fortran, C++, Perl, IDL, MatLab, Python, Ruby), Java library Multidimensional arrays, efficient subsetting. > 20,000 downloads last year (of complete netCDF-3 source by distinct hosts)

NetCDF-3 Data Model

HDF5 Machine and OS independent file format for “self-describing” scientific data (NCSA) C library (Fortran, Java, others??) Evolution from HDF4, but not compatible. HDF-EOS, HDF5-EOS Standard formats for EOSDIS, ASCI, NPOESS Parallel-IO, chunked storage, compression filters, many data types.

HDF5 Data Model

OPeNDAP Client-server protocol for scientific data access C++ client and server, Java client and server libraries. NetCDF-OPeNDAP client most popular (80/20) Current version 2.0 NASA ESE standard Working on new 4.0 protocol spec. Peter Cornillon (PI), James Gallagher (lead), et al, from Univ. Rhode Island

OpenDAP Data Model

Common Data Model (CDM)

Abstract Data Models An API is the interface to the Data Model for a specific language A file format is a persistence format for the Data Model. A data access protocol plays roughly the same role as a file format. The Abstract Data Model removes the details of any particular API and the persistence format.

Common Data Model Layers Coordinate Systems Scientific Datatypes GridStationImage Data Access

CDM Coordinate Systems

Implementing the CDM: Netcdf-4 NetCDF-Java 2.2

NetCDF-4 Project funded by NASA to create new version of netCDF using the HDF5 file format. “Extend and merge” netCDF and HDF5: –Widespread use and simplicity of netCDF-3 –Generality and performance of HDF5 Specifically, we are funded to create netCDF-4 C library API, using HDF5 library underneath. Russ Rew (PI), Ed Hartnett

17 NetCDF-4 C Library HDF5 Library netCDF-4 Library netCDF-3 Interface NetCDF-4 Architecture

NetCDF-4 and Java 100% Java library for netCDF-4 files possible? –Won’t implement MPI parallel-IO –netCDF-4 features are a subset of HDF5 –Reading easier than writing NetCDF-Java 2.1 already a 100% Java library for netCDF-3 files (and OPeNDAP) NetCDF-Java 2.2: read HDF5 to determine what netCDF-4 data model should be

Common Data Model NetCDF-Java 2.2: create one API (and data model) for access to netCDF-3, HDF5, and OPeNDAP: prototype for CDM. NetCDF, HDF5, and OPeNDAP groups are discussing a formal mapping between the three data models. –Opportunity to tweak the 3 data models to mitigate differences –Opportunity to make OPeNDAP 4.0 the remote access protocol for netCDF-4, and netCDF-4 the file persistence format for OPeNDAP.

Common Data Model NetCDF-Java 2.2 implements the CDM. NetCDF-4 C library will implement the CDM NetCDF-4 file format will be the persistence format for CDM. Caveats: –Not stable until C library and file format are finished (summer 05).

NetCDF-Java 2.2 (nj22) Alpha release: Nov 2004 Beta release: Mar 2005 Release: summer 2005

NetcdfDataset Application Scientific Datatypes GridStationImage NetCDF-Java version 2.2 architecture ADDEOpenDAP THREDDS Catalog.xml NetCDF-3 HDF5 I/O service provider GRIB GINI NIDS NetcdfFile NetCDF-4 … Nexrad DMSP

I/O Service Provider Implementations DMSP (Defense Meteorological Satellite Program) from NGDC (Ethan Davis) GINI (national radar mosaic) (Yuan Ho) GRIB-1, GRIB-2 (Robb Kambic) NEXRAD level II (NCDC archives, CRAFT compressed) NEXRAD level III (partial) (Yuan Ho) NetCDF-3 HDF5

Direct Grib reading – why? Grib is WMO standard, NCEP model data NetCDF/Grib file size = 6.6 to 40 –Grib-1 has scale/offset compression –Grib-2 has JPEG2000 (wavelet), complex compression Existing decoder (grib2nc) –needs predefined CDL –No Grib-2 decoder Want the convenience of netCDF API without actually writing a netCDF file.

ucar.grib library Standalone Java library to read Grib files –Author: Robb Kambic –Grib-1: started with JGrib library, but rewrote –Grib-2: from scratch, uses jpeg2000 library Grib file = collection of Grib records. Write index file first time it reads Grib file. Tested with only IDD/NCEP data so far. Goal: allow others to extend by adding new tables without programming. Basis for future Grib decoders.

ucar.nc2.iosp.grib Creates NetCDF / CDM objects on the fly. Collection of 2D arrays (Grib records) -> 5D dataset (netCDF). (not foolproof) Add CF-1 and _Coordinate Conventions. Looks like a CF compliant netCDF file. Can use FileWriter to write to netCDF file.

I/O Service Provider Implement this interface: public interface IOServiceProvider { boolean isValidFile( RandomAccessFile raf); void open( RandomAccessFile raf, NetcdfFile ncfile); Array readData( Variable v2, List section); // only if you use Structures Array readNestedData( Variable v2, List section); }

File Format #N File Format #2 File Format #1 CDM Visualization &Analysis Goal: N + M instead of N * M things on your TODO List NetCDF file Data Server Web Service

NcML THREDDS

NcML - NetCDF Markup Language XML representation of netCDF metadata Create new files, like ncgen uses CDL Modify existing datasets –Add, delete, rename Attributes, Dimensions, Variables, Groups –Create logical sections of existing variables. –Create unions and aggregations of multiple existing datasets.

<netcdf xmlns=" location="test/data/nids/N0R_ _2147"> NcML example

NcML Datasets Datasets NcML Dataset XML Application dataset NcML dataset

THREDDS Datasets nj22 library accepts URLs like thredds: THREDDS metadata can be used to know how to read the dataset. THREDDS metadata can be added to the Dataset as global attributes. NcML can be applied to a collection of datasets in a THREDDS catalog

THREDDS Datasets Datasets NcML Dataset XML Application Catalog.xml dataset 1 dataset 2 … Datasets Catalog.xml dataset 1 dataset 2 … dataset NcML dataset THREDDS dataset

Limitations Currently this functionality is available only through the netCDF-Java library. –NcML will probably eventually become available in the C library. –Not sure about THREDDS catalogs So your client has to be written in Java

HTTP Tomcat Server THREDDS Data Server Datasets Catalog.xml hostname.edu Data Server OPeNDAP WCS Application NJ22 library

Summary NetCDF-4 will have an extended data model based on experience with netCDF- 3, HDF5 and OPeNDAP. Lack of shared Dimensions biggest problem in mapping to other models. Currently available in alpha version of netCDF-Java 2.2 library.

Next Time Coordinates Scientific Data Types OpenDAP as remote access protocol for netCDF-4?

Warning! Danger! This is alpha quality, API still evolving! Please use and influence us: –Testing with real datasets –Convention parsing –IOServiceProvider

For More Info: Google: Netcdf-Java