Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder 2006-10-16.

Slides:



Advertisements
Similar presentations
STRING AN EXAMPLE OF REFERENCE DATA TYPE. 2 Primitive Data Types  The eight Java primitive data types are:  byte  short  int  long  float  double.
Advertisements

1 Projection Indexes in HDF5 Rishi Rakesh Sinha The HDF Group.
Recent Work in Progress
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
ESCI/CMIP5 Tools - Jeudi 2 octobre CMIP5 Tools Earth System Grid-NetCDF4- CMOR2.0-Gridspec-Hyrax …
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
NetCDF An Effective Way to Store and Retrieve Scientific Datasets Jianwei Li 02/11/2002.
HDF4 and HDF5 Performance Preliminary Results Elena Pourmal IV HDF-EOS Workshop September
Data Type. A data type defines a set of values that a variable can store along with a set of operations that can be performed on that variable. Common.
NetCDF Ed Hartnett Unidata/UCAR
Introduction to NetCDF Russ Rew, UCAR Unidata ICTP Advanced School on High Performance and Grid Computing 13 April 2011.
Developing a NetCDF-4 Interface to HDF5 Data
1 Writing NetCDF Files: Formats, Models, Conventions, and Best Practices Russ Rew, UCAR Unidata June 28, 2007.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
Introduction to NetCDF4 MuQun Yang The HDF Group 11/6/2007HDF and HDF-EOS Workshop XI, Landover, MD.
HDF 1 HDF5 Advanced Topics Object’s Properties Storage Methods and Filters Datatypes HDF and HDF-EOS Workshop VIII October 26, 2004.
The HDF Group April 17-19, 2012HDF/HDF-EOS Workshop XV1 Introduction to HDF5 Barbara Jones The HDF Group The 15 th HDF and HDF-EOS Workshop.
NetCDF-4 The Marriage of Two Data Formats Ed Hartnett, Unidata June, 2004.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
1 Russ Rew, Ed Hartnett, John Caron UCAR Unidata Program Center Mike Folk, Robert McGrath, Quincey Kozial NCSA and The HDF Group, Inc. Final Project Review,
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
HDF5 A new file format & software for high performance scientific data management.
NPP/ NPOESS Product Data Format Richard E. Ullman NASA/GSFC/NPP NOAA/NESDIS/IPOAlgorithm / System EngineeringData / Information Architecture
Unidata’s Common Data Model John Caron Unidata/UCAR Nov 2006.
February 2-3, 2006SRB Workshop, San Diego P eter Cao, NCSA Mike Wan, SDSC Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration Object-level.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
December 1, 2005HDF & HDF-EOS Workshop IX P eter Cao, NCSA December 1, 2005 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration.
The HDF Group HDF5 Datasets and I/O Dataset storage and its effect on performance May 30-31, 2012HDF5 Workshop at PSI 1.
N P O E S S I N T E G R A T E D P R O G R A M O F F I C E NPP/ NPOESS Product Data Format Richard E. Ullman NOAA/NESDIS/IPO NASA/GSFC/NPP Algorithm Division.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
October 15, 2008HDF and HDF-EOS Workshop XII1 What will be new in HDF5?
Data Types. Data types Data type tells the type of data, that you are going to store in memory. It gives the information to compiler that how much memory.
Integrating netCDF and OPeNDAP (The DrNO Project) Dr. Dennis Heimbigner Unidata Go-ESSP Workshop Seattle, WA, Sept
1 HDF5 Life cycle of data Boeing September 19, 2006.
NetCDF Data Model Issues Russ Rew, UCAR Unidata NetCDF 2010 Workshop
Unidata’s Common Data Model and the THREDDS Data Server John Caron Unidata/UCAR, Boulder CO Jan 6, 2006 ESIP Winter 2006.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
HDF Hierarchical Data Format Nancy Yeager Mike Folk NCSA University of Illinois at Urbana-Champaign, USA
Info stored in computer (memory) Numbers All in binaray – can be converted to octal, hex Characters ASCII – 1-byte/char Unicode – 2-byte/char Unicode-table.com/en.
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
Unidata’s Common Data Model and NetCDF Java Library API Overview John Caron Unidata/UCAR Nov 2008.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
NetCDF and Scientific Data Durability Russ Rew, UCAR Unidata ESIP Federation Summer Meeting
Data File Formats: netCDF by Tom Whittaker University of Wisconsin-Madison SSEC/CIMSS 2009 MUG Meeting June, 2009.
Advances in the NetCDF Data Model, Format, and Software Russ Rew Coauthors: John Caron, Ed Hartnett, Dennis Heimbigner UCAR Unidata December 2010.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
ESMF,WRF and ROMS. Purposes Not a tutorial Not a tutorial Educational and conceptual Educational and conceptual Relation to our work Relation to our work.
SDM Center Parallel I/O Storage Efficient Access Team.
Unidata Technologies Relevant to GO-ESSP: An Update Russ Rew
CF 2.0 Coming Soon? (Climate and Forecast Conventions for netCDF) Ethan Davis ESO Developing Standards - ESIP Summer Mtg 14 July 2015.
Developing Conventions for netCDF-4 Russ Rew, UCAR Unidata June 11, 2007 GO-ESSP.
Development of a CF Conventions API Russ Rew GO-ESSP Workshop, LLNL
NetCDF: Data Model, Programming Interfaces, Conventions and Format Adapted from Presentations by Russ Rew Unidata Program Center University Corporation.
Update on Unidata Technologies for Data Access Russ Rew
The HDF Group Introduction to HDF5 Session 7 Datatypes 1 Copyright © 2010 The HDF Group. All Rights Reserved.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
Libcf – A CF Convention Library for NetCDF Ed Hartnett Unidata Program Center Boulder Colorado June 11, 2007.
NetCDF Data Model Details Russ Rew, UCAR Unidata NetCDF 2009 Workshop
Copyright © 2010 The HDF Group. All Rights Reserved1 Data Storage and I/O in HDF5.
Other Projects Relevant (and Not So Relevant) to the SODA Ideal: NetCDF, HDF, OLE/COM/DCOM, OpenDoc, Zope Sheila Denn INLS April 16, 2001.
Moving from HDF4 to HDF5/netCDF-4
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
Unidata & NetCDF BoF Scientific File Formats
Recent Work in Progress
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
Libcf – A CF Convention Library for NetCDF
Presentation transcript:

Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder

2 Status of netCDF-3 Work l NetCDF 3.6 (C, Fortran, C++) eliminated most 2 GiByte size limitations u Supports 64-bit offset file format variant u Other improvements not very relevant to unstructured grids: shared libraries, portability, performance, C++ improvements, … l NetCDF Java (nj22) continues to advance u Through a Common Data Model interface, reads netcdf-3, HDF5 (most), OPeNDAP, GRIB1, GRIB2, BUFR, … u Provides CF conventions compliance, coordinate systems: u I/O provider framework for adding new data formats l libcf under development for a CF conventions API based on netCDF-3

3 Status of NetCDF-4 l NetCDF-4.0-alpha17 currently available for testing u Files created with alpha release use unsupported artifacts u We’re still seeking feedback on performance and functionality u Early users have obtained 4:1 compression and 7x speedups l NetCDF-4.0-beta waiting for HDF5 1.8-beta u Will finalize file format, eliminate necessity for artifacts u Expected within a few weeks of HDF5 1.8-beta release l HDF5 1.8 currently expected in January 2007 u Has enhancements specifically for netCDF-4: variable creation order, Unicode names, dimension scales, on-the-fly numeric conversions l Plans for netCDF-4.1 and beyond on netCDF-4 web site

4 NetCDF-3 Data Model Attribute name: String type: DataType values: 1D array Variable name: String shape: Dimension[ ] type: DataType array: read( ), … File location: Filename create( ), open( ), … Dimension name: String length: int isUnlimited( ) DataType char byte short int float double A file has named variables, dimensions, and attributes. Variables also have attributes. Variables may share dimensions, indicating a common grid. One dimension may be of unlimited length. Variables and attributes have one of six primitive data types.

5 NetCDF-4 Data Model (Common Data Access Model) Dimension name: String length: int isUnlimited( ) Attribute name: String type: DataType values: 1D array Variable name: String shape: Dimension[ ] type: DataType array: read( ), … Group name: String File location: Filename create( ), open( ), … DataType PrimitiveType char byte short int int64 float double unsigned byte unsigned short unsigned int unsigned int64 string UserDefinedType typename: String Compound VariableLength Enum Opaque A file has a top-level unnamed group. Each group may contain one or more named subgroups, variables, dimensions, attributes, and types. Variables also have attributes. Variables may share dimensions, indicating a common grid. One or more dimensions may be of unlimited length. Variables and attributes have one of twelve primitive data types or one of four user-defined types.

6 Some netCDF-3 Limitations l Relevant to representing unstructured grids: u No data structures, just scalars and multidimensional arrays u No ragged arrays or nested structures u Only one shared unlimited dimension u Flat name space for dimensions and variables l Not relevant (?) for unstructured grids: u No strings, just arrays of characters u Limited numeric types u Only ASCII characters in names u Changes to file schema can be expensive u Efficient access requires reads in same order as writes u No built-in compression u Only serial I/O

7 New Features of netCDF-4 l Relevant to representing unstructured grids: u User-defined compound types (portable structs) u User-defined variable-length types for ragged arrays u Groups for nested scopes u Multiple unlimited dimensions l Not relevant (?) for unstructured grids: u String type u Additional numeric types u Unicode names u Efficient dynamic schema changes u Multidimensional tiling (chunking) u Per variable compression u Parallel I/O

8 User-Defined Compound Type l Like C structs, but portable l May be nested Multiple variables may use same type l Attributes may be of compound type also (needed for units) l Efficiency note: members stored close together types: compound ob { int station_id; double time; float temperature; float pressure; } variables: ob obs(nstations);

9 User-Defined Variable Length Type l Has a name and a base type l Can be used for “ragged arrays” l Access to a variable-length value is atomic u Length and values written or read together u Can’t know length until value is read u In C/Fortran, library allocates memory for value l Multiple variables may use same type l May be nested to create multidimensional variable- length types types: float(*) row_of_floats; variables: row_of_floats ragged_array1(m);

10 Groups l A non-root Group has a name and a parent group l The root group is unnamed l A Group may have variables, dimensions, attributes, types, and subgroups l A Group is analogous to a netCDF-3 file (root group) AB CD

11 NetCDF-4 Architecture NetCDF Java applications NetCDF-3 applications NetCDF-4 applications HDF5 applications l NetCDF-4 uses HDF5 for storage, high performance u Parallel I/O u Chunking for efficient access in different orders, efficient use of compression u Conversion using “reader makes right” approach l Provides simple netCDF interface to subset of HDF5 l Also supports netCDF classic and 64-bit formats POSIX I/O MPI I/O HDF5netCDF-3 netCDF Java netCDF-4 … NetCDF Java application NetCDF-3 application NetCDF-4 application HDF5 application Java VM

12 Status of CF l White paper available on “Maintaining and Advancing the CF Standard for Earth System Science Community Data”, Bryan Lawrence, et al l CF becoming important to more communities New web site set up for discussions, maintenance: pcmdi.llnl.gov/ eventually: pcmdi.llnl.gov/ l Funded staff now supporting CF l CF Governance Panel now in existence (Oct 1), responsible for stewardship not technical content l Under WMO/WCRP Working Group on Coupled Modeling (WGCM) l Two CF committees u Conventions u Standard Names

13 Some Unstructured Grid Issues l Is netCDF-3 data model adequate for representing unstructured grids? l If not, what netCDF-4 features are needed for unstructured grid representations? l Can needed netCDF-4 features for unstructured grids be emulated in netCDF-3 data model? l Should means of emulation of particular netCDF-4 features in netCDF-3 be elevated to conventions level?