Efficiently serving HDF5 via OPeNDAP

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

The HDF Group ESIP Summer Meeting Easy access HDF files via Hyrax Kent Yang The HDF Group 1 July 8 – 11, 2014.
Recent Work in Progress
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Remote Access via OPeNDAP Kent Yang and Joe Lee The HDF Group The 14 th HDF/HDF-EOS Workshop.
A Common Data Model In the Middle Tier Enabling Data Access in Workflows … HDF/HDF-EOS Workshop XIV September 29, 2010 Doug Lindholm Laboratory for Atmospheric.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
® OGC Web Services Initiative, Phase 9 (OWS-9): Innovations Thread - OPeNDAP James Gallagher and Nathan Potter, OPeNDAP © 2012 Open Geospatial Consortium.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
The HDF Group Apr , 2012HDF/HDF-EOS Workshop XV1 Interoperability with netCDF-4 Kent Yang, Larry Knox, Elena Pourmal The HDF Group.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Access of NASA HDF data via OPeNDAP Kent Yang and Joe Lee The HDF Group September 28,2010.
Unidata TDS Workshop THREDDS Data Server Overview October 2014.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
OPeNDAP and the Data Access Protocol (DAP) Original version by Dave Fulker.
The HDF Group ESIP Summer Meeting HDF OPeNDAP update Kent Yang The HDF Group 1 July 8 – 11, 2014.
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
HDF5 A new file format & software for high performance scientific data management.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
Mid-Course Review: NetCDF in the Current Proposal Period Russ Rew
The future of MINC Robert D. Vincent
Tools for Interoperability between HDF and NetCDF Mike Folk and MuQun Yang The HDF Group The HDF Group provides the following tools for the NASA HDF and.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
The HDF Group November 3-5, 2009 HDF-OPeNDAP Project Update HDF/HDF-EOS Workshop XIII1 Joe Lee and Kent Yang The HDF Group James Gallagher.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
Unidata TDS Workshop THREDDS Data Server Overview
1 NASA CEOP Status & Demo CEOS WGISS-24 Oberpfaffenhofen, Germany October 15, 2007 Yonsook Enloe.
HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
The HDF Group HDF/HDF-EOS Workshop XV1 Tools to Improve the Usability of NASA HDF Data Kent Yang and Joe Lee The HDF Group April 17, 2012.
HDF4 OPeNDAP Project Progress Report MuQun Yang and Hyo-Kyung Lee 1 HDF Developers' Meeting11/24/2015.
Remote Data Access with OPeNDAP Dr. Dennis Heimbigner Unidata netCDF Workshop October 25, 2012.
CCGrid, 2012 Supporting User Defined Subsetting and Aggregation over Parallel NetCDF Datasets Yu Su and Gagan Agrawal Department of Computer Science and.
An Update on COLA’s Software Development Jennifer M. Adams and Brian Doty.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
Using a Friendly OPeNDAP Client Library to Access HDF5 Data MuQun Yang and Hyo-Kyung Lee (The HDF Group) 1 25th IIPS Conference01/14/2009.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Research & Development Building a science foundation for sound environmental decisions Remote Sensing Information Gateway (RSIG)
10/16/2012Annual HDF briefing1 HDF OPeNDAP support Kent Yang, Joe Lee, Mike Folk The HDF Group Oct. 16, 2012.
An HDF-EOS Data Server Based on OPeNDAP and ECHO Bob Bane, Mohammad Rabi, Weijun Su, Richard Ullman, Jingli Yang, Zhangshi Yin Data Usability Group, NASA/GSFC.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
ORNL DAAC SPATIAL DATA ACCESS TOOL Open Geospatial Consortium (OGC) Services Bruce E. Wilson Suresh K. Santhana Vannan Yaxing Wei Tammy W. Beaty National.
OGC Web Services with complex data Stephen Pascoe How OGC Web Services relate to GML Application Schema.
Update on Unidata Technologies for Data Access Russ Rew
TSDS (HPDE DAP). Objectives (1) develop a standard API for time series-like data, (2) develop a software package, TSDS (Time Series Data Server), that.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
HDF and HDF-EOS Workshop XII
HDF Product Designer: Using Templates to Achieve Interoperability
Data Are from Mars, Tools Are from Venus
DAP+NETCDF Using the netCDF-4 Data Model
Moving from HDF4 to HDF5/netCDF-4
HDF5 Metadata and Page Buffering
HDF5 OPeNDAP Project Update and Demo
The cf-python software library
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
HDF Support for NASA Data Producers
NetCDF and Scientific Data Standard
Accessing Remote Datasets through the netCDF interface.
Hierarchical Data Format (HDF) Status Update
HDF-EOS Workshop XXI / The 2018 ESIP Summer Meeting
ExPLORE Complex Oceanographic Data
OPeNDAP/Hyrax Interfaces
Adapting an existing web server to S3
Presentation transcript:

Efficiently serving HDF5 via OPeNDAP Kent Yang The HDF Group This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C

1 Open-source Project for a Network Data Access Protocol Why OPeNDAP1? Check metadata remotely (in various forms) Obtain the subset of data easily and efficiently Hide the original data sources Hierarchical Data Format(HDF) 4 and 5 Network Common Data Form(NetCDF) Geospatial Tagged Image File Format(GeoTiFF) OPeNDAP output(including subsets) can be downloaded as other formats Many popular earth science tools can visualize and analyze the data via OPeNDAP 1 Open-source Project for a Network Data Access Protocol

HDF5 Hyrax handler To visualize the HDF5 data via Hyrax We have to follow the Climate and Forecast(CF) conventions to translate the data layout CF-required metadata layout NASA HDF5 metadata layouts HDF5 CF Handler

How HDF5 Handler (CF Option) Works HDF5 Library CF-required layout NASA HDF5 files HDF5 Handler –CF option

Visualize A Soil Moisture Active Passive(SMAP) HDF5 variable via Hyrax This is SMAP level 3 example. An HDF5 variable can be displayed by Panoply through Hyrax.

1 NetCDF Markup Language HDF5 handler and NcML1 NcML module can be used with the HDF5 handler to provide the missing CF conventions information <variable name="VNP_Grid_500m_2D_SurfReflect_I1_1"> <!-- Rename attribute Scale and Offset --> <attribute name= "scale_factor" orgName= " Scale" /> <attribute name="add_offset" orgName="Offset" /> </variable> "scale_factor" "Scale" 1 NetCDF Markup Language

HDF5 handler and File NetCDF File NetCDF module can work with the HDF5 handler to convert HDF5 files to NetCDF-3 or NetCDF-4 classic files

1 Visible Infrared Imaging Radiometer Suite Use NcML and file NetCDF to work with HDF5 handler <variable name="VNP_Grid_500m_2D_SurfReflect_I1_1"> <!-- Rename attribute Scale and Offset --> <attribute name= "scale_factor" orgName= " Scale" /> <attribute name="add_offset" orgName="Offset" /> </variable> Scale = 1e-4 12391.0 12391.0 1.2 1.2 VIIRS1 via Hyrax directly The NetCDF file of VIIRS1 via Hyrax NcML and file NetCDF modules 1 Visible Infrared Imaging Radiometer Suite

Service Chain to access HDF5 via Hyrax files OPeNDAP clients (netCDF, Ferret, Panoply, …...) Hyrax core Main engine HDF5 handler Alternative output File NetCDF Supplement Info/Aggregate NcML NetCDF files

DAP14 support in the HDF5 handler CF option DAP4 strictly mapped from DAP2 Dataset Metadata Response(DMR) replaces Dataset Descriptor Structure(DDS) and Dataset Attribute Structure(DAS) No-CF(generic) option HDF5 group to DAP4 group HDF5 signed 8-bit and 64-bit integers to DAP4 HDF5 dimensions following the NetCDF-4 to DAP4 mapping 1 Data Access Protocol

Interoperability enhancement CF option with DAP2 DAP2 and CF have restrictions not all HDF5 objects can map to DAP2 or CF An example: DAP2 doesn’t support 64-bit integer, HDF5 supports Provide a way for service providers to check if there are any objects ignored when mapping from HDF5 to DAP2

Performance Improvement Options that may help reduce the access time Reducing data access time HDF5 is efficient to retrieve raw data Caching the raw data in the disk Best if the data is compressed Reducing DAP2 DDS and DAS access time Caching DDS and DAS in memory Caching DAS in the disk

How HDF5 Handler(CF option) Memory Cache Works HDF5 CF Handler Memory Cache HDF5 Library CF-required layout NASA HDF5 files

How HDF5 handler(CF Option) Memory Cache Works(continued) HDF5 CF Handler CF-required layout Memory Cache HDF5 Library NASA HDF5 files [

How HDF5 Handler(CF Option) Disk Cache Works CF-required layout HDF5 CF Handler DAS Disk Cache HDF5 Library NASA HDF5 files

How HDF5 Handler(CF Option) Disk Cache Works - Continued CF-required layout HDF5 CF Handler DAS Disk Cache HDF5 Library NASA HDF5 files

Other New Features Support the access of HDF-EOS5 sinusoidal projection in the HDF5 OPeNDAP handler

Future work CF option Non-CF(generic) option Support the mapping of 64-bit integer to DAP4 Support the access of other projections of HDF-EOS5-like products Add the DDS disk cache support(?) Non-CF(generic) option Add the mapping of HDF5 variable length data to DAP4

ACCESS HDF5 via Hyrax in Cloud Three architectures HDF5 handler can be enhanced for the future work of Architectures 2 and 3

1 Hypertext Transfer Protocol Archit. #2: Files With HTTP1 Range-Gets Current implementation: Range-Gets index per HDF5 chunk Add an option to handler: Range-Gets index per HDF5 variable 1 Hypertext Transfer Protocol

1 Simple Storage Service Archit. #3: HDF5 Datasets as S31 Objects Current implementation: An HDF5 chunk in a variable is an S3 object. Add an option to handler: An HDF5 variable is an S3 object. 1 Simple Storage Service

This work was supported by NASA/GSFC under Raytheon Co This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C