Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct. 16 20081HDF and.

Slides:



Advertisements
Similar presentations
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Advertisements

The HDF Group Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC Mike.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Remote Access via OPeNDAP Kent Yang and Joe Lee The HDF Group The 14 th HDF/HDF-EOS Workshop.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
The HDF Group HDF4 Mapping Status and Proposal Mike Folk, Ruth Aydt The HDF Group Oct. 17, 2012 Apr , 2012HDF/HDF-EOS Workshop XV.
ILDG File Format Chip Watson, for Middleware & MetaData Working Groups.
Improving long-term preservation of EOS data by independently mapping HDF4 data objects The HDF Group.
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting ( Howard Besser UCLA School of Education & Information.
The HDF Group Improving long-term preservation of EOS data by independently mapping HDF4 data objects Mike Folk, Ruth Aydt, Joe Lee, Binh-Minh.
University of Illinois at Urbana-ChampaignHDF 1McGrath/Yang 2/27/02 Transitioning from HDF4 to HDF5 Robert E. McGrath Kent Yang.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Access of NASA HDF data via OPeNDAP Kent Yang and Joe Lee The HDF Group September 28,2010.
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Mike Folks, The HDF Group Ruth Duerr, NSIDC 1.
University of Illinois at Urbana-ChampaignHDF 9/19/2000 McGrath 9/19/ Transition from HDF4 to HDF5: Issues Robert E. McGrath NCSA University of Illinois.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
HDF5 Tools Update Peter Cao - The HDF Group November 6, 2007 This report is based upon work supported in part by a Cooperative Agreement.
HDF-EOS Workshop VII, An XML Approach to HDF-EOS5 Files Jingli Yang 1, Bob Bane 1, Muhammad Rabi 1, Zhangshi Yin 1, Richard Ullman 1, Robert McGrath.
EOSDIS User survey follow-up Mike Folk, Kent Yang, Elena Pourmal The HDF Group Oct. 17, 2012 Annual HDF Briefing to ESDIS1.
NASA EOS DATA COMPRESSION WITH HDF5 SCALEOFFSET FILTER This work was funded by the NASA Earth Science Technology Office under NASA award AIST and.
Important ESDIS 2009 tasks review Kent Yang, Mike Folk The HDF Group April 1st, /1/20151Annual briefing to ESDIS.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
The HDF Group September 28, 2010HDF/HDF-EOS Workshop XIV1 Easy Access of HDF data via NCL/IDL/MATLAB Kent Yang, Tong Qi, Ziying Li, Yi.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
HDF Converting between HDF4 and HDF5 MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Tools for Interoperability between HDF and NetCDF Mike Folk and MuQun Yang The HDF Group The HDF Group provides the following tools for the NASA HDF and.
1/14/200925th IIPS Conference 1 Challenges to Archive and Access NASA HDF-EOS Data in the long Term MuQun Yang (The HDF Group) Choonghwan Lee (The HDF.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
HDF Dimension Scales in HDF5 HDF-EOS Workshop IX San Francisco, CA November 30 - December 2, 2005 Pedro Vicente Nunes THG/NCSA Champaign-Urbana, IL HDF.
Support for NPP/NPOESS by The HDF Group Mike Folk The HDF Group HDF and HDF-EOS Workshop XII October 17, 2008 Oct HDF and HDF-EOS Workshop XII1.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.
The HDF Group November 3-5, 2009 HDF-OPeNDAP Project Update HDF/HDF-EOS Workshop XIII1 Joe Lee and Kent Yang The HDF Group James Gallagher.
Towards Long-Term Archiving of NASA HDF-EOS and HDF Data Data Maps and the Use of Mark-Up Language Ruth Duerr, Mike Folk, Muqun Yang, Chris Lynnes, Peter.
October 15, 2008HDF and HDF-EOS Workshop XII1 What will be new in HDF5?
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
MODLAND Volumes and Loads Status MODIS Land Science Team Workshop July 15, 2003 Robert Wolfe MODIS Land Team Support Group NASA GSFC Code 922, Raytheon.
The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal, Peter Cao The HDF Group November 5, 2009 November 3-5,
HDF4 OPeNDAP Project Progress Report MuQun Yang and Hyo-Kyung Lee 1 HDF Developers' Meeting11/24/2015.
Facilitating Access to EOS Data at the NSIDC DAAC Siri Jodha Singh Khalsa ECS Science Coordinator for the National Snow and Ice Data Center, Distributed.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
Using a Friendly OPeNDAP Client Library to Access HDF5 Data MuQun Yang and Hyo-Kyung Lee (The HDF Group) 1 25th IIPS Conference01/14/2009.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
Robert Wolfe NASA Goddard Space Flight Center Code 614.5, Greenbelt, MD Robert Wolfe NASA Goddard Space Flight Center Code 614.5,
July 20, Update on the HDF5 standardization effort Elena Pourmal, Mike Folk The HDF Group July 20, 2006 SPG meeting, Palisades, NY.
HDF and HDF-EOS Workshop VII September 24, 2003 HDF5, HDF-EOS and Geospatial Data Archives Don Keefer Illinois State Geological Survey Mike Folk Univ.
10/16/2012Annual HDF briefing1 HDF OPeNDAP support Kent Yang, Joe Lee, Mike Folk The HDF Group Oct. 16, 2012.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
Improving long-term preservation EOS data by independently mapping HDF4 data objects Mike Folk, Ruth Aydt, Peter Cao, Kent Yang Ruth Duerr Christopher.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 The New HDF-EOS Web Site - How it can help you Kent Yang, Joe Lee The HDF Group.
Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group Annual HDF Briefing to ESDIS March 31, 2009 March Annual HDF Briefing.
WInSAR Operations Update Christopher Crosby & Scott Baker AGU WInSAR Business Meeting, December 16, 2015.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
HDF and HDF-EOS Workshop XII
Transition from HDF4 to HDF5: Issues
Moving from HDF4 to HDF5/netCDF-4
SRNWP Interoperability Workshop
Mike Folk, Peter Cao, Kent Yang Ruth Duerr Christopher Lynnes
Persistent Identifiers Implementation in EOSDIS
Kent Yang, Mike Folk The HDF Group March 31, 2009
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
HDF Support for NASA Data Producers
HDF5 Tools Updates and Discussions
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting
Presentation transcript:

Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct HDF and HDF-EOS Workshop XII

Background and basic concept Oct HDF and HDF-EOS Workshop XII2

Oct HDF and HDF-EOS Workshop XII3 HDF4 is FLEXIBLE EXTENSIBLE SELF- DESCRIBING I’m Plastic Man!

But There’s a cost… Oct HDF and HDF-EOS Workshop XII4

Complexity! Oct HDF and HDF-EOS Workshop XII5

Oct HDF and HDF-EOS Workshop XII6

Oct HDF and HDF-EOS Workshop XII7

Oct HDF and HDF-EOS Workshop XII8

Oct HDF and HDF-EOS Workshop XII9

Oct HDF and HDF-EOS Workshop XII10

Oct HDF and HDF-EOS Workshop XII11

Oct HDF and HDF-EOS Workshop XII12

Oct HDF and HDF-EOS Workshop XII13 How do we save HDF users from having to deal with all of the complexity under the hood?

Oct HDF and HDF-EOS Workshop XII14 Through the HDF software libraries, either by using the HDF APIs directly or by using HDF tools that depend on the HDF libraries. But what about the future…

There is a risk in depending solely on the HDF libraries to access HDF-formatted data over the long term. It is possible, especially in the distant future, that the libraries may not be available. Oct HDF and HDF-EOS Workshop XII

Really smart people and software? Oct HDF and HDF-EOS Workshop XII16 Maybe future data users and their computers will be so smart that the HDF4 format will be a piece of cake.

Oct HDF and HDF-EOS Workshop XII17 Maybe not.

We need an “easy” button Oct HDF and HDF-EOS Workshop XII18

Oct HDF and HDF-EOS Workshop XII19 read HDF data with an independent program that does not rely on the HDF API… “If only we could read HDF data with an independent program that does not rely on the HDF API… A possible approach [would be to] extend hdfls to print a hierarchical map of a data file, [and] write ncdump/hdp-like utilities to find, assemble and write out SDSes and vdatas.” “Leveraging HDF Utilities” Christopher Lynnes HDF Workshop X.

Oct HDF and HDF-EOS Workshop XII20

Oct HDF and HDF-EOS Workshop XII21 HDF4 file layout

Oct HDF and HDF-EOS Workshop XII22 HDF4 file layout

The project Oct HDF and HDF-EOS Workshop XII23

HDF4 mapping Problem  The complex internal byte layout of HDF files requires one to use the API to access HDF data.  This makes long-term readability of HDF data dependent on long-term allocation of resources to support HDF software. Proposed solution  Create a map of the layout of data objects in an HDF file, allowing a simple reader to be written to access the data. Oct HDF and HDF-EOS Workshop XII

HDF4 mapping project activities 1.Assess and categorize HDF4 data held by NASA  To determine what types of objects to map.  To get an idea of the magnitude of the project. 2.Develop prototype for proof of concept  Develop markup-language based layout specification.  Develop tool to produce layout for an HDF4 file.  Develop and test two independent tools to read HDF4 data based solely on the map files. Oct HDF and HDF-EOS Workshop XII

Project activities (continued) 3.Assess results and plan next steps  Present results and options for proceeding to the community.  Assess the likely usefulness of this approach, as well as any desirable modifications  Evaluate the effort required for a full solution that best meets community needs  Submit a proposal for the work needed to provide a full solution Oct HDF and HDF-EOS Workshop XII

1. Assess and categorize Oct HDF and HDF-EOS Workshop XII27

HDF and HDF- EOS Workshop XII How many HDF4 products? Data CenterHDF4 Products ASF0 GES-DISC236 GHRC54 ASDC63 LP-DAAC67 NSIDC47 ORNL-DAAC2 PO.DAAC22 SDAC0 MrDC95 Total586 Oct

HDF and HDF- EOS Workshop XII Data characteristics Product Identification  Product Name  Data Level  Archive Location  Product Version Whether the product was multi-file For HDF-EOS products  HDF-EOS version  For point data Number of point data sets Maximum number of levels  For swath data Number of swaths Maximum number of dimensions Organized by time, space, both, or other Whether dimension maps were used  For gridded data Number of grids Max number of dimensions in a grid Number of projections used Whether any grids were indexed HDF Version For raster data  Number of 8-bit rasters  Number of 24-bit rasters  Number of general rasters  Whether any rasters had attributes  Whether any rasters were compressed  Whether any rasters were chunked  Whether there were any palettes For SDS data  Number of SDSs  Maximum number of dimensions  Did any SDS have attributes  Was any SDS annotated  Were dimension scales used  Was compression used and if so what kind  Was chunking used For Vdata  Number of Vdata structures  Did any Vdata have attributes  Did any Vdata fields have attributes  Was compression used and if so what kind  Was chunking used Product Characteristics Examined Oct

HDF and HDF- EOS Workshop XII Other results Slightly more than half of the HDF4 products are in HDF-EOS 2 format Grids are the most common HDF-EOS data structures in use No products use a combination of grid, swath, and point data structures Oct

2. Prototype and proof of concept Oct HDF and HDF-EOS Workshop XII31

HDF4 mapping prototype workflow October 15-18, 2008HDF and HDF-EOS Workshop XII32 HDF4 File “H4.hdf” HDF4 File “H4.hdf” HDF4 Mapping File (XML document) “H4.hdf.map.xml” HDF4 Mapping File (XML document) “H4.hdf.map.xml” hmap linked with HDF4 library hmap linked with HDF4 library Reader 1 (C program) Object Data Groups, Data Objects, Structural and Application Metadata; Locations of Object Data Reader 2 (Perl Script) Reader 2 (Perl Script)

Proof-of-concept results The HDF Group created prototype map generation software and a draft map specification Map generator was tested on a wide variety of data products GES-DISC and NSIDC independently wrote software that uses maps to read data files in NSIDC’s and GES-DISC’s archives Summary - the concept is feasible! Oct HDF and HDF-EOS Workshop XII

Example map fragment Oct HDF and HDF-EOS Workshop XII

Next steps Oct HDF and HDF-EOS Workshop XII35

Effort for full implementation Generate maps for existing archives  GES-DISC approach: append the map XML to the XML files already kept for each file in their archive  NSIDC non-ECS data implementation: add an XML file for each data file in same directory  Other systems TBD Generate maps for new data  Add map generation as a step in the ingest process using stand alone tool  Request product generation systems to use new API calls that generate maps Develop production quality implementation of mapping tool, and possibly an API. Possibly do similar assessment for HDF5 maps. HDF and HDF- EOS Workshop XII36Oct

HDF and HDF- EOS Workshop XII How you can help Consider what it might take to implement this for your archive - contact Ruth if you’d like support Review the materials on the wiki and elsewhere - comment heavily! Oct

For more information Wiki page added to Confluence wiki Project page at The HDF Group website:  Paper at 2008 fall AGU Paper “Ensuring Long Term Access to Remotely Sensed Data with Layout Maps” in the upcoming TGRSS special issue on archiving and distribution HDF and HDF- EOS Workshop XII38Oct

Thank you. This report is based upon work supported in part by a Cooperative Agreement with the National Aeronautics and Space Administration (NASA) under NASA Award NNX06AC83A. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Aeronautics and Space Administration. Oct HDF and HDF-EOS Workshop XII39