Www.hdfgroup.org The HDF Group HDF Update Mike Folk The HDF Group The 13th HDF and HDF-EOS Workshop November 3-5, 2009 HDF/HDF-EOS Workshop XIII1.

Slides:



Advertisements
Similar presentations
The HDF Group Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC Mike.
Advertisements

The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group HDF/HDF-EOS Workshop XIV September 30, 2010.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Remote Access via OPeNDAP Kent Yang and Joe Lee The HDF Group The 14 th HDF/HDF-EOS Workshop.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 HDF-Java Products Peter Cao The HDF Group The 13 th HDF and HDF-EOS Workshop.
The HDF Group HDF Group Support for NPP/JPSS Mike Folk, Elena Pourmal, Larry Knox, Albert Cheng The HDF Group The 15 th HDF and HDF-EOS.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 Using visualization tools to access HDF data via OPeNDAP Joe Lee and Kent Yang.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
File Management Chapter 12. File Management File management system is considered part of the operating system Input to applications is by means of a file.
The HDF Group HDF4 Mapping Status and Proposal Mike Folk, Ruth Aydt The HDF Group Oct. 17, 2012 Apr , 2012HDF/HDF-EOS Workshop XV.
The HDF Group Apr , 2012HDF/HDF-EOS Workshop XV1 Interoperability with netCDF-4 Kent Yang, Larry Knox, Elena Pourmal The HDF Group.
University of Illinois at Urbana-ChampaignHDF Mike Folk HDF-EOS Workshop IV Sept , 2000 HDF Update HDF.
University of Illinois at Urbana-ChampaignHDF 1McGrath/Yang 2/27/02 Transitioning from HDF4 to HDF5 Robert E. McGrath Kent Yang.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Access of NASA HDF data via OPeNDAP Kent Yang and Joe Lee The HDF Group September 28,2010.
File Management Chapter 12. File Management File management system is considered part of the operating system Input to applications is by means of a file.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Mike Folks, The HDF Group Ruth Duerr, NSIDC 1.
HDF5 Tools Update Peter Cao - The HDF Group November 6, 2007 This report is based upon work supported in part by a Cooperative Agreement.
The HDF Group Company, Services and Products May 30-31, 2012HDF5 Workshop at PSI 1.
1 HDF-EOS and Related Tools Status Update. 2 Overview.
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
HDF Project Update Mike Folk, Kent Yang, Elena Pourmal The HDF Group April 5, 2010 April 5, 2011Annual HDF Briefing to ESDIS1.
Important ESDIS 2009 tasks review Kent Yang, Mike Folk The HDF Group April 1st, /1/20151Annual briefing to ESDIS.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
1 Overview of HDF5 HDF Summit Boeing Seattle The HDF Group (THG) September 19, 2006.
The HDF Group Multi-threading in HDF5: Paths Forward Current implementation - Future directions May 30-31, 2012HDF5 Workshop at PSI 1.
HDF Update Mike Folk, Kent Yang, Elena Pourmal The HDF Group March 31st, 2009 March 31, 2009Annual HDF Briefing to ESDIS1.
1 HDF-EOS Status and Development Larry Klein, Abe Taaheri, and Cid Praderas L-3 Communications Government Services, Inc. November 30, 2005.
The HDF Group HDF5 Datasets and I/O Dataset storage and its effect on performance May 30-31, 2012HDF5 Workshop at PSI 1.
HDF Converting between HDF4 and HDF5 MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,
Tools for Interoperability between HDF and NetCDF Mike Folk and MuQun Yang The HDF Group The HDF Group provides the following tools for the NASA HDF and.
Page 1 Status of HDF-EOS, Related Software, and Tools Abe Taaheri, Raytheon IIS HDF & HDF-EOS Workshp XIII Riverdale, MD November 4, 2009.
HDF 1 New Features in HDF Group Revisions HDF and HDF-EOS Workshop IX November 30, 2005.
The HDF Group HDF5 Tools Updates Peter Cao, The HDF Group September 28-30, 20101HDF and HDF-EOS Workshop XIV.
Support for NPP/NPOESS by The HDF Group Mike Folk The HDF Group HDF and HDF-EOS Workshop XII October 17, 2008 Oct HDF and HDF-EOS Workshop XII1.
11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.
The HDF Group November 3-5, 2009 HDF-OPeNDAP Project Update HDF/HDF-EOS Workshop XIII1 Joe Lee and Kent Yang The HDF Group James Gallagher.
Towards Long-Term Archiving of NASA HDF-EOS and HDF Data Data Maps and the Use of Mark-Up Language Ruth Duerr, Mike Folk, Muqun Yang, Chris Lynnes, Peter.
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct HDF and.
October 15, 2008HDF and HDF-EOS Workshop XII1 What will be new in HDF5?
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
1 HDF-EOS Status, Related Tools and Issues. 2 Overview.
HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.
1 HDF5 Life cycle of data Boeing September 19, 2006.
The HDF Group HDF/HDF-EOS Workshop XV1 Tools to Improve the Usability of NASA HDF Data Kent Yang and Joe Lee The HDF Group April 17, 2012.
Page 1 TOOLKIT / HDF-EOS Status and Development Abe Taaheri, Raytheon IIS Aura DSWG meeting October 2007.
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal, Peter Cao The HDF Group November 5, 2009 November 3-5,
HDF4 OPeNDAP Project Progress Report MuQun Yang and Hyo-Kyung Lee 1 HDF Developers' Meeting11/24/2015.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
September 9, 2008SPEEDUP Workshop - HDF5 Tutorial1 Introduction to HDF5 Command-line Tools.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
1 Status of HDF-EOS, Related Software and Tools. 2 TOOLKIT / HDF-EOS Support.
July 20, Update on the HDF5 standardization effort Elena Pourmal, Mike Folk The HDF Group July 20, 2006 SPG meeting, Palisades, NY.
HDF and HDF-EOS Workshop VII September 24, 2003 HDF5, HDF-EOS and Geospatial Data Archives Don Keefer Illinois State Geological Survey Mike Folk Univ.
10/16/2012Annual HDF briefing1 HDF OPeNDAP support Kent Yang, Joe Lee, Mike Folk The HDF Group Oct. 16, 2012.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 The New HDF-EOS Web Site - How it can help you Kent Yang, Joe Lee The HDF Group.
Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group Annual HDF Briefing to ESDIS March 31, 2009 March Annual HDF Briefing.
The HDF Group HDF5 Update Quincey Koziol The HDF Group HEC-FSIO Workshop August 3, 2010 HEC-FSIO Workshop1.
Copyright © 2010 The HDF Group. All Rights Reserved1 Data Storage and I/O in HDF5.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
HDF and HDF-EOS Workshop XII
Moving from HDF4 to HDF5/netCDF-4
Kent Yang, Mike Folk The HDF Group March 31, 2009
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
Moving applications to HDF
Hierarchical Data Format (HDF) Status Update
HDF5 Tools Updates and Discussions
Presentation transcript:

The HDF Group HDF Update Mike Folk The HDF Group The 13th HDF and HDF-EOS Workshop November 3-5, 2009 HDF/HDF-EOS Workshop XIII1

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII2

The HDF Group What’s up with The HDF Group? November 3-5, 2009HDF/HDF-EOS Workshop XIII3

The HDF Group What is The HDF Group And why does it exist? November 3-5, 2009HDF/HDF-EOS Workshop XIII4

The HDF Group Established in years at University of Illinois National Center for Supercomputing Applications 4 years an independent non-profit company “The HDF Group” The HDF Group owns HDF4 and HDF5 Basic HDF4 and HDF5 formats, libraries and tools are open and free November 3-5, 20095HDF/HDF-EOS Workshop XIII

Data challenges addressed by HDF Our ability to organize complex collections of data Efficient and scalable data storage and access A growing need to integrate a wide variety of types of data Long term preservation of data November 3-5, 2009HDF/HDF-EOS Workshop XIII6

The HDF Group The HDF Group Mission To ensure long-term accessibility of HDF data through sustainable development and support of HDF technologies. November 3-5, 2009HDF/HDF-EOS Workshop XIII7

Goals Maintain and evolve HDF for sponsors and communities that depend on it Provide support to the HDF communities through consulting, training, tuning, development, research Sustain The HDF Group for the long term to assure data access over time November 3-5, 2009HDF/HDF-EOS Workshop XIII8

The HDF Group Services Helpdesk and Mailing Lists Available to all users as a first level of support Standard Support Rapid issue resolution and advice Consulting Needs assessment, troubleshooting, design reviews, etc. Training Tutorials and hands-on practical experience Enterprise Support Supporting many HDF activities across organizations Special Projects Adapting customer applications to HDF New features and tools Research and Development November 3-5, 20099HDF/HDF-EOS Workshop XIII

Members of the HDF support community NASA – EOS NOAA/NASA/Riverside Tech – NPOESS Army Geospatial Center A leading U.S. aerospace company NIH/Geospiza (bio software company ) University of Illinois/NCSA Sandia National Laboratory (2) Lawrence Berkeley National Lab Projects for petroleum industry, vehicle testing, weapons research, others “In kind” support November 3-5, HDF/HDF-EOS Workshop XIII

Some areas of increased recent interest Improvements Concurrent access Parallel I/O performance Real-time write performance High level language support Life sciences Sequencing Biomedical imaging Database integration Microsoft products (HPC,.NET, others) November 3-5, HDF/HDF-EOS Workshop XIII

Cool recent application - Imageworks’ Field3D November 3-5, 2009HDF/HDF-EOS Workshop XIII12 Spiderman 3 The Polar Express

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII13

The HDF Group Basic Library Releases November 3-5, 2009HDF/HDF-EOS Workshop XIII14 HDF5 HDF4

Time-line of the HDF libraries releases November 3-5, 2009HDF/HDF-EOS Workshop XIII15

HDF minor release (May 09) New functions Improve flexibility when traversing external links Validate object identifier Enabled data chunk cache properties to be set per dataset (per file in previous releases) Forward/backward compatibility issues Modified library to be able to open files with corrupt root group symbol table messages Also corrects corruption errors if found. November 3-5, HDF/HDF-EOS Workshop XIII

HDF minor release (Nov 09) Modified configure and make process to properly preserve user's CFLAGS and similar environment variables. Corrected a problem where library would re- write the superblock in a file opened for R/W access, even when no changes were made to the file. November 3-5, HDF/HDF-EOS Workshop XIII

HDF5 1.6 minor releases May 09 Minor bug fixes Same tools improvements as in Nov 09 Minor bug fixes Ability to embed library information in executable binaries This is a last release of 1.6 series announced in May 2009 – no response This is your last chance! November 3-5, HDF/HDF-EOS Workshop XIII

HDF 4r2.4 minor release (Feb 09) Minor bug fixing, enhancements New routines to get size of compressed data Support for C shared libraries Support for 32-bit version on Mac Intel Updated docs in HTML and PDF November 3-5, HDF/HDF-EOS Workshop XIII

HDF 4r2.5 minor release (Feb 10) Minor bug fixes, enhancements Support for 64-bit version on Mac Intel Restructured and cleaned up source code for easier maintenance Changes in versioning Improves ability to maintain Becomes similar HDF5 versioning works Will use major, minor, release and sub-release suffix in the names of the source tar balls E.g., hdf-4.2.5, hdf snap0 Library string will include suffix E.g., "HDF Version 4.2 Release 4-snap3, October 18, 2009" November 3-5, HDF/HDF-EOS Workshop XIII

H4-H5 Conversion Software 2.1 (Feb 09) Based on HDF4r2.4 and HDF h4toh5 utility Recognizes HDF-EOS2 files (--with-hdfeos2 configuration option) Can generate HDF5 files that can be read by netCDF-4 h4toh5 library Bug fixes Performance improvements November 3-5, HDF/HDF-EOS Workshop XIII

H4-H5 Conversion Software 2.2 (Feb 10) Based on HDF4r2.5 and HDF November 3-5, HDF/HDF-EOS Workshop XIII

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII23

Major Improvements for Existing Tools H5dump additions Ability to show data pointed to by dataset region references. More options for dumping data into ASCII Compatible with MS Excel Compatible with h5import h5diff Improvements in accuracy, flexibility, and performance Some new flags Report non-comparable objects Avoid NaN detection Option to use system epsilon to compare floating-point numbers Compares for strict equality first to improve performance Treats two INFINITY values as equal Fixed segmentation fault problem on variable length strings. November 3-5, HDF/HDF-EOS Workshop XIII

Major Improvements for Existing Tools h5stat Fixed incorrect statistics on EOS big data files with corrupted headers. h5repack Added ability to preserve group creation order When chunk size not specified, uses heuristics to set chunk size Fixed problem that 1.8 fails on a file created with 1.6. November 3-5, HDF/HDF-EOS Workshop XIII

Tool activities in the works New tool -- h5tail Display new records appended to a dataset Improved code quality and testing Tools library: general purpose APIs for tools Tools library currently only for our developers Want to make it public so that people can use it in their products November 3-5, HDF/HDF-EOS Workshop XIII

Conversion Tools November 3-5, 2009HDF/HDF-EOS Workshop XIII27 HDF4 to HDF5 HDF5 to jpeg HDF5 to XML HDF5 to other formats? Please send us your comments and requests regarding HDF5 conversion tools, such as

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII28

HDF-Java 2.6 is on the way Includes all HDF java products Java Wrapper API Java Object API HDFView Adds new features, such as better support for dataset region references Improves performance Release schedule Beta 1: end of Nov. 09 Full release: end of Dec. 09 November 3-5, 2009HDF/HDF-EOS Workshop XIII29

Full support of HDF5 1.8.x in hdf-java Full HDF5 1.8 support will be added to the release after version 2.6. We are looking for input RFC: Java wrapper will be completed March 2010 Object API and HDFView update to come later November 3-5, 2009HDF/HDF-EOS Workshop XIII30

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII31

Single-Writer/Multiple-Reader Access Situation: A long-running process is modifying an HDF5 file and simultaneously other processes want to inspect data in the file. Solution: Single-Writer/Multiple-Reader (SWMR) File Access. Allows simultaneous reading of HDF5 file while the file is being modified by another process No inter-process coordination necessary November 3-5, 2009HDF/HDF-EOS Workshop XIII32

Improved Multi-Threaded Concurrency Converting from “big lock” on code (entire library) to locks on internal library data structures Will improve ability to have multiple threads performing HDF5 operations simultaneously November 3-5, HDF/HDF-EOS Workshop XIII

Other Library Features Saving space Store Partial Edge Chunks More Efficiently Persistent File Free Space tracking/recovery Allow a group’s link info to be compressed Saving time Aggregate neighboring metadata for faster metadata cache I/O November 3-5, HDF/HDF-EOS Workshop XIII

New chunk indexing methods Dataset typeIndex typeSpace improvements Speed improvements no unlimited dimensions, no filters, no missing chunks “implicit” no actual chunk index Same storage space as contiguous dataset storage (no index) Constant time lookups Faster parallel I/O no unlimited dimensions “fixed sized” smaller chunk index Smaller index overhead Constant time lookups 1 unlimited dimension “extensible array” Smaller index overhead Constant time lookups and appends 2+ unlimited dimension Improved B-tree* Smaller index overhead Faster November 3-5, 2009HDF/HDF-EOS Workshop XIII35

3-5, 2009HDF/HDF-EOS Workshop XIII36 Project with Lawrence Berkeley Nat’l Lab to improve HDF5 performance on parallel applications Up to 6x performance improvements on certain applications (so far) Parallel I/O Improvements

Topics November 3-5, 2009HDF/HDF-EOS Workshop XIII37

The HDF Group HDF-EOS library November 3-5, 2009HDF/HDF-EOS Workshop XIII38

EOS support HDF-EOS2 and HDF-EOS5 Automatic configuration with szip enabled/disabled Now tested daily with HDF4 and HDF5 development code Updated the HDF-EOS website November 3-5, 2009HDF/HDF-EOS Workshop XIII39

The HDF Group HDF-EOS5/netCDF-4 Augmentation Tool Accessing HDF-EOS5 files via netCDF-4 API November 3-5, 2009HDF/HDF-EOS Workshop XIII40

The Main Challenge Would like netCDF-4 applications to be able to read and understand HDF-EOS 5 files Problem: NetCDF-4 model follows the HDF5 dimension scale model but HDF-EOS5 does not. November 3-5, 2009 HDF/HD F-EOS Worksh op XIII41 GRIDS HDFEOS CloudFractionAndPressure Data Fields CloudFraction CloudPressure No HDF5 dimension scales are associated with this variable No HDF5 dimension scales are associated with this variable

Our Solution – Augmentation Provide dimensions required by netCDF-4 November 3-5, 2009 HDF/HD F-EOS Worksh op XIII42 GRIDS HDFEOS CloudFractionAndPressure Data Fields CloudFraction[XDim][YDim] CloudPressure[XDim][YDim] XDim YDim

Special values in HDF5 There are cases where a user may wish to specify more than one “special” value to describe non-standard data. We provide several examples (C, Fortran, IDL) on how to store special values November 3-5, 2009HDF/HDF-EOS Workshop XIII43

The HDF Group OPeNDAP November 3-5, 2009HDF/HDF-EOS Workshop XIII44

OPeNDAP HDF5-OPeNDAP handler Served OMI Swath data HDF4-OPeNDAP handler Tested with some AIRS data and some MODIS data More information in the Thursday morning session November 3-5, 2009HDF/HDF-EOS Workshop XIII45

Swath to Grid conversion Tool November 3-5, 2009HDF/HDF-EOS Workshop XIII46 Request from NASA GES DISC Convert Swath to Grid Support both HDF-EOS2 and TRMM data Still in the development MODIS Swath Converted Grid

The HDF Group Support for NPP/NPOESS by The HDF Group November 3-5, HDF/HDF-EOS Workshop XIII

Priorities for Data accessibility and usability Developed library of high level APIs to support NPP/NPOESS data management Modified h5dump to display region references Modified HDFView to view object and region references and quality flags System maintenance User support November 3-5, HDF/HDF-EOS Workshop XIII

NPOESS Project Information Project Web site November 3-5, HDF/HDF-EOS Workshop XIII

HDF4 LAYOUT MAPS November 3-5, 2009HDF/HDF-EOS Workshop XIII50

HDF4 Layout Map Project Problem Long-term readability of HDF data depends on long-term availability of software Proposed solution Create a map of the layout of data objects in an HDF file, allowing a simple reader to be written to access the data November 3-5, HDF/HDF-EOS Workshop XIII

TRANSFORMING THE GEOCOMPUTATIONAL BATTLESPACE FRAMEWORK WITH HDF5 A Project with the Army Geospatial Center November 3-5, 2009HDF/HDF-EOS Workshop XIII52

Data Challenges November 3-5, 2009HDF/HDF-EOS Workshop XIII53 Wide variety SatelliteBuckeyeCulture Large scaleHigh efficiency High res.Stream AccuracyTime Military Decision Making

BIOHDF : TOWARD SCALABLE BIOINFORMATICS INFRASTRUCTURES NIH STTR with Geospiza, Seattle WA TM November 3-5, HDF/HDF-EOS Workshop XIII

Next Generation DNA Sequencing “Genome center in a mail room”“Democratizing genomics” “Changing the landscape” “Transforms today’s biology” NGS is Powerful November 3-5, HDF/HDF-EOS Workshop XIII

… And Daunting “Prepare for the deluge” “Byte-ing off more than you can chew” November 3-5, HDF/HDF-EOS Workshop XIII

BioHDF Project Goal: Move bioinformatics problems from organizing and structuring data to asking questions and visualizing data Develop data models and tools to work with NGS data in HDF5 Create HDF5 domain-specific extensions and library modules to support the unique aspects of NGS data  BioHDF Integrate BioHDF technologies into Geospiza products Deliver core BioHDF technologies to the community as open-source software November 3-5, HDF/HDF-EOS Workshop XIII

The HDF Group Thank You All and Thank You NASA! November 3-5, 2009HDF/HDF-EOS Workshop XIII58

Acknowledgements This report is based on work supported by cooperative agreement number NNX08AO77A from the National Aeronautics and Space Administration (NASA). Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author[s] and do not necessarily reflect the views of the National Aeronautics and Space Administration. November 3-5, 2009HDF/HDF-EOS Workshop XIII59

The HDF Group Questions/comments? November 3-5, 2009HDF/HDF-EOS Workshop XIII60