Update on HDF5 1.8 The HDF Group HDF and HDF-EOS Workshop X November 28, 2006HDF.

Slides:



Advertisements
Similar presentations
A PLFS Plugin for HDF5 for Improved I/O Performance and Analysis Kshitij Mehta 1, John Bent 2, Aaron Torres 3, Gary Grider 3, Edgar Gabriel 1 1 University.
Advertisements

INSTRUCTION SET ARCHITECTURES
Dr. Kalpakis CMSC 661, Principles of Database Systems Representing Data Elements [12]
The Binary Numbering Systems
File Systems.
The HDF Group November 3-5, 2009HDF/HDF-EOS Workshop XIII1 HDF5 Advanced Topics Elena Pourmal The HDF Group The 13 th HDF and HDF-EOS.
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
HDF Update Mike Folk The HDF Group HDF and HDF-EOS Workshop X November 29, 2006HDF.
NetCDF An Effective Way to Store and Retrieve Scientific Datasets Jianwei Li 02/11/2002.
HDF4 and HDF5 Performance Preliminary Results Elena Pourmal IV HDF-EOS Workshop September
Indexing Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata.
September 9, 2008SPEEDUP Workshop - HDF5 Tutorial1 New Features in HDF5.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
CHP - 9 File Structures. INTRODUCTION In some of the previous chapters, we have discussed representations of and operations on data structures. These.
HDF5 Tools Update Peter Cao - The HDF Group November 6, 2007 This report is based upon work supported in part by a Cooperative Agreement.
Parallel HDF5 Introductory Tutorial May 19, 2008 Kent Yang The HDF Group 5/19/20081SCICOMP 14 Tutorial.
HDF 1 HDF5 Advanced Topics Object’s Properties Storage Methods and Filters Datatypes HDF and HDF-EOS Workshop VIII October 26, 2004.
The HDF Group April 17-19, 2012HDF/HDF-EOS Workshop XV1 Introduction to HDF5 Barbara Jones The HDF Group The 15 th HDF and HDF-EOS Workshop.
9/17/2015The HDF Group1 HDF Update Mike Folk The HDF Group HDF and HDF-EOS Workshop XI November 7, 2007.
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
NASA EOS DATA COMPRESSION WITH HDF5 SCALEOFFSET FILTER This work was funded by the NASA Earth Science Technology Office under NASA award AIST and.
HDF5 A new file format & software for high performance scientific data management.
Important ESDIS 2009 tasks review Kent Yang, Mike Folk The HDF Group April 1st, /1/20151Annual briefing to ESDIS.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
May 30-31, 2012HDF5 Workshop at PSI1 HDF5 at Glance Quick overview of known topics.
The HDF Group HDF5 Datasets and I/O Dataset storage and its effect on performance May 30-31, 2012HDF5 Workshop at PSI 1.
HDF 1 New Features in HDF Group Revisions HDF and HDF-EOS Workshop IX November 30, 2005.
April 28, 2008LCI Tutorial1 Introduction to HDF5 Tools Tutorial Part II.
The HDF Group HDF5 Tools Updates Peter Cao, The HDF Group September 28-30, 20101HDF and HDF-EOS Workshop XIV.
HDF Dimension Scales in HDF5 HDF-EOS Workshop IX San Francisco, CA November 30 - December 2, 2005 Pedro Vicente Nunes THG/NCSA Champaign-Urbana, IL HDF.
Support for NPP/NPOESS by The HDF Group Mike Folk The HDF Group HDF and HDF-EOS Workshop XII October 17, 2008 Oct HDF and HDF-EOS Workshop XII1.
11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.
October 15, 2008HDF and HDF-EOS Workshop XII1 What will be new in HDF5?
Operating Systems COMP 4850/CISG 5550 File Systems Files Dr. James Money.
1 N-bit and ScaleOffset filters MuQun Yang National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Urbana, IL
1 HDF5 Life cycle of data Boeing September 19, 2006.
Module 4.0: File Systems File is a contiguous logical address space.
May 30-31, 2012 HDF5 Workshop at PSI May Shared Object Headers Dana Robinson The HDF Group Efficient Use of HDF5 With High Data Rate X-Ray Detectors.
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal, Peter Cao The HDF Group November 5, 2009 November 3-5,
Overview of Previous Lesson(s) Over View  A program must be translated into a form in which it can be executed by a computer.  The software systems.
1 Error Handling Interface HDF-EOS Workshop IX Quincey Koziol and Ray Lu 30 Nov 2005.
September 9, 2008SPEEDUP Workshop - HDF5 Tutorial1 Introduction to HDF5 Command-line Tools.
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
The HDF Group HDF5 Chunking and Compression Performance tuning 10/17/15 1 ICALEPCS 2015.
Lecture 10 Page 1 CS 111 Summer 2013 File Systems Control Structures A file is a named collection of information Primary roles of file system: – To store.
© Janice Regan, CMPT 300, May CMPT 300 Introduction to Operating Systems File systems.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
Intro to Parallel HDF5 10/17/151ICALEPCS /17/152 Outline Overview of Parallel HDF5 design Parallel Environment Requirements Performance Analysis.
NTFS Filing System CHAPTER 9. New Technology File System (NTFS) Started with Window NT in 1993, Windows XP, 2000, Server 2003, 2008, and Window 7 also.
Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group Annual HDF Briefing to ESDIS March 31, 2009 March Annual HDF Briefing.
File Systems May 12, 2000 Instructor: Gary Kimura.
The HDF Group Introduction to HDF5 Session 7 Datatypes 1 Copyright © 2010 The HDF Group. All Rights Reserved.
Copyright © 2010 The HDF Group. All Rights Reserved1 Data Storage and I/O in HDF5.
HDF and HDF-EOS Workshop XII
Storage and File Organization
Moving from HDF4 to HDF5/netCDF-4
University of Central Florida COP 3330 Object Oriented Programming
HDF5 Metadata and Page Buffering
File System Structure How do I organize a disk into a file system?
OpenStorage API part II
Operation System Program 4
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
Peter Cao The HDF Group November 28, 2006
Moving applications to HDF
File Storage and Indexing
Hierarchical Data Format (HDF) Status Update
Real-World File Structures
HDF5 Tools Updates and Discussions
Chapter 5 File Systems -Compiled for MCA, PU
Presentation transcript:

Update on HDF5 1.8 The HDF Group HDF and HDF-EOS Workshop X November 28, 2006HDF

Why HDF5 1.8?

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD3 … as we know, there are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns -- the ones we don't know we don't know. Donald Rumsfeld

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD4 Some things we knew we knew Need high level APIs – image, etc. Need more datatypes - packed n-bit, etc. Need external and other links Tools needed – h5pack, etc. Caching embellishments Eventually, multithreading

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD5 Things we knew we did not know New requirements from EOS and ASCI New applications that would use HDF5 How HDF5 would really perform in parallel What new tools, features and options needed New APIs, API features

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD6 Things we didn’t know we didn’t know Completely unanticipated applications New data types and structures E.g. DNA sequences New operations E.g. write many real-time streams simultaneously

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD7 HDF5 1.8 topics Dataset and datatype improvements Group improvements Link Revisions Shared object header nessages Metadata cache improvements Other improvements Platform-specific changes High level APIs Parallel HDF5 Tool improvements

Dataset and Datatype Improvements

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD9 Text-based data type descriptions Why: Simplify datatype creation Make datatype creation code more readable Facilitate debugging by printing the text description of a data type What: New routine to create a data type through the text description of the data type: H5LTdtype_to_text

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD10 Text data type description – Example Create a datatype of compound type. /* Create the data type with text description */ ( dtype = H5Ttext_to_type( “typedef struct foo {int a; float b;} foo_t;”) “typedef struct foo {int a; float b;} foo_t;”) /* Convert the data type back to text */ H5Ttype_to_text(dtype, NULL, H5T_C, &tsize)

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD11 Serialized datatypes and dataspaces Why: Allow datatype and dataspace info to be transmitted between processes Allow datatype/dataspace to be stored in non- HDF5 files What: A new set of routines to serialize/deserialize HDF5 datatypes and dataspaces.

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD12 Int to float convert during I/O Why: Convert ints to floats during I/O What: Int to float conversion supported during I/O

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD13 Revised conversion exception handling Why: Give apps greater control over exceptions (range errors, etc.) during datatype conversion. What: Revised conversion exception handling

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD14 Revised conversion exception handling To handle exceptions during conversions, register handling function through H5Pset_type_conv_cb(). Cases of exception: H5T_CONV_EXCEPT_RANGE_HI H5T_CONV_EXCEPT_RANGE_LOW H5T_CONV_EXCEPT_TRUNCATE H5T_CONV_EXCEPT_PRECISION H5T_CONV_EXCEPT_PINF H5T_CONV_EXCEPT_NINF H5T_CONV_EXCEPT_NAN Return values: H5T_CONV_ABORT, H5T_CONV_UNHANDLED, H5T_CONV_HANDLED

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD15 Compression filter for n-bit data Why: Compact storage for user-defined datatypes What: When data stored on disk, padding bits chopped off and only significant bits stored Supports most datatypes Works with compound datatypes

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD16 N-bit compression example In memory, one value of N-Bit datatype is stored like this: | byte 3 | byte 2 | byte 1 | byte 0 | |????????|????SPPP|PPPPPPPP|PPPP????| S-sign bit P-significant bit ?-padding bit After passing through the N-Bit filter, all padding bits are chopped off, and the bits are stored on disk like this: | 1st value | 2nd value | |SPPPPPPP PPPPPPPP|SPPPPPPP PPPPPPPP|... Opposite (decompress) when going from disk to memory

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD17 Offset+size storage filter Why: Use less storage when less precision needed What: Performs scale/offset operation on each value Truncates result to fewer bits before storing Currently supports integers and floats Example H5Pset_scaleoffset (dcr,H5Z_SO_INT,H5Z_SO_INT_MINBITS_DEFAULT); H5Dcreate(……, dcr) H5Dwrite (…);

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD18 Example with floating-point type Data: { , , , } Choose scaling factor: decimal precision to keep E.g. scale factor D = 2 1. Find minimum value (offset): Subtract minimum value from each element Result: {5.102, 0, 1.086, 6.185} 3. Scale data by multiplying 10 D = 100 Result: {510.2, 0, 108.6, 618.5} 4. Round the data to integer Result: {510, 0, 109, 619} 5. Pack and store using min number of bits

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD19 “NULL” Dataspace Why: Allow datasets with no elements to be described NetCDF 4 needed a “place holder” for attributes What: A dataset with no dimensions, no data

Group improvements

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD21 Access links by creation-time order Why: Allow iteration & lookup of group’s links (children) by creation order as well as by name order Support netCDF access model for netCDF 4 What: Option to access objects in group according to relative creation time

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD22 “Compact groups” Why: Save space and access time for small groups If groups small, don’t need B-tree overhead What: Alternate storage for groups with few links Example File with 11,600 groups With original group structure, file size ~ 20 MB With compact groups, file size ~ 12 MB Total savings: 8 MB (40%) Average savings/group: ~700 bytes

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD23 Better large group storage Why: Faster, more scalable storage and access for large groups What: New format and method for storing groups with many links

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD24 Intermediate group creation Why: Simplify creation of a series of connected groups Avoid having to create each intermediate group separately, one by one What: Intermediate groups can be created when creating an object in a file, with one function call

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD25 Example: add intermediate groups Want to create “/A/B/C/dset1” “A” exists, but “B/C/dset1” do not / A / A B dset1 C H5Dcreate(file_id, “/A/B/C/dset1”,..) One call creates groups “B” & “C”, then creates “dset1”

Link Revisions

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD27 What are links? Links connect groups to their members “Hard” links point to a target by address “Soft” links store the path to a target root group Hard link dataset Soft link “/target dataset”

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD28 file2.h5 file1.h5 New: external Links Why: Access objects by file & path within file What: Store location of file and path within that file Can link across files root group “dataset EL” “file2.h5” “target dataset” root group dataset “target dataset”

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD29 New: User-defined Links Why: Allow applications to create their own kinds of links and link operations, such as Create “hard” external link that finds an object by address Create link that accesses a URL Keep track of how often a link accessed, or other behavior What: App can create new kinds of links by supplying custom callback functions Can do anything HDF5 hard, soft, or external links do

Shared Object Header Messages

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD31 Shared object header messages Why: metadata duplicated many times, wasting space Example: You create a file with 10,000 datasets All use the same datatype and dataspace HDF5 needs to write this information 10,000 times! Dataset 1 data 1 datatype dataspace Dataset 2 data 2 datatype dataspace Dataset 3 data 3 datatype dataspace

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD32 Shared object header messages What: Enable messages to be shared automatically HDF5 shares duplicated messages on its own! Dataset 1 data 1 datatype dataspace Dataset 2 data 2

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD33 Shared Messages Happens automatically Works with datatypes, dataspaces, attributes, fill values, and filter pipelines Saves space if these objects are relatively large May be faster if HDF5 can cache shared messages Drawbacks Usually slower than non-shared messages Adds overhead to the file Index for storing shared datatypes 25 bytes per instance Older library versions can’t read files with shared messages

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD34 Two informal tests File with 24 datasets, all with same big datatype 26,000 bytes normally 17,000 bytes with shared messages enabled Saves 375 bytes per dataset But, make a bad decision: invoke shared messages but only create one dataset… 9,000 bytes normally 12,000 bytes with shared messages enabled Probably slower when reading and writing, too. Moral: shared messages can be a big help, but only in the right situation!

Metadata cache improvements

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD36 Metadata Cache improvements Why: Improve I/O performance and memory usage when accessing many objects What: New metadata cache APIs control cache size monitor actual cache size and current hit rate Under the hood: adaptive cache resizing Automatically detects the current working size Sets max cache size to the working set size

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD37 Metadata cache improvements Note: most applications do not need to worry about the cache See “Advanced topics” for details And if you do see unusual memory growth or poor performance, please contact us. We want to help you.

Other improvements

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD39 New extendible error-handling API Why: Enable app to integrate error reporting with HDF5 library error stack What: New error handling API H5Epush - push major and minor error ID on specified error stack H5Eprint – print specified stack H5Ewalk – walk through specified stack H5Eclear – clear specified stack H5Eset_auto – turn error printing on/off for specified stack H5Eget_auto – return settings for specified stack traversal

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD40 Extendible ID API A ID management routines allow an application to use the HDF5 ID-to-object mapping routines

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD41 Attribute improvements Why: Use less storage when large numbers of attributes attached to a single object Iterate over or look up attributes by creation order What: Property to create index on the order in which the attributes are created Improved attribute storage

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD42 Support for Unicode Character Set Why: So apps can create names using Unicode netCDF 4 needed this What UTF-8 Unicode encoding now supported For string datatypes, names of links and attributes Example: H5Pset_char_encoding(lcpl_id, H5T_CSET_UTF8) H5Llink(file_id, "UTF-8 name", …, lcpl_id, …);

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD43 Efficient copying of HDF5 objects Why: Enable apps to copy objects efficiently What New routines to copy an object in an HDF5 file within the current file or to another file Done at a low-level in the HDF5 file, allowing Entire group hierarchies to be copied quickly Compressed datasets to be copied without going through a decompression/compression cycle

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD44 Performance of object copy routines

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD45 Data transformation filter Why: Apply arithmetic operations to data during I/O What: Data transformation filter Transform expressed by algebraic formula Only +, -, *, and /supported Example: Expression parameter set, such as x*(x-5) When dataset read/written, x*(x-5) applied per element When reading, values in file are unchanged When writing, transformed data written to file

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD46 Stackable Virtual File Drivers What is Virtual File Driver (VFD)?

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD47 Virtual file I/O (C only)  Perform byte-stream I/O operations (open/close, read/write, seek)  User-implementable I/O (stdio, network, memory, etc.) Virtual file I/O (C only)  Perform byte-stream I/O operations (open/close, read/write, seek)  User-implementable I/O (stdio, network, memory, etc.) Library internals Performs data transformations and other prep for I/O Configurable transformations (compression, etc.) Library internals Performs data transformations and other prep for I/O Configurable transformations (compression, etc.) Structure of HDF5 Library Object API (C, Fortran 90, Java, C++)  Specify objects and transformation properties  Invoke data movement operations and data transformations Object API (C, Fortran 90, Java, C++)  Specify objects and transformation properties  Invoke data movement operations and data transformations

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD48 Stackable VFD HDF5 VFD allows Storing data using different physical file layout. E.g., Family VFD (writes file as “family of files”) Doing different types of I/O. E.g., stdio (standard I/O); MPI-I/O (for parallel I/O)

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD49 Stackable VFD Why “stackable:” Before now, only one VFD could be used at a time VFDs could not inter-operative What is “stackable:” A Non-terminal VFD may stack on top of compatible non-terminal and eventually Terminal VFD’s Two kinds of VFD Non-terminal (e.g. Family) Terminal (e.g. stdio; MPI-I/O)

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD50 Stackable VFD HDF5 Files Application HDF5 API stdio Family Filesplit mpiio Sec2 Default I/O path Terminal VFD Non-terminal VFD metadatarawdata

Platform-specific changes

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD52 Platform-specific changes Why: Better UNIX/Linux Portability What: 1.8 uses latest GNU “auto” tools (autoconf, automake, libtool) improves portability between many machine and OS configurations Build can now be done in parallel with gmake “–j” flag speeds up build, test and install processes Build infrastructure includes many other improvements as well

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD53 Platforms to be dropped Operating systems HPUX MAC OS 10.3 AIX 5.1 and 5.2 SGI IRIX Linux 2.4 Solaris 2.8 and 2.9 Compilers GNU C compilers older than 3.4 (Linux) Intel 8.* PGI V. 5.*, 6.0 MPICH

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD54 Platforms to be added Systems Alpha Open VMS MAC OSX 10.4 (Intel) Solaris 2.* on Intel (?) Cray XT3 Windows 64-bit (32-bit binaries) Linux 2.6 BG/L Compilers g95 PGI V. 6.1 Intel 9.* MPICH MPICH2

High level APIs

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD56 High-Level Fortran APIs Fortran APIs have been added for H5Lite, H5Image and H5Table.

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD57 Dimension scales Similar to Dimension scales in HDF4 Coordinate variables in netCDF What is a dimension scale ? An HDF5 dataset with additional metadata that identifies the dataset as a “Dimension Scale” Associated with dimensions of HDF5 datasets Meaning of the association is left to applications A Dimension scale can be shared by two or more dataset dimensions

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD58 Dimension scales example HDF Explorer image

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD59 Dimension scales example HDF Explorer image

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD60 Sample dimension scale functions H5DSset_scale: convert dataset to a dimension scaleH5DSset_scale: convert dataset to a dimension scale H5DSattach_scale: attach scale to a dimensionH5DSattach_scale: attach scale to a dimension H5DSdetach_scale: detach scale from a dimensionH5DSdetach_scale: detach scale from a dimension H5DSis_attached: verify if scale attached to datasetH5DSis_attached: verify if scale attached to dataset H5DSget_scale_name: read name of scaleH5DSget_scale_name: read name of scale

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD61 HDF5Packet Why: High performance table writing For data acquisition, when there are many sources of data E.g. flight test What: Each row is a “packet”: a collection of fields, fixed or variable length Append only Indexed retrieval

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD62 Packets in HDF Data Variable-length records Fixed-length data records Time......

Parallel HDF5

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD64 Collective I/O improvements Why Collective I/O not available for chunked data Collective I/O not available for complex selections Collective I/O is key to improving performance for parallel HDF5 What Collective I/O works for chunked storage Works for irregular selections for both chunked and contiguous storage

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD65 Parallel h5diff (ph5diff) Compares two files in an MPI parallel environment. Compares multiple datasets simultaneously

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD66 Windows MPICH support Windows MPICH support: prototype

Tool improvements

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD68 New features for old tools h5dump Dump data in binary format Faster for files with large numbers of objects h5diff Can now compare dataset regions Parallel ph5diff now available h5repack Efficient data copy using H5Gcopy() Able to handle big datasets

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD69 New HDF5 Tools h5copy Copies a group, dataset or named datatype from one location to another Copies within a file or across files h5repart Partition file into a family of files h5import Import binary/ascii data into an HDF5 file h5check Verifies an HDF5 file against the defined HDF5 File Format Specification h5stat Reports statistics about a file and objects in a file

Thank You

Questions/comments?

Nov. 28, 2006HDF and HDF-EOS Workshop X, Landover MD72 For more information Go to Click on “Obtain HDF Alpha” Look at table “Information”

Acknowledgement This report is based upon work supported in part by a Cooperative Agreement with NASA under NASA NNG05GC60A. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Aeronautics and Space Administration.