NASA EOS DATA COMPRESSION WITH HDF5 SCALEOFFSET FILTER This work was funded by the NASA Earth Science Technology Office under NASA award AIST-02-0071 and.

Slides:

Advertisements

Similar presentations

The HDF Group Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC Mike.

Advertisements

The HDF Group July 8, Summer ESIP Federation Meeting How to Meet the CF Conventions with NcML for NASA HDF/HDF-EOS Hyo-Kyung.

Computer Engineering FloatingPoint page 1 Floating Point Number system corresponding to the decimal notation 1,837 * 10 significand exponent A great number.

HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)

An HDF5-WRF module -A performance report MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,

Datorteknik FloatingPoint bild 1 Floating point Number system corresponding to the decimal notation 1,837 * 10 significand exponent a great number of corresponding.

Prepared by: Hind J. Zourob Heba M. Matter Supervisor: Dr. Hatem El-Aydi Faculty Of Engineering Communications & Control Engineering.

The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.

Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Mike Folks, The HDF Group Ruth Duerr, NSIDC 1.

Binary Real Numbers. Introduction Computers must be able to represent real numbers (numbers w/ fractions) Two different ways:  Fixed-point  Floating-point.

HDF5 Tools Update Peter Cao - The HDF Group November 6, 2007 This report is based upon work supported in part by a Cooperative Agreement.

This material is based upon work supported by the U.S. Department of Homeland Security, Science and Technology Directorate, Office of University Programs,

HDF Windows Support MuQun Yang, Xuan Bai, Elena Pourmal, Barbara Jones, Pedro Vincent, Robert E. McGrath National Center for Supercomputing Applications.

Machine Instruction Characteristics

Fixed-Point Arithmetics: Part II

DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.

CH09 Computer Arithmetic  CPU combines of ALU and Control Unit, this chapter discusses ALU The Arithmetic and Logic Unit (ALU) Number Systems Integer.

Chapter 03 Data Representation. 2 Chapter Goals Distinguish between analog and digital information Explain data compression and calculate compression.

9.4 FLOATING-POINT REPRESENTATION

HDF Converting between HDF4 and HDF5 MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,

HDF Dimension Scales in HDF5 HDF-EOS Workshop IX San Francisco, CA November 30 - December 2, 2005 Pedro Vicente Nunes THG/NCSA Champaign-Urbana, IL HDF.

Support for NPP/NPOESS by The HDF Group Mike Folk The HDF Group HDF and HDF-EOS Workshop XII October 17, 2008 Oct HDF and HDF-EOS Workshop XII1.

11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.

Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps Ruth Duerr, NSIDC Christopher Lynnes, GES DISC The HDF Group Oct HDF and.

October 15, 2008HDF and HDF-EOS Workshop XII1 What will be new in HDF5?

HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.

1 N-bit and ScaleOffset filters MuQun Yang National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Urbana, IL

HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.

A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.

The HDF Group HDF/HDF-EOS Workshop XV1 Tools to Improve the Usability of NASA HDF Data Kent Yang and Joe Lee The HDF Group April 17, 2012.

Pointers: Basics. 2 What is a pointer? First of all, it is a variable, just like other variables you studied  So it has type, storage etc. Difference:

- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.

The HDF Group Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal, Peter Cao The HDF Group November 5, 2009 November 3-5,

IDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF ).

1 Error Handling Interface HDF-EOS Workshop IX Quincey Koziol and Ray Lu 30 Nov 2005.

HDF Windows Support MuQun Yang, Xuan Bai, Elena Pourmal, Barbara Jones, Pedro Vincent, Robert E. McGrath National Center for Supercomputing Applications.

HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.

Using a Friendly OPeNDAP Client Library to Access HDF5 Data MuQun Yang and Hyo-Kyung Lee (The HDF Group) 1 25th IIPS Conference01/14/2009.

HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,

Computer Engineering FloatingPoint page 1 Floating Point Number system corresponding to the decimal notation 1,837 * 10 significand exponent A great number.

HDF and HDF-EOS Workshop VII September 24, 2003 HDF5, HDF-EOS and Geospatial Data Archives Don Keefer Illinois State Geological Survey Mike Folk Univ.

An HDF5-WRF module -A performance report MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,

Introduction to Computer Organization & Systems Topics: Types in C: floating point COMP C Part III.

11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.

Leader Interviews Name, PhD Title, Organization University This project is funded by the National Science Foundation (NSF) under award numbers ANT

Support for NPP/NPOESS by The HDF Group Mike Folk, Elena Pourmal The HDF Group Annual HDF Briefing to ESDIS March 31, 2009 March Annual HDF Briefing.

Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package Christian Chilan, Kent Yang, Albert Cheng, Quincey Koziol, Leon Arber.

 Wind Power TEAK – Traveling Engineering Activity Kits Partial support for the TEAK Project was provided by the National Science Foundation's Course,

HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,

CSCI206 - Computer Organization & Programming

HDF and HDF-EOS Workshop XII

Plans for an Enhanced NetCDF-4 Interface to HDF5 Data

Floating Point Number system corresponding to the decimal notation

Discussion and Conclusion

| (269) | Western Michigan University

CSCI206 - Computer Organization & Programming

Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)

HDF Support for NASA Data Producers

Title of Poster Site Visit 2017 Introduction Results

People Who Did the Study Universities they are affiliated with

Title of session For Event Plus Presenters 12/5/2018.

Great Project Project 13 Lead investigator: I. Dunno, University of E. Missions Project manager: O. Perations, FAA September 27 & 28, 2018 Alexandria,

Results and Discussion

HDF5 Tools Updates and Discussions

Title of Poster Site Visit 2018 Introduction Results

LabVenture! Redesigning to Maximize Transfer Inside/Outside School

This material is based upon work supported by the National Science Foundation under Grant #XXXXXX. Any opinions, findings, and conclusions or recommendations.

1.6) Storing Integer: 1.7) storing fraction:

Presentation transcript:

NASA EOS DATA COMPRESSION WITH HDF5 SCALEOFFSET FILTER This work was funded by the NASA Earth Science Technology Office under NASA award AIST and UCAR Subaward No S Other support was provided based upon the Cooperative Agreement with the National Aeronautics and Space Administration (NASA) under NASA grant NNG05GC60A. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of NASA. For more information or documentation, contact Muqun Yang, Fang Guo, Xiaowen Wu, Robert E. McGrath, Quincey Koziol National Center for Supercomputing Applications University of Illinois, Urbana-Champaign ScaleOffset compression performs a scale and/or offset operation on each data value and truncates the resulting value to a minimum number of bits before storing it. The datatype is either integer or floating-point. Offset in Scale-Offset compression means the minimum value of a set of data values. If a fill value is defined for the dataset, the filter will ignore it when finding the minimum value. An example for Integer Type : Dataset : 32-bit integer array Maximum is 7065; Minimum is 2970 Maximum is 7065; Minimum is 2970 The "span" of dataset values = Max-Min+1 = 4076 Case 1 : No fill value is defined. Minimum number of bits per element to store = ceiling(log2(span)) = 12 Case2 : Fill value is defined in this array. Minimum number of bits per element to store = ceiling(log2(span+1)) = 13 Compression: 1. Subtract each element from minimum value. 2. Pack all data with minimum number of bits. Decompression: 1. Unpack all data. 2. Add each element to minimum value. Outcome: It can save about 60% disk space for this case. An example for Floating-point Type D-scaling factor: the decimal precision to keep for the filter output Floating-point data: { , , , } D-scaling factor: 2 Preparation for Compression Step 1: Determine the minimum value = Step 2: Subtract the minimum value from each data element Modified data: {5.102, 0, 1.086, 6.185} Step 3: Scale the data by multiplying 10 ^ D-scaling factor = 100 Modified data: {510.2, 0, 108.6, 618.5} Step 4: Round the data to integer Modified data: {510, 0, 109, 619} Compression and Decompression: using ScaleOffset for integer Restoration after decompression Step 1 : Divide each value by 10^ D-scaling factor Step 2: Add the offset The floating point data { , , , } Outcome: Produces lossy compression. Compression ratio will depend on D-scaling factor. Internal HDF5 filter Easy to understand - Integer: lossless - Floating-point: GRIB lossy compression Easy to control floating compression - D-scaling factor Easy to estimate the outcome How does ScaleOffset work? What is the ScaleOffset filter? Why use ScaleOffset filter? ScaleOffset Filter can gain better compression ratio for floating-point data. ScaleOffset Filter can use more encoding time and decoding time. Knowledge of data characteristics benefits the usage of ScaleOffset filter. EOS Data Compression Performance Comparison ScaleOffset vs. Deflate Filter Limitations Compressed floating-point data range is limited by the size of corresponding unsigned integer type. Long double is not supported. Conclusions