Kent Yang The HDF Group HDF Town Hall July 20, 2018

Slides:



Advertisements
Similar presentations
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Advertisements

The HDF Group July 8, Summer ESIP Federation Meeting How to Meet the CF Conventions with NcML for NASA HDF/HDF-EOS Hyo-Kyung.
The HDF Group ESIP Summer Meeting Easy access HDF files via Hyrax Kent Yang The HDF Group 1 July 8 – 11, 2014.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Remote Access via OPeNDAP Kent Yang and Joe Lee The HDF Group The 14 th HDF/HDF-EOS Workshop.
The HDF Group HDF Group Support for NPP/JPSS Mike Folk, Elena Pourmal, Larry Knox, Albert Cheng The HDF Group The 15 th HDF and HDF-EOS.
The HDF Group ESIP Summer Meeting HDF-Java Overview Joel Plutchak The HDF Group 1 July 8 – 11, 2014.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.)
The HDF Group Apr , 2012HDF/HDF-EOS Workshop XV1 Interoperability with netCDF-4 Kent Yang, Larry Knox, Elena Pourmal The HDF Group.
1 Generalized Conversion of HDF-EOS Products to GIS Compatible Formats Larry Klein, Ray Milburn, Cid Praderas, and Abe Taaheri Emergent Information Technologies,
Support EOS: Review and Discussions Kent Yang and Joe Lee The HDF Group October 16, 2012 Oct. 16, 2012Annual HDF Briefing to ESDIS1.
The HDF Group HDF/HDF-EOS Workshop XIV1 Easy Access of NASA HDF data via OPeNDAP Kent Yang and Joe Lee The HDF Group September 28,2010.
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
What is HDF-EOS? Information compiled from HDF-EOS Workshop II HDF-EOS Workshop III, 1999 ESDIS Project, Code 423 NASA/Goddard Space Flight Center Greenbelt.
Developing a NetCDF-4 Interface to HDF5 Data
EOSDIS User survey follow-up Mike Folk, Kent Yang, Elena Pourmal The HDF Group Oct. 17, 2012 Annual HDF Briefing to ESDIS1.
1 HDF-EOS and Related Tools Status Update. 2 Overview.
The HDF Group ESIP Summer Meeting HDF OPeNDAP update Kent Yang The HDF Group 1 July 8 – 11, 2014.
Important ESDIS 2009 tasks review Kent Yang, Mike Folk The HDF Group April 1st, /1/20151Annual briefing to ESDIS.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Improving the usability of HDF-EOS2 data Kent Yang, Joe Lee, Choonghwan Lee The HDF Group March 31 st, /26/2016Annual briefing to ESDIS1.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
HDF Converting between HDF4 and HDF5 MuQun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University of Illinois,
Tools for Interoperability between HDF and NetCDF Mike Folk and MuQun Yang The HDF Group The HDF Group provides the following tools for the NASA HDF and.
1/14/200925th IIPS Conference 1 Challenges to Archive and Access NASA HDF-EOS Data in the long Term MuQun Yang (The HDF Group) Choonghwan Lee (The HDF.
Page 1 Status of HDF-EOS, Related Software, and Tools Abe Taaheri, Raytheon IIS HDF & HDF-EOS Workshp XIII Riverdale, MD November 4, 2009.
1 HDF-EOS Development Current Status and Schedule Larry Klein, Shen Zhao, Abe Taaheri and Ray Milburn L-3 Communications Government Services, Inc. September.
The HDF Group November 3-5, 2009 HDF-OPeNDAP Project Update HDF/HDF-EOS Workshop XIII1 Joe Lee and Kent Yang The HDF Group James Gallagher.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII10/17/2008.
1 HDF-EOS Status, Related Tools and Issues. 2 Overview.
HDF OPeNDAP Project Update MuQun Yang and Hyo-Kyung Lee The HDF Group March 31, Annual briefing to ESDIS10/31/2015.
September 4, 2003MODIS Ocean Data Products Workshop, Oregon State University1 Goddard Earth Sciences (GES) Distributed Active Archive Center (DAAC) MODIS.
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
The HDF Group HDF/HDF-EOS Workshop XV1 Tools to Improve the Usability of NASA HDF Data Kent Yang and Joe Lee The HDF Group April 17, 2012.
- 1 - HDF5, HDF-EOS and Geospatial Data Archives HDF and HDF-EOS Workshop VII September 24, 2003.
NetCDF file generated from ASDC CERES SSF Subsetter ATMOSPHERIC SCIENCE DATA CENTER Conversion of Archived HDF Satellite Level 2 Swath Data Products to.
HDF4 OPeNDAP Project Progress Report MuQun Yang and Hyo-Kyung Lee 1 HDF Developers' Meeting11/24/2015.
Provenance in Earth Science Gregory Leptoukh NASA GSFC.
12/2/2015Fall 2002 AGU Meeting1 Generalized EOS Data Converter: Making Data Products Accessible to GIS Tools Larry Klein, Ray Milburn, Cid Praderas and.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1 HDF and HDF-EOS Workshop XII10/17/2008.
Using a Friendly OPeNDAP Client Library to Access HDF5 Data MuQun Yang and Hyo-Kyung Lee (The HDF Group) 1 25th IIPS Conference01/14/2009.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
1 Status of HDF-EOS, Related Software and Tools. 2 TOOLKIT / HDF-EOS Support.
Robert Wolfe NASA Goddard Space Flight Center Code 614.5, Greenbelt, MD Robert Wolfe NASA Goddard Space Flight Center Code 614.5,
HDF and HDF-EOS Workshop VII September 24, 2003 HDF5, HDF-EOS and Geospatial Data Archives Don Keefer Illinois State Geological Survey Mike Folk Univ.
10/16/2012Annual HDF briefing1 HDF OPeNDAP support Kent Yang, Joe Lee, Mike Folk The HDF Group Oct. 16, 2012.
11/8/2007HDF and HDF-EOS Workshop XI, Landover, MD1 Software to access HDF5 Datasets via OPeNDAP MuQun Yang, Hyo-Kyung Lee The HDF Group.
GES DISC Experience with HDF Formats for MeaSUREs Projects by James Johnson NASA/GES DISC (Wyle Inc.) April 17, 2012.
MODIS Data at NSIDC MODIS Science Team Meeting - Nov. 2, 2006.
NPP DataVisualization using McIDAS-V NPP DataVisualization using McIDAS-V Tommy Jasmin, Tom Rink, and Tom Achtor
NPP DataVisualization using McIDAS-V NPP DataVisualization using McIDAS-V Tommy Jasmin, Tom Rink, and Tom Achtor
Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package Christian Chilan, Kent Yang, Albert Cheng, Quincey Koziol, Leon Arber.
HDF5 OPeNDAP Project Update and Demo MuQun Yang and Hyo-Kyung Lee (The HDF Group) James Gallagher (OPeNDAP, Inc.) 1HDF and HDF-EOS Workshop XII, Aurora,
Making Satellite Datasets Accessible for Everyone A look into my NASA Internship – summer 2015 Aaron Scott University of North Dakota.
NASA Earth Science Data Stewardship
Can Data be Organized for Science and Reuse?
Adding CF Attributes to an HDF5 File
Data Are from Mars, Tools Are from Venus
Transition from HDF4 to HDF5: Issues
Moving from HDF4 to HDF5/netCDF-4
CERES Data Management Team
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Efficiently serving HDF5 via OPeNDAP
Rui Wu, Jose Painumkal, Sergiu M. Dascalu, Frederick C. Harris, Jr
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
HDF Support for NASA Data Producers
Status for Endeavor 6: Improved Scientific Data Access Infrastructure
HDF5 Performance Enhancements with the Elimination of Unlimited Dimension Debbie Mao, Daniel Ziskin, Merritt Deeter, Sara Martinez-alonso MOPITT is an.
HDF5 Tools Updates and Discussions
Presentation transcript:

Kent Yang The HDF Group HDF Town Hall July 20, 2018 NASA Terra Data Fusion Kent Yang The HDF Group HDF Town Hall July 20, 2018 July 20th, 2018 HDF TownHall

The Terra Data Fusion Project Team Department of Atmospheric Sciences, University of Illinois Larry Di Girolamo Guangyu Zhao Yizhe Zhan Landon Clipp Shashank Bansal Yat Long Lo Dongwei Fu Brandon Chen Department of Geography and GIS, University of Illinois Shaowen Wang Yan Liu Yizhao Gao The HDF Group MuQun (Kent) Yang H Joe Lee National Center for Supercomputing Applications, University of Illinois John Towns Kandace Turner Michelle Butler Sean Stevens David Ralia Jonathan Kim Donna Cox Stuart Levy Robert Patterson Andrew Christiensen Department of Atmospheric Sciences, Texas A&M Ping Yang Hioki Souichiro Yi Wang NASA Langely/SSAI Lusheng Liang NASA Goddard Space Flight Center Ralph Kahn Jim Limbacher July 20th, 2018 HDF TownHall

EOS Terra Flagship mission Launched in 1999 Projection ends in 2022 Longest single satellite climate record One of the most popular Earth Science satellite data Five instruments ASTER,CERES, MISR,MODIS,MOPITT Credits: NASA July 20th, 2018 HDF TownHall

Terra, in 2015 alone… More than 360 million files… Totaling more than 3.4 PB data… Delivered to more than 100,000 users around the world. More than 1,800 peer-reviewed publications (over 15,000 to date) Results from Terra cited more than 49,000 times (over 250K to date) “The high publication rate includes an increasing number of papers capitalizing on fusion of data among Terra sensors” – NASA Senior Review 2017: Terra

Terra Data Fusion Project Fuse existing Level 1B Terra radiance products from all Terra 5 instruments into one product July 20th, 2018 HDF TownHall

Scientific value added by data fusion of its five instruments. Why Terra Data Fusion? Scientific value added by data fusion of its five instruments. A key recommendation from the 2007 NRC Decadal Survey on Earth Science and Application from Space: “…experts should... focus on providing comprehensive data sets that combine measuremements from multiple sensors.” July 20th, 2018 HDF TownHall

Challenges for Terra Data Fusion Huge data volumes 1 PB input data from year 2000 to 2015 Need adequate cyberinfrastructure to tackle Input data residing at different locations Need to transfer huge data volumes July 20th, 2018 HDF TownHall

Solutions NCSA supercomputer clusters Blue Waters and other clusters were used for Terra Data fusion NCSA nearline tape archive system is used to store the input and fusion data NCSA experts helped transfer the huge input data to NCSA supercomputer facilities NCSA Blue Waters July 20th, 2018 HDF TownHall

More Challenges Input data Complicate fusion file organization Different granularities Different methods to store radiance and geo-location data Different file formats Complicate fusion file organization Metadata conventions need to catch up Overcoming these challenges is what The HDF Group contributed the most! July 20th, 2018 HDF TownHall

Different Instrument Granularities Map granules from different instruments to a common granule that contains data for a single Terra orbit. Contain multiple MODIS and ASTER input granules Subset CERES and MOPITT input granules July 20th, 2018 HDF TownHall

Different Methods to Store Data Unpacking MODIS, ASTER and MISR radiation data to physical units Need to unpack the data by following the specific packing schemes of individual instruments Interpolating MODIS, ASTER and MISR geolocation data to native radiance resolution Need to handle each instrument differently July 20th, 2018 HDF TownHall

Different File Formats of Input Granules All converted to HDF5 file format From HDF4, HDF-EOS2 and HDF-EOS5 Also netCDF-4 compatible Following netCDF-4 enhanced data model July 20th, 2018 HDF TownHall

Complicate Fusion file organization Use HDF5 group structure to organize different instruments and different input granules Each instrument represented by one group Each input granule stored as the subgroup of the instrument group July 20th, 2018 HDF TownHall

Metadata Conventions Catch-up Make the fusion HDF5 file follow CF conventions by adding key CF attributes Units Coordinates _FillValue Valid_min Valid_max July 20th, 2018 HDF TownHall

More usage of HDF5 features HDF5 chunking and compression are used to reduce the total fusion file size. July 20th, 2018 HDF TownHall

Fusion File Statistics About 1 million input files. 84,303 files – from Feb 25 2000 to Dec 31 2015 The total file size is 2.3 petabytes. Typical file sizes 15GB – 40GB. The largest file size is 68.7GB. Average file size is 26GB. HDF5 in-memory compression reduces the total file size by 60%. July 20th, 2018 HDF TownHall

Fusion File Statistics July 20th, 2018 HDF TownHall

Fusion HDF5 File Layout in HDFView Note the file hierarchy according to individual instrument July 20th, 2018 HDF TownHall

Fusion HDF5 File Layout in CDL Note the netCDF-CF information in the CDL. Dimension names and CF attributes. July 20th, 2018 HDF TownHall

Fusion file visualized in Panoply July 20th, 2018 HDF TownHall

Other Work Validate the generated data to ensure the high quality fusion product Implemented the advanced fusion resampling and reprojection tool Resample / reproject the radiance fields for one Terra instrument onto the grids used by another Terra instrument Generated the NASA CMR-compliant fusion Collection and granule metadata in ECHO 10 XML format May expand if more time is given. The metadata information can be added easily. July 20th, 2018 HDF TownHall

Fusion data visualization demo July 20th, 2018 HDF TownHall

Thank You! July 20th, 2018 HDF TownHall

This work was supported by NASA ACCESS Grant #NNX16AM07A. Acknowledgements This work was supported by NASA ACCESS Grant #NNX16AM07A. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author[s] and do not necessarily reflect the views of NASA. July 20th, 2018 HDF TownHall

Questions/comments? July 20th, 2018 HDF TownHall