HDF - 1 - Mike Folk National Center for Supercomputing Applications Science Data Processing Workshop February 26-28, 2002 HDF Update HDF.

Slides:



Advertisements
Similar presentations
Integrating ChemAxon technology into your End User Applications Java solutions for cheminformatics Ver. Mar., 2005.
Advertisements

How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
1 Projection Indexes in HDF5 Rishi Rakesh Sinha The HDF Group.
HDF and HDF-EOS Workshop VII, September 23-25, This work is supported in part by a Cooperative Agreement with the National Aeronautics and Space.
® Page 1 Intel Compiler Lab – Intel Array Visualizer HDF Workshop VI December 5, 2002 John Readey
Streaming NetCDF John Caron July What does NetCDF do for you? Data Storage: machine-, OS-, compiler-independent Standard API (Application Programming.
HDF Update Mike Folk National Center for Supercomputing Applications
1 Desktop Support for HDF and HDF EOS Ted Meyer President & CEO fortnersoftware LLC (703) ext.119.
HDF4 and HDF5 Performance Preliminary Results Elena Pourmal IV HDF-EOS Workshop September
1 Alternate Title Slide: Presentation Name Goes Here Presenter’s Name Infrastructure Solutions Division Date GIS Perfct Ltd. Autodesk Value Added Reseller.
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
University of Illinois at Urbana-ChampaignHDF Mike Folk HDF-EOS Workshop IV Sept , 2000 HDF Update HDF.
University of Illinois at Urbana-ChampaignHDF 1McGrath/Yang 2/27/02 Transitioning from HDF4 to HDF5 Robert E. McGrath Kent Yang.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Technical Track Session XML Techie Tools Tim Bornholt.
HDF 1 NCSA HDF XML Activities Robert E. McGrath Mike Folk National Center for Supercomputing Applications.
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
OCLC Online Computer Library Center CONTENTdm Migration Training Craig Yamashita Vice President, Technology and Product Development DiMeMa, Inc. July 2005.

Developing a NetCDF-4 Interface to HDF5 Data
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Experiments with Remote Access to Scientific Data using HDF5 Kun Yan Robert E. McGrath NCSA May 15, 2002.
Parallel HDF5 Introductory Tutorial May 19, 2008 Kent Yang The HDF Group 5/19/20081SCICOMP 14 Tutorial.
9/13/20151 Threads ICS 240: Operating Systems –William Albritton Information and Computer Sciences Department at Leeward Community College –Original slides.
HDF-EOS Workshop VII, An XML Approach to HDF-EOS5 Files Jingli Yang 1, Bob Bane 1, Muhammad Rabi 1, Zhangshi Yin 1, Richard Ullman 1, Robert McGrath.
Jason G. Caudill Assistant Professor of Business Administration Carson-Newman College.
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Developing a NetCDF-4 Interface to HDF5 Data Russ Rew (PI), UCAR Unidata Mike Folk (Co-PI), NCSA/UIUC Ed Hartnett, UCAR Unidata Quincey Kozial, NCSA/UIUC.
1 HDF-EOS Status, Related Tools and Issues. 2 Overview.
1 High level view of HDF5 Data structures and library HDF Summit Boeing Seattle September 19, 2006.
HDF5 A new file format & software for high performance scientific data management.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
1 Overview of HDF5 HDF Summit Boeing Seattle The HDF Group (THG) September 19, 2006.
February 2-3, 2006SRB Workshop, San Diego P eter Cao, NCSA Mike Wan, SDSC Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration Object-level.
December 1, 2005HDF & HDF-EOS Workshop IX P eter Cao, NCSA December 1, 2005 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration.
1 HDF-EOS Status and Development Larry Klein, Abe Taaheri, and Cid Praderas L-3 Communications Government Services, Inc. November 30, 2005.
Why do I want to know about HDF and HDF- EOS? Hierarchical Data Format for the Earth Observing System (HDF-EOS) is NASA's primary format for standard data.
OME-TIFF and Bio-Formats K. Eliceiri, E. Hathaway, M. Linkert, and C. Rueden
Page 1 Status of HDF-EOS, Related Software, and Tools Abe Taaheri, Raytheon IIS HDF & HDF-EOS Workshp XIII Riverdale, MD November 4, 2009.
11/7/2007HDF and HDF-EOS Workshop XI, Landover, MD1 HDF5 Software Process MuQun Yang, Quincey Koziol, Elena Pourmal The HDF Group.
1 HDF-EOS Development Current Status and Schedule Larry Klein, Shen Zhao, Abe Taaheri and Ray Milburn L-3 Communications Government Services, Inc. September.
1 HDF-EOS Status, Related Tools and Issues. 2 Overview.
March 17, 2006CIP Status Meeting March 17, 2006 Sponsored by NLADR, NFS PACI Project in Support of NCSA-SDSC Collaboration Project Report at CIP AG Meeting.
HDF and HDF-EOS Workshop VIII, October 26-28, /12 Peter Cao, National Center for Supercomputing Applications This work is supported in part by a.
A High performance I/O Module: the HDF5 WRF I/O module Muqun Yang, Robert E. McGrath, Mike Folk National Center for Supercomputing Applications University.
Page 1 TOOLKIT / HDF-EOS Status and Development Abe Taaheri, Raytheon IIS Aura DSWG meeting October 2007.
HDF Hierarchical Data Format Nancy Yeager Mike Folk NCSA University of Illinois at Urbana-Champaign, USA
May 2003National Coastal Data Development Center Brief Introduction Two components Data Exchange Infrastructure (DEI) Spatial Data Model (SDM) Together,
EXPRESS/Binary Report David Price ISO SC4 Vico Equense March 2006.
1 HDF Vendors/Software Developers Workshop HDF And HDF-EOS Tools R.Suresh NASA/GSFC/HSTX Ph: FAX:
HDF EOS Workshop David Han Code
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Connections to Other Packages The Cactus Team Albert Einstein Institute
1 Status of HDF-EOS, Related Software and Tools. 2 TOOLKIT / HDF-EOS Support.
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package MuQun Yang, Christian Chilan, Albert Cheng, Quincey Koziol, Mike.
© 2002 IBM Corporation Transaction Processing Facility TPF Users Group Acapulco May 2004 © 2004 IBM Corporation Pete Nicholls IBM Toronto Lab TPF Toolkit.
HDF-EOS Workshop IV September 19-21, 2000 Richard E. Ullman ESDIS Information Architect NASA/ GSFC, Code 423.
SDM Center High-Performance Parallel I/O Libraries (PI) Alok Choudhary, (Co-I) Wei-Keng Liao Northwestern University In Collaboration with the SEA Group.
Parallel I/O Performance Study and Optimizations with HDF5, A Scientific Data Package Christian Chilan, Kent Yang, Albert Cheng, Quincey Koziol, Leon Arber.
1 © 2004 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Cisco Technical Support Seminar Using the Cisco Technical Support Website.
Hierarchical Data Formats (HDF) Update
Moving from HDF4 to HDF5/netCDF-4
Plans for an Enhanced NetCDF-4 Interface to HDF5 Data
Access HDF5 Datasets via OPeNDAP’s Data Access Protocol (DAP)
Hierarchical Data Format (HDF) Status Update
Presentation transcript:

HDF Mike Folk National Center for Supercomputing Applications Science Data Processing Workshop February 26-28, 2002 HDF Update HDF

HDF - 2 -Topics What is HDF? HDF4 update and plans HDF5 update and plans

HDF NCSA HDF Mission To develop, promote, deploy, and support open and free technologies that facilitate scientific data storage, exchange, access, analysis and discovery.

HDF What is HDF? Format and software for scientific data Stores images, arrays, tables, etc. Storage and I/O efficiency Free and commercial software support Promote standards Broad user base in engineering & science

HDF Who is supporting HDF? NASA/ESDIS –Earth science applications, instrument data –All aspects of data management DOE/ASCI (Accelerated Strategic Computing Init.) –Simulations on massively parallel machines –Emphasis on parallel I/O performance, functionality DOE Scientific Data Analysis & Computation Program –High performance I/O R & D NCSA –Grid, Vis, other R&D, user support Others –Applications, support, some R&D

HDF HDF4

HDF Goals for HDF4 Support Terra, Aqua & other EOS data users Maintain and upgrade library and tools Address differences between HDF4 & HDF5

HDF HDF4 Java API & tools: HDF4 milestones since Sept 2000 HDF4 library and utilities: Q4 ‘00Q1 ‘01Q2 ‘01Q3 ‘01Q4 ‘01Q1 ‘02 HDF4.1 r5 HDF4.1 r4 Version 2.6 Version 2.7

HDF HDF4 releases HDF4.1 r4 (Nov 2000) –Chunking and chunking with compression –JPEG compression fixed –hdp utility enhancements –HDF4-to-GIF and GIF-to-HDF4 converters HDF4.1 r5 (Nov 2001) –Compression query functions –Vdata: set size of Vdata link blocks –Tools updates –Added Solaris 2.8 –Retired Solaris 2.6, OSF1 V4.0 and HPUX 10.20

HDF HDF5

HDF HDF4-HDF5 conversion: Java & other tools: High level library: HDF5 milestones since Sept 2000 ♦ “table”  “lite” & “image”  Base library: Q4 ‘00Q1 ‘01Q2 ‘01Q3 ‘01Q4 ‘01Q1 ‘02 _ _ (patch) ♦ _ _ _ Thread-safe  _ Java 2.2 _ _ ♦ library  2 ♦ library  ♦ conversion utility

HDF HDF5 Library Work

HDF New platforms and languages New platforms –HP-UX –IBM SP –IA64 architecture –Solaris 2.8 with 64-bit –Linux in parallel mode –Metrowerks Code Warrior Windows compiler Fortran 90 –Solaris 2.6 & 2.7, O2K, DEC OSF, Linux, Cray T3E, SV1 –Parallel on T3E & O2K –Windows C++ –Solaris 2.6 and 2.7, Free BSD, Linux, Windows

HDF New to HDF5 Virtual File Layer – alternate I/O drivers Array datatype – array as atomic type File sizes greater than 2GB on Linux Performance improvements – parallel & serial Improvements in configuration Tutorials and documentation

HDF Next major release -- HDF5 1.6 New format and library features –Reclamation of free space within a file –Enhanced hyperslab/region selection –Dimension scale support –Bzip2 compression –Generic Properties –Performance improvements Parallel I/O performance benchmark suite Release date: Fall 2002

HDF High level APIs HDF5 routines that do more operations per call than the basic HDF5 interface Goals –Make HDF5 easier to use –Encourage standard ways to store objects in HDF5

HDF High level APIs Lite – done Image – done Table – partly done Dimension scale – in the works Unstructured grids – in the works

HDF HDF5 High Level APIs – HDF5 Image For datasets to be interpreted as images/palettes –2-D raster data like HDF4 raster images Image operations –Create, write, read, query Based on “HDF5 Image & Palette Specification”

HDF HDF5 High Level APIs – HDF5 Table For datasets to be interpreted as “tables” –A collection of records –All records have the same structure –Like Vdatas in HDF4, but more operations Table operations –Create, write, read, query –Insert, delete records or fields –Future: sort and search

HDF HDF5 tools activities

HDF H5View – new features Improved editing XML support Supports “image” data sets Line plots of row/column data Save data to a text file Set user preferences

HDF Future Viewer work Objectives –Support transition from HDF4 to HDF5 –Eliminate duplicate work on HDF4 and HDF5 tools –Provide plug-in architecture for tool development Three stages –Object model to cover both HDF4 & HDF5 Nearly done –Simple browser for both HDF4 and HDF5 Summer 2002 –Editor Fall 2002 –Modular toolkit for browser plug-ins Fall 2002, if all goes well

HDF XML and web experiments Explore XML, Web applications for HDF5 –Investigate use of XML –investigate Web XML technologies, use with HDF5 XML DTD for HDF5 Conversions involving XML –HDF4  HDF5  XML –Compression and timing study –netcdf to HDF5 translation, via XML style sheet XML Schema for HDF5 investigated

HDF Other Java-inspired explorations Tomcat web server and JSP servlets –HDF5 file  XML  html  display in a web browser –Shows promise for providing remote access to HDF5 HDF5 and CORBA –Investigating using CORBA with Java and HDF5. –Data access done in a CORBA servant written in C/C++ –Browsing and presenting the information done in Java –Uses CORBA is alternative to the Java Native Interface, which calls C directly from Java.

HDF Other Tools Activities HDF5-to-GIF and GIF-to-HDF5 converters H5dump subsetting

HDF Facilitating the transition from HDF4 to HDF5

HDF The transition from HDF4 to HDF5 1.We support both HDF4 and HDF5. 2.We will continue to maintain HDF4, as long as we are funded to do so. 3.We recommend: Use HDF5 for new projects Consider migrating from HDF4 to HDF5 to take advantage of the improved features and performance of HDF5

HDF HDF4-to-HDF5 Work Mapping specification Utility Library Experiments

HDF HDF4-to-HDF5 Work Later talk by Bob McGrath to cover –The key technical challenges –NCSA toolkit work –Experiments to support transition

HDF Other Activities of Interest

HDF Parallel HDF5 HDF5 supports parallel I/O Emphasis on performance Driven by DOE labs Tutorial See Elena’s talk

HDF Parallel I/O benchmark suite A set of parallel performance benchmarks that can be distributed with HDF5 Parallel HDF5 in MPI environment Measures performance of raw I/O, MPI-I/O, & HDF5 Draft documentation:

HDF Other performance studies See Elena’s talk

HDF Thread-Safe HDF5 (Beta) Uses Pthreads (POSIX threads) library. Implements Phase 1 of HDF5 thread safety plan –One lock per program Later phases –2: Separate locks per major operation (read, write, convert) –3: Separate locks for all operations. For details see –

HDF HDF5-DODS * Server HDF5 data available from DODS clients Demonstrated with Ferret client Documentation –HDF5-DODS Data Model and Mapping –The HDF5-DODS Server Prototype –Demo of HDF5-DODS Server Prototype with the DODS-Ferret Client * Distributed Oceanographic Data System

HDF Transform architecture prototype Provides a mechanism for operating on data as it is moved from one location to another Inspired by HDF5 I/O operations when moving data between memory and file: –Convert numbers – size, endianness, etc. –Selection – subsetting & subsampling –Linear order – row major vs. column major Later –Change value/Meaning of data – units, coordinate system, etc.

HDF Outreach Documentation –Expanded and improved –Searchable, printable –HDF5 User’s Guide Advanced and parallel HDF5 Tutorial – –Presented at SC

HDF szip Compression Reporting for Pen-Shu Yeh and Wei Xia

HDF Szip Compression in HDF 4 CCSDS lossless compression –Fast, effective compression method for EOS data –Outperforms other common techniques in size & speed Szip software –Implements CCSDS lossless compression –Originally developed at University of New Mexico –Implemented in HDF-4 by UNM HDF-4 with CCSDS-szip option (HDF-4-Szip) –Delivered to GSFC and tested –Tested with MODIS Level-1B data from GSFC DAAC

HDF Sample results RLE Huff szip RLE Huff szip Convert 343 Megabyte MODIS file Conversion performed on PC-Linux 7.1, Pentium II, 300Mhz

HDF Szip-HDF4 software HDF4 library routine SDsetcompress –compress existing dataset or create new compressed dataset Utilities –bin2hdf – input array from binary file, compress using HDF-4-Szip, and output compressed data in HDF file –hdf2chdf – input data from HDF file, compress using HDF- 4-Szip, and output compressed data in new HDF file –chdf2bin – input HDF file (with or without compression), and output user-selected arrays as separated binary files

HDF Future Work Compress IEEE 32 bit float data with HDF-4-Szip Use HDF chunking routines –Especially beneficial for subsetting performance Test HDF-4-szip on different computer platforms –Currently HDF-4-Szip has been tested under Linux only –Others to test: Sun, SGI and HP, Windows 98 and NT Advocacy in the scientific community –Add User’s Guides and documentation to HDF docs. –Present and demo at workshops and conferences. Resolve distribution issues with NCSA and UNM

HDF Thank you! HDF website – HDF5 Information Center – HDF Helpdesk HDF users mailing list Information Sources