Discussion on an HDF-GEO concept HDF Workshop X 30 November 2006.

Slides:



Advertisements
Similar presentations
Database Planning, Design, and Administration
Advertisements

SEBGIS 2005, Agia Napa, Cyprus, October 31 - November 4, 2005 MECOSIG Adapted to the Design of Distributed GIS F. Pasquasy, F. Laplanche, J-C. Sainte &
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Chapter 6 Database Design
Chapter 10: Analyzing Systems Using Data Dictionaries Instructor: Paul K Chen.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Ch1: File Systems and Databases Hachim Haddouti
Model Driven Architecture (MDA) Partha Kuchana. Agenda What is MDA Modeling Approaches MDA in a NutShell MDA Models SDLC MDA Models (an Example) MDA -
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
6 Chapter 6 Database Design Hachim Haddouti. 6 2 Hachim Haddouti and Rob & Coronel, Ch6 In this chapter, you will learn: That successful database design.
Lecture Nine Database Planning, Design, and Administration
The database development process
Configuration Management
University of Illinois at Urbana-ChampaignHDF 1McGrath/Yang 2/27/02 Transitioning from HDF4 to HDF5 Robert E. McGrath Kent Yang.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
What is Business Analysis Planning & Monitoring?
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
05 December, 2002HDF & HDF-EOS Workshop VI1 SEEDS Standards Process Richard Ullman SEEDS Standards Formulation Team Lead
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
A Liaison Report from ISO TC211 to CEOS WGISS Dr. Liping Di
HDF-EOS Workshop VII, An XML Approach to HDF-EOS5 Files Jingli Yang 1, Bob Bane 1, Muhammad Rabi 1, Zhangshi Yin 1, Richard Ullman 1, Robert McGrath.
ITEC 3220M Using and Designing Database Systems
WMO BUFR &CREX Gil Ross, UK Met Office
HDF5 A new file format & software for high performance scientific data management.
Week 4 Lecture Part 3 of 3 Database Design Samuel ConnSamuel Conn, Faculty Suggestions for using the Lecture Slides.
Harmonizing HDF5 and NITF Alan M. Goldberg HDF Workshop X November 2006 The opinions expressed in this presentation are those of.
MODIS Land and HDF-EOS HDF-EOS Workshop Presentation September 20, 2000 Robert Wolfe NASA GSFC Code 922, Raytheon ITSS MODIS Land Science Team Support.
HDF5 for NPOESS Data Products Alan M. Goldberg The MITRE Corporation Organization: W803 Project: 1400NT01-SE This work was performed.
NPP/ NPOESS Product Data Format Richard E. Ullman NASA/GSFC/NPP NOAA/NESDIS/IPOAlgorithm / System EngineeringData / Information Architecture
An Introduction to MINC John G. Sled. What is MINC? A medical image file format based on NetCDF A core set tools and libraries for image processing A.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
N P O E S S I N T E G R A T E D P R O G R A M O F F I C E NPP/ NPOESS Product Data Format Richard E. Ullman NOAA/NESDIS/IPO NASA/GSFC/NPP Algorithm Division.
© 2012 The MITRE Corporation. All rights reserved. For internal MITRE use 13 June 2013 Meeting #3 hData Record Format Taskforce 1 © 2012 The MITRE Corporation.
Page 1 LAITS Laboratory for Advanced Information Technology and Standards ISO & Status Liping Di Laboratory for Advanced Information Technology.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander,
CSDGM Overview CSDGM Tools and Resources. Resources Series Materials: ftp://ftp.ncddc.noaa.gov/pub/Metadata/Online_ISO_Tr aining/Intro_to_Geospatial_Metadata/
Are Standards Really Standards Any More? Mélanie F. Meaux NASA / GCMD In response to Wyn Cudlip with regards to an IDN profile of ISO …
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Content Framework for Operational Environmental Remote Sensing Data Sets: NPOESS Concepts Alan M. Goldberg NOTICE This technical data.
Transitioning from FGDC CSDGM Metadata to ISO 191** Metadata
National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology Pasadena, California EDGE: The Multi-Metadata.
The HDF Group Data Interoperability The HDF Group Staff Sep , 2010HDF/HDF-EOS Workshop XIV1.
July 20, Update on the HDF5 standardization effort Elena Pourmal, Mike Folk The HDF Group July 20, 2006 SPG meeting, Palisades, NY.
Barry Weiss 1/4/ Jet Propulsion Laboratory, California Institute of Technology Quality Elements in ISO Metadata Design for Proposed SMAP Data.
Aura HDF-EOS File Format Guidelines: Overview and Status Cheryl Craig.
1 Chapter 11 Planning. 2 Project Planning “establishing a predetermined course of action within a forecasted environment” “establishing a predetermined.
IST 210 Database Design Process IST 210, Section 1 Todd S. Bacastow January 2004.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
P3 Business Analysis. 2 Section F: Project Management F1.The nature of projects F2. Building the Business Case F4. Planning,monitoring and controlling.
QA4EO in 10 Minutes! A presentation to the 10 th GHRSST Science Team Meeting.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
HDF/HDF-EOS Meeting Oct th 2008, Aurora CO Proposal for adding Named Dimensions to HDF5 Arrays Daniel Kahn Science Systems and Applications, Inc.
Project Management Chapter 3.
SRNWP Interoperability Workshop
TechStambha PMP Certification Training
Active Data Management in Space 20m DG
Data Model.
Project Management Process Groups
Database Systems Instructor Name: Lecture-3.
Palestinian Central Bureau of Statistics
Presentation transcript:

Discussion on an HDF-GEO concept HDF Workshop X 30 November 2006

Abstract At the past several HDF & HDF-EOS Workshops, there has been some informal discussion of building on the success of HDF-EOS to design a new profile, tentatively called HDF-GEO. This profile would incorporate lessons learned from Earth science, Earth applications, and Earth model data systems. It would encompass all types of data, data descriptions, and other metadata. It might support 1-, 2-, and 3-D spatial data as well as time series; and it would include raw, calibrated, and analyzed data sets. It would support data exchange by building its needed complexity on top of minimal specialized features; and by providing clear mechanisms and requirements for all types of appropriate metadata. The organizers propose to host a discussion among the workshop participants on the need, scope, and direction for HDF-GEO.

Which buzzwords would fit? profile best practices data model metadata content atomic & compound types self docu -mentation platform support markup & schema tools test suites … …ilities HDF-GEO naming rules data levels geo ref

Questions for discussion by Earth science practitioners - Bottom-up analysis: What are the successful features of existing community data formats and conventions? (HDF5, HDF-EOS, netCDF, CDF, GRIB BUFR, COARDS, CF-1, NITF, FITS, FGDC RS extensions, ISO, geoTIFF,...) Progress being made -- John Caron's Common Data Model Specific needs for geo- and time-referenced data conventions Specific needs to support observed (raw), calibrated, and analyzed data sets

Questions for discussion by Earth science practitioners - Top-down analysis: (1 of 2) What is a profile? Consider specifics of how a standard or group of standards are implemented for a related set of uses and applications. How does a profile relate to a format or other elements of a standard? What constitutes overkill? How much profile would be beneficial, and how much would be difficult to implement and of limited utility?

Questions for discussion by Earth science practitioners - Top-down analysis: (2 of 2) Why is it useful? Establishes specific meanings for complicated terms or relationships Establishes common preferred terms for attributes which can be described multiple ways. Establishes practices which are consistent with portability across operating systems, hardware, or archives Establishes common expectations and obligations for data stewardship Clarifies community (and sponsor) long-term expectations, beyond short-term necessity other...

Discussion

Wrap-up Send your list of provisional HDF-GEO requirements goals to be achieved How HDF-GEO would help to me at

Backup

Motivation: In many instances, application-specific 'profiles' or 'conventions' or best practices have shown their utility for users. In particular, profiles have encouraged data exchange within communities of interest. HDF provides minimal guidance for applications. HDF-EOS was a mission-specific profile; resulted in successes and lessons learned. HDF5 for NPOESS is another approach. Is it time for another attempt, benefitng from all the lessons, and targeted at a broader audience?

© 2005 The MITRE Corporation. All rights reserved NOTICE This technical data was produced for the U.S. Government under Contract No. 50-SPNA , and is subject to the Rights in Technical Data - General clause at FAR (JUN 1987) Approved for public release, distribution unlimited HDF Lessons from NPOESS & Future Opportunities (excerpt) Alan M. Goldberg HDF Workshop IX, December 2005

© 2005 The MITRE Corporation. All rights reserved Requirements for data products n Deal with complexity –Large data granules n Order of Gb –Complex intrinsic data complexity n Advanced sensors produce new challenges –Multi-platform, multi-sensor, long duration data production –Many data processing levels and product types n Satisfy operational, archival, and field terminal users –Multiple users with heritage traditions

© 2005 The MITRE Corporation. All rights reserved NPOESS products delivered at multiple levels

© 2005 The MITRE Corporation. All rights reserved Sensor product types n Swath-oriented multispectral imagery –VIIRS – cross-track whiskbroom –CMIS – conical scan –Imagery EDRs – resampled on uniform grid n Slit spectra –OMPS SDRs – cross-track spectra, limb spectra n Image-array fourier spectra –CrIS SDR n Directional spectra –SESS energetic particle sensor SDR n Point lists –Active fires n 3-d swath-oriented grid –Vertical profile EDRs n 2-d map grid –Seasonal land products n Abstract byte structures –RDRs n Abstract bit structures –Encapsulated ancillary data n Bit planes –Quality flage n Associated arrays (w/ stride?) –geolocation

© 2005 The MITRE Corporation. All rights reserved NPOESS product design development Requirements - Multi-platform, multi- sensor, long duration data production - Many data processing levels and product types - Satisfy operational, archival, and field terminal users Constraints - Processing architecture and optimization - Heritage designs - Contractor style and practices - Budget and schedule Intentions - Use simple, robust standards - Use best practices and experience from previous operational and EOS missions - Provide robust metadata - Maximize commonality among products - Forward-looking, not backward-looking standardization Resources - HDF5 - FGDC - C&F conventions - Expectation of tools by others Result Design Process - Experience - Trades& Analyses

© 2005 The MITRE Corporation. All rights reserved Lessons & Way Forward

© 2005 The MITRE Corporation. All rights reserved Observations from development to date n Avoid the temptation to use heritage approaches without reconsideration, but … n Novel concepts need to be tested n Data concepts, profiles, templates, or best practices should be defined before coding begins n Use broad, basic standards to the greatest possible extent –FGDC has flexible definitions, if carefully thought through n Define terms in context; clarity and precision as appropriate n Attempt to predefine data organizations in the past (e.g., HDF-EOS ‘swath’ or HDF4 ‘palette’) have offered limited flexibility. Keep to simple standards which can be built upon and described well. Lesson: be humble n It is a great service to future programs if we capture lessons and evolve the standards n How do we get true estimates of the life-cycle savings for good design?

© 2005 The MITRE Corporation. All rights reserved Thoughts on future features for Earth remote sensing products n Need to more fully integrate product components with HDF features n Formalize the organization of metadata items which establish the data structure –Need mechanism to associate arrays by their independent variables n Formalize the organization of metadata items which establish the data meaning –XML is a potential mechanism – can it be well integrated? –Work needed to understand the advantages and disadvantages –Climate and Forecast (CF) sets a benchmark n Need a mechanism to encapsulate files in native format –Case in which HDF is only used to provide consistent access n Need more investment in testing before committing to a design

Primary and Associated Arrays Index Attribute n-Dimensional Dependant Variable (Entity) Array Primary Array e.g., Flux, Brightness, Counts, NDVI Associated Array(s) e.g., QC, Error bars dimension  n

1-Dimensional Attribute Variables Index Attribute Primary e.g., UTC time or angle Additional e.g., IET time, angle, or presssure height Associated Independent Variable(s)

Multi-Dimensional Attribute Variables 2-Dimensional Independent Variable Array(s) e.g., lat/lon, XYZ, sun alt/az, sat alt/az, or land mask Key concept: Index Attributes organize the primary dependant variables, or entities. The same Index Attributes maybe used to organize associated independent variables. Associated independent variables may be used singly (almost always), in pairs (frequently), or in larger combinations.

© 2005 The MITRE Corporation. All rights reserved Issues going forward - style n Issues with assuring access understanding –How will applications know which metadata is present? –Need to define a core set with a default approach n Issues with users –How to make providers and users comfortable with this or any standard –How to communicate the value of: best practices; careful & flexible design; consistency; beauty of simplicity –Ease of use as well as ease of creation n Issues with policy –Helping to meet the letter and intent of the Information Quality Act n Capturing data product design best practices –Flexibility vs. consistency vs. ease-of-use for a purpose

© 2005 The MITRE Corporation. All rights reserved Issues going forward - features n Issues with tools –Tools are needed to create, validate, and exploit the data sets. n Understand structure and semantics n Issues with collections –How to implement file and collection metadata, with appropriate pointers forward and backward –How to implement quasi-static collection metadata n Issues with HDF –Processing efficiency (I/O) of compression, of compaction –Repeated (fixed, not predetermined) metadata items with the same not handled –Archival format

© 2005 The MITRE Corporation. All rights reserved Possible routes: Should there be an HDF-GEO? n Specify a profile for the use of HDF in Earth science applications: n Generalized point (list), swath (sensor coordinates), grid (georeferenced), abstract (raw), and encapsulated (native) profiles. n Generalized approach to associating georeferencing information with observed information. n Generalized approach to incorporating associated variables with the mission data n Generalized approach to ‘stride’ n Preferred core metadata to assure human and machine readability n Identification metadata in UserBlock n Map appropriate metadata items from HDF native features (e.g., array rank and axis sizes) n Preferred approach to data object associations: arrays-of-structs or structs-of-arrays? n Design guidelines or strict standardization?