10 th GHRSST Science Team Meeting Santa Rosa, CA 3 June 2009
The XML (Metadata)TAG Chair: Ed Armstrong Members: Jean-F Piolle, Ken Casey, Leon Majewski, Dave Poulter Advisors: Ted Habermann (NOAA NGDC), Etienne Charpentier (WMO) GHRSST-9 AI: Develop an appropriate ISO based metadata model/profile for GDS 2.0 At the breakout: Chair: Ed Armstrong Reporter: Leon Majewski Present: Ted Haberman, Thomas Huang, Dave Poulter, Jean-François Piollé, Bob Grumbine, John Sapper, Jon Mittaz, Viva Banzon, Tess Brandon, Others 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
Why... ? Current model based on NASA DIF has limitations: Data Provenance and Quality information L2 orbit characterization Other “clunky” stuff (addresses, liability statements, custom attributes) Limitations apparent even when mapped to FGDC model in GHRSST archive Maintain parity with other programs adopting ISO metadata model WMO Future Sentinel missions NOAA 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
Use cases L4 products Which are the exact buoys used in the assimilation?….currently there is no (easy) way to get at this information. L2P products What is the longitude (and time) of the ascending node? What is the swath, period and inclination? We need this information for the an orbital modeling program. L3P products What polar projection is this data in ? 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
The ISO components ISO (2003) Geographical information and associated services, data quality, access, contacts, and rights to use Over 400 metadata elements (attributes) 150 pages ISO (2009) Extended to describe imagery and gridded data > 100 attributes > 50 pages ISO (Final draft submitted July 2009) Extensions for sensor model descriptions > 300 attributes > 160 pages 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
The ISO components ISO components (superclasses, subclasses, containers, cointainees) Relationships diagrammed in UML 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
The ISO Metadata Model Crosswalk between FR/DSD DIF model to ISO (Excel spreadsheet) – completed and posted to GHRSST Project web site 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
The ISO Metadata Model UML diagrams for ISO core and crosswalked attributes – completed and posted to GHRSST Project web site UML further refined for sensor model attributes 4 June th GHRSST Science Team Meeting, Santa Rosa, CA
4 June th GHRSST Science Team Meeting, Santa Rosa, CA UML 1
4 June th GHRSST Science Team Meeting, Santa Rosa, CA UML 2
4 June th GHRSST Science Team Meeting, Santa Rosa, CA UML 3
4 June th GHRSST Science Team Meeting, Santa Rosa, CA Example ISO record for GOES data Presented by Ted Haberman Full ISO example in XML Demonstrated the potential of ISO to provide complete/rich metadata for data quality, processing lineage and consistency Web site (wiki) that breaks out the ISO components with descriptions and example XML le=ISO_Metadata_Standard
4 June th GHRSST Science Team Meeting, Santa Rosa, CA Example ISO component
4 June th GHRSST Science Team Meeting, Santa Rosa, CA LI_Lineage
4 June th GHRSST Science Team Meeting, Santa Rosa, CA Recommendation Summary Recommendations That GHRSST move to an ISO 19115/19130 implementation of metadata capture to promote interoperability and ensure that GHRSST’s metadata holdings remain relevant. That no additional, mandatory, fields are added to the metadata requirements at this time. That desired fields (orbits etc) are optional until the science team deems them mandatory. That the science team suggest additional fields that should be captured in the metadata. Once additional items (as determined by the science group) become routinely available, that the science team recommend they become mandatory. That the XML-WG provide examples of a) UML diagrams, b) the current DSD/FR records in ISO format and b) extended records that include fields suggested by the XML-WG (e.g. orbit information) to data providers. That no ISO XML be placed in netCDF 3 global structure. This will be reassessed when/if we move to netCDF 4.
4 June th GHRSST Science Team Meeting, Santa Rosa, CA Summary An ISO Metadata Model will be developed Based on existing crossswalk of FR/DSD and subset of attributes in UML diagrams Essentially an ISO “Lite” model with all current attributes mapped (required) and some optional ones AIs: Provide an updated UML structure Provide XSD/DTD XML description Provide example XML files for L2P/L3P/L4 products Input above to GDS 2.0 Investigate conversion methodologies for FR/DSD -> ISO XML TAG will: Provide the blueprints... It will be the challenge/responsibility of the data producers to implement and validate their ISO XML records