Download presentation
Presentation is loading. Please wait.
Published byGerard Nelson Modified over 9 years ago
1
Creating Archive Information Packages for Data Sets: Early Experiments with Digital Library Standards Ruth Duerr, NSIDC MiQun Yang, THG Azhar Sikander, NSIDC Choonghwan Lee, THG
2
Outline Motivation Goals Standards Plans and Status
3
Motivation Technologies change regularly, organizations come and go, but data must survive But preserving data takes more than just preserving the bits, all the components of an AIP are critical
4
Illustration =
5
Project Goals Prototype development of Archive Information Packages for HDF data: For entire data sets For individual “granules” Test usability of digital library standards with geospatial data
6
Metadata Standards - METS Metadata Encoding and Transmission Standard An initiative of the Digital Library Federation Provides the means to convey the metadata necessary for management of digital objects within a repository exchange of objects between repositories (or between repositories and their users) Designed to facilitate shared development of information management tools/services interoperable exchange of digital materials
7
METS - A very brief overview Describes the METS document itself e.g., creator or editor Describes the object using some external standard e.g., MARC, FGDC, Dublin Core Describes object creation, storage, intellectual property rights, source info, provenance, etc. e.g., PREMIS Provides an inventory of all of the files that are part of the object described A physical or logical map of the organization of the materials described Allows specification of hyperlinks between parts of the map (mostly useful when preserving websites) Used to associate executable code with parts of the content
8
ISO 19115 Geographic Information - Metadata Purpose Characterize geographic data properly Facilitate organization and management of metadata for geographic data Enable users to efficiently use such data Facilitate discovery, retrieval, and reuse Enable data assessment
9
ISO 19115 entities Identification Constraints Data Quality Maintenance Information Spatial Representation Reference System Content Information Portrayal Catalogue Reference Distribution Metadata Extension Information Application Schema Information
10
Metadata Standards - PREMIS Provide a core preservation metadata set with broad applicability across the digital preservation community Developed by an OCLC and RLG sponsored international working group Representatives from libraries, museums, archives, government, and the private sector. Maintained by the Library of Congress Based on the OAIS reference model
11
Current Program Plan NetCDF4 / HDF5 Data METS NSIDC/ ECS HDF4-data ISO-19115 H4to H5 ECS to METS (Data Set) CDM/NetCDF4 ECS to METS (Granule) NSIDC/ECS Metadata HDF5-AIP NetCDF4/HDF5-data NetCDF4 / HDF5 Data NSIDC/ ECS HDF4-data H4to H5 NetCDF4/HDF5-data
12
Data file HDF5 METS Primary Schema Extension Schema | |--- ---------------- |--- --------------|-- | |-- PREMIS | |-- |---- http://www.hdfgroup.uiuc.edu/papers/papers/AIP/HDF5_AIP_White_Paper.pdf HDF5 AIP Components Metadata file HDF5 File Level Archive Information Packages
13
METS Primary Schema Extension Schema | |--- ---------------- |--- --------------|-- | |-- PREMIS | |-- |---- Metadata file Data Set Level Archive Information Package HDF- AIP Contextual Infomation HDF- AIP Contextual Infomation Contextual Infomation Contextual Infomation Contextual Infomation HDF- AIP
14
File Level AIP Activity Status Development of a map from NSIDC/ECS metadata to METS/PREMIS/ISO 19115 completed Implementation underway Issues Auxillary file handling - own AIP or not? o E.g., browse files, processing history, PGE’s o Granules vs files Schema redundancy
15
Data Set AIP Activities Status Contextual information availability assessed for MODIS data Currently GCSRLTA information requirements are being met Much of the information is available via a variety of websites many of which are dynamically updated Format of the material varies widely Some material should be considered geographic data sets in their own right Much of the material applies to multiple data sets
16
Data Set AIP Activity Status Local sources of metadata identified ECS Earth Science Data Type (ESDT) definitions NSIDC data set catalog and documentation Data set catalog to ISO 19115 metadata translator implemented - to be released operationally soon
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.