Interagency Forum on Earth Data Preservation/LifeCycle /Stewardship January 8, 2009 Rob Raskin NASA/Jet Propulsion Lab
Introduction Objectives Objectives Identify common needs across agencies Identify common needs across agencies Support higher priority / more funding for long- term preservation of Earth system science data Support higher priority / more funding for long- term preservation of Earth system science data Possible outcomes Possible outcomes White paper White paper Congressional budget request Congressional budget request Chapman/Gordon Conference plan Chapman/Gordon Conference plan Formation of advisory body Formation of advisory body
Preserving What? Multiple Preservation Levels LevelPreservation Strategies BitsReliable storage media, fast cop BytesIEEE Standards Structure/Format /Access Datatype ontology, scientific formats Parameter specification Science ontology History/contextSpacecraft/instrument, processing history, quality, workflow ontologies Syntax Semantics
Issues Unique identifiers (UID) Unique identifiers (UID) Preserving Provenance Preserving Provenance Common naming conventions Common naming conventions Preserving Access Preserving Access Versioning Versioning Different versions have different UIDs Different versions have different UIDs Supporting Reprocessing Supporting Reprocessing
Purposes of Unique Object Identifiers Uniquely and unambiguously identify data no matter which copy a user has Uniquely and unambiguously identify data no matter which copy a user has Find and access the data no matter where the data currently resides Find and access the data no matter where the data currently resides Facilitate management of data over time by a data center Facilitate management of data over time by a data center Ensure that a copy is an exact replica Ensure that a copy is an exact replica Facilitate data citation in publications Facilitate data citation in publications Reproducibility of results Reproducibility of results Attribution of due credit Attribution of due credit
Potential UID Representations DOI DOI LSID LSID URI URI URN URN URL URL OID OID PURL PURL ARK ARK UUID UUID XRI XRI
Levels of Archival Service Level 1 Deep Archive Services Offsite back up storage of a data set Limited data access Level 2 Collection/Directory Level Archive Services Ancillary data and collection level metadata available Associated clearinghouse services Level 3 Data Set/Inventory Level Archive Services Delivery of a complete data product suite Level 4 Data System Level Archive Services Delivery of data and the system to process on demand or custom products