LOC 13 June 2003 1 NSSDC Role and OAIS Implementation Brief Overview Don Sawyer.

Slides:



Advertisements
Similar presentations
A Tour of the OAIS Reference Model Brian Lavoie Research Scientist Office of Research OCLC Museum Computer Network Annual Conference September 2002.
Advertisements

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
WOCE Global Data V3 WOCE-DPC Report Nathan Bindoff and David M. Legler Co-Chairs, WOCE DPC WOCE Conference November 2002 All of it.
An Introduction June 17, 2013 Open Archival Information System (OAIS)
Preservation Strategies: What do long-term archives do with my data? Jeff Arnfield NOAA’s National Climatic Data Center Version 1.0 Review Date.
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Current Thinking on Digital Preservation: Role of Metadata Oya Y. Rieger Coordinator, Library Office of Distributed Learning Cornell University Library.
Introduction to Databases Transparencies
Beyond Paper: Records Preservation in the Digital World Nien-Ling Wacker, CEO LaserFiche Document Imaging
March 2004 At A Glance ITOS is a highly configurable low-cost control and monitoring system. Benefits Extreme low cost Database driven - ITOS software.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Introduction to the ESA Planetary Science Archive  Jose Luis Vázquez (ESAC/ESA)  Dave Heather (ESTEC/ESA)  Joe Zender (ESTEC/ESA)
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
1 - A View from the Field - The Next Generation Data Standards For the PDS - PDS4 - ESIP Federation Meeting July 8, 2009 J. Steven Hughes JPL Copyright.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
Actualog Social PIM Helps Companies to Manage and Share Product Information Using Secure, Scalable Ease of Microsoft Azure MICROSOFT AZURE ISV PROFILE:
29 Nov 2006PDS MC NSSDC MOU history PDS-NSSDC MOU circa 1994 Reviewed in Jan 2003, June 2004, Oct 2005, Nov 2006 Add words to remove HQ changes Change.
VITAL at the National Library of Wales Glen Robson
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
NASA/NSSDC Report to MOIMS DAI/IPR Plenary 16 January 2007 Colorado Springs, USA.
06-1L ASTRO-E2 ASTRO-E2 User Group - 14 February, 2005 Astro-E2 Archive Lorella Angelini/HEASARC.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
M-1 ISO “Reference Model For an Open Archival Information System (OAIS)” ISO “Reference Model For an Open Archival Information System (OAIS)” Presentation.
Application Software System Software.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
Softwaretechnologie für Fortgeschrittene Teil Eide Stunde III: Introducing the media server (with contributions from Christian-Emil Ore, Jon Holmen, and.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Trials and Tribulations of a Small Archive Presented at the THIC Conference, NCAR, Boulder CO June 30, 2004 Presented at the THIC Meeting at the National.
HDF and HDF-EOS: Implications for Long-Term Archiving and Data Access.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
1 SUZAKU HUG 12-13April, 2006 Suzaku archive Lorella Angelini/HEASARC.
SPDF Science Advisory Group - September 29-30, 2005 Page 12/24/2016 9:09:48 PM Services of the Space Physics Data Facility (SPDF) / Sun-Earth Connection.
Zou Ziming 1 Ma Wenzhen Li Lei Zhao Hua Wang Chi 1: Center for Space Science and Applied Research Chinese Academy of Sciences Moscow ·
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
A SCRIPT FOR ARCHIVING DIGITAL RESEARCH DATA IMPROVING ACCURACY AND EFFICIENCY IN THE DATAVERSE NETWORK ABSTRACT SUMMARY Rachel Carriere, Thu-Mai Christian,
AXF – Archive eXchange Format Report of AXF WG to TC-31FS 6 December, 2012.
Metadata for the SKA - Niruj Mohan Ramanujam, NCRA.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
A Solution for Maintaining File Integrity within an Online Data Archive Dan Scholes PDS Geosciences Node Washington University 1.
NASA/NSSDC Report to MOIMS DAI/IPR Plenary
PDAP Query Language International Planetary Data Alliance
Implementing an Institutional Repository: Part II
Intermountain West Data Warehouse
An Open Archival Repository System for UT Austin
Open Archival Information System
Robin Dale RLG OAIS Functionality Robin Dale RLG
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

LOC 13 June NSSDC Role and OAIS Implementation Brief Overview Don Sawyer

LOC 13 June NSSDC Roles NSSDC is the NASA Office of Space Science (OSS) permanent archive — Astronomy, Solar & Space Plasma Physics, Planetary & Lunar data — Digital and film data spanning from >1300 instruments flown on >375 spacecraft — Distinguished from OSS Active Archives (AA) Interacts in a timely manner with all distributed OSS active archives in space physics, solar physics, astrophysics, and planetary science disciplines to acquire the OSS data and supporting metadata needed for long term preservation and understanding; — interact directly with projects when mediated by an active archive; — interact with PI's and related individuals when they have data needing long-term preservation.

LOC 13 June OSS Archive Relationships Planetary AAsSolar AAsSEC AAsAstrophysics AAs Various OSS S/C Projects NSSDC Permanent Archive DLTs, Tapes, CD/DVDs, Film, Paper Anonymous FTP OSS Researchers, Non-OSS Researchers Education Community, General Public PDS and SEC data on media

LOC 13 June NSSDC Roles (concl’d) NASA's lead for Consultative Committee for Space Data Systems (CCSDS) Archiving and Data Packaging/Registry Working Groups (on-ground data management) — Led development of CCSDS/ISO Open Archival Information System reference model standard Comprehensive information base about all launched spacecraft (~6000) Host of World Data System for Satellite Information — Part of worldwide World Data Center infrastructure established ~1958

LOC 13 June NSSDC’s Permanent Archive Environment - Legacy View ~20 TB in ~2,300 digital data sets on ~40,000 offline media — Most on tape — Most newly arriving media are CD's or DVD's "Data set" is all data from a given source (e.g., instrument on a spacecraft) at a given "processing level." Wide range of data characteristics (e.g., documented binaries specific to now-obsolete computers) Also, ~2,000 data sets on large number of film media of various form factors. — Gradually being digitized into TIFF via scanning.

LOC 13 June Initial Drivers for OAIS Re-engineering Needed to solve a migration problem — Remove dependencies of VAX VMS files on the operating system — Include record defining attributes in a standard form to accompany the data file content — Result was package of data/metadata Had software, based on CCSDS/ISO packaging standard, that could be augmented OAIS reference model provided an architectural view

LOC 13 June Created Archival Information Package Single File (binary/ascii content) Uses CCSDS/ISO packaging (SFDU) to hold multiple data objects — NSSDC defined attribute object expressed in CCSDS/ISO Parameter Value Language (PVL) — NSSDC data file content in one of four canonical forms Two flavors each of binary and ascii — 20-byte SFDU ascii labels to separate data objects

LOC 13 June NSSDC Attribute Object — Object identification and version — Archival Storage Id ( unique) — Collection Id — Checksum over rest of attribute object — Attributes for original data stream Date/time created, operating system, size in bytes, record format, binary/ascii flag, file name, checksum, etc. — Attributes for canonical form of data stream Date/time created, operating system, size in bytes, record format, binary/ascii flag, file name, checksum, processing report, format identifier (ADID), etc. — Order applied encodings (e.g., tar,gzip) — Start date/time of data observations

LOC 13 June NSSDC Permanent Archive - New Direction Bundle data files (objects) with data_file-descriptive attribute file (object) and pointers to further documentation into OAIS "Archive Information Package (AIP)" — Write to Digital Linear Tape (DLT)-based jukebox in unix environment — Write data files and attribute files to RAID disk for ftp-based access by external customer AIP Structure Attribute Object (AO) Label Sensor Data Object (SDO) CCSDS/ISO Label for Packaging CCSDS/ISO Label for Attribute Object CCSDS/ISO Label for Sensor Data Object Globally Unique Registry Identifiers Globally Unique Registry Identifier Expressed using CCSDS/ISO language

LOC 13 June “New Direction”

LOC 13 June Migrating Data into AIPs Have created AIPs for data previously on NSSDC's newly retired 12" WORM data dissemination jukebox — VMS-based, so some attributes placed in attribute objects compensate for loss of VMS/Files-11 support — Modified data files in cases of variable-length records, and introduced "CR/LF" for appropriate ASCII data Now creating multi-data-file AIP and upgrading software to accommodate data migrating from legacy offline tapes — Will start ingest from tape imminently

LOC 13 June Facilitating Archiving via Data Supplier Support NSSDC has provided software to the IMAGE spacecraft project — Generates attribute objects and bundles these with data files into Archive Information Packages (AIP — IMAGE script transmits these to NSSDC Looking for other opportunities to support NASA spacecraft projects equivalently —Cost-effective data ingest Data files Configuration information NSSDC Package Generator AIPs National Space Science Data Center ftp IMAGE Script IMAGE Science Operations Centre

LOC 13 June NSSDC Architecture Summary For the system architecture: — compliant with the OAIS functional model separates different functions : ingest, archival storage, data management, access — Compliant with the OAIS information model defines an Archival Information Package (AIP) for preservation in Archival Storage Data are being migrated into Archival Information Packages for long-term storage on DLTs New data received arrive as AIPs (e.g., the IMAGE project) or are put into AIPs during the Ingest process

LOC 13 June Current Activities Developing a better integration of our metadata databases — Many have grown up over the years — Taking advantage of Java and web capabilities Developing an Archival Information Package type that allows multiple ‘canonical data files’ in a single package file. — Needed for the migration of legacy data on magnetic tape — Needed to put small files together for ease of management Planning a better overall integration of our architecture — E.g., tighter coupling between AIPs and other information bases

LOC 13 June Backups

LOC 13 June NSSDC AIP Schematic

LOC 13 June NSSDC Archive - Logical Architecture

LOC 13 June Archive Challenges Making most cost-benefit favorable judgements on modernization of low-access-potential older data sets. — Convert vendor-specific binaries to IEEE-binary? Via EAST? Convert to ASCII? Implement efficient production process for migrating data from ~10,000 tapes through AIP-creation software to nearline DLT-based permanent archive Define post-DLT permanent archive environment Ensuring existence of all material needed to make data correctly and independently usable — Couple such material to the data being supported

LOC 13 June NSSDC Metadata Environment Information base (JEDS) about — All launched spacecraft, — Instruments on space science spacecraft, — NSSDC-held data sets therefrom. — Underlies "NSSDC Master Catalog" interface. Information base (DIOnAS) about data files — Written to new nearline permanent archive — Written to anonymous nssdcftp/spacecraft_data/ Attribute objects with technical information about data files Information base (JIN) about data media

LOC 13 June NSSDC Metadata Environment (concl’d) Information base (CAOIS) of CCSDS-registered data set-descriptive information (e.g., formats) — Assigns globally-unique registry identifiers — Relevant to growing fraction of NSSDC data plus other data Array of "data set catalogs" with detailed information on NSSDC-held legacy data sets — Presently on CD's as TIFF and PDF images Other special purpose information bases and metadata collections NSSDC data set ID's are primary mechanism currently linking these "metadata modules"

LOC 13 June NSSDC’s Metadata Challenges To ensure flow to NSSDC of material needed for the correct and independent use of data along with the flow of data to NSSDC To optimally integrate metadata modules to support: — Users' finding, retrieval and use of data, — NSSDC staffers' archive management activities To ensure that all relevant supporting material is visible to and readily retrievable by NSSDC's data-accessing customers.

LOC 13 June Software NSSDC has growing amount of low-processing-level (lpl) data — Started archiving such data only in past decade NSSDC has very little data set-specific READ/PROCESS software — This greatly limits usability of lpl data Lpl data handled by systems/formats like SDDAS/IDFS and IMAGE_Archive/UDF Major need for software standards/approaches to accompany lpl data into archives — Ensure long-term usability of such data Archiving of relevant software source code a minimal requirement