NARA Report: NARA Persistent Archives Prototype Bill Underwood GTRI, Atlanta CCSDS, MOIMS DAI / IPR WGs Toulouse, 2 Nov-5 Nov 2004
Project Members Reagan Moore, PI, SDSC SDSC - Richard Marciano, Wayne Schroeder Univ. of Maryland - Joseph Jaja SLAC - Jean Deken GTRI - Bill Underwood
Project Objectives Virtual Data Grid Services Ingestion Workflow Prototype XML Schema for SIP Data Description Languages
Virtual Data Grid Services Archival services provided by GTRI –File System Packaging with NARA Metadata in XML DTD Manifest –File type identification –File conversion –File viewers and readers –Information Extraction Services described in WSDL Register Services Discover and request archival services Demonstrate on SLAC science data
Ingestion Workflow Prototype Addressing issues similar to the Producer- Archive Interface –Provides data to NARA based on a prior agreement with Records Creator –Consists of metadata server and an ingestion client –Provides initial arrangement, context and metadata NARA –validates digital objects and metadata, –stores objects in a digital repository and –stores metadata in a catalog Demonstrate on SLAC science data.
XML Schema for SIP Modifications of MET to meet NARA records management requirement. Client generates and receive METs documents. Client contacts Metadata server using X.509 certificates. Metadata server stores METS items in a MySQL database. Metadata server manages certificates. NARA server verifies metadata integrity against schema and specification document.
Data Description Languages for Data Grids BinX M. Westhead and M. Bull. Representing Scientific Data Sets on the Grid, EPPC, University of Edinburgh, Jan DFDL M. Westhead. Data Format Description Language – Primer, Global Grid Forum Data Format Description Language Working Group