MOIMS Internet Packaging and Registries WG XML Formatted Data Units (XFDU) XML Packaging of Binary and Text Data Lou Reich NASA/CSC MOIMS Plenary May 10,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Fujitsu Laboratories of Europe © 2004 What is a (Grid) Resource? Dr. David Snelling Fujitsu Laboratories of Europe W3C TAG - Edinburgh September 20, 2005.
09-Dec Toulouse,FR AD-initial - 1 MOIMS AREA: IPR STATUS REPORT, END of Sprint 2006 Meeting Louis Reich IPR WG Chairman Joint IPR/DAI Workshop 20.
METS: An Introduction Structuring Digital Content.
19/05/2011 CSTS File transfer service discussions CSTS-File Transfer service discussions (2) CNES position.
Information Management NTU Web Services. Information Management NTU What Are Web Services? Semantically encapsulate discrete functionality Loosely coupled,
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 CCSDS Information Architecture Working Group SEA Plenary Daniel J. Crichton, Chair NASA/JPL 12 September 2005.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Introduction to MDA (Model Driven Architecture) CYT.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
Current Applications of the OAIS Model David Giaretta.
XML Registries Source: Java TM API for XML Registries Specification.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Extracts from NASA-NARA Research Report 12 June 2006 Rome, Italy.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
C. Huc/CNES, D. Boucon/CNES-SILOGIC, D.M. Sawyer/NASA/GSFC, J.G. Garrett/NASA-Raytheon Producer-Archive Interface Methodology Abstract Standard PAIMAS.
Ajh January 2007 CCSDS “Books” Adrian J. Hooke CMC Meeting, Colorado Springs 26 January 2007.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
CCSDS Meeting CNES Report, 2/11/2004 CNES 1 CCSDS - MOIMS Area Data Archive ingestion WG Information Packaging and Registry WG CNES Report on activity.
1 CCSDS Information Architecture Working Group Daniel J. Crichton, Chair NASA/JPL 14 September 2005.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
IPR WG REPORT November Information Packaging and Registries WG REPORT November 2004.
CCSDS MOIMS Falls Meeting 2007 – Colorado Springs - June 2006 SAFE Status Progress status & f Stéphane Mbaye
Wyn Cudlip BNSC/QinetiQ Presentation to WGISS25 China, February 2008 CCSDS Liaison Consultative Committee on Space Data Systems.
CCSDS MOIMS Springs Meeting 2006 – Rome - June 2006 XFDU & SAFE - ESA return from experience ESA return from experience & f Stéphane Mbaye
Information Architecture WG: Report of the Spring 2004 Meeting May 13, 2004 Dan Crichton, NASA/JPL.
NASA/NSSDC Report to MOIMS DAI/IPR Plenary 16 January 2007 Colorado Springs, USA.
SPASE and the VxOs Jim Thieman Todd King Aaron Roberts.
Modul 4 Struktur Informasi Mata Kuliah Preservasi Informasi Digital.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Information Architecture WG: Report of the Spring 2006 Meeting June 16, 2006 Dan Crichton, Chair NASA/JPL.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Information Architecture WG: Report of the Fall 2005 Meeting September 16, 2005 Dan Crichton, Chair NASA/JPL.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Djc -1 Daniel J. Crichton NASA/JPL 9 May 2006 CCSDS Information Architecture Working Group.
Information Architecture BOF: Report of the Fall 2003 Meeting October 28, 2003 Dan Crichton, NASA/JPL.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
XML Tools (Chapter 4 of XML Book). What tools are needed for a complete XML application? n Fundamental components n Web infrasructure n XML development.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Data Archive Ingest WG Report to MOIMS Plenary May 14, 2004.
31 st October – 4 th November 2011 Fall 2011 Meeting Agenda Boulder, Colorado, USA SOIS Application Support Services WG Device Virtualisation & EDS Coordination.
1 Steve Hughes Daniel J. Crichton NASA/JPL January 16, 2007 CCSDS Information Architecture Working.
1 SAIC XMSF Update XMSF Workshop & MOVES Open House 4-5 August 2003 Katherine L. Morse, Ph.D., David L. Drake, Ryan.
1 XFDU Issues and Status Report Louis Reich IPR WG Chair Sergey Nikhinsen NASA/CSC CCSDS/MOIMS IPR/DAI Workshop 05-December-2005.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
National Aeronautics and Space Administration 1 CCSDS Information Architecture Working Group Daniel J. Crichton NASA/JPL 24 March 2005.
ISC321 Database Systems I Chapter 2: Overview of Database Languages and Architectures Fall 2015 Dr. Abdullah Almutairi.
International Planetary Data Alliance Registry Project Update September 16, 2011.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
NASA/NSSDC Report to MOIMS DAI/IPR Plenary
OAIS Producer (archive) Consumer Management
An Architecture for Complex Objects and their Relationships
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Presentation transcript:

MOIMS Internet Packaging and Registries WG XML Formatted Data Units (XFDU) XML Packaging of Binary and Text Data Lou Reich NASA/CSC MOIMS Plenary May 10, 2004

XML Packaging Standard Rationale Physical media  Electronic Transfer No standard language for metadata  XML Homogeneous Remote Procedure Call  CORBA, SOAP Little understanding of long-term preservation  OAIS RM Record formats  Self describing data formats New Requirements describe multiple encodings of a data object better describe the relationships among a set of data objects.

Functionality by Release Version 1 should include : Support for XML document and ZIP/JAR type files The capabilities of current SFDU packaging and CCSDS Control Authority Concepts Support for data descriptions, MIME types, self describing formats, and detached data descriptions Flexible Metadata Model (Supports producer view of Metadata types Support for the OAIS RM Information Model Concepts and Types Flexible linkage to Metadata The ability to encapsulate related files/resources into a single file/container The ability to reference both content and metadata resources contained in the same container or at a known URL The ability to allow/reverse multiple transformations on files Behaviors Web Service Interfaces Portable Code

Functionality by Release Version 2 Functionality Enabled Behavior –Automatic execution –Scripting (Output of one behavior as input to another behavior) Software Updates Process Definition Relationship Definition Support for SOAP with attachments with no mandated packaging of files into a single object

Environment View of XFDU

Logical View metadataObject metadataSec metadata objects dataObjSec dataObject ContentUnit ipMapSec data objects structure map behaviorSec xfdu behavior metadata Category Pointers REP DED,SYNTAX, OTHER PDI CONTEXT, PROVENANCE REFERENCE, FIXITY, OTHER DMD DESCRIPTION, OTHER OTHER category class ANY metadataObject

DIAGRAM of XFDU XML SCHEMA

Expressions of Interest Interest in participation in 2004 Prototype integration Planetary Data System – JPL National Space Science Data Center (NSSDC) Deep Space MS Packaging Prototype – JPL GSFC Library – GSFC ESA – Data Distribution System CNES – Archives (e.g., SIPAD) General Interest But No Current Commitment GMSEC NASA/GSFC Code 581 HEASARC and Virtual Observatory EOSDIS Metadata Clearinghouse (ECHO) International Virtual Observatories

Status of XML Formatted Data Unit Structure and Construction Rules Interoperability Profile developed at the RAL Workshop. The Workshop Noted that agreed resources must be committed. Working Group editor and Toolkit prototype lead funding discontinued 11/ /2004 No progress in IPR WG during that timeframe A New draft of the XFDU Proposed Recommendation should be approved for TSG Review this Workshop Only prototype and testing activities will be able to improve the current solution

Review of IPR Charter

Required Resources Lead agency: NASA or CNES editor. Staffing needed: WG lead (NASA 25%) WG deputy (NASA 15%) Recommendations Editors (CNES 30%, NASA 30%) WG Contributors 10% per WG member Testing Coordinator 20% Prototype developers: 50% (NASA 1, CNES 1, ESA 0.5, BNSC 0.x) starting ASAP. Integrators: 25% for 3 months, then 15% continuing, at least 1 per environment (NASA 3+, CNES 2+, ESA 2+)

Risks Resources, Resources,Resources Regain Momentum from Working Group shutdown We cannot progress with multi-agency testing efforts Programmatic Risk Management The Packaging Recommendation functionality has been split between two planned releases of the XFDU Packaging Recommendation to allow early prototyping of required capabilities. A wide variety of use cases and testing environments including but not limited to: NASA PDS NASA/EOSDIS Libraries NASA SLE implementations CNES SLE implementations CNES Archive Ingest SIP development ESA Data Distribution System ESA CAOS

Registries Packaging partners (PDS, GSFC Library, ESA-DDS, various SLE implementations etc) should give us a good feel for a number of repositories. NASA wants to make XML descriptions of all its data available from a single logical repository. Work in other areas suggests that the ebXML registry will be a good fit to all CCSDS repository requirements. An Open Source implementation is available which some say is sufficiently mature for operational use. NASA/CSC is installing the ebXML s/w and will report back on its experience with this. At the Fall 2004 meeting a joint meeting with the Information Architecture BOF/WG will be essential to avoid duplication of work.

Backup Slides

MOIMS CCSDS ORGANIZATIONAL VIEW

File system Manifest Package Interchange File External Packages Conceptual View of Information Package

Information Package Map Logical View of XFDU Package

XML SPY DIAGRAM of XFDU XML SCHEMA

XML Schema for Metadata Linkage

XML Schema for Information Object

Data/Metadata Linkages Requirements Data Objects that are contained in the manifest are to be encoded in base64 or XML Data Objects that are included by reference from the manifest are to exist as files in the XFDU package or as files with known URIs either in a repository or in a location accessible via URL Metadata objects that are contained in the manifest are to be encoded in base64 or XML Metadata objects that are included by reference from the manifest are to exist as files in the XFDU package or as files with known URIs either in a repository or in a location accessible via URL Information Objects can reference applicable Metadata objects by ID where the name of the referencing attribute is used to classify the Metadata and the schema enables identification of the source of the metadata Allow metadata objects to be treated as data objects to enable direct mapping to the OAIS representation net where each metadata object is an information object containing both data object and representation information.

XML Schema for Metadata Linkage

XML Schema for Digital Object

Development Approach Develop Draft Concept Paper and XML Schemas for internal review Use automated tool (JAXR) to develop JAVA Classes from XML schema Modify XML Schema based on internal review comments and issues based on JAVA class implementations Develop draft CCSDS White Book for Working Group Review Begin staged implementation of API layer and crude GUI of a packaging toolkit Toolkit should provide useful functionality at a very early stage for demonstration to interested parties Present to Working Group for review and prototype commitments Develop specialization of schema that all international prototyping efforts agree to support

Technical Drivers Use of XML based technologies Designed to be extensible to include new XML technologies as they emerge Linkage of data and software Direct mapping to OAIS Information Models Support both media and network exchange Support for multiple encoding/compression on individual objects or on entire package Mapping to current SFDU Packaging and Data Description Metadata where possible Maximal use of existing standards and tools from similar efforts

Packaging Mechanisms Single XML Document Single XML document Simplest case All Binary must be encoded (base 64 or hex) Can be parsed and validated with standard XML parsers and shipped via standard WWW protocols Impractical with large binary files

Multi-file Packaging Approaches Archive Approach Encapsulate entire directory structure and all contained files into a single “file archive”using a common available technique such as ZIP Other “archive formats” such as JAR, show how the inclusion of a well-known file can include related metadata Message Approach Combines SOAP (RPC for the web) and MIME types Uses multi-part MIME/related, as a packaging format mechanism for messages that transfer multiple files Allow use of appropriate compression/encoding techniques for contained files. Use of a common “manifest” or “table of contents” object makes these two approaches symmetric Design Decision:XFDU version 1 must support the ZIP and single document forms. The SOAP/MIME/DIME forms should be prototyped but the underlying protocols may not be stable in the version 1 timeframe.

High Level Entities XFDU Schema (1 of 2) Package Header (packHeader): Administrative metadata for the whole XFDU, such as version, operating system, hardware, author, etc, and metadata about transformations and behaviours that must be understood Metadata Section (MetadataSec): This section contain or references all of the metadata for all items in the XFDU package. Multiple metadata objects are allowed so that the metadata can be recorded for each separate item within the XFDU object. The metadata schema allows the package designer to define any metadata model by providing attributes for both metadata categories and a classification scheme for finer definition within categories. The model also provides predefined metadata categories and classes via enumerate attributes that follow the OAIS information model as follows: Descriptive information is intended for the use of Finding Aids such as Catalogs or Search Engines. The Representation Section and its subsections, syntax information (syntaxMd), static semantics (dedMd), and unclassified metadata (otherMd) The classification of the PDI Section - reference, context, provenance, and fixity

High Level Entities XFDU Schema (2 of 2) Information Package Map Section (ipMapSec) outlines a hierarchical structure for the original object being encoded, by a series of nested contentUnit elements. Content units contain pointers to the data objects and to the metadata associated with those objects. Data Object Section (dataObjectSec) contains a number of dataObjEntry elements. A Data Object Entry contains some file content and any data required to allow the information consumer to reverse any transformations that have been performed on the object and restore it to the byte stream intended for the original designated community and described by the Representation metadata in the Content Unit Behavior Section (behaviorSec) can be used to associate executable behaviors with content in the XFDU object. A behavior section has an interface definition element that represents an abstract definition of the set of behaviors represented by a particular behavior section. A behavior section also has a behavior mechanism that is a module of executable code that implements and runs the behaviors defined abstractly by the interface definition.

Interoperability profile The Profile will indicate that ALL content for both metadata and data files will be referred to using dataObjPtr Transfer mechanism for XFDU : we do not support processing before all the data has come down the wire – assume XFDU file is on local file system before it is opened via HTTP in SOAP with attachment – where the XFDU zip file is an attachment Identifier – uniqueness issues package instance identifier could perhaps use UUID registry for xml Schema could be simple FTP server, with front-end index file Unique name for manifest file MANIFEST/ccsdsxfdu.xml