Data Management Planning Session Kevin Gomes Michael Meisinger Arcot Rajasekar Michael Wan October 19, 2007
Data Network Federation and Preservation of data and metadata via data streams, repositories and catalogs Two Co-operating/Complimentary Systems SSDS iRODS
List of Services Online Data Repository –Data and Metadata Persistent Archive Service –Cataloging,Preservation & Curation Asset Validation Service –Integrity and Authenticity Aggregation Service –Classification, Categorization, Grouping Attribution Service –Associate Attributes, Semantic Ontology Metadata Search & Navigation Services –Query & Browse by context Dynamic Data Distribution Services –Publish, Subscribe and Query for dynamic data resources Data Access Services –Suite of External Interfaces – OPeNDAP, THREDDS, LAS,…
Release 1 Services for Data Repository, Archive, Dynamic Data Distribution iRODS Resource: Archives, Unix File Systems API: Ingest/Access, Register, Metadata Form: Hierarchical (POSIX) Access: C,Java,PHP Protocol: Native Bin/XML Metadata Extraction: Link DataProcessing/Extraction Micro- Services, Rules Replication: supported Metadata Catalog: RDB System: Owner,ACL,Chksum,Audit,… User defined: KVU-Triplets SSDS Resource: RDB (File System for Backup) API: Ingest/Access, Register, Metadata Form: URI Access: Java,REST, WS Protocol: HTTP Metadata Extraction: API/XML and Services Replication: not supported Metadata Catalog: RDB System Ownership,provenance) User-defined: No
Main Components File Ingest Stream Ingest MD Extraction and Storage MD API for Input and Output Data Access Search/Navigation
Data Ingest/Access SSDS DP DataStreams File HTTP Ingest RDB BkUp WAN Registration File RDB DATA META DATA API HTTP/REST JMS URI ESB
Data Ingest iRODS DP File Register APIAPI Put Posix Web File System RDB APIAPI Get Web OPeNDAP THREDDS LAS DataStreams Distributed/ Replicated ESB SSDS WS HDF
iRODS/SSDS Services
Data Ingest