Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.

Slides:



Advertisements
Similar presentations
Research Data Access and Preservation Summit Panel 2 - Promoting Re-Use of Scientific Collections Some responses to the questions posed... John Harrison.
Advertisements

Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
TIPR: Repository Exchange Package Use Cases and Best Practices Joseph Pawletko and Priscilla Caplan IS&T Archiving 2011.
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
Institutional Repositories It’s not Just the Technology New England Archivists Boston College March 11, 2006 Eliot Wilczek University Records Manager Tufts.
PeDALS Persistent Digital Archives & Library System GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
DESIGNING A PUBLIC KEY INFRASTRUCTURE
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN Progress July 06, Overview of changes New flexible environment for setting up and managing interactions between producers and the archive Domains.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Introducing Symposia : “ The digital repository that thinks like a librarian”
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
7/26/2007 Review 1 A brief overview of major PAWN enhancements.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information PI: Joseph JaJa Co-PIs: Allison Druin and Doug Oard Major.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Persistent Digital Archives and Library System (PeDALS)
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Fedora and the Preservation of University Electronic Records Project NHPRC Electronic Records Research Grant Kevin L. Glick Manuscripts and Archives, Yale.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
NARA Report: NARA Persistent Archives Prototype Bill Underwood GTRI, Atlanta CCSDS, MOIMS DAI / IPR WGs Toulouse, 2 Nov-5 Nov 2004.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
PAWN: Producer-Archive Workflow Network
Joint Meeting of CSUL Committees,
Ingest and Dissemination with DAITSS
OGF PGI – EDGI Security Use Case and Requirements
Building A Repository for Digital Objects
Policy-Based Data Management integrated Rule Oriented Data System
Statewide Digitization and the FCLA Digital Archive
OGSA Data Architecture Scenarios
The Application Lifecycle
Implementing an Institutional Repository: Part II
A Tale of Two Archives: Notes from the Dark Side
Technical Issues in Sustainability
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
ArchivesSpace – Archivematica – DSpace Workflow Integration
Presentation transcript:

Brief Overview of Major Enhancements to PAWN

Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the archive. Use of web/grid technologies – platform independent Ease of integration with data grids or digital libraries. XML Representation of metadata and bitstream Self describing bitstream submissions Accountability of transfer and guarantee of data integrity

Ingestion Workflow (PAWN) 1. Negotiate Submission Agreement. 2. Workflow Initialization and Submission Information Packet (SIP) creation. 3. Transfer of SIPs to receiving servers. 4. Validation of SIP transfer 5. Organization of data into collections and transfer into the distributed archive.

Distributed Ingestion

Each Producer registers and arranges files locally prior to transport. Multiple distributed archival receiving stations. X.509 based authentication between sites. Independent Certificate Authorities at each Producer. Persistent archive is geographically distributed and managed by a data grid.

Producer Provides data to an Archive based on a prior agreement. Consists of a management/metadata server and an ingestion client. Provides initial arrangement, context, and metadata.

Enhancements to the Producer Data submissions are organized through a logical hierarchy negotiated between the archive and the producer. Clients no longer see entire hierarchy, but rather attachments points Better state tracking and oversight of submissions METS documents are no longer merged together, but rather kept separate to support larger submissions. Submission can be broken into multiple METS documents linked together through pointers. Producer signed submissions to ensure integrity.

Different administrator and client views Manager / Record Manager Administrator Views entire producer hiearchy Producer / Record Creator View restricted to allowable submission points

New Interactions Between Client and Receiving Servers Ability of client to reserve resources before starting to transfer data into the archive. Client creates a session with a receiving server and uploads metadata. Clients upload bitstreams, and receiving server validates checksums during transfer Client can resume or retransmit failed submissions

Client-Receiver Interaction

Archive - receiving Receives data from a Producer Validates bitstreams and metadata, and sends acknowledgement to Producer. Arranges into collections and specifies preservation policy. Publishes bitstreams into a digital archive.

New Features for Receiver Validation Services Designed a standard API and test suite for rapid development of validation services. New classes of services can be easily developed. Receiving Server Configurable endpoints into storage or metadata repositories Better handling of multiple producers

Scheduler Allocates the processing of data streams from multiple clients to a cluster of receiving servers. Clients are required to request a resource reservation. Receiving server will acknowledge/deny the reservation. Client will be informed about reservation/receiving server. Currently, receiving server has hooks for scheduler