May 23 2007 Archiving 2007 1 PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.

Slides:



Advertisements
Similar presentations
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
Advertisements

Business Development Suit Presented by Thomas Mathews.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
6/1/2015Ch.31 Defining Enterprise Architecture Bina Ramamurthy.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.
PAWN V0.7 University of Maryland Institute for Advanced Computer Studies.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN Progress July 06, Overview of changes New flexible environment for setting up and managing interactions between producers and the archive Domains.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information Principal Investigator: Joseph JaJa Lead Programmers: Mike.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
7/26/2007 Review 1 A brief overview of major PAWN enhancements.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information PI: Joseph JaJa Co-PIs: Allison Druin and Doug Oard Major.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
® How to Build IBM Lotus Notes Components for Composite Applications 정유신 과장 2007 하반기 로터스 알토란.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Workflow Development Overview Architecture Requirements Types of workflows Stages of workflow.
Session 7 Windows Platform Eng. Dina Alkhoudari. Learning Objectives Active Directory review Managing users and groups Single Master Operations Delegation.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
Kuali Rice A basic overview…. Kuali Rice Mission First and foremost to provide a consistent development framework and common middleware layer for Kuali.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Introduction Database integral part of our day to day life Collection of related database Database Management System : software managing and controlling.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
National Archives and Records Administration Status of the ERA Project RACO Chicago Meg Phillips August 24, 2010.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Cooperative Print Archiving by Discipline Developing an Infrastructure to Sustain Scholarly Resources in Agriculture Amy Wood Center for Research Libraries.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
PAWN: Producer-Archive Workflow Network
PLM, Document and Workflow Management
An Introduction to Tessella and The Safety Deposit Box Platform
Joseph JaJa, Mike Smorul, and Sangchul Song
AMGA Web Interface Salvatore Scifo INFN sez. Catania
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Health Ingenuity Exchange - HingX
AMGA Web Interface Vincenzo Milazzo
Open Archival Information System
Robin Dale RLG OAIS Functionality Robin Dale RLG
Remedy Integration Strategy Leverage the power of the industry’s leading service management solution via open APIs February 2018.
SDMX IT Tools SDMX Registry
Presentation transcript:

May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital Preservation Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University of Maryland, College Park Sponsored by National Archives and Records Administration, Library of Congress and NSF

May Archiving Problems Facing Ingestion Ensure integrity of data ingestion Each producer-archive interaction is unique Final destination for items in an archive is unique. Differing roles between producer and archive Hostile producers

May Archiving What is PAWN? Software that provides an ingestion framework Distributed and secure ingestion of digital objects into an archive. Handles the process –From package assembly –To archival storage Simple, customizable interface for end-users Flexible interface for archive publication

May Archiving Package Workflow 1.Create Producer-Archive Agreement 2.Client package template. 3.Create package based on template 4.Once approved, packages can be archived 5.Rejected packages can be held until rectified or deleted for resubmission.

May Archiving Expanding a Simple Workflow Support for multiple workflows. –Grouped into logical domains Definable roles per workflow Pluggable components for assembly and archival publishing Distributed components –Web-service based components

May Archiving Domain Organization Producers organized into domains, each domain contains a transfer agreement negotiated with the archive. Each domain contains a hierarchical organization of data grouped into record sets/templates (convenient groupings from the transfer agreement). Each domain contains its own users. An end-user operates within a set of record sets.

May Archiving Domain Example

May Archiving Custom Roles Actions in PAWN can be grouped together to create roles. –There are no common roles between archives, so allow custom ones. Default roles –Producer – Individual data supplier –Records Manager – Oversight of producers –Archive Manager – Final review and archive publishing –Global Administrator – Creates domain, sysadmin-like account Sample Actions –Setting permissions on record sets –Record Schedule creation and modification –Add or delete whole packages –Modify items in a package …

May Archiving Custom Package Building PAWN provides an API for developing custom package builders Custom package builders can be written in JAVA and implement a simple interface. Builders interact with a hierarchical structured package Manifest  Namespace  Type  Descriptive Name Data  Type  Descriptive Name  Bits Metadata … Manifest … Metadata  Type  Bits  Name

May Archiving PAWN Archive Gateway Pluggable component that provides an API for developing gateways into various services. Each gateway may have multiple instances, each configured differently PAWN handles managing and associating gateways with the appropriate data.

May Archiving PAWN Architecture Divided into producer and archive side components –Producer: data supplying and domain management –Archive: data storage, resource allocation and archival publishing Web-service based communication Trust relationship between producer and archive components –SAML and PKI

May Archiving Components

May Archiving Case Studies ICDL Book Builder SLAC Record Ingestion 10,000 CDroms Remote ingestion Unskilled labor Custom hardware Sample NARA ingestion Model government roles DOE Record Schedule Custom package builder Multiple data sources Model logical books

May Archiving PAWN Summary Platform for ingestion Customizable Components –Roles, ingest and publishing Distributed architecture

May Archiving More information Web site: – Wiki link for technical details. Or “I’m feeling lucky” Google keywords: –ADAPT UMIACS