PAWN Progress July 06, 2006. Overview of changes New flexible environment for setting up and managing interactions between producers and the archive Domains.

Slides:



Advertisements
Similar presentations
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
Advertisements

Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Lesson 17: Configuring Security Policies
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.
PAWN V0.7 University of Maryland Institute for Advanced Computer Studies.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
New Pawn Process V0.7. Current Final step of workflow invokes an ‘archive’ process Processes manually invoked by use on selected items Archive process.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
7/26/2007 Review 1 A brief overview of major PAWN enhancements.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
Understanding Active Directory
3SDW Project Workflow and Inventory Management
Module 8: Implementing Administrative Templates and Audit Policy.
©2011 Quest Software, Inc. All rights reserved. Steve Walch, Senior Product Manager Blog: November, 2011 Partner Training Webcast.
Salesforce Change Management Best Practices
© 2010 VMware Inc. All rights reserved Access Control Module 8.
Access Control Module 8. Module You Are Here VMware vSphere 4.1: Install, Configure, Manage – Revision A vSphere Environment Introduction to VMware.
MCTS Guide to Configuring Microsoft Windows Server 2008 Active Directory Chapter 3: Introducing Active Directory.
WebFOCUS 8: Best Practices for Migration
MCTS Guide to Configuring Microsoft Windows Server 2008 Active Directory Chapter 6: Windows File and Print Services.
Module 13: Configuring Availability of Network Resources and Content.
Module 5: Managing Public Folders. Overview Managing Public Folder Data Managing Network Access to Public Folders Publishing an Outlook 2003 Form Discussion:
9/10/20151 Hyperion Enterprise 6.5 New Features & Functionality Robert Cybulski, CPA Finit Solutions.
Week 9 Objectives Securing Files and Folders Protecting Shared Files and Folders by Using Shadow Copies Configuring Network Printing.
Implementing File and Print Services
Eric Westfall – Indiana University Jeremy Hanson – Iowa State University Building Applications with the KNS.
Module 6: Designing Active Directory Security in Windows Server 2008.
RECALL THE MAIN COMPONENTS OF KIM Functional User Interfaces We just looked at these Reference Implementation We will talk about these later Service Interface.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
September 18, 2002 Windows 2000 Server Active Directory By Jerry Haggard.
Kuali Days / November 2007 Tempe, Arizona. Kuali Research Administration Proposal Budget Module Presented by: Rhonda Dwyer, The University of Arizona.
Module 6: Configuring User Environments Using Group Policy.
Building Applications with the KNS. The History of the KNS KFS spent a large amount of development time up front, using the best talent from each of the.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Overview of the SAS® Management Console
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
Module 7 Planning and Deploying Messaging Compliance.
Module 3: Configuring File Access and Printers on Windows 7 Clients
Master Data Management & Microsoft Master Data Services Presented By: Jeff Prom Data Architect MCTS - Business Intelligence (2008), Admin (2008), Developer.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
KIM: Kuali Abstraction Layer for Identities, Groups, Roles, and Permissions.
©2012 Microsoft Corporation. All rights reserved. Content based on SharePoint 15 Technical Preview and published July 2012.
Module 6: Administering Reporting Services. Overview Server Administration Performance and Reliability Monitoring Database Administration Security Administration.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Introduction to FFI: Why and how FFI was developed Introduction to FFI: Why and how FFI was developed 04/02/2013.
Maintaining and Updating Windows Server 2008 Lesson 8.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
PAWN: Producer-Archive Workflow Network
Managing User Desktops with Group Policy
Using E-Business Suite Attachments
Template library tool and Kestrel training
06 | SQL Server and the Cloud
Presentation transcript:

PAWN Progress July 06, 2006

Overview of changes New flexible environment for setting up and managing interactions between producers and the archive Domains to organize accounts, record organization, and packages Definable roles that can be flexibly combined and assigned to accounts Interfaces for designing package builders and archival resource gateways

Components Producer Managed Archive Managed Management Server Producer data suppliers Receiving Server Distributed Archive Schedule Request Authentication Package Information Ingestion Status Validation Services

Overall Organization Producers organized into domains, each domain containing a record schedule negotiated with the archive. Each domain contains a hierarchy of the types of data and record sets (convenient groupings from the record schedule). An end-user operates within a domain with record sets associated with the account.

Package Workflow 1. Client selects a record set to use as a package template. 2. A package is built locally and then transferred to a PAWN receiving server. 3. Optionally lock package to signal complete submission. 4. Review and possible reject items. 5. Transfer items from PAWN into final archive. 6. Remove package from PAWN.

Record Organization Previous version had one hierarchy with attachment points for items as leaf nodes. Did not allow for linking of related leaf nodes Hierarchy performed multiple roles, record organization and administrative organization. Current version based on Record Sets. Separate administrative structure and record structure. Record Sets are template packages.

Record Organization Each domain contains a record schedule Record schedule is a hierarchy containing authorities as endpoints Domains also contain an organizational hierarchy. Offices, projects, etc. Record Sets group of authorities from the record schedule attached to a point in the record hierarchy. Have access permissions Presented to producers as package templates

College of Sciences Domain  Office of the Dean  Chemistry  Mathematics  Physics  Computer Science o Business Office o Research Groups o Labs … Record Sets Record Schedule Administrative oStrategic and Performance Plans oAppointment and Promotion oPolicies and Committees oAlumni Affairs Financial oContracts and Grants oPayroll oDonations Publication Reports oTechnical Reports - Archiving Rules oPresentations oPosters Record Set  Name: Research Results  Note: Reports, presentations, and other published research results  Allowed Accounts Record Schedule Mapping  Presentations o Presentations  Technical Reports o Technical Reports Domains  Offices of the President and Vice-Presidents  College of Sciences  College of Engineering  College of Medicine  College of Arts and Humanities  College of Behavioral and Social Sciences  ….. Record Set Sample

Flexible Account Roles Previous version had fixed accounts, producer, manager and administrator. Current version allows actions in PAWN to be grouped into roles. Each account is assigned a role. Sample actions in PAWN Record Set/Schedule management Package creation/deletion/modification Account management

SAML Usage SAML Assertions are issued by managers Contain manager namespace, domain, username Contain list of allowed actions by the client Contain client’s public key (holder-of-key) Signed by manager SAML Assertions authenticate and authorize a client for archive-side services. Package Management Calls Archive Management Calls Administrative Metadata Calls ArchiveProducer Call Overlap

Sample SAML Assertion umiacs:toaster urn:oasis:names:tc:SAML:1.0:cm:holder-of-key MIIDxjCCAy+gAwIBAgIDEAACMA0GCSqGSIb3DQEB.... view create modify...

SAML Assertion (cont) r7C4oNmlf4h8cXi1dGU+MIGmGbM= Rstfd1HKTe68WLQrgAvmS5hDm7SVbXnEgMlotW3aiu.... MIIDyjCCAzOgAwIBAgIDEAABMA0GCSqGSIb3DQ....

Package Creation Packages are built using a Record Set as a template. Each category in a Record Set has a hierarchy of manifests attached. Manifests are an abstraction of underlying METS documents Custom package builders use manifest interface. Manifest  Namespace  Type  Descriptive Name Data  Type  Descriptive Name  Bits Metadata … Manifest … Metadata  Type  Bits  Name

Package Builders Default Builder Create files and folders Attach descriptive metadata to files or folders ICDL Builder Create ‘books’ with dublin core metadata Uses ICDL database as source for book list and metadata

Package Scheduling and Submission Scheduler decides which receiving server to store a package Condor classad system used Receiving server periodically publishes available resources Client request space. Client Receiver Scheduler 2. Evaluate classad 1. Space Requirements 3. Create Reservation 4. Allocated Server 5. Package Transfer Receiver Classads

Publishing into Archival Resources PAWN provides an interface for registering gateways into archival resources Gateways provide: Configuration gui Client gui Mover to transfer data from PAWN to archive PAWN provides: Configuration storage Access to all items in a package Access to contextual information about a package Infrastructure for storing and loading gateway drivers.

SRB Publishing PAWN Package SRB Gateway SRB 5. GUID or Path3. Package Items Archival Context 4. Package Items PAWN Client 2. SRB Path & item list PAWN Scheduler 1. SRB Configuration

Screenshots Client Interface Configuration Interface Resulting Log Entry

XFDU publishing Create XFDU compatible Information Packet. XFDU is similar to METS. Separate data definitions from structural information Similar file attributes (size, checksum, etc..) PAWN mapping InformationPackageMap contains ContentUnits to recreate the hierarchy of data in a PAWN package. DataObjects register individual files. XFDU manifest and data files combined to form an Information Package.

Demo