Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information Principal Investigator: Joseph JaJa Lead Programmers: Mike.

Slides:



Advertisements
Similar presentations
The Access Grid Ivan R. Judson 5/25/2004.
Advertisements

CASPAR Validation. Metrics CASPAR Approach Representation Information (RepInfo) RepInfo Networks and their maintenance.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
The e-Framework Bill Olivier Director Development, Systems and Technology JISC.
TRAC / TDR ICPSR Trustworthy Digital Repositories.
DESIGNING A PUBLIC KEY INFRASTRUCTURE
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
ACE: A Software Tool to Ensure the Integrity of Digital Archives Principal Investigator: Joseph JaJa Graduate Student: Sangchul Song Lead Programmer: Michael.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
ACE: A Software Tool to Ensure the Integrity of Digital Archives Principal Investigator: Joseph JaJa Graduate Student: Sangchul Song Lead Programmers:
FOCUS: FOrmat CUration Service Advisor: Dr. Joseph JaJa Students: Sang Chul Song Muluwork Geremew.
May 23, 2007 Archiving ACE: A Novel Software Platform to Ensure the Integrity of Digital Archives Sangchul Song and Joseph JaJa Institute for Advanced.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information PI: Joseph JaJa Co-PIs: Allison Druin and Doug Oard Major.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
FOCUS – A Scalable and Extensible Digital Format Registry Principal Investigator: Joseph JaJa Graduate Students: Sang Song and Muluwork Geremew Lead Programmers:
Persistent Digital Archives and Library System (PeDALS) South Carolina Department of Archives and History.
Who is doing a good job in digital preservation? Audit and Certification of Digital Repositories: ISO and the European Framework.
Digital Curation in Architecture Curricula and vocational training for Architects The DEDICATE Framework in Architectural CAD Courses Design by Dr Ian.
DCC Conference, Glasgow November, Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan Moore, San Diego.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
PanDA Multi-User Pilot Jobs Maxim Potekhin Brookhaven National Laboratory Open Science Grid WLCG GDB Meeting CERN March 11, 2009.
A survey based analysis on training opportunities Dr. Jūratė Kuprienė Framing the digital curation curriculum International Conference Florence, Italy.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Microsoft Research Faculty Summit Natasa Milic-Frayling & Vijay Rajagopalan Microsoft Corporation.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Fedora and the Preservation of University Electronic Records Project NHPRC Electronic Records Research Grant Kevin L. Glick Manuscripts and Archives, Yale.
Managing and Monitoring the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
National Archives Center for Advanced Systems and Technologies (NCAST) The National Archives and Records Administration Welcome! Now What? Mark Conrad.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
The Mapper project receives funding from the EC's Seventh Framework Programme (FP7/ ) under grant agreement n° RI EGI and PRACE ecosystem.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Announcing the 2014 National Digital Stewardship Agenda.
PAWN: Producer-Archive Workflow Network
CESSDA SaW Training on Trust, Identifying Demand & Networking
Building A Repository for Digital Objects
Joseph JaJa, Mike Smorul, and Sangchul Song
Scheduled Accomplishments
Institutional Repositories
ACE – Auditing Control Environment
Presentation transcript:

Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate Students: Sang Song and Muluwork Geremew Institute for Advanced Computer Studies University of Maryland, College Park

Research Objectives Development of tools and technologies for: Automated Distributed Ingestion – flexible platform for Producer-Archive Interactions Management of Preservation Processes – Monitoring, Integrity Auditing, and Preservation Services. Evaluation and demonstration of tools on widely different collections.

Recent Major Accomplishments ACE (Auditing Control Environment): a policy-driven software environment to continually verify the integrity of an archive’s holdings. FOCUS – a scalable, and secure registry for persistent information and services applied to formats. Substantial enhancements to PAWN – Producer-Archive Workflow Network software platform.

ACE – Overview Integrity Token Hash (obj) ACE-AM 3 rd Party Auditor Client obj ACE-IMS

Basic Ideas Integrity auditing service that can interoperate with any archiving architecture. Active (periodic) and user-triggered auditing. Time-stamped certificates that enable the verification of the integrity of the object throughout its lifetime – auditable record of every transformation. Cost effective, scalable, and based on rigorous techniques.

FOCUS: FOrmat CUration Service Maintains persistent information on digital formats, services, and applications to access and manipulate them. Accessible either Directly through LDAP Or indirectly through SOAP (Web Services) Web Service Agent Format Registry LDAP SOAP

Answer to Question #1 Biggest Surprise – None but a number of small surprises such as: OAIS may be too general to provide a useful framework?? Significant differences for automated ingestions regarding the push and pull models. Not at all clear which communities will be able to handle or afford wide area distributed infrastructure.

Answer to Question#2 What have you done that you never thought you would? Confuse my graduate students!! Trying to explain: authenticity of an archive’s holdings (the object is what it claims to be!!); ensuring access to data after hundreds of years without having any idea about how the technology will evolve over the next ten or twenty years!

Answer to Question #3 How is the area of your project changed? A Lot and Not Much: Hardware (processor and storage) is changing very quickly – as expected. Web technologies are more mature and more widely used – as expected. Grid technologies did not progress as much as had been expected! Very little work regarding preservation services.

Conclusion Three major pieces of software: ACE, FOCUS, and PAWN. Interoperable with any archiving architecture Scalable, secure, and platform independent Continued development of preservation services.