Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.

Slides:



Advertisements
Similar presentations
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Advertisements

Institutional Repositories and the SHERPA Project Bill Hubbard SHERPA Project Manager University of Nottingham.
1 SHERPA Securing a hybrid environment for research preservation and access.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Joint Information Systems Committee 11/03/07 | | Slide 1 Joint Information Systems CommitteeSupporting education and research JISC Conference 2007 Managing.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Workflows for Digital Curation and Preservation Stacy Kowalczyk PASIG Dublin 2012 October 17, 2012.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
BitstreamFormat Renovation: DSpace Gets Real Technical Metadata.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
WMS: Democratizing Data
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
The Planets Interoperability Framework Rainer Schmidt AIT Austrian Institute of Technology 1st DPIF Symposium, April 21-23, 2010,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Columbia Digital Preservation Planning & Implementation Status Report, August 2010.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
METS, Standards and Rights METS, Safonau a Hawliau Vicky Phillips Digital Standards Manager Rheolwr Safonau Digidol 4 th March ydd Mawrth 2014.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
The NLW Digital Asset Management System Paul Bevan DAMS Implementation Manager
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
Florida Digital Archive PREMIS and DAITSS. Florida Digital Archive.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
Digital Preservation Tools for Repository Managers A practical course in five parts presented by the KeepIt project in association with Module 3, Primer.
Developing a digital repository infrastructure for King’s College London RSP Training Day, 22 nd January 2009 Gareth Knight Centre for e-Research.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Digital Library Storage using iRODS Data Grids Mark Hedges, Tobias Blanke Centre for e-Research, King’s College London Arts and Humanities Data Service.
An Introduction to Tessella and The Safety Deposit Box Platform
Joseph JaJa, Mike Smorul, and Sangchul Song
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Integrating PREMIS and METS
Implementing an Institutional Repository: Part II
Oya Y. Rieger Cornell University Library May 2004
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College London

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 SHERPA DP Project Development Partners: AHDS at King’s College London (Lead), Nottingham, Glasgow, Edinburgh, White Rose Consortium, London Leap Consortium Objective: To create a shared, distributed preservation environment for the SHERPA project framed around the OAIS Reference Model. Notes: Participating repositories all based on DSpace or EPrints. Relatively simple data objects (eprints).

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Distributed OAIS Model

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Distributed Workflow

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 System Architecture

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Key preservation actions at ingest Integrity/fixity checks. File format identification. Preservation metadata creation. Implement preservation strategy File format normalisation. Others …

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Requirements Scalability: need to handle increasingly large quantities of data Generation and management of extensive set of preservation metadata Audit trail/provenance metadata: knowledge held in explicit machine- processable form

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 More Requirements Distributed architecture Integration of specialised tools Follow standards to allow flexible integration of future tools Automate workflow where possible, but also allow human interaction

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Approach Web services encapsulating preservation actions Web interface for points in the process where human input required Linked by workflow management tool

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Workflow management Large number of tools available –Taverna –BPEL (Active BPEL) –jBPM –others … Settled on jBPM

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 jBPM Web services and UI functions chained together to form a workflow or “Business Process” Open source, flexible, extensible workflow management system Bridges the gap between users and developers by giving them a common language Packaged as a J2EE application - can run on any J2EE application server like JBoss, Tomcat, etc.

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Preservation Metadata Approach based on PREMIS data dictionary PREMIS data model based on five categories: intellectual entities, objects, agents, events, rights Implementing a subset of this model … with some format-specific extensions (e.g. MIX for images)

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Available Tools Stand-alone specialised tools that perform preservation-related tasks File format identification, e.g. DROID-PRONOM –Developed by The National Archives –Identification of file formats based on their file signatures Technical metadata generation, e.g. JHOVE –Extensible framework for format validation –Perform format-specific identification, validation, and characterization of a digital object File format migration tools (e.g. XENA, Open Office)

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Available tools and workflow Tools written in different languages Define generic interfaces for preservation actions Wrap the tools used as web services to promote: –Interoperability –Loose coupling, flexibility –Reusability

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Workflow in jBPM

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 jBPM (jPDL)

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Node and ActionHandler

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Workflow Inputs & Outputs

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Workflow Outputs Multiple METS packages (atomic model), each containing (some of): –data –Descriptive metadata –PREMIS object metadata (technical) –PREMIS event metadata –PREMIS relationship metadata –Format-specific technical metadata (e.g. MIX)

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Fedora object model

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Issues with automation Preserving content – what do we actually want to preserve? Significant properties – soft concept, hard to quantify (INSPECT) Lack of suitable tools – expensive, outputs unreliable

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Next Steps SHERPA DP 2 ( ), looking at: - Additional repository types - More complex object types - different methods of data transfer Generalise system Add post-ingest preservation actions Add semantics for dynamic service discovery Resource discovery metadata generation

Funded by: © AHDS Digital repositories: Dealing with the digital deluge, Manchester, 5 June 2007 Questions Contact: