Digital Preservation through EPrints-Archivematica Integration Tomasz Neugebauer Concordia University Justin Simpson Artefactual Justin Bradley EPrints Services International Conference on Open Repositories 2018 Bozeman, Montana State University Tuesday, June 5th, 2018 24x7 Presentation
Presentation Outline Digital Preservation as a Next Generation Repository Feature Motivation/Use Case from Spectrum Repository Archivematica-EPrints Integration Rationale/Approach Overview EPrints Export Structure Example Export Results/Benefits Challenges 5/28/2018
Preservation as Core Functionality 5/28/2018
5/28/2018 https://www.coar-repositories.org/files/NGR-Final-Formatted-Report-cc.pdf
Archivematica – EPrints Integration Proposal Rationale Continue to use EPrints for depositor/ingest and access normalization No need to develop software to process and ingest Archivematica DIPs EPrints is designed as an open access system Use Archivematica for creating preservation copies AIPs Include eprint identifiers to be able to find AIPs by eprint ID. Archivematica is designed for preservation
Spectrum: Open Access Research Repository Deposit (2011) PDF deleted in 2012 Access request in 2018
Archivematica – EPrints Integration eprint Moves to "Repository" or is Updated eprint is prepared for digital preservation export digital preservation export is ingested preservation workflow creates AIP with new METS file AIP is stored in Archival Storage
Digital Preservation Export Structure repositoryid-eprintid-lastmoddate folder (for example library-981471-2016-11-05_05-20-53) data revisions 1.xml 2.xml derivatives Folder # on disk (<pos>) filename metadata EP3.xml DublinCore.txt
Digital Preservation Export Structure Folder# (<pos>) fileid-xxxxx files documentid-xxxxx Documents eprintid-xxxx data eprintid-XXXXXX documents documentid-XXXXX fileid-XXXXX folder#-on-disk (“<pos>”) filename
Example https://spectrum.library.concordia.ca/981471/
Example - Bagged
Transfer -> Ingest -> Archival Storage
Example -> AIP files migrated files
New AIP Metadata METS file from SIP Transfer manifest METS file Format identification Logs New manifest
METS.xml
METS.xml
Challenges Exporting derivatives Efficiency Source location on disk Metadata How to reference programmatically? Efficiency Each modification to metadata would cause a new AIP to be archived – inefficient (duplication)
Thank you / Questions Tomasz Neugebauer Digital Projects & Systems Development Librarian tomasz.neugebauer@concordia.ca Justin Bradley EPrints Services Justin Simpson Artefactual