Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
7 th Open Acces Conference 2nd - 3rd November 2009 Accra, Ghana Rodrigo Torréns Universidad de Los Andes, Mérida, Venezuela Institutional Repository saber.ula.ve.
Capturing preservation metadata from institutional repositories Preserv Project Presented by Steve Hitchcock Intelligence Agents Multimedia Group, School.
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
PRESERV PReservation Eprint SERVices A two-year JISC 4/04 project: iii Institutional repository infrastructure development Steve Hitchcock and Jessie Hey.
Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Preserv Preservation Eprint Services Scenario: Digital lifecycle begins with author creation and deposit of paper or data content into the institutional.
Reshaping Preserv 2 from a Life(cycle) perspective Steve Hitchcock and Dave Tarrant Preserv 2 Project School of Electronics and Computer Science (ECS),
PRESERV a JISC 4/04 project Bid conditionally accepted Friday 24 th September Steve Hitchcock Intelligence Agents Multimedia Group, School of Electronics.
Repository preservation services: divisible, viable and sustainable? Steve Hitchcock Preserv 2 Project Intelligence Agents Multimedia Group, School of.
EPrints 3 Technical Overview EPrints 3 Briefing 8 th December 2006, London.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Creating Institutional Repositories Stephen Pinfield.
P2N: Cloud Control David Tarrant Ben OSteen
Sustainability of repositories - and EPrints Repositories – Software – Community Steve Hitchcock, WAIS, ECS, University of Southampton Kultivate Sustainability.
Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Physical preservation with Eprints: 2 File Formats and Risk Analysis.
Digital Preservation for Digital Repositories David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Introduction: Digital Preservation Recap Hannes Kulovits Andreas Rauber.
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It! David Tarrant School of Electronics.
Pulling it all together… with thanks to Sheila Anderson.
Preserving E-Prints: Scaling the Preservation Mountain Sheila Anderson, Arts and Humanities Data Service Stephen Pinfield, University of Nottingham.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise.
3-DAY INSTITUTIONAL REPOSITORY WORKSHOP USING DSPACE FOR MEMBERS OF AUNILO MAY 2009 INSTITUTIONAL REPOSITORIES Nor Edzan Che Nasir UM Library.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of Pretoria.
EPrints Preservation. What will you know after this tutorial?  Understand the challenges in digital Preservation  Understand why we need to plan preservation.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
Towards smart storage for repository preservation services Steve Hitchcock, David Tarrant, Adrian Brown 1, Ben O’Steen 2, Neil Jefferies 2 and Leslie Carr.
 EPrints & Preservation David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
David Tarrant University of Southampton Applying Open Storage to Institutional Repositories.
DSpace: Introduction and Starting an Institutional Repository
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 1 Scholarly Publishing & Academic Resources Coalition an initiative of the Association of Research Libraries Institutional Repository.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Services for Object Storage and Preservation March 2008 All content in these slides is considered work in progress. In no way does it represent an absolute.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University of Bath, UK
ScholarSpace & Open UH Mānoa March 2013 Beth Tillinghast Web Support Librarian ScholarSpace & eVols Project Manager UHM Library.
Connecting Preservation Planning and Plato with Digital Repository Interfaces David Tarrant
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
1 Two repositories - different strategies Monica Hammes : University of Pretoria Carnegie Workshop on Institutional Repositories 17 July 2007.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
The R EPOSITORY AS P UBLISHER OPPORTUNITIES AND CHALLENGES IN A DUAL ROLE BEN HOCKENBERRY SYSTEMS LIBRARIAN | ST. JOHN FISHER COLLEGE.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
An Introduction to Tessella and The Safety Deposit Box Platform
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It!
EPrints Preservation.
Implementing an Institutional Repository: Part II
PRESERV PReservation Eprint SERVices
Institutional Repositories
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
EPrints Preservation.
Presentation transcript:

Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk

A Few Definitions Repository : A repository is a place where data is stored and maintained. Wikipedia IR : A repository captures and preserves the intellectual output of an institution. The Case for Institutional Repositories – Raym Crow (SPARC 2002) IR : In my view, a university-based institutional repository is a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. It is most essentially an organizational commitment to the stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution. Institutional Repositories: Essential Infrastructure For Scholarship In The Digital Age - Clifford A. Lynch Service : A service is something provided directly to a user or 3 rd party agent. David Tarrant, 2008 Process : A process is something which is invisible to the user or agent. David Tarrant, 2008

The Library A building to store books in. A means by which new books/publications can be acquired. An indexing system to give order. Provides a mean by which books can be found. Provides a way to borrow & return books. A preservation process, e.g. rebind books when they get damaged/worn. …

The Digital Library In my view, a university-based institutional repository provides a set of services. The repository itself consists of a set of PROCESSES … The Library A building to store books in. A means by which new books/publications can be acquired. A indexing system to give order. Provides a mean by which books can be found. Provides a way to borrow & return books. A preservation process, e.g. rebind books when they get damaged/worn. … The Digital Repository A server to store resources on. A way to ingest new resources. A database of resources and metadata. A search engine and dissemination pages. Open access and downloads. A preservation process, e.g. check that the file on the server can still be read/accessed. …

Processes Service : A service is something provided directly to a user or 3 rd party agent. Process : A process is something which is invisible to the user or agent. There are lots of Processes Processes happen in parallel Processes happen in different orders

Processes and OAIS Many existing models contain this notion of processes and services, just not necessarily in a modern light. This doesnt however mean they are wrong or right they are just guiding principals.

Processes in the DCC Model

The 3 Stage Model

Breaking up the Repository Manager The manager may provide capability to perform one or more of the processes. Typically the manager is all that is used.

Repository Management Software A set of Pipes/Workflows* which know how to translate inputs into outputs. *Depending on your own definition you could also add Middleware Examples: OAI-ORE which contains Files and Metadata is split by the management software into File/Metadata storage and indexes. A request for a set of objects related to a single author is translated into a query to an index and a retrieve from the storage.

Manager :Storage Controller EPrints Storage Controller works! Local Storage Plugin (legacy) Honeycomb Storage Plugin Amazon Cloudfront (coming soon) Honeycomb Stats - 4MB/s ingest* - 200MB/s retrieve *USB2.0 max speed

The Preservation Process Bit checking & checksum calculation What is the type of file, is the file valid? Is the file at risk of not having an editor/reader? Is there a better format available? Lossless or Lossy? File migration to avert risks found by analysis. Movement of file to new storage.

Preservation - Analysis What is the type of file, is the file valid? Droid is a good classification tool for this. Is the file at risk of not having an editor/reader? Functionality is being developed in PRONOM technical registry. Is there a better format available? Lossless or Lossy? Planets registry of tools.

Preservation - Analysis What is the type of file, is the file valid? Droid is a good classification tool for this. Key

PRONOM-ROAR (Preserv 1) Preservation - Analysis

EPrints File Classification Preservation - Analysis

Risk Analysis Is the file at risk of not having an editor/reader? Functionality is being developed in PRONOM technical registry. Simple SOAP web service Takes file format identification ids, hands back risk score. Breakdown of risk score may also be available in future releases. A stub you can download and run providing this functionality before the official release with mock up risk scores is available at

EPrints File Classification + Risk Analysis Risk Analysis

EPrints File Classification + Risk Analysis Risk Analysis

Mock up Transformation Interface Transformation? Tool Preservation Level PPT -> PPTX PPT -> PDF Migration Tools

Summary – 1/2 Processes, Services and Glue Storage Controller provides an API you can glue to. Enabling preservation for any repository model by writing small bits of glue. Portable services are more powerful, faster and cheaper. Make use of existing and supported software where possible.

Summary – 2/2 EPrints will provide one of the first platforms for the development of preservation services where direct interaction takes place between the Repository Software and Preservation Services. +

Many Thanks! David Tarrant Steve Hitchcock Neil Jefferies Ben OSteen Sally Rumsey Adrian Brown

Appendix Slides

Other options for DROID Positioning Manager This is not the recommended solution as DROID is a 3 rd party service for your repository. All other services are provided by your repository.

DROID Alongside Your Resources