Digital Preservation for Digital Repositories David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

EPrints: Repositories for Grassroots Preservation Les Carr,
EPrints - Introducing EPrints 3 Software William J Nixon Digital Library Development Manager, University of Glasgow With many thanks to Les Carr and the.
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
PRESERV PReservation Eprint SERVices A two-year JISC 4/04 project: iii Institutional repository infrastructure development Steve Hitchcock and Jessie Hey.
Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Preserv Preservation Eprint Services Scenario: Digital lifecycle begins with author creation and deposit of paper or data content into the institutional.
Preservation Features in Repository Software PRESERV: Tim Brody University of Southampton.
Reshaping Preserv 2 from a Life(cycle) perspective Steve Hitchcock and Dave Tarrant Preserv 2 Project School of Electronics and Computer Science (ECS),
Tim Brody University of Southampton CiteBase Services 13/07/2001.
Repository preservation services: divisible, viable and sustainable? Steve Hitchcock Preserv 2 Project Intelligence Agents Multimedia Group, School of.
EPrints: A Biodiversity The Recent ECS publications feed on the plasma display in the foyer comes from EPrints.
Creating Institutional Repositories Stephen Pinfield.
EPrints 2.0 / March 4 th 2002 / Glasgow / Chris Gutteridge Introduction to EPrints 2.0 March 4 th 2002 Glasgow Christopher Gutteridge from the Department.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
P2N: Cloud Control David Tarrant Ben OSteen
Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Physical preservation with Eprints: 2 File Formats and Risk Analysis.
Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
Digital Preservation: Logical and bit-stream preservation using Plato and Eprints Introduction: Digital Preservation Recap Hannes Kulovits Andreas Rauber.
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It! David Tarrant School of Electronics.
Joint Information Systems Committee 11/03/07 | | Slide 1 Joint Information Systems CommitteeSupporting education and research JISC Conference 2007 Managing.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Copying Archives Project Group Members: Mushashu Lumpa Ngoni Munyaradzi.
Update on the SWORD Protocol & Future Directions.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
Management of information. Objectives Discuss the benefits of good management practice Present reference management tools Present bookmark management.
Towards smart storage for repository preservation services Steve Hitchcock, David Tarrant, Adrian Brown 1, Ben O’Steen 2, Neil Jefferies 2 and Leslie Carr.
 EPrints & Preservation David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
David Tarrant University of Southampton Applying Open Storage to Institutional Repositories.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Using the SAS® Information Delivery Portal
From the Desktop to the Cloud Leveraging Hybrid Storage Architectures In Your Repository David Tarrant, Tim Brody.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Services for Object Storage and Preservation March 2008 All content in these slides is considered work in progress. In no way does it represent an absolute.
Repositories COMP3016 Public, managed, web collections of knowledge.
Connecting Preservation Planning and Plato with Digital Repository Interfaces David Tarrant
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
From the Desktop to the Cloud Leveraging Hybrid Storage Architectures In Your Repository David Tarrant, Tim Brody.
DSpace - Digital Library Software
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Accurate  Consistent  Compliant Contact: i4i the structured content company the structured content company.
PIRUS 2 Creating a common standard for measuring online usage of individual articles Paul Needham, Cranfield University Peter Shepherd, COUNTER October.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
PIRUS PIRUS -Publisher and Institutional Repository Usage Statistics
An Introduction to Tessella and The Safety Deposit Box Platform
VI-SEEM Data Discovery Service
KeepIt Kultur, eCrystals, EdShare (and NECTAR) – Preserve It!
P2N: Cloud Control David Tarrant Ben O’Steen
Introducing the IRUSdataUK pilot
Cooperation and Competition: National Learning Object Repositories
Implementing an Institutional Repository: Part II
PRESERV PReservation Eprint SERVices
ISI Web of Knowledge EndNote® Web and EndNote® Integrated solutions for research and publishing October 2006.
Jisc Research Data Shared Service (RDSS)
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
EPrints Preservation.
Presentation transcript:

Digital Preservation for Digital Repositories David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk

Grassroots Preservation Small Science > Big Science The sum of the smaller parts adds up to a greater number than that of the bigger parts combined Grassroots preservation for Institutional and Small Business Outputs Until now EPrints has mainly been focused on encouraging acquisition of Data. How do we create our Global Collection ?

: History of… Proposed as a build your own repository solution Enable institutions and groups to participate in OAI metadata sharing initiative. First released April 2000 (to co-inside with OAI-PMH) Version 3.1 release at recent Open Repositories Conference 2008 Used by over 240 registered repositories (58130 Records) (49132 Records) (32169 Records) Number of Records captured from the Registry of Open Access Repositories (ROAR)

: Management Open source (GNU license) EPrints development model is more centralised than DSpace / Fedora Faster turnaround on development cycles More focused Easier quality management Better for Support Model EPrints Services: Repository hosting, bespoke development & training Sustain the development team

: Core Objectives Lower the barrier for depositors while improving metadata quality and ultimate collection value Time saving deposits Import data from other repositories and services Autocomplete-as-you-type for fast data entry Name authorities Enter once, reuse often Works with bibliography managers, desktop applications and new Web 2.0 mashups RSS feeds and alerts keep you up to date Easily integrate reports, bibliographic listings, author CVs and RSS feeds into your corporate web presence Used for corporate reporting and national Research Assessment Simple platform for open source contributions Tightly-managed, quality-controlled code framework Flexible plug-in architecture for developing extensions Import Export XML EndNote PubMed Spreadsheet CrossRef ACM Digital Library BibTeX OAI-ORE ORE Resource Map Google Maps Simile Timeline Bibtex Endnote PubMed OAI-PMH XML EPrints OBJECT STORE metadata + data Fully searchable and scriptable

Digital Preservation "It is important to build the concept of preservation from the outset. In the digital era, the 'outset' for most new research and educational materials will be the institutional repository."

Digital Preservation Long term reliable storage Open Storage Maintaining readability Migration / Emulation Interoperability for multiple usage scenarios

Long Term Storage Reliable Self Checking and Self Healing file System Resiliant Must have the capability to be robust in the case of part failure Simple & Expandable Must be made of parts which are easy to expand / upgrade way into the future. Open Any software developed to enable all of the above must be open, same with any hardware specifications.

November 07

Import Export XML EndNote PubMed Spreadsheet CrossRef ACM Digital Library BibTeX OAI-ORE ORE Resource Map Google Maps Simile Timeline Bibtex Endnote PubMed OAI-PMH XML EPrints OBJECT STORE metadata + data Fully searchable and scriptable : Core Objectives Lower the barrier for depositors while improving metadata quality and ultimate collection value Time saving deposits Import data from other repositories and services Autocomplete-as-you-type for fast data entry Name authorities Enter once, reuse often Works with bibliography managers, desktop applications and new Web 2.0 mashups RSS feeds and alerts keep you up to date Easily integrate reports, bibliographic listings, author CVs and RSS feeds into your corporate web presence Used for corporate reporting and national Research Assessment Simple platform for open source contributions Tightly-managed, quality-controlled code framework Flexible plug-in architecture for developing extensions

Export Plug-ins : Architecture EPrints is expanding the number places in which plug-ins can be utilised. Diagram Represents Proposed EPrints 3.2 Architecture Import Plug-ins Honeycomb

The Storage Controller Each item can be stored using a different storage plug-in (hence in a different place) dependant on file or metadata properties and values. e.g. Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage. Processed results can be stored locally and on a honeycomb server where they are preserved. Allows a repository to use a 3 rd party storage platform Direct deposition into a honeycomb etc Great enabler for preservation Let the repository control the deposit process. Ensures that the complete object is preserved and not just the harvested bits

Open Storage for Repositories Honeycomb Simple, open, managed storage. Advanced features built in: ZFS Error and Bit Shift Correction Metadata Layer Simple API Store Retrieve Delete Simple to interface with Repository Software RAID 6

Repository Software Storage Controller Physical Storage EPrints Fedora EPrints Fedora DSpace Local Disk Remote Server Cloud Service Honeycomb EPrints Fedora Services Registry Services TNA API - PRONOM File Format Identification Significant Properties Migration Tools (Performance Metrics) Scheduler (Oxford) Services & Invocation API Interoperability OAI-ORE Specification & Mapping Application Program Interface (API) + XML Relation Exclusivity (1 to 1, 1 to Many) ContentPolicy Preserv Project Structure (May 2008) Preserv Repository Preservation and Interoperability.org.uk

Repository Software Storage Controller Physical Storage EPrints Fedora EPrints Fedora DSpace Local Disk Remote Server Cloud Service Honeycomb EPrints Fedora Services Registry Services TNA API - PRONOM File Format Identification Significant Properties Migration Tools (Performance Metrics) Scheduler (Oxford) Services & Invocation API Preserv Project Structure (May 2008) Interoperability OAI-ORE Specification & Mapping Application Program Interface (API) + XML Relation Exclusivity (1 to 1, 1 to Many) ContentPolicy Preserv Repository Preservation and Interoperability.org.uk

Characterisation & Migration Services The online registry of technical information. PRONOM is a resource for anyone requiring impartial and definitive information about the file formats, software products and other technical components required to support long-term access to electronic records and other digital objects of cultural, historical or business value. Free PRONOM tools and services to support digital preservation, including DROID, the automatic file format identification tool, together with links to relevant external tools and services. Under Investigation: Significant Properties Registry, Migration Tools Registry, Risk Analysis and Feedback.

Interoperability in Action Preserv Repository Preservation and Interoperability.org.uk OAI-ORE EPrints & Fedora Which is which?

Digital Preservation EPrints will provide one of the first platforms for the development of preservation services where direct interaction takes place between the Repository Software and Preservation Services. +

Physical Storage Honeycomb … Services TNA API - PRONOM File Format Identification Significant Properties Migration Tools (Performance Metrics) Scheduler (Oxford) Services & Invocation API Interoperability OAI-ORE Specification & Mapping One More Thing… Smart Storage Storage which has the capability to perform actions directly upon the objects stored within it. Autonomous classification and migration of objects No reliance on repository software for processor time, yet same results.

Many Thanks! Christopher Gutteridge Tim Brody Steve Hitchcock Neil Jeffries Ben OSteen Adrian Brown

Questions…? ? David Tarrant University of Southampton (UK)