MODELLING THE DIGITAL PRESERVATION COSTS Paul Wheatley Digital Preservation Manager British Library.

Slides:



Advertisements
Similar presentations
Max Kaiser: PLANETS Testbed
Advertisements

1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
Using the LIFE Costing Model Case studies from DK Anders Bo Nielsen, The Danish National Archives Ulla Bøgvad Kejser, The Royal Library, Denmark.
LIFE Project Lifecycle Information for E-literature Richard Davies LIFE Project Manager The British Library CARL Visit to the BL 27 November 2007.
LIFE 2 LIFE2 Conference The Life Model Paul Wheatley Digital Preservation Manager The British Library.
A centre of expertise in digital information management Developing a Quality Culture For Digital Library Programmes Author & Presenter Brian Kelly UKOLN.
Outcomes of The Living Murray Icon Sites Application Project Stuart Little Project Officer, The Living Murray Environmental Monitoring eWater CRC Participants.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library IFLA conference 27/02/10.
A centre of expertise in digital information management A QA Framework To Support Your Library Web Site Review Brian Kelly UKOLN University of Bath Bath.
How the University Library can help you with your term paper
Collaboration to Clarify the Costs of Curation The 4C Project – A Collaboration to Clarify the Costs of Curation APARSEN Webinar: 13 June 2013 Neil Grindley.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
UCL LIBRARY SERVICES Pulling Together the Threads Next Steps for Repositories Dr Paul Ayris Director of UCL Library Services and UCL Copyright Officer.
1 Phases & Impact on other Projects Definition and Scope –Relationship between Appraisal Policy/ Procedure, Technology and Management Overview, Components.
University of Leeds Department of Chemistry The New MCM Website Stephen Pascoe, Louise Whitehouse and Andrew Rickard.
Information Retrieval in Practice
Costs of Digital Archiving the case of DANS Anna Palaiologk | Heiko Tjalsma | Laurents Sesink |
"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton.
Overview of Search Engines
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
WMO UNEP INTERGOVERNMENTAL PANEL ON CLIMATE CHANGE NATIONAL GREENHOUSE GAS INVENTORIES PROGRAMME WMO UNEP IPCC Good Practice Guidance Simon Eggleston Technical.
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
Contract Lifecycle Management Improve responsiveness, efficiencies, and oversight & reduce risks and costs INSTRUCTIONS: 1. Apply your company template.
Introduction to BIM BIM Curriculum 01.
Metadata: Integral Part of Statistics Canada Quality Framework International Conference on Agriculture Statistics October 22-24, 2007 Marcelle Dion Director.
OSSE School Improvement Data Workshop Workshop #4 June 30, 2015 Office of the State Superintendent of Education.
LIBER Digitisation Conference, Copenhagen The cost of digitisation and preservation: The LIFE Project October 2007 Richard Davies LIFE 2 Project.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
‘One Sky for Europe’ EUROCONTROL © 2002 European Organisation for the Safety of Air Navigation (EUROCONTROL) Page 1 VALIDATION DATA REPOSITORY Overview.
Franklin Consulting Programme X The Innovation Base The e-Framework: What do they mean for programme management? Tom Franklin Franklin Consulting Richard.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
VIRTUAL CLASSROOM TOUR Documents Web Links Innovative Teachers Date Title Creator/s Homepage Objective/s Learning Together in Dundee Initiative To raise.
What is Oracle Hyperion Planning  Centralized, web- based Budgeting and Planning application  Combines Operational and Financial measures to improve.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Outcome Based Evaluation for Digital Library Projects and Services
A generic tool to assess impact of changing edit rules in a business survey – SNOWDON-X Pedro Luis do Nascimento Silva Robert Bucknall Ping Zong Alaa Al-Hamad.
Supporting People Programme Review and Contract Monitoring.
Public Records Act 2005 Audit Programme Update presented to Government Recordkeeping Forum, Auckland John Roberts, Acting Group Manager, Government Recordkeeping.
Rev. 0 CONFIDENTIAL Mod.19 02/00 Rev.2 Mobile Terminals S.p.A. Trieste Author: M.Fragiacomo, D.Protti, M.Torelli 31 Project Idea Feasibility.
IV-3.1 JCOMMOPS SOT Technical Coordinator. 2 JCOMMOPS structure Programmes currently supported –Ship Observations Team (30% Mathieu Belbeoch) –Argo Profiling.
National Commission for Academic Accreditation & Assessment Developmental Reviews at King Saud University and King Faisal University.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
Assessing the Frequency of Empirical Evaluation in Software Modeling Research Workshop on Experiences and Empirical Studies in Software Modelling (EESSMod)
Sergey Parinov, euroCRIS Board meeting, Antwerp, February 2010 BP/DRIS TG progress report.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
The Government Recordkeeping Survey 2008 Natalie Dewson, Senior Advisor, Government Recordkeeping Programme, Archives New Zealand.
Process Quality in ONS Rachel Skentelbery, Rachael Viles & Sarah Green
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
MSE Portfolio Presentation 1 Doug Smith November 13, 2008
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Presentation to the Ad-hoc Joint Sub-Committee on Parliamentary Oversight and Accountability Wednesday 20 March 2002 PUBLIC SERVICE MONITORING AND EVALUATION.
Simon Compton Methodology Directorate Office for National Statistics
EXPLORER project Elizabeth Lunt Project Manager De Montfort University.
Fourth UNICA Scholarly Communication Seminar, Prague The LIFE Project Costing Digital Preservation May 2008 Richard Davies LIFE 2 Project Manager,
Archiving CAD in Archaeology: Ingest to Dissemination (or The ADS experience to date) Kieron Niven Archaeology Data Service, University of York, UK.
GCE Software Systems Development A2 Agreement Trial Implementing Solutions October 2015.
Applying preservation metadata to repositories The British Library, 21 January 2008 Led by Steve Hitchcock With Bill Hubbard, Gareth Johnson.
Information Retrieval in Practice
DAITSS and the Florida Digital Archive
Digitisation in academic libraries: Experience from Makerere University Library, Kampala Uganda By Patrick Sekikome Presented at the CERN-UNESCO School.
Using the LIFE Costing Model Case studies from DK
COMP390/3/4/5 && COMP593 Final Year Projects Demonstration & Dissertation Irina Biktasheva
Presentation transcript:

MODELLING THE DIGITAL PRESERVATION COSTS Paul Wheatley Digital Preservation Manager British Library

2 2 Summary Overview of the model: Aims Development process Model Results Evaluation Conclusions

3 3 Scope Acquisition Ingest Metadata Storage Access Preservation

4 4 Background and aims Previous work (see Final Report): National Archief, Digital Bewaring – full costing/audit approach Oltmans, Kol – lifecycle and strategies Key aims: Make the first major step in defining and estimating the lifecycle cost of digital preservation activities. Propose a model for comment by the wider preservation community Enable the LIFE Case Studies to be compared and contrasted by providing some cost estimates for “P” in the Lifecycle Model. Attempt to identify the scale of preservation costs. Are they dramatically high as suggested previously by many in the preservation community or are they more achievable as suggested recently (see Rusbridge, C, “Excuse Me... Some Digital Preservation Fallacies?”)?

5 5 Development process Key cost factors, experimentation, iterative development and refinement Based on evidence or indications of trends where possible Editable inputs where key estimation or assumptions made Cost component review Application of draft model, refinement of inputs Team review, refinement of model weaknesses

6 6 The Generic LIFE Preservation Model Preservation = t * TEW + (t / ULE + PON) * (CRS + UME + PPA + QAA) Expansion of calculated components: ULE – Unaided Life Expectancy of a Format = BLE + 0.1*t CRS – Cost of new rendering solution = (1 - PTA) * TDC * FCX + PTA * COA PPA – Performing preservation action = PON * (SCM + n * HVM) QAA – Quality Assurance = n * BCT * FCX PTA – Proportion of Tool Availability = STA(1-t/20)+ETA(t/20) Expansion of scaling components: PON – Proportion of normalisation = 0.4 FCX - Format complexity (e.g. JPEG = 0.2, WMF = 0.4, PDF = 0.6, Word = 0.8) Expansion of cost component inputs: HVM – High volume migration cost per object = £0.05 BCT – Base cost of testing a preservation action per object = £0.17 UME – Update Metadata = 2 metadata officer £30k annual salary = £1250 TDC – Tool development cost = 24 programmer £30k annual salary - £60000 COA – Cost of available tool = £1500 TEW - Technology Watch = 1 metadata officer £30k annual salary = £625 BLE - Base life expectancy = 8 (years) STA – Starting tool availability = 0.5 ETA – Ending tool availability = 0.9 SCM – Setup cost of migration = £340

7 7 The Generic LIFE Preservation Model : key elements explained Preservation = t * TEW + (t / ULE + PON) * (CRS + UME + PPA + QAA) Frequency of action Tech Watch Preservation action Preservation cost of n objects of a particular format for the period 0 to t. Preservation = + * Eg objects of the GIF format for a period of 10 years. Monitoring formats and software for obsolescence Updating and managing metadata (Representation Information). The number of preservation actions within the time period calculated Q/A Update metadata Perform preservation action Cost of Preservation tool

8 8 Series of small technology watch events and spikes of preservation activity at increasing intervals The occurrence of costs (1 st detailed sample of the model) Preservation action Preservation = + * Tech Watch Frequency of action Example : FCLA Action Plans Base life expectancy = 8 years Increases by a year every decade

9 9 Q/A Update metadata Perform preservation action Cost of Preservation tool Complexity of file formats (2 nd detailed sample of the model) Size Complexity Proprietary Open Standardised Frequency of action Tech Watch Preservation action Preservation = + * = CategoryComplexityExamples Simple0.1ASCII, Unicode Bitmap0.2JPEG, GIF Mark-up0.3XML, HTML Vector0.4EMF, Draw Multimedia0.6MPEG3, WAV Document0.8Word, PDF Complex1Oracle database dump Format Complexity

10 Preservation tool cost (3 rd detailed sample of the model) Cost of Preservation Tool (CRS) Frequency of action Tech Watch Preservation action Preservation = + * Q/A Update metadata Perform preservation action = Proportion of tool Availability (PTA) = Cost of developing a new tool Cost of acquiring an existing tool + PTA (1- ) PTA Tool Development Cost (TDC) = Estimated as 24 programmer 30k annual salary (£60000) Format Complexity Cost of Available tool = Estimated as £1500 (1-t/20) + (t/20) STA ETA STA = 0.9 = 0.5 Average proportion across the time period Preservation = t * TEW + (t / ULE + PON) * (CRS + UME + PPA + QAA)

11 Estimated costs using the model File Format Format Complexity Number of objects Frequency of pres action GIF Case study nameSub categoryYear1Year 10 Percentage of total lifecycle cost VDEPe-monographs£0.89£1.454% VDEPe-serials£10£272% Web archiving£425£850962% File Format Technology watch Preservation tool cost Metadata Preservation action Quality assurance Total cost (over 10 years) GIF£6,250£7,027£1,889£7,008£11,564£33,738 Estimated preservation costs for GIF files in the Web Archiving Case Study Comparison of average object preservation costs across the Case Studies

12 Model outputs: WA Case Study, percentage breakdown Quality assurance Preservation action Metadata Tool cost Technology watch Time period (years) Breakdown of complete preservation costs over time in the WA Case Study

13 Self evaluation of the model Evaluation against key aims: Make the first major step in defining and estimating the lifecycle cost of digital preservation activities. Propose a model for comment by the wider preservation community Enable the LIFE Case Studies to be compared and contrasted by providing some cost estimates for “P” in the Lifecycle Model. Attempt to identify the scale of preservation costs. Are they dramatically high as suggested previously by many in the preservation community or are they more achievable as suggested recently (see Rusbridge, C, “Excuse Me... Some Digital Preservation Fallacies?”)?

14 Further work and refinement Refinement based on real cost data, removal of assumptions Level of detail Format complexity Re-ingest More detailed discussion in the Final Report…

15 Summary and conclusions Estimating the cost is not easy but appears to be possible! Provides a useful perspective on performing preservation Focuses on achieving cost effective preservation

16 Finally… Two appeals to the audience: Please cost, record and publish your preservation work Provide comment on the preservation model: Questions, comments, evaluation: