Planning for Digital Preservation. Planning for Preservation Digital preservation issues come up much faster than traditional preservation issues Digital.

Slides:



Advertisements
Similar presentations
Pulling it all together… with thanks to Sheila Anderson.
Advertisements

Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
DATA PROCESSING SYSTEMS
QA Focus Digital Preservation End of Programme Meeting: 5/99, 7/99, DiVLE and JISC/NSF International Digital Libraries.
A centre of expertise in digital information management A QA Framework To Support Your Library Web Site Review Brian Kelly UKOLN University of Bath Bath.
Windows XP Photo Workflow Tim Grey Imaging Strategist Microsoft Corporation.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
ICASAS305A Provide Advice to Clients
Quicktime Howell Istance School of Computing De Montfort University.
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
Strategic Thinking and Significant Characteristics Hamish James.
HAN Conference © History Data Service The History Data Service : Promoting Good Practice and Standards of Scholarship Cressida Chappell Head of.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Introduction to Systems Analysis and Design
Installing software on personal computer
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
HYPERTEXT MARKUP LANGUAGE (HTML)
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Database Systems: Design, Implementation, and Management Ninth Edition
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Copyright © 2012 Certification Partners, LLC -- All Rights Reserved Lesson 5: Multimedia on the Web.
Learning from the past Standards, hardware, software and migration Gareth Knight
Chapter-4 Windows 2000 Professional Win2K Professional provides a very usable interface and was designed for use in the desktop PC. Microsoft server system.
Multimedia and the Web Chapter Overview  This chapter covers:  What Web-based multimedia is  how it is used today  advantages and disadvantages.
Tutorial 11 Installing, Updating, and Configuring Software
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
Module 7. Data Backups  Definitions: Protection vs. Backups vs. Archiving  Why plan for and execute data backups?  Considerations  Issues/Concerns.
What Agencies Should Know About PDF/A September 20, 2005 Susan J. Sullivan, CRM
Multimedia and The Web.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
What Agencies Should Know About PDF/A-1 April 6, 2006 Mark Giguere
12 Developing a Web Site Section 12.1 Discuss the functions of a Web site Compare and contrast style sheets Apply cascading style sheets (CSS) to a Web.
Week #3 Objectives Partition Disks in Windows® 7 Manage Disk Volumes Maintain Disks in Windows 7 Install and Configure Device Drivers.
European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November
Test and Review chapter State the differences between archive and back-up data. Answer: Archive data is a copy of data which is no longer in regular.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
Computer Literacy for IC 3 Unit 1: Computing Fundamentals © 2010 Pearson Education, Inc. | Publishing as Prentice Hall.1 Chapter 4: Identifying Software.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Animal Shelter Activity 2.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Irwin/McGraw-Hill Copyright © 2000 The McGraw-Hill Companies. All Rights reserved Whitten Bentley DittmanSYSTEMS ANALYSIS AND DESIGN METHODS5th Edition.
Learning Aim B.  In this section, you will consider the resources necessary for designing your website.  You will also think about any constraints that.
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Introduction. Internet Worldwide collection of computers and computer networks that link people to businesses, governmental agencies, educational institutions,
ICAD3218A Create User Documentation.  Before starting to create any user documentation ask ‘What is the documentation going to be used for?’.  When.
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
Preservation Planning Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
Building A Repository for Digital Objects
Publishing and Maintaining a Website
Digital Project Lifecycle Curating Across the Curriculum
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
R082 Creating digital graphics
Executive Sponsor: Tom Church, Cabinet Secretary
Executive Sponsor: Tom Church, Cabinet Secretary
Executive Sponsor: Tom Church, Cabinet Secretary
Presentation transcript:

Planning for Digital Preservation

Planning for Preservation Digital preservation issues come up much faster than traditional preservation issues Digital resources need on-going attention Build a preservation strategy into your project from the start Keep dealing with the short-term issues and you wont ever need to face the long-term problem

Issues The content of digital resources is only accessible with the aid of intermediary technologies Digital resources are complex Reliance on specific combination of formats, software and hardware to operate correctly I.T. develops rapidly, and resources can become obsolete very quickly

Three Key Areas Content – the bits and bytes Technologies: software systems; hardware: websites, access and delivery systems Organisational

Planning for the Future Short-term: Initial technology still current and actively supported years Medium-term: Initial technology still in use and supported, but no longer used for new work years Long-term: Initial technology no longer used or supported years

….. In the Short Term Making digital assets available Website administration Website updates Software and operating system patches Periodic backups Periodic checks on master copies

…….. In the Medium Term Keeping your existing digital outputs up and running Upgrading operating systems and software Upgrading hardware Replacing hardware components Refreshing master copies Periodic backups Periodic checks on master copies

…….. In the Longer Term Overcoming technological obsolescence to preserve a usable digital resource Introducing completely new software Replacing entire hardware systems Enhancing functionality Periodic backups Periodic checks on master copies

During the Data Creation Phase Importance of backups Preferably more than one copy, on and off site Appropriate frequency More than one file format Check your backups But backup is not preservation!

What to Preserve? Significant Characteristics Very difficult to preserve everything (data, functionality and interaction) about a digital resource Documented or commonly understood significant characteristics help simplify preservation action

Analogue…… Book - Significant: Words, paragraphs, chapters, author, publication date, … Not Significant: Binding, print run, font, colour of paper, … Newspapers - Significant: Words, paragraphs, headlines, size of type, date, page number of article, … Not Significant: Size of page, spacing, text justification, colour of paper, …

Digital……… There is a shared understanding of what is important in a paper-based resource Less agreement about what is important in a digital resource Complicated to decide as software and formats support many options that are not knowingly used but have default settings

Questions to ask…. What are the significant characteristics of your digital outputs? What are the digital objects that make up your resource? What is the purpose of your digital resource? Think about the problem in terms of content and purpose Very difficult (if not impossible) to ensure your resource stays exactly the same in the future What can change without adverse effects? What changes must be limited, and by how much? How can you check changes are acceptable?

Assessing the scale of the Preservation Task Estimating volume and type: Textual Documents Still Images Moving Images Audio files Numeric dataset Database Markup Documents (XML etc.) CAD GIS Virtual reality Website Software executable

Risk Assessment for file formats used Review data types and file formats Assess the risks associated with those file formats Establish policy for dealing with them

Preservation Metadata Metadata needed to manage preservation of digital collections: technical; administrative Not necessarily a complete set of preservation metadata elements Possible sources: OCLC/RLG Working Group; the Consultative Committee for Space Data Systems; CEDARS project; The UK National Archives (formerly the Public Record Office); Arts and Humanities Data Service; NEDLIB project; California Digital Library; Harvard University Library

File Structure Create an overview of the file structure Create a list of all files Create a logical file strategy from the outset Choose consistent filenames Avoid using re-using same filename even in separate folders. Store files in a logical order with systems and contents files kept apart. Summary of contents may be included with each file. Keep a record of encryption keys – important for preservation.

Preservation Strategies: Content Migration: convert the data to work with new applications Emulation: convert the data, application (and operating system) to work on new hardware Technology preservation: Keep everything running Virtual computing: create a standard virtual runtime environment Migration on demand: convert original format directly into up-to-date format

Theory Practice In practice, migration is the simplest and most common approach Limitations of migration are: Can be difficult to ensure accurate migration Does not capture functionality, only (possibly partial) data May need to be repeated frequently Might lead to mutation over time

Migrating to new standards – but which one? "The good thing about standards is that there are so many to choose from (A. Tanenbaum) Quicktime MPEG Real Media1995 MPEG RealVideo1997 MPEG Quicktime Active Streaming Format1999 DIVX The number of A/V de-facto standard formats has exploded in the past five years, and this does not cover the dozens of audio and video codec combinations!

Measuring Longevity of Standard Who developed it? Microsoft, Motion Picture Expert Group, etc. Has it received mainstream support? Can your hardware save data in that format? What organisations are using it? Is it used in industry Is it widely accepted by the professional and amateur community? Technology watch – check web sites, developer forums and newsgroups. Has it been submitted as an ISO standard?

Measuring Longevity of Standard Are there any legal actions to change the standard? Is there a licensing fee? What tools are available to create and manipulate the format Open source vs. proprietary PRONOM – National Archive database of 250 software products, 550 file formats and 100 manufacturers Can I execute these tools on my computer? Java, Windows-only, Mac-only

Choosing a Suitable Migration Path What are the main features? Small file size, streaming support Will it support your specialist needs? Subtitles, DRM, Internet delivery, etc. Does it provide sufficient quality Lossless vs. lossy compression. Will it impose any restrictions on use? Can it actually be played by your target audience? Is the standard stable or does it change frequently? How will this affect your desire to use the format?

Migration problems Have you encountered any problems when accessing these files in other applications? Quirks (text not displaying, desynchronised audio/video, upside-down video playback). Version incompatibilities Migrating to other formats Are there any other problems when exporting to other formats? E.g. lossless-to-lossless conversion, in-editable Document quirks & incompatibilities for later.

Updating Hardware Hardware has changed dramatically in the last 3 years Memory – DDR vs. SD-RAM CPU – pin compatibility Graphics cards – AGP 2x, 4x, 8x Operating system – will Windows NT4/98 run on newer hardware? Do you upgrade existing hardware or replace it with new equipment?

Updating Software Software changes on a frequent basis Four service packs available for Windows Microsoft issues 3 patches per week on average. Legal action force changes to plugin handling. In addition, there is an estimated 20 un-patched vulnerabilities in Internet Explorer alone (PivX Solutions). Do you upgrade to a later operating system or continue to use an operating system & software with known security flaws?

Preserving Your Website: technical issues Standards And Formats Has the Web site been designed using open standards, which should help future-proofing? Have proprietary formats been used (for which backwards compatibility may not be considered) Architecture & Implementation Has the technical architecture of the Web site been documented? Can you continue to use technical systems after funding has finished?

Preserving Your Website: content issues Accuracy: Is the content of the Web site accurate today Who and how will changes be made Could the content of the Web site be misleading in the future? Usability: Maintaining links – short medium and long term Legal: Is the Web site legal (accessibility; copyright; defamation; IPR; …)? Will the Web site be legal tomorrow, if new legislation is enacted? How will you know – who will make necessary changes?

Maintaining a Website Run a link check across the Web site. Fix broken internal links and as many external links as is reasonable. Document the link report. Run HTML (and CSS) validation checks across the Web site. Fix as many invalid pages as is reasonable. Document the findings. Run an accessibility check across the Web site. Fix as many inaccessible pages as is reasonable. Document the findings.

Maintaining a Website Address technical areas: Remove any backend scripts which are no longer needed Remember that scripts, etc. are liable to go wrong. Ensure that applications are configured to break gracefully and provide meaningful errors – tell users who to contact if they find an error

Procedures framework From start to finish: Creation and Management Manuals within Procedures Framework Key File Format Conversion Guides Digital Object Preservation Handbook: a how-to guide

Options for Ensuring Preservation Once a project is completed…………… Live, (supported) system Archived Organisational Repository Shelved Abandoned

Not Recommended…….. Abandoned May be appropriate, probably isnt, think about archiving the resource instead Shelved Dont - shelving a digital resource without active, on- going attention is highly likely to result in its loss Media degradation Software and hardware obsolescence Loss of knowledge about the resource

Recommended……. But Think About Live System Importance of functionality/interface Organisational buy-in: who is running the system, and what is their commitment to it? What will happen if the system is shut down? Is the digital resource completed or on-going? Who Pays?

Recommended…… But Think About Deposit in an Archive Is the digital resource going to a trusted archive? Are only some aspects of the resource being archived? Will it be available for others to use? Will the resource be updated in the future? Costs?

Recommended…….. But think about Establish a Repository Business model and financial plan Management and administrative processes Policies and procedures Systems and tools Software and hardware Resource curation Metadata and documentation Preservation management

Establishing Requirements A pragmatic approach – workable and achievable Preservation requirements Establish common practices, procedures and use of standards Investigate and establish hardware, systems, and tools requirements Investigate and evaluate products Business planning and costings

Developing the Architecture The architecture must support: The entire activity cycle including ingest, data management, storage, long term preservation, discovery, access and delivery All necessary security aspects Complex resources Discovery and delivery options

Summary Build in preservation right from the start Document decisions/policies/procedures Balance longevity with innovation Be ruthless about what you must keep and what can be discarded Think content and functionality Planning Its a continuous process – not a one-off