Getting Started: An Introduction to Digital Preservation

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
Business Excellence Day November 2009 Putting trust in your electronic information store Alan Shipman Group 5 Training Limited.
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
The British Library’s METS Experience The Cost of METS Carl Wilson
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
FP7-ICT PrestoPRIME 1 Richard Wright BBC R&D Preservation: Scenarios, Risks, Costs Screening the Future Hilversum March 2011 Richard.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Allegra Huxtable Manager Government Recordkeeping Tasmanian Archives and Heritage Office.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digital Preservation Coalition Supporting Digital Preservation NOF-digi Preservation Workshop Senior Managers’ Brief Maggie Jones DPC Co-ordinator
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Institute Repositories and Digital Preservation : Assessing Current Practices at Research Library Rathachai Chawuthai Information.
Gateways Heather Brown Project Officer, State Library of S.A, for Business Information Program, University of S.A. and Assistant Director, Paper, Artlab.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
1/ 4 OCTOBER 2007 Electronic Records Retention Issues Frank Nemeth NMCI Engineering.
11 Researcher practice in data management Margaret Henty.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
@ulccwww.ulcc.ac.uk IRMS Cymru October 2015 From EDRMS to digital archive: a wish-list for ways to preserve digital records.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Practical Aspects of Preservation Peter Simpson Development Officer Arts and Humanities Data Service.
DP Knowhow: Introduction to Audit and Certification in ISO APARSEN-EGI Community Workshop on Managing, Computing and Preserving Big Data for Research.
13 July 2005 Archives Hub day conference The Paradigm Project: The University of Oxford & The University of Manchester
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network aparsen.eu #APARSEN Options.
Preservation Planning Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
UK DP Needs Assessment Project overview 2 November 2005 Martin Waller.
Joint Meeting of CSUL Committees,
Open Exeter Project Team
Ingest and Dissemination with DAITSS
Dependency Management
OAIS Producer (archive) Consumer Management
Building A Repository for Digital Objects
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
Digital Asset Management Part 15: Summary
Statewide Digitization and the FCLA Digital Archive
Certification, Sustainability and Advocacy
Implementing an Institutional Repository: Part II
Metadata for preservation
Using the LIFE Costing Model Case studies from DK
Archiving of Electronic Records
Research data preservation in Canada
Digital Preservation and Trusted Digital Repositories
Jisc Research Data Shared Service (RDSS)
Nancy Y. McGovern Digital Preservation Officer, ICPSR IASSIST 2007
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

Getting Started: An Introduction to Digital Preservation

Traditional Media Robust Tangible Independently understandable Experienced in assigning value Well-developed approaches to preservation Traditional objects are generally quite robust They are tangible, we can hold them in our hands Are generally independently understandable (if you speak the language they are written in…..) We are quite experienced in understanding their worth and assigning value to such objects

Digital Information Ephemeral Need technology & documentation to interpret Obsolescence Media Formats Software/Hardware Documentation How to estimate value? New skills and solutions But also new opportunities! Digital objects are ephemeral by their very nature They very susceptible to obsolescence as they are entirely dependent on the media they are stored on, the accessibility of their file format and often require documentation to use and understand them Managing issues such as rights can also be much more difficult, from protecting copyright to ensuring personal data is protected They require us to gain new skills to care for them, or for us to work with new groups of colleagues with different skills groups (particularly IT specialists) But they do also bring a whole host of new benefits, in particular the ability to make content accessible to users.

What’s the Problem? Digital data (images, documents etc.) have value and create opportunities ...but... Access depends on software, hardware and people …and… Technology and people change, creating barriers to reuse ...therefore... We need to actively manage data to protect and create opportunities

Why We Preserve…. Legal and Regulatory Compliance Increased Efficiency New Revenue Streams Improving Health Protecting the Environment Enabling Research Documenting Cultural Heritage Ensuring Transparency and Accountability

Digital Preservation: A Tale of Three Models….

DP Models: Three Legged Stool http://dpworkshop.org/dpm-eng/conclusion.html

More on the Three Legged Stool Technology Organisation Resources Storage/Back-Up Policy & Strategy Business Planning Repository Systems Planning Cost modelling Procedures Tools Funding Risks and Benefits Fixity Sustainability Security Staffing Staff skills Illustration by Jørgen Stamp digitalbevaring.dk CC BY 2.5 Denmark

DP Models: DCC Life-Cycle http://www.dcc.ac.uk/resources/curation-lifecycle-model

DP Models: OAIS While it is far from perfect the Open Archival Information System model is one of the keystones of Digital Preservation. In particular it provides much of the terminology used within the field. This diagram represents it’s functional model at the highest level. As well as the key functions of an OAIS that it shows (such as Ingest, Preservation and Access) it also includes various information packages. These information packages contain the digital material to be preserved along with its accompanying metadata and within OAIS these exist in 3 different forms across the lifecycle: The Submission Information Package The Archive Information Package The Dissemination Information Package To accompany this functional model the OAIS also describes an information model that lays out what types of metadata (specifically called Representation Information in OAIS) should be included in the information packages to facilitate preservation. Full OAIS Standard: https://public.ccsds.org/pubs/650x0m2.pdf Brian Lavoie’s Tech Watch Report on OAIS: http://dx.doi.org/10.7207/twr14-02

The full functional model

The OAIS Information Model

How do you measure compliance? Certification CoreTrustSeal ISO 16363 DIN 31644 Maturity Modelling NDSA Levels DPCMM 5 Organizational Stages E-ARK Maturity Model

Some More Standards….

Why Do We Need Standards? Provide a common understanding and approach Allow us to share experiences, collaborate and build on the work of others Help build sustainable systems and operating models Accelerate capability-building Help to build trust in our services

Metadata Standards PREMIS METS Domain-specific, e.g. MARC, ISAD(G) Type-specific, e.g. Z39.87, DDI Container formats, e.g. BagIt, MPEG-21, WARC

File Formats Open Document Format & Office Open XML JPEG-2000 PDF/A TIFF Industry Foundation Classes for BIM STEP: Standard for the Exchange of Product model data (ISO 10303)

Cost Models Keeping Research Data Safe (1 and 2) LIFE project (1-3) 4C project and survey of existing costs models

Local Standards Metadata Transfer: Formats Documentation Packaging Agreements Metadata

Standards, beware!

Remember the Big Picture!

Models and standards are powerful tools but...

Models and standards are powerful tools but... Equipping people with the right skills is even more powerful

Preservation Methods

Approaches to Preservation Bit-Level Migration Emulation Hardware Preservation Digital Archaeology Virtualisation etc.…… Illustration by Jørgen Stamp digitalbevaring.dk CC BY 2.5 Denmark

Bit-Level Preservation File Manifest/ Digital Asset Register Storage Multiple copies Back-up Security Fixity Checking

Migration “Normalisation” To New Versions There are two main forms of migration for digital preservation, and it is possible to use one or both. The first is a method often referred to as ‘normalisation’. This is where all files of a particular type (for example, text documents) are ‘normalised’ to one file format. The example on the slide shows Word documents being normalised to PDF. For images this could be JPEGs and GIFs normalised to TIFFs. The choice of normalised files format used will depend on the needs of the organisation and its users. The second method involves migrating old file formats to newer versions when they are at risk of becoming obsolete. This could be migrating an old .xls spreadsheet to a newer .xlsx format. Both methods have their positives and negatives: Normalisation creates homogenous, easier to manage collections and means that users need to know how to use fewer files types. Migrating to new versions means that files can be accessed in current computer environments. Both processes can be automated but quality control is incredibly important and careful consideration must be given to migration pathways to avoid loss of data and functionality.

Emulation Emulation is the process of recreating the original environment in which a file was created and used via a layer of specially written software: the emulator.  Emulation has been particularly successful in the world of computer games, where enthusiasts will create emulators to allow them to play older games.  It is also an increasingly popular preservation method and several projects have produced emulators for everything from early browsers to old versions of PowerPoint. Many of these emulators are freely available, either online or as software downloads. Emulation perhaps seems like the ideal version of digital preservation as it allows users to access the files in their originally environment, providing a more authentic experience. It is however, very resource intensive and emulators will require updates (or their own emulators) as computer environments change. It can also be difficult to confirm the emulator truly captures the original environment unless there is still access to an original example to compare.

One More Thing….

Risk Management Identifying risks is a key stage in successful digital preservation Useful for: Policy development Building a business case Identifying requirements Making preservation decisions Resources: DRAMBORA, SPOT, TNA

Some Key Messages It’s about more than storage Be prepared to advocate for DP Know the opportunities and benefits Collaboration is essential Investing in people is as important as technology The DPC is here to help!

Digital Preservation Coalition 68? members 4 strategic areas Shiny new website Responsive design Easier access to resources http://www.dpconline.org

Blog and Case Notes

Networking Important part of all our events Peer to peer learning Planning Day Counting the Bits

Webinars Covering a wide range of digital preservation issues Tool demos Research project updates Case studies Important resources

Research Projects E-ARK: http://eark-project.com/ VeraPDF: http://verapdf.org/ TIMBUS: http://timbusproject.net/ 4C: http://www.4cproject.eu/ APARSEN: http://www.alliancepermanentaccess.org/index.php/about-aparsen/aparsen-deliverables/ SPRUCE: http://wiki.opf-labs.org/display/SPR/Home

Tech Watch Reports http://www.dpconline.org/knowledge-base/tech-watch-reports

Training Workshops Getting Started and Making Progress: http://www.dpconline.org/events Short videos of GS online via the Handbook: http://dpconline.org/handbook/getting-started

Digital Preservation Handbook http://dpconline.org/handbook

Questions? sharon@dpconline.org www.dpconline.org @SharonMcMeekin (@williamkilbride @prwheatley)