Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.


Similar presentations
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.

PREMIS: To Be or Not To Be in My METS The Preservation Journey at the University of Connecticut Libraries ALA Annual 2013 ALCTS PARS Intellectual Access.
PREMIS Conformance. Agenda 1.NLNZ and NLB conformance exercise 2.History of PREMIS Conformance 3.Current status 4.Mapping to functionality.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
ISO & OAI-PMH By Neal Harmeyer, Amy Hatfield, and Brandon Beatty PURDUE UNIVERSITY RESEARCH REPOSITORY.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
3. Technical and administrative metadata standards Metadata Standards and Applications Workshop.
A Logical Model for Digital Archives Rathachai Chawuthai Information Management CSIM / AIT Draft document 0.1.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Ensuring Enduring Access: A Forum on Digital Preservation, July 21, 2009.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Linked Digital Archive Institutional Repository Rathachai Chawuthai CSIM/SET/AIT.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
Florida Digital Archive PREMIS and DAITSS. Florida Digital Archive.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST IRMS Cymru October 2015 From EDRMS to digital archive: a wish-list for ways to preserve digital records.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
13 July 2005 Archives Hub day conference The Paradigm Project: The University of Oxford & The University of Manchester
Data Management and Archival Storage Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
Joint Meeting of CSUL Committees,
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
Exercise: understanding authenticity evidence
An Introduction to Tessella and The Safety Deposit Box Platform
Statewide Digitization and the FCLA Digital Archive
Active Data Management in Space 20m DG
Exercise: understanding authenticity evidence
Integrating PREMIS and METS
Implementing an Institutional Repository: Part II
Metadata in Digital Preservation: Setting the Scene
Technical Issues in Sustainability
The Reference Model for an Open Archival Information System (OAIS)
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August 16, 8:30-9:45am Mark Evans – Director of Digital Archives History Associates Incorporated

Why metadata is important Metadata landscape in digital preservation Guiding efforts and standards Contrasting approaches Agenda

Metadata enables the information to be discovered and accessed Metadata informs future preservation actions to ensure continued access Why is Metadata Important? A binary sequence is meaningless by itself

What metadata is necessary to preserve digital objects so that they remain accessible and authentic over time? The Big Question

Metadata Landscape Descriptive Structural Technical Administrative Preservation Access

Open Archival Information System (OAIS) Reference Model We have Some Guidance Representation Information Data Object Information Object Interpreted using its Yields Information necessary to render and understand the bit sequences constituting the digital object.

Information Objects Content Information Preservation Descriptive Information Packaging Information Provenance Context Reference (Identity) Fixity Descriptive Information Each Information Object has associated representation information OAIS Information Model

File Format PDF v1.4 Portable Document Format Fixity SHA1=2323A563DF4329DA234E1234 Identity FileName = HF2653.pdf UUID = f81d4fae-7dec-11d0-a765-00a0c91e6bf6 Environment Operating System = Windows v7.1 Application = Adobe Acrobat 12 Technical Properties Size = bytes, Number of pages = 3 Number of images = 2 Valid = True Well Formed = true Provenance Created = 10/5/2011, Creating Application = Microsoft Office 2010 Last Modified = 8/12/2014 Examples of Object Metadata Binary Sequence

What metadata enabling functions does a digital preservation system need? Understand structural Information: Hierarchy of both records and files Relationships between Records Relationships between files Relationships between Records & Files Technology-dependent information: Determine if preservation actions are needed (e.g., obsolete format) Technology-independent information: Verify preservation actions were performed Maintain provenance of all entities Beyond the Needs of Objects

PREMIS – Preservation Metadata Implementation Strategies We Have Some More Guidance A data dictionary that defines a set of semantic units for capturing preservation metadata First developed in 2005, latest version is 2.3 Object Discreet unit of information in digital form. Three types File Bitstream Representation Events An action that involves or impacts at least one Object or Agent known by the system Rights Assertions of one or more rights or permissions pertaining to an Object and/or Agent. Agent Person, organization, or software program/system associated with Events in the life of an Object, or with Rights attached to an Object.

PREMIS – Examples Objects Entity Object identifier Preservation level Significant characteristics Object characteristics fixity format size creating application inhibitors object characteristics extension Original name Storage Environment software hardware Digital signatures Relationships Linking event identifier Linking rights statement identifier Event Entity Event identifier Event type (e.g. capture, creation, validation, migration, fixity check, ingestion) Event dateTime Event detail Event outcome Event outcome detail Linking agent identifier Linking object identifier

PREMIS – Conformance A current “Hot Topic” PREMIS is implementation neutral (“What” not “How”) A PREMIS semantic unit can be recorded in any way a repository finds convenient: Using an alternative name As a single metadata element or a set of metadata elements Implicit recording but must be recoverable Capturing additional levels of detail Conformance is Important. Especially for: Exchange of digital objects between repositories Use of shared technical registries Use of automated metadata extraction tools

Digital Preservation system must also support Application and enforcement of access and rights restrictions Maintaining descriptive metadata with appropriate entity and level in the hierarchy Ability for users to search on metadata Ability for users to view metadata Ability for users to add / edit metadata Don’t Forget Access and Management Needs

What About Other Standards? Lots of existing standards for a variety of uses / content types Organization may have adopted / obligated to use particular standard Metadata may already exist in a particular standard

A typical digital preservation system may have to deal with lots of ingest sources Each ingest source potentially contains metadata Can be in addition to or embedded within a digital object (or both) Unlikely to be a consistent scheme across sources Could be content specific Could be standards based Could be custom Ingest sources unlikely to contain sufficient preservation metadata Hopefully this will not be the case in the future Today this is typically extracted and generated at ingest Dealing with Diversity

Convert or crosswalk existing metadata to a normalized form (Or force a standard on the creators) OAIS Digital Archive Source / Schema 1 Source / Schema 2 Source / Schema 3 Convert to common Schema Two Approaches Extract / Create Preservation Metadata Ingest

Embed source metadata into a fixed schema that Can represent structure Can understand preservation metadata Can embed metadata for any entity OAIS Digital Archive Source / Schema 1 Source / Schema 2 Source / Schema 3 Embed within a fixed schema Two Approaches cont.. Extract / Create Preservation Metadata Ingest

Metadata is the mechanism that enables access and preservation OAIS and PREMIS provide great guidance for digital content Provides a degree of freedom for implementation Not a one size fits all approach The next presentations will illustrate this Conclusions

Mark Evans Thank you for your attention