PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.

Slides:



Advertisements
Similar presentations
Implementing PREMIS in Container Formats Rebecca Guenther, Library of Congress Zhiwu Xie, Los Alamos National Laboratory IS&T’s.
Advertisements

October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
METS: An Introduction Towards a Digital Object Standard Rick Beaubien Library Systems Office U.C. Berkeley.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Workflows for Digital Curation and Preservation Stacy Kowalczyk PASIG Dublin 2012 October 17, 2012.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Joachim Bauer Senior System Engineer, CCS
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
PREMIS in the Real World: some reflections on constraints Jan Lavelle Senior Librarian (Systems Development) State Library of Tasmania.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
A METS Application Profile for Historical Newspapers
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 The Universal Object Format - A METS Profile for an archiving and exchange format for digital objects.
PREMIS Implementation at The Royal Library of Denmark by Eld Zierau.
Glen Robson Head of Systems Unit National Library of Wales
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
Author(s): Paul Conway, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Digital Preservation System ExLibris Rosetta OAI6 | Geneva | June 2009 Dr. Axel Kaschte, Strategy Director Europe.
Gathering Audio Metadata for the Monterey Jazz Festival Concerts OLAC 2006 By Nancy J. Hoebelheinrich, Stanford University Libraries.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
METS, Standards and Rights METS, Safonau a Hawliau Vicky Phillips Digital Standards Manager Rheolwr Safonau Digidol 4 th March ydd Mawrth 2014.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
IMPLEMENTATION ISSUES. How PREMIS can be used  For systems in development as a basis for metadata definition  For existing repositories as a checklist.
VITAL at the National Library of Wales Glen Robson
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
Florida Digital Archive PREMIS and DAITSS. Florida Digital Archive.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Arwen Hutt & Bradley D. Westbrook Metadata Analysis and Specification Unit UCSD Libraries For PREMIS Workshop La Jolla, CA, 11 Feb 2008.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
Joint Meeting of CSUL Committees,
and Transmission Standard overview – and case study
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
An Introduction to Tessella and The Safety Deposit Box Platform
Integrating PREMIS and METS
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
Medusa at the University of Illinois
Presentation transcript:

PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009

2 General Archival Information Package (AIP) AIP is just a conceptual entity Conceptual (generic) data model Content files stored on write once media Content files may be containerized (stored in ZIP or WARC files) One or more containers per AIP; files in containers may belong to various AIPs AIP Descriptor: METS file describes the content of the AIP structure, files, descriptive metadata, preservation metadata Different METS profiles for different content streams eJournals, newspapers (born digital and digitized), web archiving Common underlying document model for all AIPs

3 METS Descriptor What is stored in the METS Descriptor? Structure of the document (logical and physical in different structMaps) Not all content streams have two structMaps (born digital streams have only on) Descriptive metadata File Section Defines container files as well as content files (nested elements)

4 METS Descriptor What is stored in the METS Descriptor? Structure of the document (logical and physical in different structMaps) Not all content streams have two structMaps (born digital streams Descriptive metadata File Section Defines container files as well as content files (nested elements) Preservation metadata Preservation metadata for files and representations

5 METS Descriptor What is stored in the METS Descriptor? Preservation metadata: Preservation metadata for files and representations Focusses on: Audit trail – events and agents Technical metadata – basic technical metadata in METS and PREMIS Assumption: future migrations of files necessary No emulation considered; no environment information stored elements

6 Preservation Metadata (PREMIS) in METS Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output Newspapers uses PREMIS 2.0; MODS 3.3; METS 1.8 Web Archiving uses PREMIS 2.0; MODS 3.3; DC; METS 1.8

7 Preservation Metadata (PREMIS) eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output AIP model: One AIP per article, issue, journal, digital manifestation Any changes will lead to a new AIP; old version of AIP is referenced

8 Preservation Metadata (PREMIS) eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output AIP model: One AIP per article, issue, journal, digital manifestation Journal, Issue, Article: AIP consists just of a METS descriptor (mainly descriptive metadata (MODS) embedded and preservation metadata: PREMIS: regarded as representations of intellectual entities Relationships between representations are recorded in MODS record

9 Preservation Metadata (PREMIS) eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove dtd AIP model: One AIP per article, issue, journal, manifestation Digital Manifestation: AIP consists of content files and METS descriptor. METS descriptor contains PREMIS records for files and one for the Digital Manifestation itself Relationships to article recorded in PREMIS record (manifestationOf) Relationships to submission is recorded in PREMIS (containedInSubmission) Submission: received content files in ZIP (one AIP)

10 Preservation Metadata (PREMIS) and METS: eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output amdSec: one amdSec per PREMIS record; referenced from and elements Use of ; ; elements techMD: Extracted data from Jhove (files) PREMIS record of a file digiprovMD: PREMIS record of representations (journal, issue, article) PREMIS record of a file

11 Preservation Metadata (PREMIS) and METS: eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output PREMIS elements used: objectIdentifier objectCategory preservationLevel size fixity (MD5, SHA-512) format (PRONOM) Relationships, events and agents where necessary

12 Preservation Metadata (PREMIS) and METS: eJournal content stream Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove output PREMIS elements used: objectIdentifier objectCategory preservationLevel size fixity (MD5, SHA-512) format (PRONOM) Relationships, events and agents where necessary Redundantly in METS element }

13 Preservation Metadata (PREMIS): relationships PREMIS relationships: manifestationOf (between Manifestation and Article) containedInSubmission (between Manifestation and Submission) PREMIS relationships (between files: m-n relationships): migration uncompression modification Relationships are always stored in Premis records for files will have techMD and digiProvMD

14 Preservation Metadata (PREMIS): events PREMIS events (on file level): integrityCheck formatIdentification validation wellformness propertyExtraction PREMIS events (on representation level): metadataUpdate Relationships are always stored in Premis records for files will have techMD and digiProvMD

15 Preservation Metadata (PREMIS): events PREMIS events always have an agent Event and agents are stored in each PREMIS record: In case an event effects more than one object, it must be repeated in each object’s PREMIS record. Using the same identifier indicating it is the same event.

16 Preservation Metadata (PREMIS) in METS Content streams: eJournals uses PREMIS 1.1; MODS 3.2; METS 1.4; jhove dtd Newspapers uses PREMIS 2.0; MODS 3.3; METS 1.8 Web Archiving uses PREMIS 2.0; MODS 3.3; DC; METS 1.8 Move to PREMIS 2.0 Changes to AIP model

17 AIPs and PREMIS 2.0 Change of AIP: Newspapers need second structMap (and structLink) Hierarchy of AIPs no longer possible Instead: one AIP per issue Manifestations are modelled as a (various manifestations per AIP possible) Support of container files (ZIP, WARC) Modelled as nested elements; no PREMIS record for container files No file format specific technical metadata is captured

18 METS and PREMIS 2.0 METS and PREMIS 2.0: Use of new METS schema versions: instead of objectCategory just use Agent, object, event in separate elements within the same PREMIS record should be self containing

19 METS and PREMIS 2.0 Extended list of event types: deselection: files which are defined in the AIP descriptor but never ingested (no FLocat element) metadataExtraction vs. propertyExtraction Extended list of relationship types (relationshipSubType): modification vs. manipulation

20 METS and PREMIS 2.0 Extended list of event types: deselection: files which are defined in the AIP descriptor but never ingested (no FLocat element) metadataExtraction vs. propertyExtraction Extended list of relationship types (relationshipSubType): modification vs. manipulation

21 METS and PREMIS 2.0 Problems: Validation Using controlled vocabularies Considering dependencies between METS and PREMIS Standardized workflow for creating METS and PREMIS for all content streams Currently specific implementations for each content stream Extending the AIP Model Preservation metadata for metadata records

22 Thanks Markus Enders The British Library