An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.

Slides:



Advertisements
Similar presentations
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Advertisements

PREMIS Conformance. Agenda 1.NLNZ and NLB conformance exercise 2.History of PREMIS Conformance 3.Current status 4.Mapping to functionality.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Documenting to preserve your data: metadata in support of digital preservation Michael Day, UKOLN, University of Bath
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
3. Technical and administrative metadata standards Metadata Standards and Applications Workshop.
Seminar: OAIS Model application in digital preservation projects Michael Day, Digital Curation Centre UKOLN, University of Bath.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
Jenn Riley Metadata Librarian Indiana University Digital Library Program.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
ESIP 2009 Summer Meeting, UC Santa Barbara, CA, July 7 – 10, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich InfoAnalytics.
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Moving from a locally-developed data model to a standard conceptual model Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Linked Digital Archive Institutional Repository Rathachai Chawuthai CSIM/SET/AIT.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
Health eDecisions Use Case 2: CDS Guidance Service Strawman of Core Concepts Use Case 2 1.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An overview of the Reference Model for an Open Archival Information System (OAIS) Michael Day, Digital Curation Centre UKOLN, University.
2/26/2004 Dan Swaney 1 Preservation Metadata and the OAIS Information Model A Metadata Framework to Support the Preservation of Digital Objects A review.
Draft Data Foundation and Terminology (DFT) Vocabulary Development Process Prepared for WG-Core meeting 24/25.2 Munich/Garching Gary Berg-Cross Co-Chair.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Joint Meeting of CSUL Committees,
Building A Repository for Digital Objects
DAITSS: Dark Archive in the Sunshine State
COIT20235 Business Process Modelling
Integrating PREMIS and METS
Metadata for preservation
Metadata for digital long-term preservation
Metadata in Digital Preservation: Setting the Scene
Medusa at the University of Illinois
Presentation transcript:

An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program

2/7/2007Digital Library Brown Bag Series2 Outline  Background and context  PREMIS data model  PREMIS data dictionary  Implementing PREMIS  Adoption and ongoing developments

2/7/2007Digital Library Brown Bag Series3 What is preservation metadata?  Information that “supports and documents the digital preservation process”  The problem is, we don’t really know what that metadata looks like  Well, we know something, but not nearly enough  The community is thinking hard about the issues, but we still have very little real-world data

2/7/2007Digital Library Brown Bag Series4 Some related work  National Library of Australia Preservation Metadata for Digital Collections (October 1999)Preservation Metadata for Digital Collections  Reference Model for an Open Archival Information System (OAIS) (June 2001) Reference Model for an Open Archival Information System (OAIS)  RLG/OCLC report Trusted Digital Repositories: Attributes and Responsibilities (May 2002)Trusted Digital Repositories: Attributes and Responsibilities  OCLC/RLG Metadata Framework to Support the Preservation of Digital Objects (June 2002) OCLC/RLG Metadata Framework to Support the Preservation of Digital Objects  National Library of New Zealand Metadata Standards Framework –Preservation Metadata (November 2002)Metadata Standards Framework –Preservation Metadata  RLG/NARA Audit Checklist for the Certification of Trusted Digital Repositories (August 2005)Audit Checklist for the Certification of Trusted Digital Repositories

2/7/2007Digital Library Brown Bag Series5 What is PREMIS?  PREservation Metadata Implementation Strategies  A working group of over 30 members sponsored by OCLC and RLG  A data dictionary for preservation metadata included in the May 2005 final report of the working group

2/7/2007Digital Library Brown Bag Series6 Charge to the PREMIS working group  define an implementable set of “core” preservation metadata elements, with broad applicability within the digital preservation community;  draft a Data Dictionary to support the core preservation metadata element set;  examine and evaluate alternative strategies for the encoding, storage, and management of preservation metadata within a digital preservation system, as well as for the exchange of preservation metadata among systems;  conduct pilot programs for testing the group’s recommendations and best practices in a variety of systems settings; and  explore opportunities for the cooperative creation and sharing of preservation metadata.

2/7/2007Digital Library Brown Bag Series7 Working group structure  Implementation Strategies Subgroup “examined various strategies for encoding, storing, and managing preservation metadata within digital preservation systems” performed survey of existing and planned digital preservation systemssurvey  Core Elements Subgroup defined core elements drafted data dictionary

2/7/2007Digital Library Brown Bag Series8 How PREMIS defines preservation metadata  “The information a repository uses to support the digital preservation process”  Metadata that supports viability renderability understandability authenticity identity  Mandatory elements represent “the minimum amount for [a] second repository to accept custody of [a] digital object and assume responsibility for its long-term preservation”

2/7/2007Digital Library Brown Bag Series9 PREMIS goals  Build on the OAIS reference model  Be implementation independent  “Provide a starting point for improvements and enhancements based on community experience and feedback”

2/7/2007Digital Library Brown Bag Series10 Development strategies  Paid particular attention to documenting digital provenance relationships  “Whenever possible the group defined elements that do not require human intervention to supply or analyze,” but did not limit to these  Defined “semantic units” rather than “metadata elements”

2/7/2007Digital Library Brown Bag Series11 Defining “core”  “Things that most working preservation repositories are likely to need to know in order to support digital preservation”  “Core does not necessarily mean mandatory”  “Core elements define information that a repository needs to know, regardless of how, or even whether, that information is stored”  Core elements support checking of: Fixity – object is unchanged since some previous time Integrity – compliant with relevant specifications Authenticity – object is what it purports to be

2/7/2007Digital Library Brown Bag Series12 Outline  Background and context  PREMIS data model  PREMIS data dictionary  Implementing PREMIS  Adoption and ongoing developments

2/7/2007Digital Library Brown Bag Series13 The PREMIS data model

2/7/2007Digital Library Brown Bag Series14 Intellectual entities  “A coherent set of content that is reasonably described as a unit”  Can include other Intellectual Entities  May have one or more digital representations  May not be managed by all repositories

2/7/2007Digital Library Brown Bag Series15 Objects  “A discrete unit of information in digital form”  Is a static set of bits that cannot be modified  Three subtypes File Bitstream Representation

2/7/2007Digital Library Brown Bag Series16 File object  “A named and ordered sequence of bytes that is known by an operating system”  Defined like “file” in common usage  No restriction on format, etc.

2/7/2007Digital Library Brown Bag Series17 Bitstream object  “Contiguous or non-contiguous data within a file that has meaningful common properties for preservation purposes”  Defined differently than common usage Can’t span files Must have some sort of reformatting to be made into a file  Weird exception: filestreams Bitstreams that don’t need additional information to be transformed into a file Follow all the file rules in the data dictionary, not the bitstream rules

2/7/2007Digital Library Brown Bag Series18 Representation object  “The set of files, including structural metadata, needed for a complete and reasonable rendition of an Intellectual Entity”  More than one Representation may exist for each Intellectual Entity  Repository doesn’t necessarily have to track representations

2/7/2007Digital Library Brown Bag Series19 Objects example: ETD

2/7/2007Digital Library Brown Bag Series20 Events  The Event entity aggregates metadata about actions that involve at least one object or agent known to the preservation repository  Many types of Events might be of interest to a preservation repository Creation of a new version of an object Create/alter relationships Validity/integrity checking etc.  All Events have outcomes  Some Events have outputs

2/7/2007Digital Library Brown Bag Series21 Agents  “A person, organization, or software program associated with preservation events in the life of an object”  Agents represented minimally in PREMIS Means of identification Classification as person, organization, or software  Assumes other initiatives will more fully define Agents  Agents influence Objects only indirectly through Events

2/7/2007Digital Library Brown Bag Series22 Rights  Rights Statements “are assertions of one or more rights or permissions pertaining to an Object and/or Agent”  Semantic units related to rights restricted to those concerned with preservation activities  All expressible as “Agent A grants this permission for Object B.”  3 semantic units allowed act expiration date of the permission all other terms, conditions, restrictions and/or limitations  Acknowledges much more work needs to be done

2/7/2007Digital Library Brown Bag Series23 Relationships between entities  Between objects Structural relationships Derivation relationships Dependency relationships  Others defined by data model indicated in data dictionary by linking attributes

2/7/2007Digital Library Brown Bag Series24 Outline  Background and context  PREMIS data model  PREMIS data dictionary  Implementing PREMIS  Adoption and ongoing developments

2/7/2007Digital Library Brown Bag Series25 The PREMIS data dictionary  Defines semantic units for: Objects Events Agents Rights  Intellectual Entity is out of scope because it is “well served by descriptive metadata”

2/7/2007Digital Library Brown Bag Series26 Entries include information on:  Name  Semantic components  Definition  Rationale  Data constraint  Object category  Applicability  Examples  Repeatability  Obligation  Creation/Maintenance notes  Usage notes

2/7/2007Digital Library Brown Bag Series27 Sample data dictionary entry

2/7/2007Digital Library Brown Bag Series28 Outline  Background and context  PREMIS data model  PREMIS data dictionary  Implementing PREMIS  Adoption and ongoing developments

2/7/2007Digital Library Brown Bag Series29 Role of a preservation policy  PREMIS helps a repository to implement a preservation policy; it doesn’t set that policy  Policy can be complicated Is descriptive metadata part of an Intellectual Entity? If so, should we treat it as a file? Is PREMIS data itself a file (or a bitstream) that is managed by the repository? etc., ad infinitum…  The data dictionary is only a starting point, does not include all information needed to preserve an Object

2/7/2007Digital Library Brown Bag Series30 Relationship to technical metadata  PREMIS semantic units restricted to: intellectual characteristics characteristics common to all formats  Some overlap between PREMIS semantic units and elements defined by various technical metadata standards

2/7/2007Digital Library Brown Bag Series31 Other types of metadata  Structural and rights metadata fall at least partly within the scope of PREMIS, but perhaps not entirely  Descriptive metadata is useful for defining Intellectual Entities, which are managed by some repositories

2/7/2007Digital Library Brown Bag Series32 Lack of relevant content standards and controlled vocabularies  Only in some cases do definitions of semantic elements provide guidance on how to structure the data recorded  PREMIS semantic units largely outside the scope of most existing content standards  “PREMIS assumes that repositories will adopt or define controlled vocabularies useful to them”  Perhaps a common content standard isn’t needed, but the lack of one does mean more decisions have to be made when implementing a repository

2/7/2007Digital Library Brown Bag Series33 PREMIS conformance  “Local metadata can be used to extend but not modify the PREMIS semantic units”  “The mandatory semantic units of the Data Dictionary represent the information that a preservation repository must be able to associate with any archived digital object in its possession”  Don’t have to store PREMIS information, just have to know it  Currently no formal means of stating or testing “conformance”

2/7/2007Digital Library Brown Bag Series34 XML Schemas  Literal representations of the semantic units and attributes of the PREMIS data dictionary  Of use for exchange of preservation objects  Likely of less use for a repository’s internal representation  5 separate schemas PREMIS container Object entity Event entity Agent entity Rights entity

2/7/2007Digital Library Brown Bag Series35 Outline  Background and context  PREMIS data model  PREMIS data dictionary  Implementing PREMIS  Adoption and ongoing developments

2/7/2007Digital Library Brown Bag Series36 Adoption  Hard to tell, since preservation repositories operate behind the scenes  PREMIS Implementation Registry currently has 8 diverse entries PREMIS Implementation Registry  Active implementer's discussion listimplementer's discussion list  Working Group won the 2005 Digital Preservation Coalition Digital Preservation Award  Several PREMIS workshops scheduled

2/7/2007Digital Library Brown Bag Series37 Current PREMIS activity  PREMIS Maintenance Activity hosted at the Library of Congress PREMIS Maintenance Activity  Editorial Committee named Editorial Committee  Commissioned report on Rights in the PREMIS Data ModelRights in the PREMIS Data Model  Proposals for revisions of two semantic units in public comment period Proposals for revisions

2/7/2007Digital Library Brown Bag Series38 So where are we?  The data dictionary looks to be having a big impact  Discussion of preservation metadata is increasing  It seems the PREMIS goal of a “starting point” has been well fulfilled  PREMIS looks promising as a source of ideas for the IU DLP preservation repository

2/7/2007Digital Library Brown Bag Series39 For more information  PREMIS Working Group site:  PREMIS Maintenance Activity site:  PREMIS Implementors Listserv:  These presentation slides: 