Standards Showcase: PREMIS (Preservation metadata) Rebecca Guenther, Library of Congress ALA Annual 2006 LC booth presentation June 24-25, 2006.

Slides:



Advertisements
Similar presentations
The PREMIS Working Group: Preservation Metadata for Digital Repositories DLF Fall Forum October 26, 2004 Rebecca Guenther LC/NDMSO
Advertisements

Metadata for Digital Preservation: A Status Report on PREMIS Priscilla Caplan, FCLA Nancy Hoebelheinrich, Stanford University CNI Fall Task Force Meeting.
OCLC/RLG Working Group on Metadata for Digital Preservation Brian Lavoie Office of Research OCLC Online Computer Library Center, Inc. Information Infrastructures.
PRESERVATION METADATA: IMPLEMENTATION STRATEGIES Preservation Metadata: The PREMIS Experience Priscilla Caplan Florida Center for Library Automation (FCLA)
Preservation Metadata: Implementation Strategies (PREMIS) Rebecca Guenther Library of Congress IS&T Archiving Conference April 28, 2005.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Digital Preservation and Trusted Digital Repositories Priscilla Caplan Florida Center for Library Automation ALA 2005 Chicago IL.
Implementing PREMIS in Container Formats Rebecca Guenther, Library of Congress Zhiwu Xie, Los Alamos National Laboratory IS&T’s.
PREMIS Conformance Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
An Introduction to Preservation Metadata and the PREMIS Data Dictionary Rebecca Guenther, Library of Congress ALA Midwinter 2009 Intellectual Access to.
Understanding and Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Digital Preservation Partners’
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
The Promise of PREMIS: background, scope and purpose of the Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Long-term Repositories:
3. Technical and administrative metadata standards Metadata Standards and Applications.
An Introduction to the Hennepin County Hennepin County GIS Technical Advisory Group (eGTAG) 10/20/2009.
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Rebecca Guenther Library of Congress
PREMIS Update Rebecca Guenther Library of Congress PREMIS Implementation Fair Vienna, Austria 22 September 2010.
A Brief Introduction Neil Beagrie Chinese National Academy of Sciences July 04 The Digital Preservation Coalition.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Documenting to preserve your data: metadata in support of digital preservation Michael Day, UKOLN, University of Bath
Metadata for Digital Preservation: A Status Report on PREMIS Priscilla Caplan, FCLA Nancy Hoebelheinrich, Stanford University CNI Fall Task Force Meeting.
3. Technical and administrative metadata standards Metadata Standards and Applications Workshop.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
Metadata in support of digital preservation Michael Day, UKOLN, University of Bath Beginners Guide to Metadata:
Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
PREMIS Tutorial: Understanding & Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Brian Lavoie,
A survey based analysis on training opportunities Dr. Jūratė Kuprienė Framing the digital curation curriculum International Conference Florence, Italy.
RLG Digital Certification Task Force Don Sawyer 2 November 2004.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Formalizing Project EMANI Ithaca, July 26 th, 2002.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
PREMIS Tutorial: Understanding & Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Olaf Brandt, BStU.
April 12, 2005 WHAT DOES IT MEAN TO BE AN ARCHIVES? Trusted Digital Repository Model Original Presentation by Bruce Ambacher Extended by Don Sawyer 12.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
Nancy J. Hoebelheinrich, Metadata Coordinator, Stanford University 1 Metadata for the NGDA: Developing a Shared Approach Joint UCSB / Stanford meeting.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
A Shared Commitment to Digital Preservation and Access.
Applying preservation metadata to repositories The British Library, 21 January 2008 Led by Steve Hitchcock With Bill Hubbard, Gareth Johnson.
RLG Digital Certification Task Force
Building A Repository for Digital Objects
DAITSS: Dark Archive in the Sunshine State
Statewide Digitization and the FCLA Digital Archive
Rebecca Guenther, Library of Congress Brian Lavoie, OCLC
Metadata for preservation
Digital Preservation and Trusted Digital Repositories
Presentation transcript:

Standards Showcase: PREMIS (Preservation metadata) Rebecca Guenther, Library of Congress ALA Annual 2006 LC booth presentation June 24-25, 2006

Overview  What is preservation metadata?  Background  PREMIS work Survey Data dictionary  Features of the data dictionary  Implementing PREMIS  Future

Digital preservation: advances & remaining challenges  Groups around the world and conferences continue to make significant progress in raising awareness about digital preservation imperative  Gradual shift in focus from articulating problem to solving it … Not so much “Why is digital preservation important” anymore; rather, “What must be done to achieve preservation objectives?”  Many practical challenges in implementing reliable, sustainable digital preservation programs  One key implementation challenge: preservation metadata

Preservation metadata includes:  Provenance: Who has had custody/ownership of the digital object?  Authenticity: Is the digital object what it purports to be?  Preservation Activity: What has been done to preserve the digital object?  Technical Environment: What is needed to render and use the digital object?  Rights Management: What IPR must be observed?  Makes digital objects self-documenting across time Content Preservation Metadata 10 years on 50 years on Forever!

PREMIS background …  Pre-2002: various preservation metadata element sets released Different scopes, purposes, underlying models/assumptions No international standard; little consolidation of expertise/best practice  June 2002: Preservation Metadata Framework International working group (jointly sponsored by OCLC, RLG) Comprehensive, high-level description of types of information constituting preservation metadata Used OAIS reference model as starting point Set of “prototype” preservation metadata elements Consensus-based foundation for developing formal preservation metadata specifications … but not an “off-the-shelf, ready to implement” solution  Post-2002: Needed implementable preservation metadata, with guidelines for application and use, relevant to a wide range of digital preservation systems and contexts Motivated formation of PREMIS Working Group

PREMIS Working Group  Preservation metadata: key component of sustainable digital preservation  June 2003: OCLC, RLG sponsored international working group: PREMIS: Preservation Metadata: Implementation Strategies  Objective: Define implementable, core preservation metadata, with guidelines/recommendations for management and use  Membership: > 30 experts from 5 countries, libraries, museums, archives, government agencies, private sector Co-Chairs: Priscilla Caplan (FCLA), Rebecca Guenther (LC)

Membership  Priscilla Caplan, FCLA (Chair)  Rebecca Guenther, LC (Chair)  Michael Alexander, British Library  George Barnum, GPO  Charles Blair, U. of Chicago  Olaf Brandt, U. of Göttingen  Adam Farquhar, British Library  David Gewirtz, Yale  Kevin Glavash, MIT/Dspace  Cathy Hartman, U. of N. Texas  Helen Hodgart, British Library  Nancy Hoebelheinrich, Stanford  Roger Howard/Sally Hubbard, Getty Museum  Pam Kircher, OCLC  John Kunze, Calif. Digital Library  Brian Lavoie, OCLC liaison  Robin Dale, RLG liaison  Vicky McCarger, LA Times  Jerry McDonough, NYU/METS  Evan Owens, JSTOR  Erin Rhodes, NARA  Madi Solomon, Walt Disney Co.  Angela Spinazze, ATSPIN  Gunter Waibel, RLG  Lisa Weber, NARA  Robin Wendler, Harvard  Hilde van Wijngaarden, KB  Andrew Wilson, NAA

Advisory Committee  Howard Besser, UCLA  Liz Bishoff, OCLC (via Colorado Digitization Program)  Gerard Clifton, National Library of Australia  Gail Hodge, CENDI  Steve Knight, National Library of New Zealand  Maggie Jones, Digital Preservation Coalition  Nancy McGovern, Cornell  Cliff Morgan, Wiley UK  Richard Rinehart, U. of California, Berkeley

Survey Report  September 2004: Implementing Preservation Repositories for Digital Materials: Current Practice and Emerging Trends in the Cultural Heritage Community  Survey of existing and planned digital repositories: Mission, content, funding, preservation policies/strategies, take up of OAIS, access mechanisms, and more … Use of metadata to support repository processes, functions, policies; types of metadata collected; metadata storage/management practices  ~50 responses: 28 libraries, 7 archives, 3 museums, and 11 other 13 different countries; 45% from U.S. 38% in planning; 33% development; 46% production  Snapshot of current practices and emerging trends related to managing preservation metadata in digital archiving systems Variety of preservation contexts, institution types, and domains

Survey findings  Little experience with digital preservation Most didn’t have active preservation strategy Many not yet in production Cannot assess adequacy of metadata  Lack of common vocabulary and conceptual framework Informed by OAIS reference model Difference of opinion as to meaning of OAIS compliance  Metadata Many recording rights, provenance, technical, administrative, descriptive and structural  Most repositories serve goals of both preservation and access

PREMIS Data Dictionary  May 2005: Data Dictionary for Preservation Metadata: Final Report of the PREMIS Working Group  237-page report includes: PREMIS Data Dictionary 1.0 Accompanying report (context, data model, assumptions) Special topics, glossary, usage examples Set of XML schema to support implementation  Data Dictionary: comprehensive, practical resource for implementing preservation metadata in digital archiving systems Comprehensive view of information requirements needed to support digital preservation Based on deep pool of institutional experiences in setting up and managing operational capacity for digital preservation Builds on previous work

From theory to practice … OAIS Digital Archiving Systems Framework PREMIS Data Dictionary PREMIS Data Dictionary Preservation Metadata Requirements

Winner: 2005 Digital Preservation Award

Some guiding principles and assumptions …  “Implementable, core, preservation metadata”: “Preservation metadata”: maintain viability, renderability, understandability, authenticity, identity in a preservation context “Core”: What most preservation repositories need to know to preserve digital materials over the long-term “Implementable”: rigorously defined; supported by usage guidelines/recommendations; emphasis on automated workflows  Implementation neutral: No assumptions on specific implementation Promote flexibility/interoperability Focus on semantic units: what you need to know (implementation-neutral) vs. metadata elements: how you record it (implementation-specific) Information that needs to be “recoverable” from the digital archiving system, independent of local implementation

Scope of data dictionary  Implementation independent  Descriptive metadata out of scope  Technical metadata applying to all or most format types  Media or hardware details are limited  Business rules are essential for working repositories, but not covered  Rights information for preservation actions, not access

PREMIS data model Intellectual Entities Objects Rights Agents Events

Sample Data Dictionary entry

Semantic units pertaining to objects  objectIdentifier  preservationLevel  objectCategory  objectCharacteristics  creatingApplication  originalName  Storage  environment  signatureInformation  relationship  linkingEventIdentifier  linkingIntellectual Entity Identifier  linkingPermission StatementIdentifier

Semantic units pertaining to Events  eventIdentifier  eventType  eventDateTime  eventDetail  eventOutcome  eventOutcomeDetail  linkingAgentIdentifier  linkingObjectIdentifier

Semantic units pertaining to Agents  agentIdentifier  agentName  agentType

Semantic units pertaining to Rights  permissionStatement  permissionStatementIdentifier  relatedObject  grantingAgent  grantingAgreement  permissionGranted  act  restriction  termOfGrant  permissionNote

Community interest  As of March 2006: ~25,000 “hits” on Data Dictionary More than 100 subscribers to the PREMIS Implementers’ Group discussion list  PREMIS Data Dictionary product of collaboration and consensus PREMIS membership reflects variety of institutions, domains, countries Multiplicity of perspectives promotes applicability in multiplicity of contexts Digital preservation is a shared problem; this invites shared solutions  Data Dictionary useful to any institution or organization committed to the long-term preservation of digital materials

PREMIS Maintenance Activity Permanent Web presence, hosted by Library of Congress Centralized destination for information, announcements, and other PREMIS-related resources Discussion list for PREMIS implementers (PIG list) Coordinate future revisions of Data Dictionary and XML schema Editorial committee being established to guide development and revisions

Current activities  Documenting errata and proposed revisions to Data Dictionary (feedback through PIG list)  PREMIS Implementers’ Registry  Consultancies, etc.: Rights issues for digital preservation (Karen Coyle) PREMIS implementation guidelines and recommendations (Deborah Woodyard-Robinson) PREMIS-to-OAIS mapping (Brian Lavoie)  PREMIS on the road: Digital Curation Center PREMIS workshop (July Glasgow) Repository workshop at National Library of Australia (Aug. 31) Investigating workshops in US

Going forward …  Establish Editorial committee  First revision of Data Dictionary  Work with other initiatives (e.g., METS, Z39.87) to integrate PREMIS with existing standards, technologies, best practices (e.g. METS)  Contribute preservation metadata resources to digital preservation community that are: Openly available Oriented toward practical implementation Supported by a long-term commitment Tools

Some implementers …  MathArc (Germany): A joint project funded by NSF (Cornell) and SUB Göttingen (DFG) to build a distributed archive for mathematical journals distributed between two archives to keep information redundant.  DAITTSS (Florida): a preservation repository for the use of the libraries of the public universities of Florida. Uses a locally- developed software application (DAITSS), which implements most of the PREMIS data elements.  Ex Libris (DigiTool): an enterprise solution for the management of digital assets in libraries and academic environments consisting of a number of modules, each designed to address different needs, functions, and workflows pertaining to the life cycle of a digital object  For more information see:

Conclusion  PREMIS Data Dictionary provides critical piece of reliable digital preservation infrastructure comprised of technology, standards, and best practice  PREMIS Data Dictionary is a building block with which effective, sustainable digital preservation strategies can be implemented  PREMIS Data Dictionary tightly focused on implementation: Practical implementation was guiding principle in all discussions Developed tools to support implementation; released with Data Dictionary Further work with encouragement for international participation and tools development is ongoing  Unglamorous but necessary infrastructure!

URLs, etc.  PREMIS Maintenance Activity:  PREMIS Working Group:  Data Dictionary for Preservation Metadata: Final Report of the PREMIS Working Group:  Please send project information to Implementers’ Registry and join the PIG list!