Rebecca Guenther Library of Congress

Slides:



Advertisements
Similar presentations
PRESERVATION METADATA: IMPLEMENTATION STRATEGIES Preservation Metadata: The PREMIS Experience Priscilla Caplan Florida Center for Library Automation (FCLA)
Advertisements

Preservation Metadata: Implementation Strategies (PREMIS) Rebecca Guenther Library of Congress IS&T Archiving Conference April 28, 2005.
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Implementing PREMIS in Container Formats Rebecca Guenther, Library of Congress Zhiwu Xie, Los Alamos National Laboratory IS&T’s.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
METS: An Introduction Structuring Digital Content.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
An Introduction to Preservation Metadata and the PREMIS Data Dictionary Rebecca Guenther, Library of Congress ALA Midwinter 2009 Intellectual Access to.
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
MODS, METS, and other metadata standards
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Understanding and Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Digital Preservation Partners’
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
The Promise of PREMIS: background, scope and purpose of the Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Long-term Repositories:
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
A METS Application Profile for Historical Newspapers
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
METS Intro & Overview Mets Opening Day Germany May 7, 2007 Nancy J. Hoebelheinrich Stanford University Libraries.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Standards Showcase: PREMIS (Preservation metadata) Rebecca Guenther, Library of Congress ALA Annual 2006 LC booth presentation June 24-25, 2006.
3. Technical and administrative metadata standards Metadata Standards and Applications Workshop.
13 Oct DC2004--IFLA New and traditional descriptive formats in the library environment DC2004: IFLA session 13 Oct Rebecca Guenther
Metadata Object Description Schema (MODS). XML What is XML? –EXtensible Markup Language. XML is a set of rules for defining markup languages and describing.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
PREMIS Tutorial: Understanding & Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Brian Lavoie,
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Cataloging Compound Digital Objects: Using METS for Digitized Sanborn Maps Christopher Cronin Head of Digital Resources Cataloging University of Colorado.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
IMPLEMENTATION ISSUES. How PREMIS can be used  For systems in development as a basis for metadata definition  For existing repositories as a checklist.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
PREMIS Tutorial: Understanding & Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Olaf Brandt, BStU.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
If not DC, then MODS? A look at the Metadata Object Description Schema Cheryl Walters Kayla Willey ULA Annual Conference St. George, Utah May 17, 2006.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
Building A Repository for Digital Objects
Introduction to Metadata
Integrating PREMIS and METS
Rebecca Guenther, Library of Congress Brian Lavoie, OCLC
Metadata in Digital Preservation: Setting the Scene
Some Options for Non-MARC Descriptive Metadata
Presentation transcript:

Using Metadata Standards in Digital Libraries: Implementing METS, MODS, PREMIS and MIX: Introduction Rebecca Guenther Library of Congress LITA Standards IG Program, ALA Annual 2007

Program overview Introduction To METS, MODS, PREMIS and MIX (Guenther) Using METS and MODS for presentations of LC content (Cundiff, Trail) Using METS in special collections at CDL (Tingle) Creating rich shareable metadata: the DLF Aquifer MODS implementation guidelines (Shreeves) METS, MODS and PREMIS, Oh My!: Integrating digital library standards for interoperability and preservation (Habing) MODS as metadata Hub (Olson)

Metadata standards in digital libraries XML is the de-facto standard for metadata descriptions on the Internet Interoperability and object exchange requires the use of established standards Many digital objects are complex and are comprised of multiple files Complex digital objects require many more forms of metadata than analog for their management and use Descriptive Technical Digital provenance/events Structural Rights/Terms and conditions

Descriptive metadata: MARCXML Millions of rich descriptive records in MARC systems: can be reused in an XML environment using MARCXML MARCXML uses the MARC data element set in an XML syntax Allows interoperability with other XML schemes by taking advantage of free XML tools Allows for collaborative use of metadata for access (e.g. OAI) Provides continuity with current data and flexible transition options

MARC 21 evolution to XML

MARCXML Music record in MARCXML MARCXML record XML exact equivalent of MARC (2709) record Lossless/roundtrip conversion to/from MARC 21 record Simple flexible XML schema, no need to change when MARC 21 changes Presentations using XML stylesheets LC provides converters (open source) http://www.loc.gov/standards/marcxml Music record in MARCXML

What is MODS? Metadata Object Description Schema An XML descriptive metadata standard A derivative of MARC Uses language based tags Contains a subset of MARC data elements Repackages elements to eliminate redundancies MODS does not assume the use of any specific rules for description Element set is particularly applicable to digital resources

Uses of MODS Extension schema to METS Rich description works well with hierarchical METS objects To represent metadata for harvesting (OAI) Language based tags are more user friendly As a specified XML format for SRU As a core element set for convergence between MARC and non-MARC XML descriptions For original resource description in XML syntax that is simpler than full MARC

MODS high-level elements Title Info Name Type of resource Genre Origin Info Language Physical description Abstract Table of contents Target audience Note Subject Classification Related item Identifier Location Access conditions Part Extension Record Info Music record in MODS

MODS Development Developed 2002 through open listserv discussion of possible implementers (LC coordinated) Version 1 in late 2002; now in version 3.2 with 3.3 almost complete Companion for authority metadata (MADS) in version 1.0 (2005) Endorsed as METS extension schema for descriptive metadata section Registered with NISO Widely used in digital library projects MODS Implementation registry: http://www.loc.gov/mods/registry.php

What is METS? METS records the (possibly hierarchical) structure of digital objects, the names and locations of the files that comprise those objects, and the associated metadata A container for metadata and file pointers A METS document may be a unit of storage or a transmission format METS is extensible and modular, using “wrappers” or “sockets” where elements from other schemas can be plugged in METS uses the XML Schema facility for combining vocabularies from different Namespaces

What is PREMIS? A data dictionary for metadata to support the long-term preservation of digital objects A piece of the necessary infrastructure for implementing reliable, sustainable preservation programs A supporting set of XML schema for implementation in a variety of contexts A maintenance activity hosted at LC including an Implementers’ Group and Editorial Committee

What is preservation metadata? Provenance: Who has had custody/ownership of the digital object? Authenticity: Is the digital object what it purports to be? Preservation Activity: What has been done to preserve the digital object? Technical Environment: What is needed to render and use the digital object? Rights Management: What IPR must be observed? Makes digital objects self-documenting across time Content 10 years on 50 years on Forever!

Guiding principles and assumptions … “Implementable, core, preservation metadata”: “Preservation metadata”: maintain viability, renderability, understandability, authenticity, identity in a preservation context “Core”: What most preservation repositories need to know to preserve digital materials over the long-term “Implementable”: rigorously defined; supported by usage guidelines/recommendations; emphasis on automated workflows Implementation neutral: No assumptions on specific implementation Promote flexibility/interoperability Focus on semantic units: what you need to know (implementation-neutral) vs. metadata elements: how you record it (implementation-specific) Information that needs to be “recoverable” from the digital archiving system, independent of local implementation

Scope What PREMIS is: Common data model for organizing/thinking about preservation metadata Guidance for local implementations Standard for exchanging information packages between repositories What PREMIS is not: Out-of-the-box solution: need to instantiate as metadata elements in repository system All needed metadata: excludes business rules, format-specific technical metadata, descriptive metadata for access, non-core preservation metadata Lifecycle management of objects outside repository Rights management: limited to permissions regarding actions taken within repository

PREMIS data model Intellectual Entities Rights Objects Agents Events

Semantic units pertaining to objects: technical metadata objectIdentifier preservationLevel objectCategory objectCharacteristics creatingApplication originalName storage environment signatureInformation relationship linkingEventIdentifier linkingIntellectual Entity Identifier linkingPermission StatementIdentifier

Semantic units pertaining to Events: provenance and preservation activity eventIdentifier eventType eventDateTime eventDetail eventOutcome eventOutcomeDetail linkingAgentIdentifier linkingObjectIdentifier

Semantic units pertaining to Rights: terms and conditions permissionStatement permissionStatementIdentifier relatedObject grantingAgent grantingAgreement permissionGranted act restriction termOfGrant permissionNote

Semantic units pertaining to Agents agentIdentifier agentName agentType

PREMIS maintenance activities First revision of Data Dictionary (PREMIS 2.0) Documenting errata and proposed revisions to Data Dictionary (feedback through PIG list) http://www.loc.gov/standards/premis/changes.html PREMIS Implementers’ Registry http://www.loc.gov/standards/premis/premis-registry.html Consultancies (funded by Library of Congress): Rights issues for digital preservation (Karen Coyle) PREMIS implementation guidelines and recommendations (Deborah Woodyard-Robinson) PREMIS Tutorials: Glasgow, Boston, Stockholm, Albuquerque, Washington

What is MIX? Metadata For Images in XML An XML Schema designed for expressing technical metadata for digital still images Based on the NISO Z39.87 Data Dictionary – Technical Metadata for Digitial Still Images Used to express attributes of digital images such as file format, file size, dimensions, resolution, compression, etc. Version 1.0 (recently released) includes support for GIS images and JPEG 2000 images; data element names harmonized with PREMIS Can be used standalone or as an extension schema with METS

How do these standards work together for digital libraries? A container format such as METS allows for packaging together forms of metadata with objects or pointers to objects There are about 5 years of experimentation experience using METS in combination with other standards for managing and using digital objects in digital libraries These standards are all freely available METS profiles detail how METS is used for particular object types or applications Best practices are needed (and being developed) for use of PREMIS with METS and MIX Using METS, MODS, PREMIS and MIX: http://www.loc.gov/premis/louis.xml