Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Natalia Wehler: Dublin Core Requirements on Metadata  multiple softwares to use metadata  management of changing standards  needs to be functional,
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Introduction to Metadata for Cultural Heritage Organizations Jenn Riley Metadata Librarian Indiana University Digital Library Program.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
An introduction to metadata in digital projects Jenn Riley Metadata Librarian L566 Fall 2006.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
Is Dublin Core Dying? Kayla Willey – Brigham Young University Cheryl Walters – Utah State University Utah Library Association Annual Conference St. George,
Introduction to Metadata for Cultural Heritage Organizations Jenn Riley Metadata Librarian Indiana University Digital Library Program For technical support:
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Metadata for Music: Understanding the Landscape Jenn Riley Indiana University Digital Library Program.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Introduction to metadata
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums Jenn Riley Metadata Librarian Indiana University Digital Library.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
A Whirlwind Tour Through Part of the Metadata Landscape Jenn Riley Metadata Librarian IU Digital Library Program.
21 June 2001Managing Information Resources for e-Government1 The Dublin Core Makx Dekkers, Managing Director, Dublin Core Metadata Initiative
Introduction to the Semantic Web and Linked Data
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Jenn Riley Metadata Librarian IU Digital Library Program
Metadata (and cataloging?) Jenn Riley Metadata Librarian IU Digital Library Program.
Sally McCallum Library of Congress
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Metadata for your Digital Collections Jenn Riley Metadata Librarian IU Digital Library Program.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The ___ is a global network of computer networks Internet.
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Session 3 Metadata & Workflow
Dublin Core Metadata Initiative Abstract Model
Metadata Standards - Types
Introduction to Metadata
Lifecycle Metadata for Digital Objects
Attributes and Values Describing Entities.
Metadata to fit your needs... How much is too much?
A Whirlwind Tour Through Part of the Metadata Landscape
PREMIS Tools and Services
Introduction to Metadata
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Metadata in Digital Preservation: Setting the Scene
Some Options for Non-MARC Descriptive Metadata
Introduction to metadata for IDAH fellows
Attributes and Values Describing Entities.
Integrated Access and Shareable Metadata
Presentation transcript:

Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005

Descriptive metadata Enables users to find relevant materials Used by many different knowledge domains Many potential representations Controlled by Data structure standards Data content standards Data value standards

Some data structure standards Dublin Core (DC) Unqualified (simple) Qualified MAchine Readable Cataloging (MARC) MARC in XML (MARCXML) Metadata Object Description Schema (MODS)

How do I pick one? Genre of materials being described Format of materials being described Nature of holding institution Robustness needed for the given materials and users What others in the community are doing Describing original vs. digitized item Mechanisms for providing relationships between records Plan for interoperability, including repeatability of elements Formats supported by your delivery software More information on handout

Dublin Core (DC) 15-element set National and international standard 2001: Released as ANSI/NISO Z39.85 2003: Released as ISO 15836 Maintained by the Dublin Core Metadata Initiative (DCMI) Other players DCMI Working Groups DC Usage Board

DCMI mission The mission of DCMI is to make it easier to find resources using the Internet through the following activities: Developing metadata standards for discovery across domains, Defining frameworks for the interoperation of metadata sets, and, Facilitating the development of community- or disciplinary-specific metadata sets that are consistent with items 1 and 2

DC Principles “Core” across all knowledge domains No element required All elements repeatable 1:1 principle

DC encodings HTML <meta> XML RDF [Spreadsheets] [Databases]

Content/value standards for DC None required Some elements recommend a content or value standard as a best practice Coverage Date Format Language Identifier Relation Source Subject Type

Some limitations of DC Can’t indicate a main title vs. other subordinate titles No method for specifying creator roles W3CDTF format can’t indicate date ranges or uncertainty Can’t by itself provide robust record relationships

Good times to use DC Cross-collection searching Cross-domain discovery Metadata sharing Describing some types of simple resources Metadata creation by novices

By novices, by specialists, and by derivation DC [record] QDC [collection] MARC MARCXML MODS Record format XML RDF (X)HTML Field labels Text Reliance on AACR None Common method of creation By novices, by specialists, and by derivation

Qualified Dublin Core (QDC) Adds some increased specificity to Unqualified Dublin Core Same governance structure as DC Same encodings as DC Same content/value standards as DC Listed in DMCI Terms Additional principles Extensibility Dumb-down principle Extensibility: via Application Profiles and local qualifiers. Local qualifiers maybe not kosher but there are no metadata police. Usually.

Types of DC qualifiers Additional elements Element refinements Encoding schemes Vocabulary encoding schemes Syntax encoding schemes

DC qualifier status Recommended Conforming Obsolete Registered Recommended: Elements, Element Refinements, and DCMI-maintained Vocabulary Terms (e.g., member terms of the DCMI Type Vocabulary) useful for resource discovery across domains. Conforming: Elements, Element Refinements and Application Profiles may be assigned a status of conforming. Elements and Element Refinements assigned a status of conforming are those for which an implementation community has a demonstrated need and which conform to the grammar of Elements and Element Refinements, though without necessarily meeting the stricter criteria of usefulness across domains or usefulness for resource discovery. Obsolete: For Elements and Element Refinements that have been superseded, deprecated, or rendered obsolete. Such terms will remain in the registry for use in interpreting legacy metadata. Registered: Used for Vocabulary Encoding Schemes and language translations for which the DCMI provides information but not necessarily a specific recommendation.

Limitations of QDC Widely misunderstood No method for specifying creator roles W3CDTF format can’t indicate date ranges or uncertainty Split across 3 XML schemas No encoding in XML officially endorsed by DCMI

Best times to use QDC More specificity needed than simple DC, but not a fundamentally different approach to description Want to share DC with others, but need a few extensions for your local environment Describing some types of simple resources Metadata creation by novices

By novices, by specialists, and by derivation DC [record] QDC [collection] MARC MARCXML MODS Record format XML RDF (X)HTML Field labels Text Reliance on AACR None Common method of creation By novices, by specialists, and by derivation

MAchine Readable Cataloging (MARC) Format for the records in IUCAT and other OPACs Used for library metadata since 1960s Adopted as national standard in 1971 Adopted as international standard in 1973 Maintained by: Network Development and MARC Standards Office at the Library of Congress Standards and the Support Office at the National Library of Canada

More about MARC Actually a family of MARC standards throughout the world U.S. & Canada use MARC21 Structured as a binary interchange format ANSI/NISO Z39.2 ISO 2709 Field names Numeric fields Alphabetic subfields

Content/value standards for MARC None required by the format itself But US record creation practice relies heavily on: AACR2r ISBD LCNAF LCSH

Limitations of MARC Use of all its potential is time-consuming OPACs don’t make full use of all possible data OPACs virtually the only systems to use MARC data Requires highly-trained staff to create Local practice differs greatly

Good times to use MARC Integration with other records in OPAC Resources are like those traditionally found in library catalogs Maximum compatibility with other libraries is needed Have expert catalogers for metadata creation

By novices, by specialists, and by derivation DC [record] QDC [collection] MARC MARCXML MODS Record format XML RDF (X)HTML ISO 2709 [ANSI Z39.2] Field labels Text Numeric Reliance on AACR None Strong Common method of creation By novices, by specialists, and by derivation By specialists

MARC in XML (MARCXML) Copies the exact structure of MARC21 in an XML syntax Numeric fields Alphabetic subfields Implicit assumption that content/value standards are the same as in MARC

Limitations of MARCXML Not appropriate for direct data entry Extremely verbose syntax Full content validation requires tools external to XML Schema conformance

Best times to use MARCXML As a transition format between a MARC record and another XML-encoded metadata format Materials lend themselves to library-type description Need more robustness than DC offers Want XML representation to store within larger digital object but need lossless conversion to MARC

By novices, by specialists, and by derivation DC [record] QDC [collection] MARC MARCXML MODS Record format XML RDF (X)HTML ISO 2709 [ANSI Z39.2] Field labels Text Numeric Reliance on AACR None Strong Common method of creation By novices, by specialists, and by derivation By specialists By derivation

Metadata Object Description Schema (MODS) Developed and managed by the Library of Congress Network Development and MARC Standards Office First released for trial use June 2002 MODS 3.0 released December 2003 “Schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications.”

Differences between MODS and MARC MODS is “MARC-like” but intended to be simpler Textual tag names Encoded in XML Some specific changes Some regrouping of elements Removes some elements Adds some elements

Content/value standards for MODS Many elements indicate a given content/value standard should be used Generally follows MARC/AACR2/ISBD conventions But not all enforced by the MODS XML schema Authority attribute available on many elements

Limitations of MODS No lossless round-trip conversion from and to MARC Still largely implemented by library community only Some semantics of MARC lost

Good times to use MODS Materials lend themselves to library-type description Want to reach both library and non-library audiences Need more robustness than DC offers Want XML representation to store within larger digital object

Common method of creation DC [record] QDC [collection] MARC MARCXML MODS Record format XML RDF (X)HTML ISO 2709 [ANSI Z39.2] Field labels Text Numeric Reliance on AACR None Strong Implied Common method of creation By novices, by specialists, and by derivation By specialists By derivation By specialists and by derivation

Mapping between metadata formats Also called “crosswalking” To create “views” of metadata for specific purposes Mapping from robust format to more general format is common Mapping from general format to more robust format is ineffective

Types of mapping logic Mapping the complete contents of one field to another Splitting multiple values in a single local field into multiple fields in the target schema Translating anomalous local practices into a more generally useful value Splitting data in one field into two or more fields Transforming data values Boilerplate values to include in output schema

Common mapping pitfalls Cramming in too much information Leaving in trailing punctuation Missing context of records Meaningless placeholder data ALWAYS remember the purpose of the metadata you are creating!

No, really, which one do I pick? It depends. Sorry. Be as robust as you can afford Plan for future uses of the metadata you create Leverage existing expertise as much as possible Focus on content and value standards as much as possible

More information Dublin Core MODS MARC MARCXML DC Element Set version 1.1 DCMI Metadata Terms MODS MARC MARCXML

Questions? Jenn Riley, Metadata Librarian, IU Digital Library Program: jenlrile@indiana.edu These presentation slides: <http://www.dlib.indiana.edu/~jenlrile/presentations/bbspr05/descMDBB/>