Introduction to Metadata for Cultural Heritage Organizations Jenn Riley Metadata Librarian Indiana University Digital Library Program For technical support:

Slides:



Advertisements
Similar presentations
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Advertisements

Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Digital Encoding What’s behind E-text Resources?.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Introduction to Metadata for Cultural Heritage Organizations Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Metadata for Visual Resources Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
An introduction to metadata in digital projects Jenn Riley Metadata Librarian L566 Fall 2006.
Applying Digital Library Metadata Standards Jenn Riley Metadata Librarian IU Digital Library Program.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
RDA : Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
What does this record describe? identifier: X0802]1004_112 publisher: Museum of Zoology, Fish Field Notes format:jpeg.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Metadata for Music: Understanding the Landscape Jenn Riley Indiana University Digital Library Program.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Sharing With the Open Archives Initiative Jenn Riley Metadata Librarian Indiana University.
Introduction to metadata
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
A Whirlwind Tour Through Part of the Metadata Landscape Jenn Riley Metadata Librarian IU Digital Library Program.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Shareable Metadata in the Museum Community Jenn Riley Metadata Librarian Indiana University Digital Library Program.
JENN RILEY METADATA LIBRARIAN INDIANA UNIVERSITY DIGITAL LIBRARY PROGRAM Shareable Metadata for Visual Resources.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Jenn Riley Metadata Librarian IU Digital Library Program
Metadata (and cataloging?) Jenn Riley Metadata Librarian IU Digital Library Program.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Collection Management Systems
A Training Program for Shareable Metadata Metadata for You & Me is a collaboration between the University of Illinois Library and Indiana University. This.
Metadata for your Digital Collections Jenn Riley Metadata Librarian IU Digital Library Program.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Metadata Standards - Types
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Introduction to Metadata
Metadata for research outputs management
Metadata to fit your needs... How much is too much?
A Whirlwind Tour Through Part of the Metadata Landscape
Oya Y. Rieger Cornell University Library May 2004
Some Options for Non-MARC Descriptive Metadata
Introduction to metadata for IDAH fellows
Integrated Access and Shareable Metadata
Presentation transcript:

Introduction to Metadata for Cultural Heritage Organizations Jenn Riley Metadata Librarian Indiana University Digital Library Program For technical support: Jonathan Robinson x3971

Show Us Where You Are! On the toolbar, click on the yellow star, then click on your location!

Using the Centra Software Microphone Raising your hand Green  / Red X Laughing / Clapping Stepping out Text chat Feedback Full Screen Exiting

A little about me

Many definitions of metadata “Data about data” “Structured information about an information resource of any media type or format.” (Caplan) “Structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource.” (NISO) “Metadata is constructed, constructive, and actionable.” (Coyle) …

Metadata and cataloging Depends on what you mean by: –metadata, and –cataloging! But, in general: –Metadata is broader in scope than cataloging –Much metadata creation takes place outside of libraries –Good metadata practitioners use fundamental cataloging principles in non-MARC environments Metadata can be created by humans, or by software programs Metadata is NOT only for Internet resources!

Some types of metadata TypeUse Descriptive metadataSearching Browsing Display Interoperability Technical metadataInteroperability Digital object management Preservation Preservation metadataInteroperability Preservation Rights metadataInteroperability Digital object management Structural metadataNavigation Markup languagesNavigation Enhancing content

How metadata is used

Creating descriptive metadata Digital library content management systems –CONTENTdmCONTENTdm –ExLibris DigitoolExLibris Digitool –GreenstoneGreenstone –DSpaceDSpace Library catalogs Spreadsheets & databases Directly in XML (only use in special cases!)

Technical –Generated by and stored in content management system –Generated by specialized tools such as JHOVE –Stored in separate Excel spreadsheet Structural –Created and stored in content management system –METS XML Content markup –In XML Creating other types of metadata

Descriptive metadata This is what we’ll cover today Purpose –Discovery –Description to support use and interpretation Some common general schemas –Dublin Core (simple and qualified) –MARC –MARCXML –MODS LOTS of domain-specific schemas

Levels of control Data structure standards (e.g., MARC, MODS) Data content standards (e.g., AACR2r, RDA) Encoding schemes –Vocabulary (a.k.a. controlled vocabularies) –Syntax High-level models (e.g., FRBR, DCAM) We’ll focus on structure standards today

Simple Dublin Core (DC) 15-element metadata structure standard National and international standard –2001: Released as ANSI/NISO Z39.85ANSI/NISO Z39.85 –2003: Released as ISO 15836ISO Maintained by the Dublin Core Metadata Initiative “Core” across all knowledge domains No element required All elements repeatable Simple DC required for sharing metadata via the Open Archives Initiative Protocol for Metadata Harvesting Open Archives Initiative Protocol for Metadata Harvesting

Content/value standards for DC None required No reason you can’t use AACR2! Some elements recommend a content or value standard as a best practice –Relation –Source –Subject –Type –Coverage –Date –Format –Language –Identifier

Some limitations of DC Can’t indicate a main title vs. other subordinate titles No method for specifying creator roles W3CDTF format can’t indicate date ranges or uncertainty Can’t by itself provide robust record relationships

Good times to use DC Cross-collection searching Cross-domain discovery Metadata sharing Describing some types of simple resources Metadata creation by novices

DC [record]record QDC [record]record MARC [record]record MARCXML [record]record MODS [record]record Record format XML RDF (X)HTML Field labelsText Reliance on AACR None Common method of creation By novices, by specialists, and by derivation

Qualified Dublin Core (QDC) Adds some increased specificity to Unqualified Dublin Core –Additional elements –Element refinements –Encoding schemes (vocabulary and syntax) Defined by DMCI TermsDMCI Terms Most implementations expand beyond official qualifiers Same encodings as DC Same content/value standards as DC

Limitations of QDC Widely misunderstood No method for specifying creator roles W3CDTF format can’t indicate date ranges or uncertainty XML encoding has never been very stable; few implementations conform to newest DCMI proposed recommendation The DCMI Abstract Model changes everything

Best times to use QDC More specificity needed than simple DC, but not a fundamentally different approach to description Want to share DC with others, but need a few extensions for your local environment Describing some types of simple resources Metadata creation by novices

DC [record]record QDC [record]record MARC [record]record MARCXML [record]record MODS [record]record Record format XML RDF (X)HTML XML RDF (X)HTML Field labelsText Reliance on AACR None Common method of creation By novices, by specialists, and by derivation

MAchine Readable Cataloging (MARC) Format for records in library catalogs Used for library metadata since 1960s –Adopted as national standard in 1971 –Adopted as international standard in 1973 Actually a family of MARC standards throughout the world –U.S. & Canada use MARC21 Field names –Numeric fields –Alphanumeric subfields

Content/value standards for MARC None required by the format itself But US record creation practice relies heavily on: –AACR2r –ISBD –LCNAF –LCSH

Limitations of MARC Use of all its potential is time-consuming OPACs don’t make full use of all possible data OPACs virtually the only systems to use MARC data Requires highly-trained staff to create Local practice differs greatly

Good times to use MARC Integration with other records in OPAC Resources are like those traditionally found in library catalogs Maximum compatibility with other libraries is needed Have expert catalogers for metadata creation

DC [record]record QDC [record]record MARC [record]record MARCXML [record]record MODS [record]record Record format XML RDF (X)HTML XML RDF (X)HTML ISO 2709 [ANSI Z39.2] Field labelsText Numeric Reliance on AACR None Strong Common method of creation By novices, by specialists, and by derivation By specialists

MARC in XML (MARCXML) Copies the exact structure of MARC21 in an XML syntax –Numeric fields –Alphanumeric subfields Implicit assumption that content/value standards are the same as in MARC

Limitations of MARCXML Not appropriate for direct data entry Extremely verbose syntax Full content validation requires tools external to XML Schema conformance

Best times to use MARCXML As a transition format between a MARC record and another XML-encoded metadata format Materials lend themselves to library-type description Need more robustness than DC offers Want XML representation to store within larger digital object but need lossless conversion to MARC

DC [record]record QDC [record]record MARC [record]record MARCXML [record]record MODS [record]record Record format XML RDF (X)HTML XML RDF (X)HTML ISO 2709 [ANSI Z39.2] XML Field labelsText Numeric Reliance on AACR None Strong Common method of creation By novices, by specialists, and by derivation By specialists By derivation

Metadata Object Description Schema (MODS) Developed and managed by the Library of Congress Network Development and MARC Standards Office For encoding bibliographic information Influenced by MARC, but not equivalent Usable for any format of materials First released for trial use June 2002 MODS 3.4 to be released soon

MODS is “MARC-like” but intended to be simpler Textual tag names Encoded in XML Some specific changes –Some regrouping of elements –Removes some elements –Adds some elements MODS differences from MARC

Many elements indicate a given content/value standard should be used –Generally follows MARC/AACR2/ISBD conventions –But not all enforced by the MODS XML schema Authority attribute available on many elements Content/value standards for MODS

No lossless round-trip conversion from and to MARC Still largely implemented by library community only Some semantics of MARC lost Limitations of MODS

Good times to use MODS Materials lend themselves to library-type description Want to reach both library and non-library audiences Need more robustness than DC offers Want XML representation to store within larger digital object

DC [record]record QDC [record]record MARC [record]record MARCXML [record]record MODS [record]record Record format XML RDF (X)HTML XML RDF (X)HTML ISO 2709 [ANSI Z39.2] XML Field labelsText Numeric Text Reliance on AACR None Strong Implied Common method of creation By novices, by specialists, and by derivation By specialists By derivation By specialists and by derivation

Visual Resources Association (VRA) Core Grew out of a professional association Separates Work from Image Library focus Inspiration from Dublin Core Version 4.0 exists in “restricted” and “unrestricted” versions

Categories for the Description of Works of Art (CDWA) Lite Reduced version of the Categories for the Description of Works of Art (512 categories) From J. Paul Getty Trust Museum focus Conceived for record sharing

Structure standards for learning materials Gateway to Educational Materials (GEM) –From the U.S. Department of Education –Based on Qualified Dublin Core –Adds elements for instructional level, instructional method, etc. –“GEM's goal is to improve the organization and accessibility of the substantial collections of materials that are already available on various federal, state, university, non-profit, and commercial Internet sites.”* IEEE Learning Object Metadata (LOM) –Elements for technical and descriptive metadata about learning resources * From

Encoded Archival Description (EAD) Maintained by the Society for American Archivists Markup language for archival finding aids Designed to accommodate multi-level description Requires specialized search engine Delivery requires specialized software or offline conversion to HTML

Text Encoding Initiative (TEI) Best Practices for TEI in Libraries For encoding full texts of documents –Literary texts –Letters –…etc. Requires specialized search engine Delivery requires specialized software or offline conversion to HTML

How do I pick standards? (1) Institution –Nature of holding institution –Resources available for metadata creation –What others in the community are doing –Capabilities of your delivery software The standard –Purpose –Structure –Context –History

How do I pick standards? (2) Materials –Genre –Format –Likely audiences –What metadata already exists for these materials Project goals –Robustness needed for the given materials and users –Describing multiple versions –Mechanisms for providing relationships between records –Plan for interoperability, including repeatability of elements

Assessing materials for ease of metadata creation Number of items? Homogeneity of items? Foreign language? Published or unpublished? Specialist needed? How much information is known? Any existing metadata?

Assessing currently existing metadata Machine-readable? Divided into fields? What format? What content standards? Complete?

Assessing software capabilities Are there templates for standard metadata formats? Can you add/remove fields to a template? Can you create new templates? Can you add additional clarifying information without creating a separate field? –Personal vs. corporate names –Subject vocabulary used Is there an XML export? Does it produce valid records?

Building “Good digital collections” * Interoperable – with the important goal of cross-collection searching Persistent – reliably accessible Re-usable – repositories of digital objects that can be used for multiple purposes Good metadata promotes good digital collections. * Institute for Museum and Library Services. A Framework of Guidance for Building Good Digital Collections. Washington, D.C.: Institute for Museum and Library Services, 3 rd edition, December

Where your metadata can go Collection Registries ????? Photograph from Indiana University Charles W. Cushman Collection

Why share metadata? Benefits to users –One-stop searching –Aggregation of subject-specific resources Benefits to institutions –Increased exposure for collections –Broader user base –Bringing together of distributed collections Don’t expect users will know about your collection or remember to visit it.

Sample OAI RequestSample OAI Request:

Three possible OAI-PMH architectures OAI Harvester Digital asset management system Metadata creation module OAI data provider module Transformation Metadata creation system Stand-alone OAI data provider Transformation DC QDC MODS MARCXML DCMARCXML QDCMODS Metadata creation system Static Repository Gateway Transformation

Putting it all into practice Develop written documentation Develop a quality control workflow for metadata creation Share your metadata openly Discuss your findings with others Get better with every new online collection

Thank you! Please fill out class evaluation form – –You’ll get a Certificate of Participation for the class if you fill out the evaluation form For more information: –Metadata librarians listserv: My best advice: –Read –Talk to colleagues –Know WHY you are doing things the way you’re doing them