Metadata standards and interoperability. The world of standards A standard is any agreed-upon means of doing something. Standards can be formally created.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Catherine Worrall Slide Library Co-ordinator, University College Falmouth.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
IMT530- Organization of Information Resources1 Feedback Like exercises –But want more instructions and feedback on them –Wondering about grading on these.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
RDA Test “Train the Trainer Module 1: What RDA is and isn’t [Content as of Mar. 31, 2010]
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Metadata standards Guidelines, data structures, and file formats to facilitate reliability and quality of description.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Semantics and Syntax of Dublin Core Usage in Open Archives Initiative Data Providers of Cultural Heritage Materials Arwen Hutt, University of Tennessee.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
RDA Toolkit is an integrated, browser-based, online product that allow user to interact with a collection of cataloging-related documents and resources.
Metadata for Music: Understanding the Landscape Jenn Riley Indiana University Digital Library Program.
Metadata standards Guidelines, data structures, and file formats to facilitate reliability and quality of description.
Overview of EAD Jenn Riley Metadata Librarian Digital Library Program.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Appropriate representation of the resource through metadata Metadata as a view of the resource Standards promote interoperability Appropriate formats Appropriate.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Sally McCallum Library of Congress
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Metadata standards and interoperability 384C – Organizing Information Spring 2016 Karen Wickett School of Information University of Texas at Austin.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
The Use of EAD in Archival Based Repositories
Metadata Standards - Types
Information modeling and infrastructures for metadata
INFORMATION STRUCTURES FOR VISUAL WORKS
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Metadata standards Guidelines, data structures, and file formats to facilitate reliability and quality of description INF 384 C, Spring 2009.
Applications of IFLA Namespaces
Attributes and Values Describing Entities.
Introduction to Metadata
Accommodating local cataloguing traditions in a global context
Some Options for Non-MARC Descriptive Metadata
RDA in a non-MARC environment
The new RDA: resource description in libraries and beyond
Attributes and Values Describing Entities.
FRBR and FRAD as Implemented in RDA
Presentation transcript:

Metadata standards and interoperability

The world of standards A standard is any agreed-upon means of doing something. Standards can be formally created and adopted or merely customary. With standards, products and processes have a certain level of consistency and predictability that can make production and use more efficient.

Goals of metadata standards Metadata standards enable more reliable and consistent description. For example, by agreeing to use separate fields to indicate first names and last names of resource creators, displays of search results by author can be properly alphabetized and more easily read, no matter if first name or last name comes first in the display. Reliable description facilitates the sharing of data across different systems—interoperability.

Interoperability: for money as well as love Interoperable records facilitate information access and exchange across contexts: not just for cultural heritage. A distributor like Amazon sells products from many different providers. To the extent that Amazon can get interoperable product records from its suppliers, its job is easier—and you can find that book, or pair of shoes, or compost bin.

Interoperability: for money as well as love Interoperable wine records? Astor Wines Wine.com Sherry-Lehmann 67 Wine

Types of standards Elings and Waibel describe four types of metadata standards: Data structure (attributes, elements, or fields): Dublin Core; CDWA (museums), EAD (archives), MARC (libraries). Data content (values): CCO (museums), RDA (libraries), DACS (archives). Data format: XML (aaaannddd...MARC; EAD is also built around XML). Data exchange: Z39.50 and OAI. These are useful categories, but standards may straddle them. You could say, for example, that MARC reflects RDA and not the other way around—although MARC defines data fields in a technical sense, RDA defines the content with which the fields are populated and to some degree conceptually determines the MARC fields; in practice these two become functionally intertwined.

Multiple standards at work A cataloger uses RDA to determine: That a book’s title should be part of its description. The wording, spelling, capitalization, and punctuation of the title. The cataloger uses MARC to record the title information in a consistent form that computers can process.

Multiple standards at work Two computer networks can use Z39.50 to determine how to exchange their MARC catalog records. The result? A user at Library A can search Library B’s catalog and not discern a difference in the way that information is structured and presented. It just works.

Multiple standards at work An archivist uses EAD to determine that an archival finding aid should include a scope and content note. The archivist uses DACS for guidance on what to include in the scope and content note and how to express that content.

Multiple standards at work The archivist uses EAD to include the scope and content note in a machine-accessible document. The result? A researcher can access finding aids from Archive A online, and these documents have similar content and structure. Other areas of the finding aid document might appear as links in the Scope and Content note.

Multiple standards at work A museum curator is documenting a new acquisition in proprietary museum database software. The collection management system includes a field for the “Work Type,” which is a core attribute from CDWA. Guidance for describing the work type is given in CCO. The Art and Architecture Thesaurus (AAT) includes vocabulary terms that can be used to describe the work type.

Multiple standards at work Later, collection data is mapped from CDWA (data structure) to the Europeana Data Model (EDM) (data structure), for aggregation into Europeana and subsequent data reuse. In this mapping, the proprietary database format (data format) is translated to the EDM’s RDF/XML schema (data format).

Developing and adopting standards Organizations agree to adopt standards because the benefits of creating products or services that work together can be great. However, developing standards and forging that agreement can be a difficult process. For metadata content standards, using them can be complicated, and there is plenty of room for interpretive flexibility.

Content standards: considerations Why are content standards so complicated? Because documents are various! Most content standards will try to implement a few basic guidelines supplemented by rules and options for special cases. Ideally, the basic guidelines will be based on clearly articulated goals and principles.

Example: RDA goals RDA has articulated a concrete set of descriptive goals and principles. A few goals: Enable description of any resource (not just printed materials). Align with the FRBR conceptual model (works, expressions, manifestations, resources) and its objectives (finding, selecting, understanding, and so on). Create content descriptions that can be used in multiple encodings and displays. Retain backward compatibility with existing records.

Example: RDA Principles One principle is that descriptions should reflect “the resource’s representation of itself.” This is a longstanding principle in library cataloging: where possible, description = transcription. This can be linked to the objective of finding known items: the catalog description should match how the item is known to others, which is most likely from the item itself.

Example: RDA guidelines This principle of transcription underlies the basic guideline for RDA titles, which is that the “title proper” or primary title should come from the preferred source of information, which for books is the title page. While the wording comes from the title page, though, the capitalization and punctuation are standardized for all titles.

Example: RDA special cases What if... Some introductory words on the title page seem like they’re not really part of the title (e.g., Walt Disney Presents Sleeping Beauty)? The title is given in two languages (e.g., Canadian Literature/Literature Canadienne)? There is a spelling mistake in the title? The document is a manifestation of a commonly known work but has a slightly different title than most manifestations (e.g., William Shakespeare’s Hamlet)? A subtitle appears under what seems to be the main title (e.g., Museum Informatics an introductory textbook)? The title is over one paragraph long?

Keeping standards relevant Standards are immediately out of date. Particular institutions, such as the Library of Congress, will issue their own rules for interpreting the standards, which smaller organizations (such as the University of Texas) may or may not choose to adopt.

Levels of interoperability Different kinds of standards enable different kinds of interoperability. Let’s say someone gives you a metadata record to incorporate in your database of records from your schema. What can you do with it? Your computer can read the file—system interoperability. Your database understands the file format—syntax interoperability. The attributes match other records in the database—structural interoperability. The values in the fields are consistent with other records in the database—semantic interoperability.

Derivation New schemas are subsets, supersets, or direct translations of existing schemas: CDWA Lite is a subset of CDWA (removes some attributes). French Dublin Core is a translated version of Dublin Core (same attributes, different labels). Gateway to Educational Materials (GEM) adds elements to Dublin Core.

Application profiles Application profiles mix attributes from different existing schemas or mix usage rules for attributes from different existing schemas. The application profile for the Digital Public Library of America (DPLA) uses elements from: Dublin Core. The Europeana data model (EDM). A “Basic Geo” schema created by the W3C (wgs84) for simple geographic information. The DPLA itself (published separately from the profile).

Crosswalks Crosswalks are mappings between one schema to another. For example, a crosswalk might specify that the Title element in CDWA should be mapped to the Title element in Dublin Core. Crosswalks can map only schema elements that are semantically equivalent, or they can map semantically “close” elements to each other.

Switching languages Switches map multiple schemas to a single switching language. For example, multiple content schemas could all be mapped to Dublin Core. The Dublin Core content could then in turn be mapped to something else. (This is more efficient than mapping each individual schema to the result.) Imagine a multilingual conversation in which everyone has a different native language but speaks French...

Frameworks A basic set of concepts and specifications that are agreed upon by a particular group. For example, the Warwick Framework is an early specification that designates the idea of a “container” as an aggregation of metadata sets, or “packages.” Agreements on ideas like containers and packages facilitate the sharing of different sorts of units. (The DPLA, for example, relies on “service hubs” that aggregate metadata sets from individual contributing institutions.)

Registries Registries publish information about metadata schemas. Registries constitute reference information that facilitate the development of new application profiles, crosswalks, and so on. Open Metadata Registry

Aggregated infrastructures Some examples of systems that are enabled via all of this stuff: Europeana, the European cultural heritage data aggregation.Europeana, The Digital Public Library of America (DPLA).Digital Public Library of America Europeana and the DPLA describe themselves primarily as platforms: they want you (really, they want you) to create applications and other cool stuff with the data (really metadata) that they aggregate and publish.

Schema assignment notes Consider whether attributes should be: Mandatory or optional. Repeatable. You might include general guidelines that apply to all attributes in your schema, as well as guidelines for each attribute. (Check the CDP best practice guidelines for an example.)