A Logical Framework for Metadata Interoperability 16th August 2007 The Advanced Digital Library Seminar 2007 Guilin, China.

Slides:



Advertisements
Similar presentations
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Advertisements

DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
An ontology server for the agentcities.NET project Dr. Manjula Patel Technical Research and Development
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Pete Johnston & Andy Powell, Eduserv Foundation 28 June 2006 Update.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
A centre of expertise in digital information management UKOLN is supported by: XML Schema for DC Libraries AP DC Libraries WG Meeting,
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
The role of metadata schema registries XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
Metadata Modularization Concepts and Tools Carl Lagoze CS
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Moving from a locally-developed data model to a standard conceptual model Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Modularization and Interoperability: Dublin Core and the Warwick Framework Sandra D. Payette Digital Library Research Group Cornell University November.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Application Profiles: interoperable friend or foe? Rachel Heery Michael Day TEL Milestone Conference Frankfurt am Main, April 2002.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
RELATORS, ROLES AND DATA… … similarities and differences.
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Metadata Interaction, Integration, and Interoperability MODS, MARC and Metadata Interoperability, ALA Conference, June 27, 2005, Chicago, IL William E.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lotzi Bölöni.
Metadata Interaction, Integration, and Interoperability NISO Workshop: Metadata Practices on the Cutting Edge, May 20, 2004, Washington, DC William E.
Application Profiles Application profiles -- are schemas which consist of data elements drawn from one or more namespaces, combined together by implementers,
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Dublin Core Basics Workshop Lisa Gonzalez KB/LM Librarian.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Metadata standards and interoperability 384C – Organizing Information Spring 2016 Karen Wickett School of Information University of Texas at Austin.
The Semantic Web By: Maulik Parikh.
Introduction to Metadata
Lifecycle Metadata for Digital Objects
Attributes and Values Describing Entities.
LOD reference architecture
The new RDA: resource description in libraries and beyond
Attributes and Values Describing Entities.
Presentation transcript:

A Logical Framework for Metadata Interoperability 16th August 2007 The Advanced Digital Library Seminar 2007 Guilin, China

Contents 1. Metadata interoperability in libraries 2. Metadata interoperability 3. Two interoperability prototypes 4. Consistency issues 5. Models and interoperability 6. Semantics and syntax of metadata 7. Interoperability across models 8. Dublin Core application profile 9. Discussion: library metadata practices

1. Metadata interoperability in libraries Metadata interoperability is increasingly accepted as something that libraries have to deal with.

New challenges for libraries A new information environment – Variety of resources – Distributed locations – Heterogeneous structures

New challenges for libraries New library services – putting everything together

New challenges for libraries New library system architecture

WorldCat WorldCat achieves interoperability across geographically distributed library catalogues with a unique metadata system, MARC21. Centralized architecture.

Z39.50 Z39.50 performs interoperability with decentralized architecture.

2. Metadata interoperability When we talk about interoperability in relation to metadata, we are generally talking about search interoperability, or the ability to perform a search over a diverse set of metadata records to obtain meaningful results. (Caplan, 2003) Caplan, P. (2003). Metadata Fundamentals for All Librarians: American Library Association

Levels of metadata interoperability Schema level – focused on interoperability between elements of the schemas. Record level – integrate and convert metadata records. Repository level – harvested or integrated records from varying sources, mapping value strings. (Chen & Zeng, 2006) Chen, L. M., & Zeng, M. L. (2006). Metadata Interoperability and Standardization – A Study of Methodology Part I. D-Lib Magazine, 22(6).

Machine-understandable When does “metadata interoperability” become a problem? – Human understandable – Machine understandable The core issue of metadata interoperability is how to make the relationships among metadata systems machine-understandable.

Machine-understandable Real world Human understandable Metadata Machine world Machine understandable

3. Two interoperability prototypes There are two basic mechanisms to achieve interoperability between metadata systems: mapping and integrating. Mapping is a process of matching original metadata systems to target metadata systems. It assigns a binary relationship between a pair of members in different metadata systems. Integrating is a process of combining different metadata systems together into a unique metadata system. The principle of the federating is integrating different metadata systems into one metadata system, both on the record level and the element level, and making them work together.

Mapping and Integrating Metadata Interoperability Mapping Integrating Crosswalk Switching schema Metadata packages Warwick Framework Element reusing Application Profile

Crosswalk Element 1 Element 2 Element 3 Element n Element 1 Element 2 Element 3 Element n Metadata AMetadata B

Switching schema MARC-XML GEM Dublin Core MARC ONIX EDA Interoperable Core Godby, C. J., Smith, D., & Childress, E. (2003). Two paths to interoperable metadata. Retrieved 31 July, 2005, from

Metadata packages: Warwick Framework Metadata module –Container –Packages Metadata set Indirect Container Lagoze, C. (1996). The Warwick Framework [Electronic Version]. D-Lib Magazine. Retrieved 8 July, 2007 from

Application profiles The principle of the application profile approach is the reuse of existing metadata schemas. Application profiles consist of data elements drawn from one or more namespace schemas combined together by implementors and optimised for a particular local application. (Heery & Patel, 2000) Heery, R., & Patel, M. (2000). Application profiles: mixing and matching metadata schemas [Electronic Version]. Ariadne. Retrieved July 8, 2007 from

Canada government metadata project: an example of application profiles Devey, M. & Côté, M. (2006). The development and use of metadata application Profiles: the Government of Canada experience. The Serials Librarian, 51(2)

4. Consistency issues “ Two communities may agree about the meaning of the term title or creator or identifier, but until they have a shared convention for identifying and encoding values, they cannot easily exchange their metadata. ” (Duval, Hodgins, Sutton, & Weibel, 2002) Duval, E., Hodgins, W., Sutton, S., & Weibel, S. L. (2002). Metadata Principles and Practicalities [Electronic Version]. D-Lib Magazine, 8. Retrieved July 10, 2007 from

Barriers to interoperability Semantic inconsistency e.g. dc.Title vs. MARC.245 dc.Creator vs. EAD. Author Syntax inconformity e.g. ISO2709 vs. XML Representation inconformity e.g. Hillmann, Diane I. vs. Hillmann, D. I. Vocabulary inconformity e.g. LCSH vs. MeSH

5. Models and interoperability A metadata model is an abstract construct that represents metadata by a set of components and a set of logical relationships between them. A metadata model may help us to gain a better understanding of metadata and its relationship with the real word, as well as the relationship within an encoding system that is machine-understandable. A metadata model facilitates the development of better interoperability between metadata systems.

Models and interoperability Real World Machine World (i.e. computer) Metadata System Metadata Model Better understanding Interoperability Better understanding

DC Abstract Model (DCAM) Dublin Core Abstract Model is for encoding metadata. It acts as a grammar for Dublin Core. DCAM is intended to be independent from any specific encoding syntax, such as RDF framework, and to be an universal meta syntax model for metadata encoding.

Principles of DC Abstract Model (DCAM) One to one principle A description describes exactly one resource. Dumb-down principle Refinements principle for narrower-broader relationship consistency Appropriate values Well-typed values to ensure that the usage of metadata elements in a particular context will be well guided

One to one principle: Based on FRBR Model A Book Title Publisher Subject ISBN Author Subject SchemeTerms Author Name Address Description A Description B Description C Group 1 : work Group 3 : Subject Group 2 : People, Co.

DCAM resources model Any object that the metadata is intended to describe is a resource. A resource is composed of property/value pairs. Each value is either a literal value or a non- literal value. Property Value literal Non-literal Resourc e

Property/value pair with different formats Property/value pairs are the core of the DCAM. They are the minimal semantic units of metadata.

DCAM description set model Property/value pairs are represented by a statement which is made up of a property URI/value surrogate pair. A set of statements, which describes a resource, is a description. A set of descriptions is a description set. Property URIValue Surrogate Statement Description Description Set

Description set model Description Statement Property -Value

DCAM vocabulary model Vocabulary is a set of terms. The terms denote the semantics both of the property and the value. In the DCAM model, the vocabulary may be one of three things, value vocabulary, property vocabulary, or class. Property vocabulary includes metadata elements set and it’s definitions. Value vocabulary includes vocabulary encoding schemes, e.g. LCSH, DDC; syntax encoding schemes, e.g or or

DCAM : a summary Domain Model Syntax Model

6. Semantic and syntax of metadata The semantics of metadata is about machine understandable meanings of the metadata. All of machine understandable meanings are inherited from human understandable meanings. So the human understandable meaning space of the metadata is also a significant part of the metadata model. The vocabulary model of metadata is the human understandable semantic space that defines and identifies the attributes and content of the resource. It is also a mechanism which transforms the meanings between humans and machines.

Domain models The meanings of the two functional components, property and value, are based on a general view of the resource which the metadata intends to describe. Property denotes the attributes of the resource, therefore, definitions of property relies on how the resource is structured into a set of attributes. The value is extracted from the resource under guidelines for selecting, formulating and presenting values. The general view of the resource can be indicated by domain models, such as FRBR in the bibliography domain and CIDC CRM in the cultural heritage domain.

Syntax of metadata The syntax of metadata is about machine processable formats. Metadata should be encoded into a machine readable format e.g. MARC, XML, RDF, etc. following standards for encoding. These standards are designed to provide a common way to describe metadata information and make it easily read and understood by computer applications. The framework for encoding metadata into a particular kind of format is the syntax of the metadata.

Metadata model structure Semantic Domain ModelFRBR, ABC, CRM Vocabulary Model Property Model Metadata elements schemes Value Model Vocabulary encoding schemes and syntax encoding schemes, AACRII, RDA Resources ModelProperty /Value Pairs Syntax Description Set Model Syntax ModelRDF, SKOS

7. Interoperability across models Metadata interoperability may occur between each layer of the metadata models. Domain Model Property Model Value Model Vocabulary Model Resource Model Description Set Model Syntax Model Semantic Syntax Domain Model Property Model Value Model Vocabulary Model Resource Model Description Set Model Syntax Model Semantic Syntax

Semantic interoperability Interoperability at the semantic layer makes agreement between semantic models, such as metadata element sets, encoding schemes and, at the top level, domain models. It must be seriously considered when achieving an interoperability at schema level. Terminology Harmonization Elements Mapping Common Ontology

Terminology harmonization The simplest way to achieve semantic interoperability is by extracting a common set of terminology for metadata standards, (Pierre & LaPlant, 1998) which is called harmonization. However, terminology can not carry whole meanings for achieving semantic interoperability. Sometimes, the precise definition of the same terms are implemented differently. Pierre, M. S., & LaPlant, W. P. (1998). Issues in Crosswalking Content Metadata Standards [Electronic Version]. Retrieved July 8, 2007 from

Elements mapping

Interoperability between domain models To achieve perfect semantic interoperability, the property in each of the metadata systems should be defined under the same conceptual framework. This common conceptual framework is a common ontology. An ontology is a formal explicit description of concepts. It consists of classes, slots, facets, and instances. An ontology of metadata consists of a set of individual instances of concepts, and constitutes a knowledge base for metadata interoperability. A common ontology can sufficiently present the semantics of metadata using knowledge representation languages, such as OWL et al., and achieve common understanding of metadata across different domains.

ABC model, example of common ontology ABC ontology, developed within Harmony International Digital Library Project, provides a common conceptual model to facilitate interoperability between metadata ontologies from different domains. Lagoze, C., & Hunter, J. (2001). The ABC Ontology and Model [Electronic Version]. Journal of Digital Information, 2. Retrieved July 2007 from

Interoperability between domain models Ontology mapping between ABC and CRM Doerr, M., Hunter, J., & Lagoze, C. (2003). Towards a Core Ontology for Information Integration [Electronic Version]. Journal of Digital Information, 4. Retrieved July, 2007 from (Doerr, Hunter, & Lagoze, 2003)

Interoperability between domain models Extending ABC with MPEG-7 Hunter, J. (2003). Hunter, J. (2003). Enhancing the semantic interoperability of multimedia through a core ontology. IEEE Transactions on Circuits and Systems for Video Technology, 13(1),

Syntax interoperability The interoperability at the syntax layer is to ensure format conformity. Syntax or format inconformity often happens during metadata record conversion at the record level of metadata interoperability. For example, when we convert a MARC record into MARCXML format, we must follow exactly the syntax of MARCXML. e.g |a Arithmetic / |c Carl Sandburg. Arithmetic / Carl Sandburg a Arithmetic / |c Carl Sandburg.

Syntax interoperability The approach to metadata interoperability at the syntax level is now more focused on developing syntax- independent data models. The models could be equally applicable in a variety of syntax contexts. According to such models, metadata property/value pairs and their statements can be encoded with different encoding frameworks, such as RDF,, SHOE etc..

In DCAM, a value in the property/value pair could be another resource, which is identified by URI. It means that a resource could be described by another resource. This principle provides a framework which links different metadata description sets together URI and interoperability Metadata B Metadata A Property Value URI Resource A Property URI Value Resource B

8. Dublin Core application profile CEN MMI-DC Workshop, a research group of the European Committee for Standardization, developed application profile guidelines to give guidance on how to create a customized or adapted metadata schema for a particular application. DCAP is based on Dublin Core. It allows people to extend Dublin Core by drawing the elements from other existing metadata schema or creating new elements to meet particular needs. DCAP follows DCAM, and especially follows the Principle of Appropriate Identification, Principle of Readability. (CEN MMI-DC Workshop, 2003) CEN MMI-DC Workshop. (2003). Dublin Core Application Profile Guidelines. Retrieved July, 2007, from ftp://ftp.cenorm.be/PUBLIC/CWAs/e-Europe/MMI-DC/cwa Nov.pdfftp://ftp.cenorm.be/PUBLIC/CWAs/e-Europe/MMI-DC/cwa Nov.pdf

Dublin Core application profile Name of Termtitle Term URIhttp://purl.org/dc/elements/1.1/title LabelTitle Defined Byhttp://dublincore.org/documents/dcmi-terms/ Source DefinitionA name given to the resource DC-Lib Definition- Source CommentsTypically, a title will be a name by which the resource is formally known. DC-Lib CommentsA parallel/transliterated title is considered a main title, i.e. the Title element is repeated. Either a title or identifier is mandatory. If no title is available, best practice is to give a constructed title, derive a title from the resource or supply [no title]. If using qualified Dublin Core, an element refinement for titles other than the main title(s) should be included. Retain initial articles and use local sorting algorithms based on language. A language qualifier may be used to indicate language of title if appropriate. (For example, see: Initial Definite and Indefinite Articles for a list of articles in various languages). Type of termelement Refines Refined Byalternative Has Encoding Scheme ObligationM Occurence CEN MMI-DC Workshop. (2003). Dublin Core Application Profile Guidelines. Retrieved July, 2007, from ftp://ftp.cenorm.be/PUBLIC/CWAs/e-Europe/MMI-DC/cwa Nov.pdfftp://ftp.cenorm.be/PUBLIC/CWAs/e-Europe/MMI-DC/cwa Nov.pdf

9. Discussion: library metadata practices Information organizing paradigm in library community shifts Single metadata schema to multiple metadata schemas. Application profile becomes cornerstone of information organization in library community. Working for both users and machines – new library professional.

Questions and Thanks! Special thanks to Dr. MARCIA LEI ZENG Professor of School of Library and Information Science, Kent State University, U.S.A. Dr. LIU WEI Senior Research Librarian of Shanghai Library, China Ms. KAAREN HIYAMA Asian Languages Librarian, Asian Studies Subject Librarian of University of Auckland Library, New Zealand for their help and contribution with this presentation.