RDA data and context Gordon Dunsire

Slides:

Advertisements

Similar presentations

Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011

Advertisements

Interoperability and semantics in RDF representations of FRBR, FRAD and FRSAD Gordon Dunsire Presented at the Cologne Conference on Interoperability and.

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.

From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.

How to publish local metadata as linked data Gordon Dunsire Presented at Linked Open Data: current practice in libraries and archives (3rd Linked Open.

RDA: a new international standard for resource discovery and access Gordon Dunsire Based on a presentation created by the Joint Steering Committee for.

IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.

National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.

Engaging with RDA: governance and strategy Gordon Dunsire Presented to RDA Forum, ALA Annual 2015, San Francisco, USA.

Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.

RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.

Report on recent activity of the IFLA Namespaces Task Group and the DCMI/RDA Task Group Gordon Dunsire Presented to the Semantic Web Special Interest Group,

Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.

Linked Data by Dr. Barbara B. Tillett Chief, Policy and Standards Division Library of Congress For Texas Library Association Conference April 12, 2011.

RDA for linked data Gordon Dunsire Presented to CILIP Linked Data Executive Briefing 24 November 2015, London.

Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.

RDA data capture and storage Gordon Dunsire Chair, RDA Steering Committee Presented to Committee on Cataloging: Description and Access II (CC:DA) - ALCTS.

RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at Selmathon 2, 10 May 2016, Stockholm, Sweden.

RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.

RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.

Marathon RDA Gordon Dunsire Chair, RDA Steering Committee Presented at Bibliothèque national de France, Paris, 2 May 2016.

RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at the Cervathon, 14 April 2016, Madrid, Spain.

RDA: international linked data for cultural heritage resources Gordon Dunsire Presented to ALCTS/LITA session, ALA Annual, Orlando, USA, June 25, 2016.

Engineering, 7th edition. Chapter 8 Slide 1 System models.

Where is RDA in the bibliographic universe? Gordon Dunsire Presented to Standard RDA – korzyści i problemy związane z jego wdrożeniem = RDA standard –

RDA and linked data Gordon Dunsire Presented to Code4Lib Ottawa, MacOdrum Library, Carleton University, Ottawa, 27 April 2016.

RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee

LRM-RDA Gordon Dunsire

RDA work plan: current and future activities

Aligning RDA with the LRM

RDA, linked data, and development

Authority versus authenticity: the shift from labels to identifiers

From the old to the new… Towards better resource discoverability

RDA Update Edinburgh 2017 Gordon Dunsire

Instructions, interfaces, and interoperable data: the RIMMF experience with RDA Gordon Dunsire Presented to Session 093 Let’s make IT usable! Formats,

RDA, linked data, and development

Telling tails: metadata standards and the digital humanities

IFLA FRBR-Library Reference Model and RDA

LRM and RDA : overview of the 3R Project

Lifecycle Metadata for Digital Objects

Recording RDA data as linked data

RDA and the IFLA Library Reference Model (LRM)

UNIMARC and linked data

RDA, linked data, and update on development

Applications of IFLA Namespaces

The new RDA: resource description in libraries and beyond

A model to link them all IFLA-LRM as a driver for harmonization of cataloguing standards related to serials and other continuing resources Gordon Dunsire.

RDA and translations Gordon Dunsire, Chair, RSC

Appellations, Authorities, and Access

LRM-RDA Gordon Dunsire.

Attributes and Values Describing Entities.

Metadata for research outputs management

Introduction into Knowledge and information

RDA and practical linked open data

Gordon Dunsire, Françoise Leresche, Mirna Willer

Accommodating local cataloguing traditions in a global context

From Big Bang to beta An overview of the 3R Project

Cataloguing with RDA Gordon Dunsire, Chair, RSC

RDA and semantic data Gordon Dunsire

Introducing IFLA-LRM Gordon Dunsire, Chair, RSC

Metadata in Digital Preservation: Setting the Scene

Extending RDA (briefly)

RDA cataloguing and linked data

RDA in a non-MARC environment

Content of beta RDA A brief overview Gordon Dunsire, Chair, RSC

RDA Community and linked data

The new RDA: resource description in libraries and beyond

Describing Documents Ch3 in textbook Organizing Knowledge: An

RDA linked data update Gordon Dunsire, RDA Technical Team Liaison Officer Presented at RDA Linked Data Forum, ALA Midwinter January 28, 2019, Seattle,

Future directions for RDA

Presentation transcript:

RDA data and context Gordon Dunsire Presented at "Desarrollo de RDA y casos de implementación en Europa" National Library of Spain, Madrid, 27 Oct 2017

The digital deluge Libraries are facing a flood of digital resources: "Born digital" publications Digitization of publishers' back catalogues Mass digitization of library collections Self-published resources: Blogs Email Twitter Many libraries are adding an increasing number of information resources in digital formats to their collections. These come from a variety of sources. The costs of digitization have reduced significantly in recent years, and the tools for publishing information online are freely and openly available to billions of people.

Costs of cataloguing Libraries must acquire digital resources : Electronic legal deposit National bibliographies Special collections Digitization for access and preservation National libraries and bibliographic agencies must obtain and create digital resources as part of their collecting and access policies. At the same time, there is usually little or no extra funding assigned to catalogue these important resources. In many cases, overall funding for professional cataloguing has been reduced. This may be due to the "Google effect": web search engines are seen as sufficient for general information retrieval requirements, despite their lack of precision and coverage. But there is no extra money for professional metadata creation  Just use Google!***?

Sources of metadata Professional cataloguers Smart, standard, and expensive Amateur cataloguers (crowd-sourcing) Smart, non-standard, and voluntary Publishers Smart, non-standard, and reluctant They have costs too At the same time, libraries are using metadata from a wider variety of sources. These metadata are created by different kinds of agent in different contexts. The result is a mix of metadata that range from standard to non-standard, in varying degrees of completeness and accuracy. Machines Dumb, cheap, and slaves 24/7, no food, shelter, etc.

The metadata maelstrom Libraries must obtain metadata from a variety of sources Get it when you can; take it when offered Metadata can be standard, non-standard, smart, dumb, incomplete, contextual, and non-contextual Created for non-library applications Libraries cannot afford to create their metadata by themselves, and must accept and use whatever metadata are available. It is a challenge for libraries to incorporate this data into information discovery and access services. But it must be processed for coherent and consistent information retrieval applications

(Meta)Data values Data value = "descriptive string"  Attribute element Data value = identified thing  Relationship element (local) Linked data value = URI  Relationship element (global) The values assigned in metadata fall into two distinct categories: attribute and relationship. A string value that describes an entity in itself is an attribute. A related entity can be indicated by a string identifier that is assigned in a local relational data system, or by a URI designed to have global applicability. Linked open data and the Semantic Web use the more fundamental distinction between a string value and a "thing" value represented by a IRI (an extended form of URI). The new IFLA Library Reference Model is optimized for Semantic Web applications, and reflects the dual string and thing categories by blurring the distinction between attributes and relationships. IFLA Library Reference Model has string/thing duality

RDA database scenarios Flat-file data Catalogue cards, printed bibliographies, etc. Description and access Bibliographic and authority records Relational databases RDA has always accommodated a variety of scenarios for storing and processing RDA metadata. The RDA Steering Committee is now developing RDA to accommodate linked data for Semantic Web applications. Linked data in the Semantic Web

RDA development contexts (2-3 years) Governance transition to international representation Strategic communities: international, cultural heritage, linked data IFLA Library Reference Model (LRM) and other international standards (IFLA, ISSN, …) RDA is being developed in several contexts over the next few years. The composition of the RDA Steering Committee and RDA Board is moving from its Anglo-American basis to international representation. The RDA Board strategy for RDA includes development to meet the needs of international communities, cultural heritage collections, and linked data applications. The Library Reference Model is a development of the IFLA models used by RDA, including FRBR, FRAD, and FRSAD. Other standards related to RDA are undergoing similar development. These form the background to the RDA Toolkit Review and Re-organization (3R) Project. Toolkit review and re-organization (3R Project)

RDA data “RDA is a package of data elements, guidelines, and instructions for creating library and cultural heritage resource metadata that are well-formed according to international models for user-focussed linked data applications.” RDA Toolkit provides the user-focussed elements, guidelines, and instructions. RDA Registry provides the infrastructure for well-formed, linked, RDA data applications. Open Metadata Registry (OMR) provides linked data representation of RDA elements. The RDA data ecosystem is summed up in this sentence introducing the RDA Board’s announcement of RDA’s strategic directions. The RDA package is delivered by an infrastructure of two interacting services. The human-facing components, including the guidelines and instructions, are the Toolkit. The data-facing components are contained in the Registry. Applying the data capture and storage techniques in RDA Toolkit to the data architecture in the RDA Registry produces well-formed data for RDA applications.

RDA data recording methods Unstructured description Structured description Identifier RDA has always provided a range of methods for recording RDA metadata. An entity related to the one being described can be referenced by an unstructured description, a structured description, or a local identifier. These methods are being extended to cover the finer granularity of RDA elements, and to explicitly include global linked open data identifiers. Internationalized Resource Identifier (IRI-URI)

Unstructured description Transcribed data: what you see is what you get Free text notes Uncontrolled terminology for description Non-standard metadata from non-professional sources or lacking provenance (authority) An unstructured description is a data value that has no specific internal structure. In general, the data can only be processed for keyword extraction and indexing. An unstructured description can include data that is non-standard, not authenticated, or created by non-professional agents.

Structured description Data recorded in regular, standard, structured formats for human consumers Sources of data have "authority" Authorized and variant access points, or controlled terminology A structured description is usually composed of data with finer granularity that is assembled according to standard pattern. The data can come from external authority file and other knowledge organizations systems, or be derived locally using RDA guidance and instructions. Data from authority files, vocabulary encoding schemes, and knowledge organization systems

Identifiers Coded labels intended for machine identification Identification and disambiguation within a local domain Authority control numbers, standard identifier schemes, machine-readable database keys, terminology and vocabulary notations An identifier value can be taken from an external, authoritative source, or from a local database system. Identifiers are generally intended for machine processing.

IRI/URI Internationalized Resource Identifier > Uniform Resource Identifier Identification within a global domain: the Semantic Web of linked data Data for "smart" machine applications An IRI value can be processed by applications that are aware of the semantics of the metadata schema used for linked data.

Recording methods for related data "note on related entity" "access point for related entity" RDA Entity 1 is related to "identifier for related entity" This diagram shows all four recording methods in use to reference a related entity. The diagram can be interpreted as an RDF graph for linked data, or as a relational schema for a local database. RDA Entity 2

RDA as data integration framework An RDA data element can (usually) accommodate all four types of data value Multiple types can be assigned to a single element Metadata from multiple sources can be used in integrated catalogues and other applications Most RDA elements will accommodate all four recording methods and types of value. An example of an exception is a "note" element, which can only accept an unstructured description. As shown in the previous diagram, a single element for a specific entity can take values from multiple recording methods. This allows RDA applications to accommodate and integrate metadata values from multiple resources, ranging from "dumb" data of dubious provenance to "smart" semantic linked data.

Getting the best out of data Unstructured: keyword extraction and indexing Structured: authority and terminology control Identifier: relational database applications IRI: Semantic web and linked open data This flexibility is intended to help libraries use metadata from a variety of sources to describe and access information resources of all types, and to store metadata for appropriate machine-processing. RDA is for a smart future, not a dumb past

Thank you!