RDA Update Edinburgh 2017 Gordon Dunsire

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Interoperability and semantics in RDF representations of FRBR, FRAD and FRSAD Gordon Dunsire Presented at the Cologne Conference on Interoperability and.
IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen Cornell University May 16, 2006
RDA in the wild: taking RDA into the global Gordon Dunsire Presented at the RDA Forum “RDA in the wild”, ALA Midwinter, 1 February 2015.
RDA AND AUTHORITY CONTROL Name: Hester Marais Job Title: Authority Describer Tel: Your institution's logo.
Engaging with RDA: governance and strategy Gordon Dunsire Presented to RDA Forum, ALA Annual 2015, San Francisco, USA.
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen RDA Forum ALA Annual Meeting, New Orleans, June 24,
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
RDA development and implementation overview Gordon Dunsire Presented at Cataloguing & Indexing Group in Scotland RDA for Implementers 27 May 2015.
RDA progress on governance and strategy Gordon Dunsire Chair, RDA Steering Committee Presented to RDA Forum, ALA Midwinter 2016, 9 January 2016, Boston,
RDA data capture and storage Gordon Dunsire Chair, RDA Steering Committee Presented to Committee on Cataloging: Description and Access II (CC:DA) - ALCTS.
RDA: history and background Ann Huthwaite Library Resource Services Manager, QUT ACOC Seminar, Sydney, 24 October 2008.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at Selmathon 2, 10 May 2016, Stockholm, Sweden.
RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.
RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.
Marathon RDA Gordon Dunsire Chair, RDA Steering Committee Presented at Bibliothèque national de France, Paris, 2 May 2016.
RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at the Cervathon, 14 April 2016, Madrid, Spain.
RDA: international linked data for cultural heritage resources Gordon Dunsire Presented to ALCTS/LITA session, ALA Annual, Orlando, USA, June 25, 2016.
RDA internationalization and application profiles: applying the global to the local Gordon Dunsire Presented to the CC:DA meeting, ALA Annual, Orlando,
Where is RDA in the bibliographic universe? Gordon Dunsire Presented to Standard RDA – korzyści i problemy związane z jego wdrożeniem = RDA standard –
RDA and linked data Gordon Dunsire Presented to Code4Lib Ottawa, MacOdrum Library, Carleton University, Ottawa, 27 April 2016.
Subjects in the FR family
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
LRM-RDA Gordon Dunsire
RDA work plan: current and future activities
Aligning RDA with the LRM
National Library of Scotland, Edinburgh, 19 May 2014
RDA, linked data, and development
Authority versus authenticity: the shift from labels to identifiers
Instructions, interfaces, and interoperable data: the RIMMF experience with RDA Gordon Dunsire Presented to Session 093 Let’s make IT usable! Formats,
By Dr Hester Marais
RDA, LC-PCC Policy Statements, and MLA Best Practices Updates
RDA, linked data, and development
Telling tails: metadata standards and the digital humanities
RDA data and context Gordon Dunsire
IFLA FRBR-Library Reference Model and RDA
LRM and RDA : overview of the 3R Project
Recording RDA data as linked data
RDA 3R Project: Status Report
RDA and the IFLA Library Reference Model (LRM)
UNIMARC and linked data
RDA, linked data, and update on development
Aggregates and Diachronic Works
The new RDA: resource description in libraries and beyond
A model to link them all IFLA-LRM as a driver for harmonization of cataloguing standards related to serials and other continuing resources Gordon Dunsire.
RDA and translations Gordon Dunsire, Chair, RSC
Appellations, Authorities, and Access
LRM-RDA Gordon Dunsire.
RDA and practical linked open data
Accommodating local cataloguing traditions in a global context
From Big Bang to beta An overview of the 3R Project
Cataloguing with RDA Gordon Dunsire, Chair, RSC
RDA and semantic data Gordon Dunsire
Introducing IFLA-LRM Gordon Dunsire, Chair, RSC
The LRM and its impact on RDA and related standards
Extending RDA (briefly)
Application profiles and cataloging a manifestation
RDA cataloguing and linked data
RDA in a non-MARC environment
Content of beta RDA A brief overview Gordon Dunsire, Chair, RSC
RDA Community and linked data
The new RDA: resource description in libraries and beyond
FRBR and FRAD as Implemented in RDA
RDA linked data update Gordon Dunsire, RDA Technical Team Liaison Officer Presented at RDA Linked Data Forum, ALA Midwinter January 28, 2019, Seattle,
Future directions for RDA
Customizing RDA for local applications
Presentation transcript:

RDA Update Edinburgh 2017 Gordon Dunsire Presented at a Cataloguing and Indexing Group Scotland seminar National Library of Scotland, Edinburgh, 4 August 2017

Overview Implementation of the new governance model for RDA 3R project to redesign RDA Toolkit Design Content Recent meetings with specialist cataloguing communities

RDA Governance RDA Steering Committee Europe North America Latin Oceania Asia Africa EURIG NARDAC RDA Board ORDAC IFLA 2018? CILIP/BL Committee http://www.rda-rsc.org/europe Chair: Jenny Wright, BDS

3R Project RDA Toolkit Restructure and Redesign Project New IFLA Library Reference Model (LRM) Resolves gaps and inconsistencies in RDA Time for a better design Increasing number of translations, policy statements, etc. Current tools and practices are inefficient

3R goals Better meet needs of users Provide a more productive environment Greater flexibility and utility in access and display of Toolkit content Efficient and reliable work processes and tools for RDA editors, translators, and creators of derivative products

Toolkit schedule during 3R English text frozen with April 2017 release German translation updated: August 2017 release (next week) Catalan and French translation updated, and Norwegian translation added: October 2017 Finnish translation updated: November 2017 Redesigned Toolkit: April 2018 Old Toolkit available until: 1 year after

Open Metadata Registry RDA Reference data flow Atomic audit trail Deprecation policy Entities, element sets, value vocabularies Semantic versioning: Application roll-back RDA editors: Secretary Translators Open Metadata Registry RDA/RDF Developers RDA Vocabularies (GitHub) Developers RDA Toolkit content RIMMF3 data editor RDA Vocabulary Server RDA Registry This diagram shows the basic flow of RDA Reference data through the RDA content management infrastructure. RDA Reference is the content of labels, definitions, and scope notes of RDA entities, element sets and value vocabularies. Editors of RDA Reference, including translators, maintain the content in the Open Metadata Registry in linked data format. From there, the content flows via software processes to various RDA products and services aimed at different audiences. Further information can be found in the RDA Registry at http://www.rdaregistry.info/rgAbout/rdaref/dataflow/ and in the appendix of RSC/Chair/17 - RDA Toolkit Glossary and RDA Reference (http://www.rda-rsc.org/sites/all/files/RSC-Chair-17-fix.pdf). Cataloguers Trainers; Prototypers Applications Developers

3R Toolkit structure General chapters: Introductory material, guidance and instructions on general topics Entity chapters: One chapter per entity, containing all elements for the entity. Relationship hierarchies Supplementary material (e.g. Books of bible) User generated material (e.g. workflows)

3R element layout Each element is a "chunk" of content Reference data definition, user tasks, etc. may have graphic/active display Instructions for recording data values Unstructured description Structured description Identifier Linked data Navigation to related elements and entities

3R Examples Context Appropriate examples for each method of recording (4-fold path) Correct level of granularity "View in context" or set of examples for the same entity Display [Design consultants] User-controlled on/off switch

3R 4-fold example example Unstructured description “The author writes in English” * Structured description “English” * Identifier “eng” * Linked data IRI http://id.loc.gov/vocabulary/languages/eng * String value

Library Reference Model (LRM) Consolidation of FRBR, FRAD, FRSAD, and report of Working Group on Aggregates: Seamless (no gaps) Generalized Entity-relationship model Optimized for linked data implementation

National institution (meta)data Significant quantities of metadata from "lightly" curated sources e.g. Electronic legal deposit (publisher community) Metadata values from a wide variety of un/controlled terminologies e.g. "Bob Dylan" (natural order) e.g. "blueish" (folksonomy) Strategies to reduce human costs by automation, re-use, and system integration

FRBR-LRM and RDA entities Any RDA Thing: Covers all other types of entity FRBR-LRM and RDA entities Res is sub-class of RDA Entity has appellation Nomen W is created by Place E Agent is associated with is sub-class of Time-span M Collective Agent The LRM uses a super-entity, "Res", to model high-level relationships and attributes for all other entities. In RDA, the super-entity "RDA Entity" is used in place of Res for all other RDA entities. RDA Entity is a sub-type (sub-class in RDF) of Res. This RDF graph shows new RDA entities taken from the LRM: Nomen, Place, Time-span, Collective Agent, and Agent. Current RDA entities are labelled only with their initials. The graph also shows the high-level relationships between the new and current entities. The only RDA entity which does not fit without significant modification is Person. In the LRM, the definition of this entity restricts it to a human being, and non-humans including animals, fictitious and legendary beings, and natural phemomena, are excluded. The integrated semantic structure of the LRM and RDA entities allows the RDA relationships to be refinements of the high-level LRM relationships, as element sub-types (sub-properties in RDF). I RDA refines LRM relationships as element sub-types (RDF sub-properties) is modified by P* F C

Nomens and appellations has appellation RDA Entity Nomen has title proper has nomen string M1 N1 “My title” has title proper Nomen is a new LRM entity for RDA, and represents the class of strings (names, titles, etc.) used to label and identify any other entity. The high-level relationship between RDA Entity and Nomen is "has appellation". This essentially says "All things have names". The current RDA relationships between an entity and an identifying label are refinements of the high-level relationships. So "[has] title proper" is a refinement of the "has appellation" relationship between a Manifestation and a Nomen. The Nomen entity is always associated with the string of characters, symbols, etc. that constitutes the "name" or other label by which the entity is known or called. The "has nomen string" relationship associates the Nomen with its string. The chain of relationships "has title proper" + "has nomen string" can be short-cut to give the current RDA model of "appellation" attributes. Similarly, the RDA "[has] identifier for …" attributes are also refinements of "has appellation". Note that the nomen string is this example may look like an ISSN, but it could be some other kind of identifier. More information about the Nomen is needed; this is one reason for treating such string labels as an entity or class that can have other attributes and relationships. has identifier … has nomen string M1 N2 “0123-4567” has identifier …

4-fold path for attributes LRM blurs the distinction between attributes and relationships – an echo of the 4-fold path A relationship with string data (unstructured or structured description, or identifier) is like an attribute The 4-fold path accommodates both string and thing data. This is compatible with the LRM which allows attributes to be treated as relationships, and relationships to be treated as attributes. An attribute with "thing" data (IRI), e.g. SKOS concept, is like a relationship

"Descriptions" (path 1 and path 2) An unstructured description (path 1) has no internal structure that can be parsed by machine; only keywords can be extracted. Example: a transcription or a note A structured description (path 2) has some form of internal or external structure. Example: An aggregated string composed of sub-element values Example: A term from a vocabulary encoding scheme or authority file The distinction between an unstructured description and a structured description lies with the string containing structured data. An unstructured description is assumed to have no internal structure that can be parsed by an application, except for standard string manipulation such as keyword extraction. A structured description has some kind of internal structure. An "aggregated statement" for a super-element is a string composed of the string values of its sub-elements with an indication of what string iss associated with what sub-element, through the use of punctuation or name/value pairs. A structured description may also have external structure, such as a term taken from a controlled vocabulary or authority file.

Identifiers (path 3 and path 4) "A nomen consisting of a code, number, or other string, usually independent of natural language and social naming conventions." (Draft) Identifier is distinct from language-based "descriptions" Identifier is "local": not unique at global level RDA does not currently distinguish between general identifiers and IRIs. Such a distinction is required for the 4-fold path to be recorded as RDA data in a well-formed way. The distinction lies in the guaranteed global uniqueness of an IRI required for machine-processing. Other identifiers cannot be guaranteed to be unique at global level, including international identifiers such as ISBNs and ISSNs. For example, the same ISBN is often used for different Manifestations. There is also a requirement to distinguish an identifier from other forms of Nomen; they all "identify" or "describe" an entity. The distinction is linguistic: identifiers are usually coded and intended for machine processing, and can be considered distinct from language-based labels, even if they carry hints of human or social labelling. Path 4: International Resource Identifier (IRI) or URI is unique at global level

4-fold path for related entities “[Unstructured description]” has nomen string “[Structured description]” RDA Entity 1 has related entity “[Identifier]” N1 The current RDA instructions allow an related entity to be described using three distinct types of string: an unstructured description, a structured description, or an identifier. In addition, RDA implicitly allows a related entity to be identified by an Internationalized Resource Identifier (IRI) or URI; the related entity is represented as a thing, not a string. But all things have names: the related entity represented as a thing may have each of the equivalent strings as a nomen string of some related Nomen. It is a moot point whether an unstructured description is a nomen string … RDA's 4-fold path is thus an extension of LRM's "has appellation" relationship. RDA Entity 2 N2 has appellation

Implications for authority control No need for "preferred" nomen (string) if local Identifier or global IRI is available for user task Identify Human-readable nomens still required for user tasks Find and Explore The user task Identify is best served with an identifier (Identifier or IRI path). If these are available, there is no need for a unique and distinct string label; there is no requirement for a preferred nomen. Of course, human-readable nomens are required for the user tasks Find and Explore. This results in a shift of emphasis in traditional authority control systems, from developing preferred or authoritative forms of nomen to maintaining multiple forms that can be used for Find and Explore. This is essentially what happens in VIAF, where all local "preferred" forms are treated equally; that is, there is no selection of one local form to be the preferred form overall. This also applies to Vocabulary Encoding Schemes and other forms of KOS (knowledge organization systems) in linked data environments. Although SKOS accommodates a "preferred label", it is not mandatory; the concept or instance is uniquely identified by an IRI. Emphasis shifts from "authority form" to maintaining multiple forms of nomens: cf VIAF

Nomen granularity and hierarchies Current RDA elements form categories of nomens Titles: essentially unstructured descriptions with no "authority"; hierarchical (sub-types) Names: essentially unstructured labels or structured labels with "authority"; hierarchical (sub-types) The concept of "identifying" or "known by" labels is already present in RDA. Resources, described as Works, Expressions, Manifestations, and Items, have Nomens usually referred to as "titles"; Agents, described as Persons, Families, and Corporate Bodies, have Nomens referred to as "names". RDA has sets of elements for titles and names, arranged in hierarchies of element sub-types. RDA also covers identifiers; there are no sub-types. RDA does not, however, currently represent structured descriptions, in the form of access points, as elements. This now seems to be a requirement for the 4-fold path, and will be considered in the 3R Project. Identifiers; no sub-types No current elements for access points: structured descriptions

Work to Nomen relationships 4-fold path 1: Unstructured 2: Structured 3: Identifier Work to Nomen relationships [has] related nomen (work) [has] represented name of creator (work) [has] subject (nomen) [has] appellation of work [has] identifier for work [has] title of work [has] access point of work This diagram shows the RDA relationship elements between Works and Nomens. The diagram can be interpreted as an RDF graph of the relationship ontology if the connectors are assumed to be the RDFS sub-property relationship, or as a relationship hierarchy if the connectors are treated as element sub-type relationships. Nodes with solid outlines are existing RDA elements; nodes with dashed outlines are new RDA elements. The "title" elements form a hierarchical cluster. But there is also the current relationship "[has] identifier for work": this is not a "title", so there is a requirement for a higher-level relationship of which both are sub-types or sub-properties; this is the high-level "has appellation" relationship between a Work and a Nomen. And there is also the new relationship "[has] subject (nomen)" required for consistency with similar RDA relationships; this is not a refinement of the "has appellation of work" relationship, requiring an even higher-level relationship that is equivalent to the LRM's "has associated entity" relationship between two entities. This allows the possibility of other new relationships, for example to link names found in a statement of responsibility directly with a Work. 3 [has] preferred title of work [has] variant title of work [AAP] [VAP] 1 2

Person to Nomen 4-fold path 1: Unstructured 2: Structured [has] related nomen (person) 4-fold path 1: Unstructured 2: Structured 3: Identifier [has] appellation of person [has] name of person [has] access point for person [has] identifier for person [has] preferred name of person [has] variant name of person 3 This diagram shows the RDA relationship elements between Persons and Nomens. It has a similar structure to the Work to Nomen relationships diagram It exposes the conflation of "name of …" elements as unstructured descriptions, for example the form of name appearing on sources of information, and as structured descriptions, for example a normalized form of name used as the basis of an access point. The 3R Project is investigating the clarification and distinction between different forms of name for the same agent. For example, "Gordon Dunsire", "G. J. Dunsire", etc. are forms of name that appear on sources of information, and are unstructured descriptions, while "Dunsire, Gordon", "Dunsire, G. J.", etc. are normalized forms that are the basis of access points or structured descriptions. [AAP] [VAP] 2 1

LRM-E4-A4 Manifestation statements A statement appearing in the manifestation and deemed to be significant for users to understand how the resource represents itself. … normally transcribed from a source … in a manifestation. Transcription conventions are codified by each implementation. The LRM attribute for Manifestation statement supports the principle of representation – how a resource (manifestation) describes itself. The data is usually transcribed from an exemplar of the manifestation, and supports the user task Identify only. Principle of representation User task: Identify

RDA Manifestation statement elements Broad level of granularity: Covers wide range of layouts on manifestation One level of hierarchy: All specific statements are sub-types Manifestation statement > Manifestation title and responsibility statement > Manifestation edition statement > Manifestation identifier statement > … The RDA implementation of Manifestation statement keeps specific kinds of statement at a broad level of granularity in order to cover a wide range of ways in which the data is presented on the manifestation. The specific kinds of statement do not have internal structure; there is only one level of hierarchy, and the specific manifestation statements are all sub-types of the general element. Internal structure is not required to support the user task Identify, and in many cases can be counter-productive. For example, it may be difficult to make a useful transcription of just the place(s) of publication. These are some of the new RDA elements for manifestation statements. They use a labelling pattern for consistency and to distinguish them from the current hybrid transcription/recording elements, which are being retained to accommodate current practice.

Non-human agents N Agent? W creator of Manifestation title and responsibility statement Geronimo Stilton THE CHEESE EXPERIMENT Title proper [normalized] The cheese experiment Statement of responsibility relating to title proper Geronimo Stilton creator of N Agent? W To return to the issue of non-human persons (Martians?) being named in a statement of responsibility, this is an example of how it might be resolved. This volume has a non-human entity, a cartoon animal, named in a statement of responsibility. The top set of metadata shows a new manifestation statement element with a basic WYSIWYG transcription and two of the existing hybrid elements that are used for "normalized" transcription. We have a name, but we do not know which agent (one or more human beings) the name is an appellation of, so it is difficult to link the name to the creator role. We could use a placeholder Agent entity, with the name as a pseudonym, or we can introduce a new relationship designator that links the name directly to the work. This approach is being investigated by the RSC Fictitious Entities Working Group Represented name of creator (work) Stilton, Geronimo

Special communities "Pop-up" meeting held at ALA Annual 2017 Took over vacant CC:DA slot Brief reports and lots of discussion about developing RDA for special materials Other dialogue via email and documentation Extension and refinement of RDA elements LRM > RDA > special/local

Low-hanging fruit Some suggestions easy to incorporate in 3R New elements similar to current elements Local (RDA) extension to RDA/ONIX Framework categories E.g. new elements for cartographic materials Prime meridian, Relief type E.g. new content type for choreographic materials: Performed movement Otherwise, planning for post-3R development

Thank you! rscchair@rdatoolkit.org http://access.rdatoolkit.org/ http://www.rdaregistry.info/ http://www.rda-rsc.org/