Toward an International Sharing and Use of Subject Authority Data

Slides:



Advertisements
Similar presentations
FRBR and Cataloguing Rules: Impact on IFLAs Statement of Principles and AACR/RDA by Barbara B. Tillett FRBR Workshop Dublin, Ohio May 4, 2005.
Advertisements

Terminology Services Diane Vizine-Goetz Senior Research Scientist OCLC Research.
OCLC Online Computer Library Center Terminology Services Diane Vizine-Goetz OCLC Research.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Alexandria Digital Library Project Concept-based Learning Spaces Apply domain-specific KOS principles for organizing collections/services for given applications.
Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
FRAD QUIZ Start quiz Take quiz by pointing to the answer and clicking on it.
Future of Cataloging RDA and other innovations pt.1a.
Module 5a: Authority Control and Encoding Schemes IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
FRBR – A Refresher Course Marjorie E. Bloss RDA Project Manager April 9, 2008.
8/28/97Information Organization and Retrieval Metadata and Data Structures University of California, Berkeley School of Information Management and Systems.
Module 6a: Intro to Controlled Vocabularies, Taxonomies and Classification IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
Change is in the Air: What’s New with RDA & FRBR? 2006 OLC Technical Services Retreat April 25, 2006 Glenn Patton.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen OLAC 2006 Conference October 27, 2006
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen Cornell University May 16, 2006
The Library Cataloging Tradition
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Federal Department of Home Affairs FDHA Swiss Federal Office of Culture FOC Swiss National Library SNL Multilingual Access to Subjects (MACS) Patrice Landry.
LIS510 lecture 9 Thomas Krichel Organization of information Libraries organize information. Otherwise nothing that is an library could ever.
The DDC at Universitätsbibliothek Wien Universitätsbibliothek Wien 29 April 2009 Joan S. Mitchell Editor in Chief, DDC OCLC
FRAD: Functional Requirements for Authority Data.
Inside the DDC Dewey goes Europe: On the use and development of the Dewey Decimal Classification (DDC) in European libraries Austrian National Library.
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
OCLC and Vocabulary Identifiers Eric Childress Andrew Houghton Diane Vizine-Goetz DC-2005 Vocabularies in Practice 13 September 2005 Madrid, Spain Presented.
7/14/09. Robert L. Maxwell RDA Lecture Series National Library of South Africa 22 July /14/09 Cataloging: Still a Professional Asset to Become Excited.
History of Bibliographic Control. Library and Information-type Work –Undertaken through much of human history. –Information packages of various types,
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
IME ICC5 Report Working Group 3: Seriality Working Group Leader: Elise Roberts Co-leader: Martha de Waal Working Group Recorder: Marion Chibambo IME ICC5,
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen RDA Forum ALA Annual Meeting, New Orleans, June 24,
RDA Toolkit is an integrated, browser-based, online product that allow user to interact with a collection of cataloging-related documents and resources.
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
What users want & how FRBR can help Diane Vizine-Goetz Research Scientist OCLC Research.
FRSAD Functional Requirements for Subject Authority Data in its finalizing stages : a very brief report Päivi Pekkarinen Helsinki University Library Meilahti.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Setting a new standard Resource Description and Access Deirdre Kiorgaard 18 September 2006.
What Does FRBR Mean To You? Jenn Riley Metadata Librarian IU Digital Library Program
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
Building blocks for RDA Theory behind RDA ALLUNY Annual Meeting September 28-30, 2012.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
FRBR: Cataloging’s New Frontier Emily Dust Nimsakont Nebraska Library Commission NCompass Live December 15, 2010 Photo credit:
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata Applications Marcia Lei Zeng NSDL All Project Meeting October, 2003.
Twenty-Minute Library Tips A presentation about the Library of Congress classification system and how you can use it to find books in our library! Beth.
Margherita Sini, FAO 1 / 19 Using RSS to Share KOS Metadata Margherita Sini, Gauri Salokhe IV Ecoterm Vienna, Austria April.
Sally McCallum Library of Congress
RDA: history and background Ann Huthwaite Library Resource Services Manager, QUT ACOC Seminar, Sydney, 24 October 2008.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
8/28/97Information Organization and Retrieval Introduction University of California, Berkeley School of Information Management and Systems SIMS 245: Organization.
Diane Vizine-Goetz Senior Research Scientist, OCLC Research Joan S. Mitchell Editor in Chief, DDC Michael Panzer Assistant Editor, DDC Publisher and Librarian.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Charlyn P. Salcedo Instructor Types of Indexing Languages.
Semantic Web Overview Diane Vizine-Goetz OCLC Research.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
D3.4 Report on Cross-Language Subject Access Options Subject access seminar, Prague Patrice Landry Swiss National Library.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Introduction to FRBR Functional Requirements for Bibliographic Records GACOMO Oct. 16, 2008.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Subject Analysis: An Introduction
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Diane Vizine-Goetz OCLC Research
Metadata - Catalogues and Digitised works
FRBR and FRAD as Implemented in RDA
Presentation transcript:

Toward an International Sharing and Use of Subject Authority Data Marcia Lei Zeng Athena Salaba Kent State University FRBR Workshop, OCLC, 2005

Outline Background information Current State Authority Data Sharing Authority Data FRBR Workshop, OCLC, 2005

1. Background 1.1 Subject access Seeking information on a topic is still the predominant user task Subject access includes: Subject searching Keyword searching Subject browsing It is still very problematic for the majority of searchers FRBR Workshop, OCLC, 2005

1.2 Functions of a catalog regarding subject access (1) Cutter (1897) To find a book if the subject is known To show what a library has on a given subject (collocate) To assist in the choice as to its character (identify) FRBR Workshop, OCLC, 2005

1.2 Functions of a catalog regarding subject access (2) FRBR (1998) To find entities of Group 1 that have entities from Group 1, 2, 3 as their subject To identify To select To obtain FRBR Workshop, OCLC, 2005

1.3 What is a subject? Group 1 Group 3 Group 2 FRBR – Functional Requirements for Bibliographic Records Group 1 Work Expression Manifestation Item Group 2 Persons Families Corporate bodies Group 3 Concepts Objects Place Event FRBR Workshop, OCLC, 2005

Revisiting Group 3? Time Process Event is a combination of place and time Concrete vs. abstract concept Ranganathan Personality Matter Energy Space FRBR Workshop, OCLC, 2005

2. Current State Subject Authority Data 2.1 Structure (heterogeneous) 2.2 Existing Knowledge Organization Systems/Structures/Schemas (KOS) 2.3 Rules and guidelines 2.4 Communication/Encoding FRBR Workshop, OCLC, 2005

2.1 Structures Relationship Groups: Classification & Categorization: Ontologies Semantic networks Thesauri Relationship Groups: Strongly-structured Classification schemes Taxonomies Categorization schemes Classification & Categorization: Subject Headings Synonym Rings Authority Files Glossaries/Dictionaries Gazetteers Weakly-structured Term Lists: Pick lists Natural language Controlled language

Structures: Coordination Pre-coordination ……….. Post-coordination e.g. subject headings e.g. thesauri - LCSH - AAT, INSPEC MeSH FAST UMLS FRBR Workshop, OCLC, 2005

2.2 Existing KOS (1) Library of Congress Subject Headings (LCSH) Medical Subject Headings (MeSH) ERIC Thesaurus (ERIC) Inspec Thesaurus Inspec Classification Dewey Decimal Classification (DDC) Library of Congress Classification (LCC) Universal Decimal Classification (UDC) HEREIN Thesaurus Alexandria Digital Library (ADL) Gazetteer and Thesaurus Schlagwortnormdatei (SWD) Regenburger Verbund Klassifikation (RVK) RAMEAU: repertoire d'authorite de matieres encyclopedique unifie Art and Architecture Thesaurus (AAT) National Agriculture Library Subject Headings … … FRBR Workshop, OCLC, 2005

2.2 Existing KOS (2) Combined V C Global environment Verbal based Language Structure Verbal based AAT LCSH RAMEU INSPEC Thesaurus MeSH Code based UDC RVK DDC LCC Integrated INSPEC MeSH Hierarchy FRBR Workshop, OCLC, 2005

2.3 Rules of KOS Construction Different rules and guidelines AACR2, Z39.19, RAK (Regeln für die alphabetische Katalogisierung), ISO5964, ISO2788, IFLA Principles Underlying Subject Heading Languages (SHLs) … No rules Indirect/Inherent use of rules (by example) FRBR Workshop, OCLC, 2005

2.4 Communication/Encoding for authority data MARC MARC21 (1xx, 2xx, etc.) UNIMARC (1xx, 2xx, etc. different definition) etc. Guidelines for Authority Records and References (GARR) (>, <, >>, <<) NISO Z39.19 (BT, NT, RT, etc.) XML-based: OWL Web Ontology Language, RDF Schema, Voc-ML, etc. Vocabulary ML: Metacode strawman DTD FRBR Workshop, OCLC, 2005

3. Authority Data 3.1 Use of authority data Direct use of authority data Index Identify/Verify Search & Browse the authority data Indirect use of authority data Searching bibliographic file Browsing bibliographic file Users Information professionals Searcher/end-user FRBR Workshop, OCLC, 2005

3.2 Common Authority Data Authorized/established term Variations Related terms Notes Linked/Parallel terms Numbering, International numbering? Other: language, rules, links to external resources, roles, etc. FRBR Workshop, OCLC, 2005

Do we need one authorized term? Keep USER in mind! Preference, language, script Trends: all are preferred Synonym rings (included in NISO Z39.19 now) Sets of terms that are considered equivalent for the purpose of retrieval. Usually found as sets of lists that allow users to access all content containing any of the terms. Used in conjunction with search engines. B and T World Seeds http://www.b-and-t-world-seeds.com/synony.html DrugStore.com http://www.drugstore.com/ SEARCH and LOAD Synonym list http://www.sliccware.com/WebHelp/ATSHelp2_1.htm#Translate_Table/Translate_Table_Lists/Synonym_List.htm FRBR Workshop, OCLC, 2005

3.3 Common Semantic Relationships in Authority Data Broad categories Equivalence (Use, Used For, UF, See) Hierarchical (BT, NT, see also) Associative (RT, see also) More specific relationships, such as: Is part of Is instance of Agent/process Process/product Need for other types of relationships? ADL, such as: Overlap; administrativePartOf; SubFeatureOf UMLS, such as: Like; Parent; Child; Sibling WordNet, such as: Familiarity; derivationally related ADL: Footprints: Inherent spatial relationships: contains, overlaps, is-contained-by, adjacent (versus explicit statements of relationships) FRBR Workshop, OCLC, 2005

Unanswered Question What authority data currently exist in an authority record? or What authority data should be included in an authority record? FRBR Workshop, OCLC, 2005

4. Sharing Authority Data in a Global Environment Combined V C Global environment Language Structure 4.1. Challenges Structures Languages and scripts Rules Encoding FRBR Workshop, OCLC, 2005

4.2. Projects Specifically for Subject Authority Data Sharing Construction (not to be discussed here) Implementation Projects based on different types of structures Projects involving multiple languages FRBR Workshop, OCLC, 2005

Marcia Lei Zeng and Lois Mai Chan Trends and issues in establishing interoperability among knowledge organization systems. Journal of American Society for Information Science and Technology (JASIST) 55(5): 377 – 395 2004

KOS Vocabularies Bibliographic files Authority files

KOS Vocabularies Bibliographic files Authority files KOS Vocabularies Bibliographic files Authority files

Sharing at Vocabulary Level KOS Vocabularies KOS Vocabularies adaptation, extension, extraction, translation, etc. KOS Vocabularies

Sharing at Vocabulary Level National database "Merimee" about the French Heritage KOS Vocabularies The Thesaurus of Architecture (Le thésaurus de l'architecture) was created and mapped to the Art and Architecture Thesaurus (AAT) and the English Heritage Thesaurus (NMR) 1.Direct mapping KOS Vocabularies For the purpose of indexing complexes, buildings and built works described in the national database "Merimee" about the French Heritage, The Thesaurus of Architecture (Le thésaurus de l'architecture) was created and mapped to the Art and Architecture Thesaurus (AAT, http://www.getty.edu/research/tools/vocabulary/aat/, published by The J. Paul Getty Trust) and the English Heritage Thesaurus (http://www.rchme.gov.uk/nmr.html, published by The National Monuments Record (NMR) (http://www.culture.gouv.fr/documentation/thesarch/pres.htm). When mapping from Merimee’s Thesaurus of Architecture to the AAT and NMR, “AND” and “OR” post-coordination are used to indicate the equivalence, in addition to the conventional equivalence types (exact and partial). (See statistics reported in Doerr, 2001.)

Sharing at Vocabulary Level Renardus project “a cross-browsing feature based on the DDC and improved subject searching across distributed and heterogeneous European subject gateways.” KOS Vocabularies 2.Using a switching system KOS Vocabularies KOS Vocabularies Renardus is a EU project (coordinated by the National Library of the Netherlands with partners from Denmark, Finland, Germany, the Netherlands, Sweden, and the UK) with the purpose of producing “a cross-browsing feature based on the DDC and improved subject searching across distributed and heterogeneous European subject gateways.” The initial investigation included the use of classification systems by Renardus partners’ gateways, general mapping approaches and issues, the definition of mapping relationships, and information on technical solutions and the mapping tool. The approach adopted by the project is a harmonization process that maps local class schemes to a common scheme, thereby enabling users to browse a single subject hierarchy. DDC was chosen as the switching language and common browsing structure. Each DDC class in Renardus presents links to "related collections" which allow to jump to the mapped classes in the participating local gateways and to continue browsing there in the local classification structure. In addition, a virtual browsing feature allows the merging of all local related records from all mapped classes into one common Renardus result set. (Koch, Neuroth , and Day, 2001)

Sharing at Vocabulary Level UMLS® Metathesaurus ® KOS Vocabularies Over 1,000,000 concepts and 4.3 million concept names from more than 100 controlled vocabularies, some in multiple languages 3.Creating a superstructure KOS Vocabularies KOS Vocabularies http://www.nlm.nih.gov/pubs/factsheets/umlsmeta.html

Sharing at Vocabulary Level UCB Unfamiliar Metadata Vocabularies KOS Vocabularies Accepts query vocabularies and responds with a ranked list of the system’s entry vocabularies– which is an index to five controlled vocabularies. 4.Creating a superstructure (an index) KOS Vocabularies KOS Vocabularies The project "Mapping Entry Vocabulary to Unfamiliar Metadata Vocabularies" (http://metadata.sims.berkeley.edu/GrantSupported/unfamiliar.html) has been conducted at the University of California, Berkeley in recent years. As stated in the project website, the researchers plan to develop Entry Vocabulary Indexes that accept topical statements in the searcher's terms ("Query vocabularies") and respond with a ranked list of terms in the system's vocabulary ("Entry vocabularies"). The prototype Entry Vocabulary Modules included English language indexes to BIOSIS Concept Codes, INSPEC Thesaurus, U.S. Patent and Trademark Office Patent Classification, and the Standard Industrial Classification (SIC) codes, and a multilingual index (supporting queries in English, French, German, Russian, or Spanish) to the physical sciences sections of the Library of Congress Classification (LCC). When the Entry Vocabulary Module leads users to a promising term in the target metadata vocabulary, a search can be executed using the newly-found metadata against a remote database (Buckland et al., 1999).

Sharing at Vocabulary Level CAMed Cross-thesaurus searching Terms are linked in a temporary union list generated by the software in response to a query. KOS Vocabularies 5.Creating a superstructure (a virtual index) KOS Vocabularies KOS Vocabularies http://circe.slis.kent.edu/mzeng/tmshome.html

Sharing at Vocabulary Level UCSB Alexandria Digital Library KOS Vocabularies The Thesaurus Protocol is based on the ANSI/NISO (1993, R2003) Z39.19 thesaurus model and supports downloading, querying, and navigating thesauri. 6. Linking through a thesaurus server protocol KOS Vocabularies KOS Vocabularies ADL-2 http://www.alexandria.ucsb.edu/~gjanee/thesaurus/test-forms.shtml

Sharing at Subject Authority File Level KOS Vocabularies Bibliographic files Authority files Direct Mapping KOS Vocabularies Bibliographic files Authority files

Direct Mapping -- MACS (Multilingual Access to Subjects) SWD/RSWK (Schlagwortnormdatei / Regeln für den Schlagwortkatalog) for German, Rameau (Répertoire d'autorité-matière encyclopédique et alphabétique unifié) for French, and LCSH for English. The Swiss National Library (SNL) is the project leader, and its partners are: the Bibliothèque nationale de France (BnF), The British Library (BL), and Die Deutsche Bibliothek (DDB).

LCSH AND MeSH MAPPING PROJECT SAMPLE AUTHORITY RECORDS, Northwestern University Library http://www.library.northwestern.edu/public/lcshmesh/lchs-mesh_sample.txt

KOS Vocabularies Bibliographic files Authority files Co-occurrence mapping -- works at the application level, i.e., in metadata records, where the group of subject terms can actually result in loosely-mapped terms. Metadata Terms from thesaurus 1 Terms from thesaurus 2 S1 S2

So far, Functional Requirements for Authority Records (FRAR) Covers: Names for persons, families, corporate bodies (Group 2) Titles (Group 1) Projects for Authority Data Sharing focus mainly on Names: ONE Shared Authority Control (ONESAC, ppt) Virtual International Authority File (VIAF) Linking and Exploring Authority Files (LEAF) Hong Kong Chinese Authority (Name) (HKCAN) VIAF: The Virtual International Authority File A joint project with the Library of Congress and Die Deutsche Bibliothek, VIAF explores virtually combining the name authority files of both institutions into a single name authority service. LEAF develops a model architecture for a system that uploads distributed authorities (persons and corporate bodies) to a central system and automatically links those authorities that belong to the same entity. In January 1999, a group of academic libraries in Hong Kong agreed to set up among themselves the Hong Kong Chinese Authority (Name) (HKCAN) Workgroup for establishing a union database that would reflect the unique characteristics of the Chinese authors and organizational names. This project was spearheaded by both Lingnan and Chinese University Libraries.  FRBR Workshop, OCLC, 2005

FRSAR: Functional Requirements for Subject Authority Data Scope: focus on FRBR’s Group 3 entities FRSAR Working Group contact: Marcia Zeng mzeng@kent.edu Maja Zumer Athena Salaba asalaba@kent.edu FRBR Workshop, OCLC, 2005

FRBR Workshop, OCLC, 2005

FRSAR terms of reference build a conceptual model of Group 3 entities within the FRBR framework (Entities in Group 1 and Group 2 can be used as the subjects of works; but further inclusion of them will depend on the outcomes of the work of the FRANAR Working Group); provide a clearly defined, structured frame of reference for relating the data that are recorded in subject authority records to the needs of the users of those records; and assist in an assessment of the potential for international sharing and use of subject authority data both within the library sector and beyond. FRBR Workshop, OCLC, 2005