MARC Content Designation Use I mplications for indexing & interoperability William E. Moen School of Library and Information Sciences Texas Center for.

Slides:



Advertisements
Similar presentations
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
Advertisements

Z39.50 Profiles The Bath Profile ZIG Meeting Leuven, Belgium July 2000 William E. Moen School of Library and Information Sciences University.
Barriers to Interoperability Technical and Not So Technical William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
5 th September 2003Diane Tough Content Creation at the NHM or The evolving catalogue!
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
The MERIC Prototype A Proof of Concept for the MERIC Vision William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge.
IMLS NLG Collection Registry & Item-Level Metadata Repository at the University of Illinois Timothy W. Cole Mathematics Librarian &
Positioning Z39.50 in the Networked Library Standards for Building Sustainable Services William E. Moen School of Library and Information Sciences Texas.
Z39.50 for Finding It All William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North Texas Denton,
The 21 st Century Library Collaborative Services, Standards, and Interoperability William E. Moen School of Library and Information Sciences Texas Center.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability: Where the irresistible force of flexibility meets the immovable.
Society of American Archivists Research Forum 18 August 2015 A Deep Dive into the Archival MARC Records in WorldCat (and ArchiveGrid) Jackie Dooley Program.
ODINCINDIO Marine Information Management Training Course February 2006 Cataloguing: Introduction Murari P Tapaswi National Institute of Oceanography,
Testing and Improving Interoperability The Z39.50 Interoperability Testbed William E. Moen School of Library and Information Sciences Texas Center for.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
An Alternative Approach to Interoperability Testing The Use of Special Diagnostic Records in the Context of Z39.50 and Online Library Catalogs William.
MARC Content Designation Utilization: Inquiry and Analysis Can Empirical Evidence Help Shape the Future of MARC? Amy Eklund, Research Asst., MCDU Project;
Optimizing Resource Discovery Service Interfaces in Statewide Virtual Libraries: The Library of Texas Challenge William E. Moen, Ph.D. Texas Center for.
MARC Content Designation and Utilization Future of MARC: Challenges and Opportunities of 21 st Century Cataloging William E. Moen School of Library and.
Rethinking What We Do The Library’s Diminishing Market Share William E. Moen Texas Center for Digital Knowledge School of Library and Information Sciences.
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
MARC Content Designation and Utilization Examining MARC Records as Artifacts Reflecting Metadata Utilization Decisions William E. Moen School of Library.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
1 Interoperability and the DNER Paul Miller Interoperability Focus UK Office for Library & Information Networking (U KOLN )
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
The physical parts of a computer are called hardware.
Radioactive Metadata Records An Interoperability Testing Approach Based on Metadata Utilization William E. Moen School of Library and Information Sciences.
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
MARC Content Designation and Utilization Learning from Artifacts: Metadata Utilization Analysis William E. Moen School of Library and Information Sciences.
Z39.50 & The Z Texas Profile William E. Moen School of Library and Information Sciences University of North Texas Denton, TX.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata Interaction, Integration, and Interoperability MODS, MARC and Metadata Interoperability, ALA Conference, June 27, 2005, Chicago, IL William E.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Interoperability, Z39.50 Profiles & Testing William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of.
Metadata Interaction, Integration, and Interoperability NISO Workshop: Metadata Practices on the Cutting Edge, May 20, 2004, Washington, DC William E.
No Longer Under Our Control? The Nature and Role of Standards in the 21 st Century Library William E. Moen School of Library and Information Sciences Texas.
An Inquiry and Analysis of Metadata Utilization A Case Study of MARC 2005 ASIS&T Annual Meeting, November 1, 2005, Charlotte, North Carolina William E.
Renee Register Senior Product Manager OCLC Cataloging and Metadata Services Sandy Piver OCLC Publisher Services Consultant OCLC Services for the Publisher.
Research and Projects: Z, M, and Beyond! William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Users and Metasearch Applications: New Challenges for Usability Assessment William E. Moen, Ph.D. Texas Center for Digital Knowledge University of North.
Placing All Information Within Our Control? Standards, Information Organization, and the 21 st Century Library William E. Moen Texas Center for Digital.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The ___ is a global network of computer networks Internet.
An information retrieval system may include 3 categories of information:  Factual  Bibliographical  Institutional  Exchange and sharing of these categories.
OAI metadata: why and how Jenn Riley Metadata Librarian Indiana University.
A Complex Standard and Its Use Results from an empirical analysis of MARC 2004 Texas Library Association Annual Conference, March 18, 2004, San Antonio,
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
MICHAEL and the European Digital Library: promoting teaching, learning and research The MICHAEL Project is funded under the European Commission eTEN Programme.
From the old to the new… Towards better resource discoverability
Metadata Standards - Types
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
Cataloging Tips and Tricks
MARC: Beyond the Basics 11/24/2018 (C) 2006, Tom Kaun.
Attributes and Values Describing Entities.
Presentation transcript:

MARC Content Designation Use I mplications for indexing & interoperability William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North Texas Denton, TX South Central Unicorn Users Group Annual Conference, October 17, 2003 Austin, Texas

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Overview Context for the analysis -- interoperability Findings from the analysis Indexing and MARC Discussion

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Context for the analysis Interoperability across library online catalogs Indexing of MARC records to support searching Richness of MARC content designation available Indexing guidelines prepared for the Z39.50 Interoperability Testbed (Z-Interop) Implications for indexing guidelines and policies

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Interoperability Systems and organizations will interoperate! One should actively be engaged in the ongoing process of ensuring that the systems, procedures and culture of an organisation are managed in such a way as to maximise opportunities for exchange and re-use of information, whether internally or externally. Paul Miller, 2000

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Factors affecting interoperability Multiple and disparate systems operating systems, information retrieval systems, etc. Multiple protocols Z39.50, HTTP, SOAP, etc. Multiple data formats, syntax, metadata schemes MARC 21, UNIMARC, XML, ISBD/AACR2-based, Dublin Core Multiple vocabularies, ontologies, disciplines LCSH, MESH, AAT Multiple languages and character sets Indexing, word normalization, and word extraction policies

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Information communities Community agreements exist (e.g., standards, rules, etc.) Interoperability factors reduced Interoperability more easily achieved Do we need additional agreements regarding indexing policies to improve interoperability? Libraries as Focal Community  Relative homogeneity of data and systems  Standards-based MARC records  Content and structure prescribed by AACR  Commonly understood access points  Use of controlled vocabularies

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Interoperability testbed project Realizing the Vision of Networked Access to Library Resources: An Applied Research and Demonstration Project to Establish and Operate a Z39.50 Interoperability Testbed A Institute of Museum and Library Services National Leadership Grant Goal: Improve Z39.50 semantic interoperability among libraries for information access and resource sharing FOR MORE INFORMATION, VISIT THE PROJECT WEBSITE…

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Threats to Z39.50 interoperability Differences in implementation of the standard Differences in local information retrieval systems Search functionality Indexing policies These threats can be addressed by Z39.50 specifications and configuration (i.e., profiles) Enhancing local information retrieval systems Recommendations for local indexing decisions

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Components of the testbed Test dataset 400,000+ MARC 21 records from OCLC’s WorldCat Z39.50 reference implementations Z-client (Bookwhere), Z-server & information retrieval system (Sirsi Unicorn) Test scenarios & searches Searches with known result records from dataset Benchmarks Results of test searches using reference implementations

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, MARC Record structure for encoding data for machine processing Standard structure (ANSI/NISO Z39.2/ISO 2709) Leader Directory map 3-digit tag to identify a field 2 indicator values to provide additional processing information 1 or more delimiters/codes to identify subfields Content designation: Semantics MARC $a [title] $h [format] : $b [subtitle] Rules Anglo-American Cataloguing Rules and others

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, MARC 21 content designation MARC 21 Field Groups Currently Defined ObsoleteTotalMARC 1972 (Books Format Only) 00x6173 0xx xx xx xx xx xx xx xx xx TOTAL

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Z-Interop test dataset Books: 91% Cartographic Materials: < 1% Electronic resources: < 1% Archival/Mixed Materials: <1% Sound recordings: 4% Visual Materials: 1% Serials: 3% Approximately 1% sample of MARC records from OCLC’s WorldCat database Weighted sampling based on number of libraries “holding” the object represented by the record 419,657 total MARC records 89% of records “full level” cataloging Formats represented in test dataset

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, MARC record LDR01019cam ^ 001 ocm ^ 003 OCoLC^ ^ s1963 nyu b eng ^ 010 $a ^ 040 $aDLC $cDLC ^ $aHV700.5 $b.N37 ^ $a362.7/3 ^ $aNational Study Service. ^ $aIllegitimacy and adoption in Maine : $breport of a study made for the Maine Committee on Children and Youth. ^ 260 $a[New York], $c1963. ^ 300 $a24 p. ; $c28 cm. ^ 500 $aCover title. ^ 504 $aBibliographical footnotes. ^ $aIllegitimacy $zMaine. ^ $aAdoption $zMaine. ^ $aMaine. $bCommittee on Children and Youth. ^

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Decomposing MARC Records OCLC # Tag1 st Ind 2 nd Ind SubFldFld Pos SubFld Pos Word Pos Word Ocm OCoLC 31102a1111 National 31102a1112 Study 31102a1113 Service a1211 Illegitimacy a1212 and a1213 Adoption b1221 Report 36500a1711 Illegitimacy 36500z1721 Maine 400,000 MARC21 records = 33 million decomposed records

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Content designation in dataset MARC 21 Field Groups Currently Defined ObsoleteUnlikely Used Total 00x6006 0xx xx xx xx xx xx xx xx xx TOTAL

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Summary frequency results Frequency# of Fields/Subfields% of All Occurrences > 600, % 500,000 > 599,99900% 400,000 > 499, % 300,000 > 399, % 200,000 > 299, % 100,000 > 199, % TOTAL3679.5% Total number of fields/subfields occurring in dataset = 13,849,499 Only 4% of all fields/subfields account for 80% of all occurrences or 96% of all fields/subfields account for 20% of all occurrences

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Characteristics of top 36 Most frequently occurring: 650 $a [Subject data] 2 nd most frequently occurring: 040 $d [Cataloging source] 3 rd & 4 th most frequently occurring: 260 $a & $b [Publication information] 5 th most frequently occurring: 245 $a [Title] Contain data useful to end users: 28 Contain control numbers, etc.: 5 Contain data useful to catalogers: 3

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Indexing & MARC Indexing Guidelines to Support Z39.50 Profile Searches Indexing Guidelines to Support Z39.50 Profile Searches Identified all MARC 21 fields/subfields that may contain author, title, or subject data Author-related fields/subfields : 119 AuthorTitle-related fields/subfields: 21 Title-related fields/subfields: 253 Subject-related fields/subfields: fields/subfields contain author, title, subject data Usefulness of indexing all possible fields?

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Occurrences in test dataset 381 occur one or more times in Z-Interop dataset Author, title, or subject fields/subfields in Z-Interop dataset Author-related fields/subfields : 86 AuthorTitle-related fields/subfields: 16 Title-related fields/subfields: 178 Subject-related fields/subfields: of the 381 (5%) account for 80% of all occurrences 9 of 19 are subject-related 5 of 19 are author-related 5 of 19 are title-related The 19 fields/subfields

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, Implications for indexing What difference does indexing decisions make? Preliminary testing using the 19 fields/subfields: 95% - 100% of correct records retrieved! How much time would be saved in setting up indexing policies? Is there a systematic method to identify the “best” fields/subfields to index? Per format of materials? Per user (librarians and end users) needs? Good enough search results?

Moen South Central Unicorn Users Group Annual Conference -- Austin, Texas -- October 17, References Z39.50 Interoperability Testbed  Indexing Guidelines to Support Z39.50 Profile Searches  delines1Feb2002.pdf delines1Feb2002.pdf