The world’s libraries. Connected. MARC & The Trouble With Online Or, Metadata Carnage and Where We Go From Here ALA Midwinder, January 2013 Roy Tennant.

Slides:



Advertisements
Similar presentations
The worlds libraries. Connected. Linked Data at OCLC Roy Tennant Senior Program Officer ALA Midwinder, January 2013.
Advertisements

Evaluation and Quality Of electronic journals and related information resources.
Evaluation of electronic resources. Review of Internet quality issues Nearly anyone can publish information on the Internet so –academic journals sit.
Getting Started with MarcEdit
The world’s libraries. Connected. An Introduction to OCLC WorldShare ® Interlibrary Loan Michael Cosentino Senior Training Coordinator OCLC August 2013.
Cyberlesson Let’s Begin TITLE AUTHOR: Presented by: Recommended Grade Level:
Introduction to Online Resources Aeronautics & Astronautics, Mechanical Engineering and Ship Science Michael Whitton November 2011 & February 2012 University.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
CIS101 Introduction to Computing Week 11. Agenda Your questions Copy and Paste Assignment Practice Test JavaScript: Functions and Selection Lesson 06,
The Internet vs. The Online Database What’s the difference?
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
PubMed/How to Search, Display, Download & (module 4.1)
Search Optimization Techniques Dan Belhassen greatBIGnews.com Modern Earth Inc.
Electronic Communication and Web Accessibility Workshop.
Using AGORA. Workshop Objectives Learn what AGORA offers, main features, and appropriate use Learn how to open AGORA, log in and navigate to find journals.
1 Session 4 Online versions How is the single record approach applied to electronic versions of print serials? How can reproduction cataloging practices.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
OSU | PSU | UO The Oregon Spatial Data Library: A Vision for Increased Data Sharing Myrica McCune Institute for Natural Resources February 5, 2014.
Chapter 6 Text and Multimedia Languages and Properties
O VERVIEW OF THE W RITING P ROCESS Language Network – Chapter 12.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Meta-Knowledge Computer-age study skill or What kids need to know to be effective students Graham Seibert Copyright 2006.
Roy Tennant California Digital Library Is Metasearch Dead?
Society of American Archivists Research Forum 18 August 2015 A Deep Dive into the Archival MARC Records in WorldCat (and ArchiveGrid) Jackie Dooley Program.
Selecting a Topic and Purpose
Categories of Vocabulary Compatibility Dmitry Lenkov Oracle.
ACADEMIC SOURCES. What is an academic source? An academic source represents a scholarly writing that is reviewed by peers. Most of these will be found.
Publishing tracks Went WWW When your web pages are ready to go on the World Wide Web, you are ready to “publish” them, to make them accessible.
RESOURCE DESCRIPTION AND ACCESS: A COBEC WORKSHOP JANUARY 31, 2014 GUY FROST VALDOSTA STATE UNIVERSITY Electronic Resources: Computer.
OCLC Online Computer Library Center ALA Midwinter 2006 (updated 1/27/2005) Resource Sharing User Group Dana Dietz Global Product Manager, Resource Sharing.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Library databases. database NOUN:also data base Computer Science A collection of data arranged for ease and speed of search and retrieval. Also called.
World Cat World wide catalog of libraries in the U.S., Canada, and Europe.
How do I begin a Research Project?. Research? What is it and how do you make sure you use your resources wisely? A good research paper should have between.
CH 42 DEVELOPING A RESEARCH PLAN CH 43 FINDING SOURCES CH 44 EVALUATING SOURCES CH 45 SYNTHESIZING IDEAS Research!
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
Side effects A side effect is anything that happens in a method other than computing and/or returning a value. Example: public class hello { public int.
Internet Literacy Evaluating Web Sites. Objective The Student will be able to evaluate internet web sites for accuracy and reliability The Student will.
The Catalog of the Future: Integrating Electronic Resources By Dana M. Caudle Cataloging Librarian Auburn University Libraries
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Evaluating Sources. Evaluation During Reading After you have asked yourself some questions about the source and determined that it's worth your time to.
1 CS 430: Information Discovery Lecture 8 Collection-Level Metadata Vector Methods.
The ___ is a global network of computer networks Internet.
WHY SHOULD I CARE ABOUT (PRIMO) NORM RULES?. WHAT NORMALIZATION RULES DO Content display in Primo Primo functionality Troubleshooting.
The ABC’s of Web Site Evaluation Digital Literacy Digital literacy is the ability to understand and use information in multiple formats from a wide range.
1 Yoel Kortick Senior Librarian Adding a local Electronic Collection.
Abstract  An abstract is a concise summary of a larger project (a thesis, research report, performance, service project, etc.) that concisely describes.
Global Rangelands Data Entry Guidelines March 23, 2015.
1 Yoel Kortick Senior Librarian Alma Product Management Mapping the bibliographic call number to the holding record call number.
Databases vs the Internet Coconino Community College Revised August 2010.
Technical Communication: Concepts and Features
Off Campus Library Services
Databases vs the Internet
Databases vs the Internet
Open Up Your Finding Aids
Electronic Integrating Resources
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Metadata Editor Introduction
Publishing and Maintaining a Website
E-Books: MARC Fields MARC Fields used for cataloging monographic e-resources.
SOURCES finding & evaluating them
e-Thesis Submission: What You Need to Know About Going Global
Skills in Information Retrieval
Louisiana: Our History.
Accessing CSJ’s newest online resource.
Internet Literacy Evaluating Web Sites.
2008 Workshop AHEPA District 3 Website Presentation
PUBLIC SCHOOL LAW Part 9: Primary Legal Sources: The Constitution
Presentation transcript:

The world’s libraries. Connected. MARC & The Trouble With Online Or, Metadata Carnage and Where We Go From Here ALA Midwinder, January 2013 Roy Tennant Senior Program Officer OCLC

The world’s libraries. Connected. The Hierarchy of Desire Online in full, open access Online in full, licensed on my behalf Online in full, easily acquirable Online in part Offline, but easily acquirable Offline, but can be acquired through delivery (ILL) The Line of Damage

The world’s libraries. Connected. Where the Confusion Lies The 856 URL applies to A digital “version” of the item “The item” (often a “born digital” item} Often clear Often unclear Table of Contents? Sample Chapter? Full Text? Etc.

The world’s libraries. Connected.

The world’s libraries. Connected. What is online in full? Of that, what is openly accessible?* No time to discuss this aspect today * Initially, for a US audience Two Main Questions

The world’s libraries. Connected. Initial Investigations OMG. I mean, srsly.

The world’s libraries. Connected. Number of URLs per host (Oct 2010)

The world’s libraries. Connected. Values from 856 $z (public note)

The world’s libraries. Connected. Values from the 856 $3 (materials specified)

The world’s libraries. Connected. Magic Happens Here Sure thing. Whatever you say.

The world’s libraries. Connected.

A Drafty Algorithm I Can’t Make This Shit Up. Oh, Wait, I Did.

The world’s libraries. Connected. Based on assigning scores for certain field and/or value occurrences and/or their contents We determined the scoring was good enough for our purposes We DID NOT evaluate each individual score for its relevance (that is, some may not matter in the end) We DID NOT identify all relevant uncontrolled text strings — especially foreign language terms We implemented a final check to catch false positives Algorithm: Info and Caveats

The world’s libraries. Connected. 245 subfield $h has any of the following strings: “website”, “graphic”, “digital”, “internet”, etc. 530 has any of the following: “world wide web”, “digital”, “internet”, “electronic”, “online”, etc. 538 has any of the following: “world wide web”, “acrobat”, “internet”, etc. 856 has any of the following: “full”, “online”, “pdf”, “free access”, “electronic version”, etc. ALL case insensitive Plus 2 Scores

The world’s libraries. Connected. Byte 6 of the leader or 006 of ‘m’ Byte 23 or byte 29 of the 008 is ‘o’ or ‘s’ 245 $h has any of the following strings: “electronic”, “elektronische”, “elecktronisk”, etc. 533 has any of the following strings: “world wide web”, “acrobat”, “internet”, etc. 856 second indicator 0 Plus 1 Scores

The world’s libraries. Connected. If score is equal or greater to 2: 856 has any of the following strings: “table of contents”, “publisher description”, “biographical information”, “Inhaltsverzeichnis”, “sample text”, “book review”, “abstract”, etc., SET TO ZERO Otherwise, declare the item to be ONLINE IN FULL Final Check

The world’s libraries. Connected. There is no sanctioned method for encoding this information in a MARC record unambiguously and machine understandably Our suggestions: Short-term: We find an appropriate method to unambiguously record this information in MARC21 Long-term: Build into whatever replaces MARC the ability to unambiguously declare when an item is available in full, AND a set of unambiguous and controlled markers for varying levels of access What Then?

The world’s libraries. Connected. We believe it is possible to algorithmically determine when a URL leads to the full item at a roughly 80/20 percentage of accuracy We also believe it is possible to determine open access vs. gated access at roughly the same % There is presently NO approved way to encode this unambiguously in MARC21 We MUST have the ability to encode these aspects now and into the future Main Take-Aways

The world’s libraries. Connected. Thank you for your time. Roy Facebook.com/roytennant/