Toward a post-MARC view of bibliographic metadata Jean Godby, Senior Research Scientist Triangle Research Libraries Network workshop -- Chapel Hill, North.

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

John Espley and Robert Pillow ALA New Orleans 26 June 2011 The RDA Sandbox and RDA Implementation Scenario One.
RDA : the Inside Story The Genesis OLA, February 2, 2008 Ingrid Parent Library and Archives Canada.
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Making and Moving Metadata: Two Library of Congress Initiatives Sally McCallum NDMSO, Library of Congress NISO/BISG Forum - June 22, 2012.
Update on LC Preparations for RDA CEAL Committee on Technical Processing Meeting : Session 4 March 14, 2012 Tom Yee LC Policy & Standards Division.
OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
Aligning BIBFRAME with The Schema/Bib Extend model
Terminology Services Diane Vizine-Goetz Senior Research Scientist OCLC Research.
UKOLN, University of Bath
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
Beyond the Record : OCLC & the Future of MARC Ted Fons Director WorldCat Global Metadata Network CCS Forum ALA - Chicago July 11, 2009 CDF MARC
W3C and RDF. Why OCLC is a W3C Member Access to networked information resources –the browser and online access –the breath and depth of networked information.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
An introduction to RDF and library linked data Gordon Dunsire Presented at the Dewey Decimal Classification Executive Briefing 15 Sep 2011, London.
AFTER MARC: OPTIONS New bibliographic framework. Aside: what we need to do Identify the resources we are describing, e.g.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
RDF AND LINKED DATA Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
Linked Data for Libraries, Archives, Museums. Learning objectives Define the concept of linked data State 3 benefits of creating linked data and making.
RDA AND LINKED DATA: MOVING BEYOND THE RULES Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
Module 6: Preparing for RDA... Library of Congress RDA Preconference for MLA/DLA May 4, 2011.
The OCLC Metadata Switch Project Jean Godby, Thomas Hickey, Diane Vizine-Goetz OCLC Office of Research Digital Library Federation May 14, 2003.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.
Vocabulary Mapping Framework & Libraries Alan Danskin Metadata & Bibliographic Standards Coordinator.
Carol Jean Godby Research Scientist Mapping Bibliographic Metadata ALA OCLC Update June 28, 2010.
RDA and Linked Data Steve Henry University of Maryland March 2, 2013.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Information Trends in Libraries Get More Value from Data Give More Value to Users Get Users involved July 9, 2007 Stuart Weibel Senior Research Scientist.
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Resource Description and Access Since We Last Met… Marjorie E. Bloss RDA Project Manager 1.
LINKED DATA AND RDA: LOOKING TOWARD NEXT GENERATION CATALOGING Jenn Riley Head, Carolina Digital Library and Archives Digital Discussions series Twitter:
Jennifer Bowen, University of Rochester ALA Annual Conference, 2009, Chicago, Illinois 1 Defining Linked Data for the eXtensible Catalog (XC): Metadata.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Setting a new standard Resource Description and Access Deirdre Kiorgaard 18 September 2006.
Evidence from Metadata INST 734 Doug Oard Module 8.
RELATORS, ROLES AND DATA… … similarities and differences.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
ER&L March, 2008 Karen Coyle ER&L March, 2008 There’s no catalog….
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
RDA and OCLC Karen Calhoun Jean Godby Ted Fons Glenn Patton October 2009 Webinar.
Digital libraries research IG Cataloging and metadata IG Web services and metadata switch February 2003 Web services and metadata switch February 2003.
OCLC and RDA Karen Calhoun Jean Godby Ted Fons Glenn Patton October 2009 Webinar.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Introduction to FRBR Functional Requirements for Bibliographic Records GACOMO Oct. 16, 2008.
BIBFRAME and Schema.org New Models for Resource Description and Access
From the old to the new… Towards better resource discoverability
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
Lifecycle Metadata for Digital Objects
Getting started With Linked Data.
Applications of IFLA Namespaces
Introduction to Metadata
Introduction of Linked Data – From Cataloging to Catalinking
Presentation transcript:

Toward a post-MARC view of bibliographic metadata Jean Godby, Senior Research Scientist Triangle Research Libraries Network workshop -- Chapel Hill, North Carolina March 15, 2012

Post-MARC bibliographic metadata2 Outline for today 1.How did I get to this place? 2.The Library of Congress Bibliographic Framework for Digital Resources 3.The OCLC Beyond MARC work agenda 4.Four guiding assumptions 5.Some questions

Post-MARC bibliographic metadata3 OCLC MARC OutputsInputs Translations in the Crosswalk service ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core OCLC MARC DC-Qualified MARC ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core DC-Qualified MARC OCLC MARC

Post-MARC bibliographic metadata4 Problems with mapping to and from MARC Problem: In a MARC record, some critical information is represented redundantly. Effect on the Crosswalk: requires one-to-many mappings, which are semantically opaque and difficult to maintain. Problem: Some MARC fields are ambiguous. Effect on the Crosswalk: The distinctions are difficult to recover or may be lost. Problem: Many MARC free-text fields have formatting requirements. Effect on the Crosswalk: They must be added in (and taken out).

Post-MARC bibliographic metadata5 And so forth….and so on Problem: Many formatting requirements are explicitly stated only in cataloging rules, not in the data that is algorithmically processed. Effect on the Crosswalk: Knowledge of the cataloging rules must be embedded in the translation software. Problem: Some MARC fields are coded with hidden assumptions. Effect on the Crosswalk: Knowledge of the hidden assumptions must be embedded in the translation software, which requires complex and brittle Boolean logic. Problem: MARC has a long tail. Effect on the Crosswalk: It is necessary to maintain a large number of mappings that are not used.

Post-MARC bibliographic metadata6 RDA or other structured metadata vocabulary OutputsInputs MARCs complexity needs to be quarantined. ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core OCLC MARC DC-Qualified MARC ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core DC-Qualified MARC OCLC MARC

Post-MARC bibliographic metadata7 In other words, with MARC in the center of our model…

Post-MARC bibliographic metadata8 The new bibliographic framework we are aiming for will broaden participation in the network of resources, librarians will be able to do a much better job of linking their patrons to resources of all kinds (from the library and from many other sources), and costs can be better contained. -- Library of Congress Bibliographic framework is... an environment rather than a format A Bibliographic Framework for the Digital Age (October 31, 2011)

Post-MARC bibliographic metadata9 resource relationship manifestation entity object data abstract library RDA service format linked authority MARC carrier groundtruthing FRBR semantic beyond content transformation RDF instance description statement schema role hadoop property UML model identifier legacy web OCLCs Beyond MARC research agenda theme

Post-MARC bibliographic metadata10 The OCLC Beyond MARC: research agenda: whos involved Eric Childress, Consulting Product Manager Eric Childress Jean Godby, Senior Research Scientist Jean Godby Thom Hickey, Chief Scientist Thom Hickey Devon Smith, Consulting Software Engineer Devon Smith Karen Smith-Yoshimura, Program Officer Karen Smith-Yoshimura Roy Tennant, Senior Program Officer Roy Tennant Diane Vizine-Goetz, Senior Research Scientist Diane Vizine-Goetz Jeff Young, Software Architect Jeff Young

Post-MARC bibliographic metadata11 Assumption 1 There are many moving targets

Post-MARC bibliographic metadata12 Dont add to the complexity. Use publicly defined standards wherever possible. Leverage the work of others. Focus on data preparation, cleanup, and modeling that will support a variety of formats. The OCLC Research response: Some guiding principles

Post-MARC bibliographic metadata13

Post-MARC bibliographic metadata14 Make your stuff available on the web. Make it available as structured data… …in a non-proprietary format. Use URLs to identify things. Link your data to other peoples data. Data preparation: principles Source: W3C Data, not text Identifiers, not strings Statements, not records Machine-readable schema Machine-readable lists Source: Karen Coyle

Post-MARC bibliographic metadata15 Assumption 2: Most bibliographic metadata will not be created by libraries

Post-MARC bibliographic metadata16 Why ONIX is interesting BB 01 McBains Ladies A01 Hunter, Evan 02 Policewomen--Fiction. Leader jm a g eng 020 $a $a Hunter, Evan 245 $a McBains ladies 260 $b Mysterious Press $d $a 320 p. 650 #2 $a Policewomen -- Fiction Leader jm a g eng 020 $a $a Hunter, Evan 245 $a McBains ladies 260 $b Mysterious Press $d $a 320 p. 650 #2 $a Policewomen -- Fiction identifier text A record string identifier string data identifier data string

Post-MARC bibliographic metadata17 A hypothetical bibliographic description expressed as linked data Ladies A01 Evan

Post-MARC bibliographic metadata18 This list is inadequate for describing the range of material types held by libraries. This list is inadequate for describing the range of material types held by libraries.

Post-MARC bibliographic metadata19 Some proposed library extensions to Schema.org.

Post-MARC bibliographic metadata20 The extensions are derived from MARC data for the WorldCat search interface.

Post-MARC bibliographic metadata21 The WorldCat search interface terms reduce a complex MARC concept space to a list.

Post-MARC bibliographic metadata22 Assumption 3: MARC will be around for awhile. Assumption 4: Mapping is still necessary.

A publishing model OCLC Abstract Model model map Raw Data Standard Vocabularies RDA or other structured metadata vocabulary OutputsInputs ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core OCLC MARC DC-Qualified MARC ONIX Books 2.1 ONIX Books 3.0 MODS Dublin Core DC-Qualified MARC OCLC MARC

Post-MARC bibliographic metadata24 It is not enough To RDF-ify MARC It is not enough To RDF-ify MARC The concepts must be extracted. The concepts must be extracted. They eventually emerge. They eventually emerge.

Post-MARC bibliographic metadata25 Some (perhaps uncomfortable) questions 1.How much work will be involved in building out the abstract model? What is the value proposition? 2.How can we engage communities of practice to contribute to the parts of the abstract model that describe their resources? 3.How will mappings be implemented in the post-MARC information landscape? 4.How much information in the MARC record will get lost? 5.What will content standards look like in post-MARC descriptions? 6.How many of the FRBR and RDA concepts are algorithmically recoverable from legacy data? 7.What happens if linked data does not live up to its promise or is not adopted quickly enough?

Post-MARC bibliographic metadata26 But maps from many MARC concepts look like this. Set-theoretic mappings can be implemented elegantly in RDF/OWL.

Post-MARC bibliographic metadata27 References Coyle, Karen MARC 21 as data: a start Taking library data from here to there. Godby, Carol Jean From records to streams: merging library and publisher metadata. Library of Congress A bibliographic framework for the digital age. Library Linked Data Incubator Group final report OCLC FAST Linked Data. Schema.org Smith-Yoshimura, Karen, et al Implications of MARC tag usage on library metadata practices.

Thank you!