Transitioning from and Beyond MARC Karen Smith-Yoshimura 2010 RLG Partnership Annual Meeting Chicago, IL 10 June 2010.

Slides:



Advertisements
Similar presentations
John Espley and Robert Pillow ALA New Orleans 26 June 2011 The RDA Sandbox and RDA Implementation Scenario One.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
The Institute for Learning and Research Technology is a national centre of excellence in the development and use of technology-based methods in teaching,
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
RLG Programs Karen Smith-Yoshimura OCLC Research CEAL, Philadelphia 24 March 2010 Cooperative Identities Hub.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
RDA and libraries Gordon Dunsire Presented at a College Development Network webinar, 13 June 2013.
Carol Jean Godby Research Scientist Mapping Bibliographic Metadata ALA OCLC Update June 28, 2010.
Rethinking Cataloguing Paula Goossens and Dan Matei Wageningen, 14 April 2008.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
Structures and Standards for Our Bibliographic Future Diane I. Hillmann Research Librarian Cornell University Library.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
SLIDE 1IS 245 – Spring 2009 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
Networked Resources and Metadata Interest Group Diane I. Hillmann Research Librarian Cornell University Library.
Looking to the Future with RDA Presented by Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress For National Central Library.
Metadata Support and Management Eric Childress Karen Smith-Yoshimura OCLC Research FutureCast, Washington D.C. 8 June 2011.
Leveraging Names with Linked Data Karen Smith-Yoshimura Ralph LeVan 2010 RLG Partnership Annual Meeting Chicago, IL 9 June 2010.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Jennifer Bowen, University of Rochester ALA Midwinter Conference January 22, 2012, Dallas, TX The eXtensible Catalog (XC): Transitioning to a Post-MARC.
Only Connect: Better Use of Library, Publisher and End-User Metadata in a Networked World 31 st International Supply Chain Seminar Tuesday 13 th October,
Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B. Tillett Chief, Policy & Standards Division.
Bibliographic Framework and Future Scenarios for RDA Records Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress & Chair, Joint.
VIAF (Virtual International Authority File) Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Looking to the Future with RDA Presented by Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress For Georgia Cataloging Summit.
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
Looking to the Future: Information Systems and Metadata Presented by Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress January.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Evidence from Metadata INST 734 Doug Oard Module 8.
RELATORS, ROLES AND DATA… … similarities and differences.
Looking to the Future with RDA Presented by Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress For AMIGOS February 4, 2011.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
OCLC Research Library Partnership Work-In-Progress webinar 3 December 2015 A Close Look at the Four Million Archival MARC Records in WorldCat Jackie Dooley.
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
Jenn Riley Metadata Librarian IU Digital Library Program
Looking to the Future: Information Systems and Metadata Presented by Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress LC.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Renee Register Senior Product Manager OCLC Cataloging and Metadata Services Sandy Piver OCLC Publisher Services Consultant OCLC Services for the Publisher.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
RDA and OCLC Karen Calhoun Jean Godby Ted Fons Glenn Patton October 2009 Webinar.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The ___ is a global network of computer networks Internet.
OCLC and RDA Karen Calhoun Jean Godby Ted Fons Glenn Patton October 2009 Webinar.
Event Linking With Meaning: Ontological Hypertext and the Semantic Web Hugh Davis Learning Societies Lab ECS The University of Southampton, UK All Notes.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
From the old to the new… Towards better resource discoverability
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
PREMIS Tools and Services
Some Options for Non-MARC Descriptive Metadata
Attributes and Values Describing Entities.
Presentation transcript:

Transitioning from and Beyond MARC Karen Smith-Yoshimura 2010 RLG Partnership Annual Meeting Chicago, IL 10 June 2010

Transitioning from and Beyond MARC2 Where we are Where we want to go How do we get there?

Transitioning from and Beyond MARC3 Now: Managing MARC and non-MARC metadata RLG Partners use same staff to create both MARC and non-MARC metadata? Yes 64 66% No 33 34% RLG Partners create non-MARC metadata as part of routine workflows? Yes 86 80% No 22 20% What We’ve Learned from the RLG Partners Metadata Creation Workflows Survey, 2009

Transitioning from and Beyond MARC4 Metadata Description Tools RLG Programs Descriptive Metadata Practices Survey Results: Data Supplement 2007

Transitioning from and Beyond MARC5 What We’ve Learned from the RLG Partners Metadata Creation Workflows Survey, 2009

Transitioning from and Beyond MARC6 RLG Programs Descriptive Metadata Practices Survey Results: Data Supplement 2007

Transitioning from and Beyond MARC7 What We’ve Learned from the RLG Partners Metadata Creation Workflows Survey, 2009

Moving between old and new paradigms Subject Publisher Identifier Contributor Physical description AACR2 encoding ISBD punctuation Non-MARC elementsMARC record

Transitioning from and Beyond MARC9 Example: Physical descriptions in ONIX and MARC Leader jm 007 sdfsngnnmmned 245 $a #1 Puccini album AC 01 #1 Puccini Album $h [sound recording] Over-specified relationship Redundant information Maps between coded & textual information unreliable Carol Jean Godby, “Mapping Bibliographic Metadata”, NETSL Annual Spring Conference,

Transitioning from and Beyond MARC10 Some problems with crosswalking MARC Extra effort is required to add, validate, and dismantle ISBD and AACR2 rules. The ISBD and AACR2 layers are not a worldwide standard. Vocabulary and semantic concepts are different. Differences in punctuation and formatting require crosswalks to peek at the data. As a result:  The mappings are brittle.  Duplicate detection is difficult. Carol Jean Godby, “Mapping Bibliographic Metadata”, NETSL Annual Spring Conference,

65% 15% 9% 6% 39 tags (of 199 total) 5% or more occurrences 100%001, 008, 040, % - 99%020, 100, 260, 300, 500, 650, % - 19%007, 010, 016, 043, 050, 082, 250, 440, 490, 504, 710 5% - 9%015, 024, 041, 084, 110, 246, 502, 505, 520, 533, 600, 610, 651, 653, 830, 856, 880 4% 2%

Transitioning from and Beyond MARC12 Some MARC fields are more heavily used in specific formats than WorldCat as a whole… Mixed Materials: Greatest Variances Mixed % WorldCat % 520 Summary, Etc Index term - genre/form Biographical or historical data Cumulative index/finding aids note Immediate source of acquisition note Organization and arrangement of material Preferred citation of described materials note Action note Linking entry complexity note Ownership and custodial history Implications of MARC Tag Usage on Library Metadata Practices Webinar

Transitioning from and Beyond MARC13 OCLC no. Leader/06ppp Leader/07ccc / /06iii 008/ / /15-17xxucau 008/23 MXr 008/35-37eng ger 008/39ddd 040a b 043aa 100a da 245a b fa f 300a c3 a ba 500 a 506 a 520aaa b 530 a 5333 a 535 a 545 a 555 a 600 a d v 610 a 650a x va z va z v y 651a x v 655 a 2 700a d Mixed material (3 records) Searching in All databases Searching in 4 databases Searching in 3 databases Searching in 2 databases Searching in 1 database Searching in no databases Limiting in any database Colour Key Catherine Argus (NLA) comparison of MARC fields indexed in Amicus, COPAC, Libraries Australia, WC.org and FirstSearch Implications of MARC Tag Usage on Library Metadata Practices Webinar

Transitioning from and Beyond MARC14 Some implications MARC data cannot continue to exist in its own discrete environment. It will need to be leveraged and used in other domains to reach users in their own networked environments. MARC is a niche data communication format approaching the end of its life cycle. Future systems need to take advantage of linked data to meet users’ needs. MARC is not the solution. Future encoding schemas will need to have a robust MARC crosswalk to ingest millions of legacy records. Implications of MARC Tag Usage on Library Metadata Practices, 2010

Transitioning from and Beyond MARC15 We’re already repurposing the metadata we have

Transitioning from and Beyond MARC16 OCLC’s xISSN Web Service xissn.worldcat.org/

Transitioning from and Beyond MARC17 OCLC Web Services’ Application Gallery oclc.org/applicationgallery/

Transitioning from and Beyond MARC18

Transitioning from and Beyond MARC19

Transitioning from and Beyond MARC20

Transitioning from and Beyond MARC21

Transitioning from and Beyond MARC22

Transitioning from and Beyond MARC23

Transitioning from and Beyond MARC24

Transitioning from and Beyond MARC25 Where we want to go: The Semantic Web “I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers.” —Tim Berners-Lee

Transitioning from and Beyond MARC26 Where we are Creating MARC and non- MARC metadata, often redundantly. Limited reuse outside the library domain. Metadata created by libraries generally hidden or buried in Web results. Where we want to go Create metadata once, and reuse in different contexts. Expanded reuse of metadata from variety of sources for own context. Contribute own metadata to the Semantic Web for discovery and metadata creation.

Transitioning from and Beyond MARC27 How do we do it? Define data elements in an actionable way Define controlled lists in an actionable way Assign identifiers that will be unique on the web Create the data using these elements and lists Share the data Karen Coyle, “Directions in Metadata”, TechSource Webinar, Enable users/machines to combine selected data elements as they need them.

Transitioning from and Beyond MARC28 How we get there Move beyond “records” and converse with rest of the networked world. Aggregate “records” from statements when we need them. “Statement-based” data can be managed and improved more easily than record-based data Statement-based data can carry provenance for each statement. Diane Hillmann, “Application Profiles”, ALA ALCTS: CCDA Link data instead of copying it.

Transitioning from and Beyond MARC29 Linked data “… a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web.” —Wikipedia Bridges the gap between our technologies and the rest of the world’s

Transitioning from and Beyond MARC30 Why linked data? Share data in a non-library-centered exchange format.  MARC not popular with the Web community  Dublin Core not semantically rich Provide a framework for sharing semantically rich data in a Web-friendly way. Participate in the Semantic Web.

Transitioning from and Beyond MARC31 Semantic Web Syntax: RDF Resource Description Framework: Markup syntax exposing semantic richness of MARC21 and structural richness of AACR2 For everything you want to talk about  Give it a URI (Universal Resource Identifier)  Provide useful information at that URI Talk about things  Not just descriptions of things  Use structure (e.g. metadata)  Link to other resources

Transitioning from and Beyond MARC32 Vocabularies available in RDF dewey.info

id.loc.gov/authorities

Transitioning from and Beyond MARC34

Transitioning from and Beyond MARC35 Virtual International Authority File (VIAF) Application/RDF as xml:

Taking off? National Library of Sweden VIAF LCSH R|D|A

RDA Linked Data Hamlet México, D.F English Spanish French German Shakespeare Library of Congress Copy 1 Green leather binding Romeo and Juliet Stoppard Rosencrantz & Guildenstern Are Dead Text Movies … Derivative works Subject Barbara Tillett, “Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web”, NETSL,

Switching Languages Hamlet México, D.F Inglés Español Francés Alemán Shakespeare Library of Congress Copia 1 Encuadernación en piel color verde Romeo y Julieta Stoppard Rosencrantz & Guildenstern Are Dead Texto Películas … Obras derivadas Materias Barbara Tillett, “Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web”, NETSL,

Transitioning from and Beyond MARC39 Prototype from Europeana’s “Thought Lab” of a semantic search engine eculture.cs.vu.nl/europeana/session/search

Transitioning from and Beyond MARC40 Europeana’s “Thought Lab” data cloud version1.europeana.eu/web/europeana-project/whitepapers

Transitioning from and Beyond MARC41 Discussion What ideas do you have for “next steps” to transition beyond MARC and have our metadata part of the semantic Web?

Transitioning from and Beyond MARC42 Next up 3:30 Collections Futures David Lewis, Indiana University-Purdue University Indianapolis Buckingham