FRBR information exchange Thomas Hickey & Jenny Toves OCLC Research
Current FRBR information exchange Sets of MARC-21 records Both bibliographic and authority Sometimes extended pKeys Unique pKeys Lists of sets of control numbers xISBN web service superWork records
Some background Our FRBRization has been done primarily at the work level We have FRBRized OCLC WorldCat ~60,000,000 records ~1,000,000,000 holdings Used in Open WorldCat, FictionFinder now Will be visible in FirstSearch displays this fall Norwegian BIBSYS records Finish national bibliography (now in WorldCat) Electronic thesis metadata Processing done on a 24-node Beowulf Linux cluster
MARC 21 bibliographic data Basic method of accepting information Other formats get mapped into it Fields we use: Author main entry Titles ISBN Personal name added entries Language Extensions BIBSYS use of 490 fields to indicate hierarchy
MARC 21 Authority data Map personal names using cross references Map author-titles using cross references Fields we currently use 008 fixed field 100, 130, 400 Extensions Files of additional cross references Common title patterns xISBN matching
pKeys An author-title key for matching Derived from MARC-like records & authority data ocm shakespeare, william\ /hamlet ocm /hamlet/shakespeare, william\ ocm hamlet motion picture 1948 ocm /hamlet/ocm
Unique pKeys pKeys that have been sorted and counted 692 sw milton, john\ /poems 691 sw puccini, giacomo\ /tosca 690 sw chaucer, geoffrey\d 1400/canterbury tales 688 sw melville, herman\ /moby dick 682 sw china/laws etc
Lists of control numbers sw sw sw sw sw sw
xISBN web service Takes an ISBN as input Returns list of ISBNs in associated work Significant processing Starts with control-number list of work-sets Uses ISBNs to pull work-sets together Allows fuzzy-matching on author/title Ends up with consistent clusters In general larger than those in control-number list
xISBN examples [ , ]: sw barnea, amir/agency problems and financial contracting sw barnea, amir/agency problems on financial contracting [ x, , , , ]: x sw /collins new school dictionary/ocm sw /collins new school dictionary/ocm sw /collins new school dictionary/ocm
xISBN XML response - - x
superWorks format Developed for FictionFinderFictionFinder XML format Includes expression-level information All the information needed We are adapting it to the Curioser projectCurioser
superWork record layout pKey # manifestations, holdings, sw-id, control #s publication dates expressions expression classes language authors titles subjects components author, title, publication data
Summary Simpler when only work-level relationships are needed Even for work-level relationships, a number of different formats are useful Information needed for an interface gets much more complicated