Download presentation
Presentation is loading. Please wait.
Published byLuke Warner Modified over 8 years ago
1
OCLC Cluster Service Leiden March 28 2007 Discussion Session With KB & UVA Janifer Gatenby, Strategic Research
2
2 Agenda Welcome and Introductions Presentation –Clustering –Audience Level –Copyright / Rareness –FAST subject headings Discussion Lunch
3
3 Some slides from NCSU’s Endeca Test Catalog using OCLC work identifiers for Clustering
4
4
5
5
6
6
7
7 Some slides from PiCarta (Netherlands) Test Catalog using OCLC work identifiers for Clustering
8
8 Without clustering
9
9 With Clustering
10
10 Consolidation of Holdings The above example shows 2 holdings, one each per bibliographic record. The consolidation of holdings permits Reservations (holds) and Requests at work level
11
11 Dutch NCSU 6.7 million work identifiers / 7.7 million bib records Collapse rate of 13% –Av. 1.15 bibliographic records per work record Software adaptation less than 1 week 1.64 million work identifiers / 1.7 million bib records Collapse rate of 3% –Av. 1.03 bibliographic records per work record
12
12 Method OCLC # OCLC Work IDTitle 6564779420842726Goldene vliess 2792161230369321Goldene vliess 577323519885466Goldene vliess 3663814912019603Goldene vliess 3663814912019603Goldene vliess 3663814912019603Goldene vliess
13
13 Method PPNOCLC # OCLC Work IDTitleComments 806377606564779420842726 Goldene vliessnot in main group 1245948832792161230369321 Goldene vliessnot in main group 36330531577323519885466 Goldene vliessnot in main group 806262033663814912019603 Goldene vliessin main group 18113649x3663814912019603 Goldene vliessin main group 805403333663814912019603 Goldene vliessin main group
14
14 Fixing Mismatches Alternatives –Fix data at source –Apply name / title authority records –Enhance algorithm Eliminate foreign articles Convert “fünf”, “vijf”, “cinq” to “5” etc. At OCLC –Quality control –Office of Research
15
15 Authorities Ensure Matching Foreign union catalogue data –Non AACR2, not native MARC21, other language of cataloguing, non standard uniform titles –Requesting 1,000 name / title authority records per union catalogue Bib record for a translation without uniform title will match if there is a comprehensive author / title authority record
16
16 Bib Authority 100 …Rowling, J.K. 245 …La chambre secrète ……………. Rowling, J.K. The secret chamber De geheime kamer La chambre secrète Die geheime kammer ……………
17
17 FRBR – Divide and conquer Creation of works (38 million) Algorithm Authority records Cleaning bibliographic records where necessary No manual links created Improved user interfaces Harvesting Loading IDs & records Authority records Improved user interfaces Suggestions for the improvement of the algorithm and records
18
18 ALA Mid Winter Meeting Representatives 19 libraries with substantial holdings in WorldCat Clear Requirements –XML cluster record service –Minimum of daily update
19
19 Discussion
20
20 Phase 2 Phase 1 – table Phase 2 – work record with enriched data –Audience level –Rareness –Copyright –FAST headings for faceted search
21
21 Audience Level and Rareness
22
22 OpenURL Request Transfer Message
23
23 Faceted Search
24
24 FAST headings Fully formed concepts Suitable for faceted search –LCSH “sentences” – breaking into concepts is tricky http://www.oclc.org/research/projects/fast/
25
25 Discussion
26
26 Cluster Identifier Type Value Instance/s Identifier/s + type Copyright estimate Holdings count (rarity) Description Related Works WC Cluster Identifier Instance/s Relationship (sequel etc.) OCLC Number
27
27
28
28 Deployment CBS 3.2 ++ incorporating cluster record in test due Easter Installation in LBS OCLC Distribution service – dev. To start in April PSI modifications to use cluster record Looking for testing partners
29
29
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.