1 Maintaining the integrity of e-book titles in CityU library catalogue 7 th HKIUG, 12 Dec 2006, HKUST Joanna Pong, Philip Wong Run Run Shaw Library City.

Slides:



Advertisements
Similar presentations
BETH BRENNAN CHRISTINE MOULEN ELUNA 5/2/2014 Automating MARCit! for a single-record approach.
Advertisements

CatWork: Practical Experiences in Automation for Retrospective Conversion, Reclassification and Backlog Reduction LO Tin King The University Of Hong Kong.
Lisa Bradley Electronic Resources and Subscriptions Coordinator 27/9/12 Managing eBook Collections at the ANU.
1 JURO4C: Online Usage Reports for Consortia José Fernandes 12th October 2006.
Creation of an online catalog of dissertations using Access & ASP – slide 1 Creation of an online catalog of dissertations using Access & ASP: from Datatel.
1 Managing Serials Chaos Could “INNOPAC + SerialsSolutions” become Serials Solutions? Bill Tang Serials & E-Resources Librarian.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Linking UML Resources to Google Scholar (and Beyond!) April 21 st, 2006 UML Reference Forum.
Using a Vendor’s System to Streamline Book Selection and Ordering Thomas Hung University of Hong Kong Libraries 3 rd HKIUG Meeting.
Experiences on services enabling integration of electronic resources: A to Z (EBSCO) and ScholarSFX (SFX Express with Google Scholar) Natalia Litvinova.
4th Annual HKIUG Meeting 9th December 2003 Developing an Online System for Book Selection and Hong Kong Baptist University Library & Lingnan.
HKUL & CityUL HKIUG – Dec03 InnReach Project in Hong Kong David Palmer Hong Kong University Libraries University of Hong Kong Eva Wong Run Run Shaw Library.
Enhancing bibliographic records in the INNOPAC by adding URLs of book reviews and other value-added information from online bookstores 10 th December 2002.
Staying Afloat in a Sea of E- journals An Automated Process for Cataloging Electronic Serials David Banush Nathan Rupp Cornell University EndUser April.
Swets Information Services SwetsWise Title Bank 13 th Panhellenic Libraries Conference th October Corfu.
Integration of Library Resources in the Development of the University Portal Eva Wong Run Run Shaw Library City University of Hong Kong HKIUG December.
Millennium Statistics : Beta Testing Experience Presented by Edward So, Run Run Shaw Library City University of Hong Kong HKIUG, Dec. 2002CityU Library.
HKALL Hong Kong Academic Library Link (HKALL) 香港高校圖書聯網 ( 港書網 ) An Accelerated Resource Sharing Project in Hong Kong.
Catalog: Batch delete old Patron Records How to conduct global/batch updates to records – patron Adding Faculty and Patron/Student Records Manually Standardizing.
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
MANAGING E-BOOK ACQUISITION: THE COORDINATION OF "P" AND "E" PUBLICATION DATES Sarah Forzetting Collections Consultant Coutts Information Services Gabrielle.
E-books and consortia: business models, access issues, etc. Some ideas for a general discussion SELL meeting, Fiesole, May 23rd 2014 Session E.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. M I C R O S O F T ® Preparing for Electronic Distribution Lesson 14.
March 20, 2008Electronic Resources and Libraries College Center for Library Automation Tallahassee, FL Susan B. Campbell Susan.
Getting started on informaworld™ How do I register my institution with informaworld™? How is my institution’s online access activated? What do I do if.
What’s New in VRS? GUGM May 15, 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
The world’s libraries. Connected. WorldShare platform & Management Services Integrate all of your collections: print, licensed & digital Chris Thewlis.
Orbis/PORTALS E-journal Workshop Orbis/PORTALS E-journal Workshop May 9, 2002 Sarah Beasley - Portland State University and Bonnie Parks - Oregon State.
Managing Serials in an Electronic World the Stirling Experience Sonia Wilson University of Stirling Library 19 October 2004.
Integrating and managing your Engaging Networks data Top ten data features.
Slide 1 E-Books at OhioLINK : Expanding the Statewide Collection Dan Gottlieb, University of Cincinnati Karen Wilhoit, Wright State University.
Joyce Bell Catalog Division Coordinator Princeton University Bib Linking print and electronic records.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
CUFTS: Open-Source ERMS Andy Perry and Bill Drew SUNY New Paltz Tompkins Cortland Community College.
Let VRS Work for You! ELUNA Conference 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
The British Library and SUNCAT Brenda Young The British Library Bibliographic Development.
Relational Databases Melton, Beth “Databases: Access Terminology and Relational Database Concepts.” 09/LPMArticle.asp?ID=73http://pubs.logicalexpressions.com/Pub00.
The E-Book Dilemma: A Study of Aggregator and Publisher Options to Deliver Electronic Book Content.
Monograph Collection Development in an Age of Uncertainty: The University of Haifa Library Experience Cecilia Harel Head of Collection Development, Gifts.
Usage versus Cost Analytics for Selection Management and Informed Purchase Decisions MTA Budapest, October 2012.
5th SELL Meetting Lisboa, Activities report Government agreement to improve libraries 2.ILS change 3.ICOLC 4.Union catalogue 5.Digital.
ORBIS & PORTALS E-Journal Workshop Michael Markwith, TDNet Inc. Reed College Library May 9, 2002.
ICOLC Las Vegas March 28, 2003 TDNet E-Management Services for Consortia From E-Journals to E-Resources Michael Markwith President, TDNet Inc.
香港大學圖書館 Upstream Content Management in an ILS Downstream Integrated Access, Authentication, Portals & Statistics - Dr. Ku Kam-ming - David Palmer.
Informed decisions for Selection Support in Libraries 20th Pan-helenic Conference of Academic Libraries Thessaloniki, 14/11/2011 Núria Sauri Electronic.
MARCIt records for e-journals project to implement MARCIt service McGill University Library Feb
Vendor-Supplied Records Overview May 2015 Jemma Hazen MSC Technical Services Support.
CENDI/FLICC Workshop, June 21, 2000 Slide 1 of 24 The Impact of Reference Linking on the Creation and Use of References/Citations CENDI/FLICC Workshop.
Ebooks? John Akeroyd Milano March 7 th Ebook Readers.
Migration of Physical to Electronic (P2E) Resources in Alma
Once you acquire thousands e-books, then what? Shi Deng, UC San Diego OCLC CJK User Group Meeting March 24, 2007.
CSCI 6962: Server-side Design and Programming Shopping Carts and Databases.
Georgia Fujikawa and Bob McQuillan Electronic Resource Management: Getting a Running Start on Your Implementation May , 2009.
Caitlin Spears, Library Training Consultant Electronic Resource Management: Soup to Nuts April , 2008.
Walking a Tightrope in the Transition to Electronic Resources Debra G. Skinner Georgia Southern University GOLD/GALILEO Users Group Conference July 31,
Creating MARC Records for CLICnet Using E-journal Title Lists Amy Kreitzer College of St. Catherine Library.
Legal Digital Content through YBP Library Services Barbara Kawecki, Senior Manager, Digital Content (West) Beverley Geer, Collection Development Manager.
Give ‘em What They Want Patron-Driven Collection Development Karen Fischer, Collections Analysis & Planning Librarian Mike Wright, Head, Acquisitions &
Providing and Maintaining Access to Electronic Serials —from Consortium and Member University Library’s perspectives NASIG 31st Annual Conference June.
E-books in the Catalog: Managing MARC Records in Batches Bonnie Figgatt Sacred Heart University Library April 15 & 16, 2011.
Batchloading: Current Practices and Future Challenges Rebecca L. Mugridge Pennsylvania State University Libraries American Library Association January.
7th Annual Hong Kong Innovative Users Group Meeting
Creating Workflow Efficiencies in the E-book Ecosystem
Working the A to Z List enhance journal access in the OPAC
Ebooks in academic libraries: management and access issues
Vendor Records What to do?
Maintaining the integrity of e-book titles in CityU library catalogue
OpenURL: Pointing a Loaded Resolver
Presentation transcript:

1 Maintaining the integrity of e-book titles in CityU library catalogue 7 th HKIUG, 12 Dec 2006, HKUST Joanna Pong, Philip Wong Run Run Shaw Library City University of Hong Kong

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Table of Contents 1. Growth of e-books in CityU 2. Duplication problems 3. Attempted solutions 4. Effective Solutions 5. De-duplication jobs 6. Benefits and limitations

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU  E-book collection contains English e-books, Chinese e-books & e-theses  From 2001: NetLibrary (around 200 titles) To Oct 2006: > 200,000 titles English e-books: > 87,000 titles Chinese e-books: > 45,000 titles e-theses: > 70,000 titles

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d)  Acquisition of e-books from 2001 onwards English ebooks Chinese ebooks eThesesTotal > > > ,3001,40039,000>40, ,00044,00031,000>150, (Jul-Oct 06) >8,100 >87,000>45,000>70,000>200,000

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d) Total > 200,000 titles

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d)  Major e-book collections

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d)  E-theses

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d)  Consortial acquisition of e-books Digital Dissertation Consortium – since 2005 Apabi D-Lib Consortium – since 2006 NetLibrary Super E-book Consortium – since 2006  New consortia Electronic Resources Academic Library Link (ERALL), a JULAC project on collective e-book collection development

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Growth of e-books in CityU (cont’d)  Growth of e-book usages (from CGI Logs) -- showed an uprising trend eBooksYr 2004Yr 2005Yr 2006% Growth 05 to 06 Apabi % ebrary % netLibrary % Safari % Wiley InterScience % Digital Dissert. Con % ProQuest Dissert %

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems The variety of e-book collections and high number of titles created problems in cataloguing A major problem-> Title duplication We load records supplied by different vendors, resulted in title duplication More e-book titles, more title duplication  same title from different collections  same title from same collection

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems (cont’d)  Duplication from different collections

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems (cont’d)  Duplication from the same collection NetLibrary collection  Titles purchased by CityU since 2001  Titles acquired via Super-ebook Consortium

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems (cont’d) Same title from NetLibrary acquired in different period

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems (cont’d)  Duplication from the same collection (cont’d) UMI e-theses  Titles purchased by CityU since 2002  Titles acquired via Digital Dissertation Consortium  Titles in ProQuest Database

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Duplication problems (cont’d) Same UMI e-thesis title acquired in different period

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Attempted solutions  Single record approach in cataloguing We apply single record approach for all e-versions of the same title Applied to e-books and e-journals

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Attempted solutions (cont’d)  Duplication control in e-journals CityU applied and modified BU’s program to merge e- journal titles from aggregator databases

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Attempted solutions (cont’d)  Duplication control through manual methods For e-books, our previous solutions 1. Manual checking 2. Headings reports – duplicate call numbers 3. Loading through match field 001 – identify duplicate records 4. Encounter basis Okay when the number of titles remains small

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Attempted solutions (cont’d)  Duplication control through customized load profiles The first attempt to automate the procedure Utilized the local load profiles and translation table in INNOPAC to merge 2 sets of NetLibrary titles  Super E-book Consortium titles purchased in 2006  NetLibrary titles purchased since 2001  2,206 titles were found duplicated

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Attempted solutions (cont’d)  Duplication control through customized load profiles (cont’d) Using load profiles is not a complete solution  Cannot match multiple tags (cannot match tag 020 against tag 024)  Cannot match selected sets (cannot exclude print titles)  Cannot merge multiple records automatically; must output for manual checking to decide the master record

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions  Cataloguing worked with Systems to run de- duplication and merging of records  Prerequisite easy to apply able to fit in the existing workflow have flexibility to handle different sizes of e-book batches allow prompt or ad hoc loading of records if necessary

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Scope of de-duplication Include English e-books and e-theses  e-books: 88,000 records  e-theses: 70,000 records

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Scope of de-duplication (cont’d) Exclude Chinese e-books because  CityU so far only has one Chinese e-book collection, Apabi.  Vendor supplied unique records when we joined the Apabi D-Lib consortium (no duplication with previously purchased titles)  We will also handle Chinese e-books if we acquire other Chinese e-book collections in the future

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  What fields to match? E-books  Match ISBN – a relatively reliable tag  Match major MARC tags – 110 match key UMI e-theses  Use UMI number for matching

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  How to merge? Set the one with the earliest Create Date as the master record Add reproduction note (tag 533), name of book collection (tag 773) and URL link (tag 856) of the duplicate record(s) to the master record

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Matching algorithm of ISBN Print ISBN vs. e-book ISBN  Some records come with print ISBN, some with e- book ISBN, some with both  Both types are used for matching Different tags to store ISBN  020 $a, $z  024 (1 st indicator 3) $a, $z  776 $z  All the above are used for matching

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Matching algorithm of ISBN (cont’d) 13-digit ISBN vs. 10-digit ISBN  Starting on 1 Jan 2007, the ISBN is 13-digit  Some publishers already used 13-digit ISBN before that  Starting from 12 Nov 06, OCLC moves 13-digit ISBN to tag 020  13-digit ISBN with prefix “978” may have 10-digit equivalents, they are converted to 10-digit for matching

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Matching algorithm of ISBN (cont’d) ISBN with “noise”  Some ISBN include a note enclosed in parentheses  Do not use ISBN for matching if the text inside the parentheses indicates that the ISBN is for a set, a series, or a volume etc. e.g. “ (series : International library of psychology)”  Hints: look for keywords “set”, “series” and compare with Tag 440 and Tag 830

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Effective Solutions (cont’d)  Matching algorithm of the 110 Match Key To guarantee there is no mismatch by ISBN, construct additional match key based on INN- Reach 110 Match Key Title + Gen. Media + Pub. Year + Pagination + Edition + Publisher + Type of Record + Title Part + Title Number  Constructed the key and normalized  Refer to INN-Reach documentation for details

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs  Initial clean-up  Regular de-duplication

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  Initial clean-up One time -- to de-duplicate records that had been loaded 6,063 (7.2%) duplicate records were found, out of 84,756 English e-book titles Fine tune program after initial clean-up

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  Regular de-duplication Once every month Flexibility  Depends on no. of title loaded & urgency to load the records  Clean-up before loading vs. clean-up after loading

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  Regular de-duplication (cont’d) Procedures  Output e-book records from catalogue  Run de-duplication program to match with vendor records  Overlay records in catalogue with merged records  If vendor records have been loaded delete duplicate vendor records from catalogue  Else insert new vendor records into catalogue

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  Flow chart Match & Merge DeleteOverlay Master records Vendor records MergedDuplicatedNew INNOPAC Insert Vendor INNOPAC

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  De-duplication results Initial clean-up of e-books Total English e-book records % Records duplicated % Titles merged from 2 records % Titles merged from 3 records50.2% Titles merged from >= 4 records0 0.0%

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  De-duplication results Initial clean-up of e-books (cont’d) Books24x7ebrarynetLibrarySafariSpringerWileyTotal Books24x77 ebrary014 netLibrary Safari Springer Wiley Total (Misc)2 Distribution of titles merged from 2 records

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  De-duplication results Initial clean-up of e-books (cont’d)  We found that for the duplicated titles within the same collection, some will direct users to different e-books, this problem is more serious in ebrary.  Fine-tune program, add the condition: When two matched records have the same CGI scripts (i.e. belong to the same collection) but different book IDs, do not merge them, but flag for review

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  De-duplication results (cont’d) Initial clean-up of e-theses Total UMI e-thesis records % Records duplicated % Titles merged from 2 records % Titles merged from 3 records00.0% Titles merged from >= 4 records0 0.0%

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, De-duplication jobs (cont’d)  De-duplication results Initial clean-up of e-theses (cont’d) UMI (pdf)DDCProQuestTotal UMI (pdf)0 DDC2260 ProQuest2320 Total Distribution of titles merged from 2 records (DDC = Digital Dissertation Consortium) More than 4,000 DDC & ProQuest records had been de-duplicated with manual process (using 001 field) before the initial clean-up process.

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Benefits and limitations  Benefits Single record for all versions of the same e-book or e-thesis titles, maintain integrity in the library catalogue Save much staff time & manual effort Method applicable to other e-resources Management need – generate duplication statistics Can be applied to match existing e-book collections with e-book titles supplied by potential vendors – e-book collection development

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Benefits and limitations (cont’d)  Limitations Depends on data in vendor-supplied records  Incorrect match and merge in case of incorrect or incomplete data  Chinese e-book records  Brief bibliographic data  Lack of standardization in transcription  Difficult to construct reliable match-key  Sometimes lack of ISBNs

Maintaining the intergrity of e-book titles in CityU library catalogue, 7th HKIUG, Maintaining the integrity of e-book titles in CityU library catalogue Thank You! Joanna Pong Philip Wong