Build Better Data: Best Practices for Catalog Cleanup CT Library Association, April 23, 2018 Diane Napert, Interim Director Monographic Processing Services,

Slides:



Advertisements
Similar presentations
LINC Directors Group December 2, 2009 Lincoln Trail Libraries System.
Advertisements

Serials Acquisitions Workflow East Central University Dana Belcher, Asst Library Director Ashley Romans, Cataloging/Government Documents Librarian SIGALO.
System Migration IS 551 Fall 2005 Dr. Dania Bilal.
Integrated Library System (ILS) Group 5: Leung Chui Ting Yuen Miu Kwan Chan Ying, Sarah Cheung Chor Ying Wan Ka Wai,
From OCLC to SkyRiver, for Better or for Worse, a Cataloger’s Perspective Suzhen Chen Kelvin Smith Library, Case Western Reserve.
RMIT University An early Alma implementer. RMIT University 3 campuses – Melbourne + 2 Vietnam Offer programs through partners in 6 other countries 74,000.
CODING the HOLDINGS 008’s Or… I’ll have time to do that next year- REALLY! Mary Bailey, Serials Manager Connie Kissee, Government Publications Kansas State.
(R)evolutionary changes at University of the Arts London. Ray Delahunty Systems Support Librarian, UAL Coordinator, VPWG.
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
The world’s libraries. Connected. Single-search access to Tenn-Share library resources through WorldCat Group Catalog September 28, 2012 Suzanne Butte.
BIBFLOW: An IMLS Project
Library integrated system -Aleph Fang Peng Stony Brook University.
GPO, OPACs and Bibliographic Access in the 21 st Century Laurie Beyer Hall, Director Office of Bibliographic Services U.S. Government Printing Office ALA.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Evergreen tools at IISH Mieke Stroo Evergreen Conference 2013.
At the North of England Institute of Mining and Mechanical Engineers Library, Newcastle upon Tyne.
The FCLA Endeca Project By Michele Newberry. M.Newberry2 Why ENDECA?  Already proven by NCSU  Build on NCSU’s work instead of starting from zero  Product.
WILIUG 1. June 2, 2005 Using Review Files with Millennium Rapid & Global Update jenny schmidt SWITCH Library Consortium.
Forcing a change into the Global Change Queue A strategy for handling heading changes when there is no matching authority record.
The physical parts of a computer are called hardware.
Automating Collection Development, Streamlining Acquisitions and Outsourcing Copy Cataloging: New Partnerships with YBP, Innovative Interfaces and PromptCat.
Cataloging and Metadata at the University Library.
WILIUG June 2015 Julie Woodruff Indianhead Federated Library System Eau Claire, WI.
Taking the Ack Out of Acquisitions Presented by Tim Spindler C/W Mars, Inc. Jennifer Pringle Sitka.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
The FCLA Endeca Project By Michele Newberry. M.Newberry2 Current OPAC environment  Aleph 500 v.15.5  Heavily customized to reflect pre- implementation.
Acquisitions and ILL Processing Through ILLiad Getting It System Toolkit IDS Conference August 5th 2009.
Writing macros and programs for Voyager cataloging Kathryn Lybarger ELUNA 2013 May 3, #ELUNA2013.
OPAC Training aid (Library solutions & Library world)
2006/2007 Annual Report Working together to advance research and learning... through innovative, collaborative information access.
6/1/20161 Acquisitions Automation: Import without data going bump in the night.
April 25, 2012 Making the Most of Library Collaboration and Cooperative Projects Partnering for Discovery: Jennifer LissErika Dowell Metadata/Cataloging.
Cataloging/Acquisitions Workflow William Rainey Harper College James Edstrom Michele Ukleja.
What’s New in Destiny 11.0 April 9, Follett’s Destiny Team Don Rokusek Program Director Julie Krater Product Manager Julie Krater Product Manager.
OCLC CJK USERS GROUP FORUM Charlene Chou March 27th,
Using SQL for Patron Card Expiration Reminders For Norcal IUG – Nov. 20, 2015 At the Berkeley Public Library.
Once you acquire thousands e-books, then what? Shi Deng, UC San Diego OCLC CJK User Group Meeting March 24, 2007.
23 rd Annual Innovative Users Group Conference April 13 th – 16 th 2015.
Can you do this in SmarTeam?
Improving Cataloging Workflows at LMU Walter Walker, Loyola Marymount University.
Latin American bulk loading A collaboration between Tulane’s Latin American Library and the Howard-Tilton Memorial Library Technical Services Division.
Richard Wisneski OVGTSL Conference May  Kelvin Smith Library works primarily with Ingram/Coutts  Cataloging services are through SkyRiver  Integrated.
Automating Data Normalization and Clean-up.
E-books in the Catalog: Managing MARC Records in Batches Bonnie Figgatt Sacred Heart University Library April 15 & 16, 2011.
Batchloading: Current Practices and Future Challenges Rebecca L. Mugridge Pennsylvania State University Libraries American Library Association January.
What is it that cataloguers and librarians fear the most?
Automating Cataloging Workflows with OCLC and Alma APIs
Headline.
7 ways to clean up the catalog
The How and Why of DOI Assigning DOI’s to IR content
PEPPERDINE UNIVERSITY LIBRARIES
Pause for Cleanup: MarcEdit in Real Life
CAT FLAG Communication
Cleaning up the catalog: getting your data in order
Standing Orders in Alma
Automated acquisitions & collaborative projects
ALA Practical Linked Data With Open Source
Gary R. Cocozzoli Lawrence Technological University
Tools and Techniques to Clean Up your Database
Tools and Techniques to Clean Up your Database
Working the A to Z List enhance journal access in the OPAC
Demand Driven Acquisitions and Alma
Decisions, Decisions: How to Determine the Appropriate Method of Cataloging Special Collections in the 21st Century Presented by Patricia Falk, Music Catalog/Metadata.
Headline.
CSU Millennium to Alma migration
Technical Services Workflow
Onboarding Webinar 13 April 2019 Presented by and.
Acquisitions: Version 18 Upgrade Training
ExportQ A program written by the Yale Library Systems Office for use with the Voyager cataloging client… New England Voyager Users Group, 4 June 2003.
A Metadata “Connexion” from SharePoint to WorldCat
Presentation transcript:

Build Better Data: Best Practices for Catalog Cleanup CT Library Association, April 23, 2018 Diane Napert, Interim Director Monographic Processing Services, Yale University Library

The Numbers Yale has three holdings symbols in OCLC, mainly due to Interlibrary Loan 10, 430, 569 active bib records, 674,819 suppressed records 3,534,654 million authority records as an estimate We imported over 300,000 records in 2017, including batch loads for e-resources and print material, that wouldn’t include the law library, which uses Millennium (Innovative)

The Yale Library Tale From Voyager 8.1 to 10 over the holiday break 2017/2018 Hardware configuration began in August/September The underlying system moved to Linux also (was Oracle) Oracle to Workday summer 2017

Upgrade Planning

Complications after upgrade 64 issues on the problem report

Tools SQL Oracle SQL Developer – free from website: http://www.oracle.com/technetwork/developer-tools/sql-developer/overview/index- 097090.html Excel – Power Pivot, VLOOKUP function Access, MySQL (relational databases) MarcEdit - http://marcedit.reeset.net/ Voyager Global Headings Change Cataloger’s Toolkit works with Voyager (authority clean-up in bib records) (Gary Strawn) Voyager Global Data Change – more robust starting with Voyager 9 PyMARC (Python report-writer for bibliographic records using Marc 21) Digital Information Research Specialist – Library GitHub https://github.com/younnoh/diff_marc LINQPad https://www.linqpad.net/ OpenRefine - Backlog searching Checking links in bib records BaseX http://basex.org/ XML data clean-up Python, Perl, PHP Scripting

Reports Bibs without holdings report is done periodically as this clears out temporary ILL bib records which have no holdings Sub-divisions in records – hard to do programmatically so write reports and do manually Yale original catalogers also correct as they encounter them as time permits Yale de-dupes bibliographic records during the load process for vendors which send bib records System also de-dupes Language Codes missing, dates missing, in fixed fields Discovery Metadata Librarian wrote these in PyMARC and there were thousands to correct manually (before QuickSearch, Discovery Interface) Ran report which listed On Order records, there were 3000+ of old ones, records which never got overlaid when a book or item came in. This also points to a training issues. Empty sub-fields report, Discovery Metadata Librarian report Reports listing vouchers pending older than a certain date Old POs remaining unpaid

Considerations Timing - Some of these are run at the end of the year to get all invoices paid Perhaps might not want to migrate some of this data into a new system Perhaps take into account the type of orders, firm vs ongoing/subscriptions

Future E-book records – improving on quality of current e-book records – pre-processing in MarcEdit Automating more reports Backlog by language – would like it to run automatically, and compare month to month Statistics – now use a report which requires manual input Productivity Reports Linked Data Casalini SHARE-VDE Project - http://share-vde.org/sharevde/clusters?l=en Sent 10,217,644 bib records, 3,496471 authority records to Casalini for conversion to BibFrame 2.0 870,400,000 Triples! (185 gigabytes of uncompressed data) Data Cleanup – Data which is not machine actionable, omission of data (empty fields), fields converted to wrong level, local fields don’t convert (690 field, Beinecke) LD4P – Linked Data for Production Project (Stanford Lead, Mellon Grant)

Why? Productivity Informational Staffing Workflows Deleting data and enhancing record quality Future Upgrades All for the users!!!

A little help from my friends Thanks to: Éva Bolkovac, Asssistant Catalog Management Librarian Debbie Falvey, Collection Procurement Librarian Lynette Robinson-Johnson, Acquisitions Assistant Angela Sidman, Director, E-Resources and Serials Management Steelsen Smith, Technical Lead