Reconciling OCLC and Orbis Managing a Bibliographic and Holdings Synchronization Between Yale University Library and WorldCat Melissa A. Wisner.

Slides:



Advertisements
Similar presentations
FROM RLIN TO OCLC CONNEXION DIFFERENT WORKFLOWS AND DIFFERENT PRACTICE Teresa Mei East Asian Catalog Librarian Cornell University Library.
Advertisements

Serials Acquisitions Workflow East Central University Dana Belcher, Asst Library Director Ashley Romans, Cataloging/Government Documents Librarian SIGALO.
ALEPH version 19.01/20.01 Cataloging & Acquisitions/Serials Updates South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
Large Scale Digitization Workflow Yale University Library January 2008.
BETH BRENNAN CHRISTINE MOULEN ELUNA 5/2/2014 Automating MARCit! for a single-record approach.
ExportQ Yale University Library. What Is ExportQ ? Written by Library Systems Office Used with Voyager Cataloging Two main functions –Facilitates record.
RMIT University An early Alma implementer. RMIT University 3 campuses – Melbourne + 2 Vietnam Offer programs through partners in 6 other countries 74,000.
ERM Holdings: Different Strokes for Different Folks Janet Crum, OHSU, Tom Larsen, PSU, Terry.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Zack Lane ReCAP Coordinator July 2013 ReCAP Columbia University.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
David Whitehair Director, Metadata Management, OCLC Cooperative cataloging with WorldShare Collection Manager and the WorldCat knowledge base.
Reclaiming your Catalog: Benefits of Batch Reclamation Roman S. Panchyshyn, MLIS Kent State University ALCTS CCS Catalog Management Interest Group ALA.
FireRMS SQL Audit, Archiving & Purging Presented by Laura Small FireRMS Quality Assurance.
The world’s libraries. Connected. Batchload Process for Alberta Libraries Carol Ritzenthaler Customer Support OCLC July 2013.
Vended Authority Control --Procedures and issues.
What’s New in VRS? GUGM May 15, 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
OCLC Online Computer Library Center MFHD Local Holdings Project Status (a.k.a. UL Migration) Myrtle Myers Product Manager, Holdings and Local Data.
AQS Web Quick Reference Guide Changing Raw Data Values Using Maintenance 1. From Main Menu, click Maintenance, Sample Values, Raw Data 2. Enter monitor.
WorldCat Local and Voyager Z39.50 Challenges and Solutions Andy Kohler - UCLA Library IT - Voyager Developer Meeting - March 25-26,
Cataloging and Metadata at the University Library.
Project Overview Bibliographic merging, Endeca, and Web application.
Using Voyager Reports Linda Taylor Oklahoma State University Edmon Low Library February 21, 2007.
Joyce Bell Catalog Division Coordinator Princeton University Bib Linking print and electronic records.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Let VRS Work for You! ELUNA Conference 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
Data and its manifestations. Storage and Retrieval techniques.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
Digital Volcanoes and Data Flows Carol Hamilton 1VALA 2012.
Starting Ongoing MARCIVE Authorities Processing Without Doing a Backfile: History File Creation + Notification + Overnight Authorities Joan Chapa and Carol.
Authority Control and Bib Enhancement with Marcive Mark Sandford William Paterson University
Zack Lane ReCAP Coordinator April 2011 ReCAP Columbia University.
A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
Building User Services with OCLC’s WorldCat Local Washington State University Libraries Al Cornish, Head of Library Systems Lihong Zhu, Head of Technical.
OCLC Reclamation Project Summer 2010 CCC June 3, 2010 rev. 8/05/2010 8/5/20101.
MARCIVE - An Overview Part one of an authority workshop presented September 2001 by: Jenifer Marquardt Assistant Authorities Librarian University of Georgia.
MARCIt records for e-journals project to implement MARCIt service McGill University Library Feb
Web Z: A Non-Programmers Perspective Sandy Card State University of New York at Binghamton March 23, 1999.
A & M Libraries Voyager Training Bulk Export, Import, and Prebulk Processing February 21, 2007 Co-ming Chan Oklahoma State University, STW.
Migration of Physical to Electronic (P2E) Resources in Alma
Loading Bibliographic Records Online and in Batch Pat Riva Romance Languages Cataloguer/ Bibliographic Database Specialist McGill University
Cataloging/Acquisitions Workflow William Rainey Harper College James Edstrom Michele Ukleja.
Georgia Fujikawa and Bob McQuillan Electronic Resource Management: Getting a Running Start on Your Implementation May , 2009.
Easing into EJournals in Aleph Susan Camille Pyzynski Librarian for Everything and Everyone And Mary Agnes Moynihan Manager, For That Thing Called the.
ILL Inter-Library Loan. Inter-Library Loan Overview The ILL module is for the management of Inter-library loans received and sent by Your library.
A& M Libraries Voyager Training Basic Cataloging February 21, 2007 Janet H. Ahrberg Oklahoma State University Library.
Vendor-Supplied Authority Control -- What Can the Vendor Deliver? What Still Needs to Be Done Locally?
Item Records – Everything One Needs to Know – well almost.
Lihong Zhu Interim Cataloging Manager/Monographic Cataloging Librarian Washington State University Libraries
Richard Wisneski OVGTSL Conference May  Kelvin Smith Library works primarily with Ingram/Coutts  Cataloging services are through SkyRiver  Integrated.
Single Bib Pilot Project Florida Library Association Conference April 8, 2010 Jean Phillips Florida Center for Library Automation.
SILO File Upload & Feedback System By Marie Harms State Library of Iowa August 18 & 19, 2010.
Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston.
Automating Cataloging Workflows with OCLC and Alma APIs
Assimilating Music Resources from one Library to another
MARC extensions Yoel Kortick | Senior Librarian
CAT FLAG Communication
Cleaning up the catalog: getting your data in order
Importing Serial Prediction Patterns Via the Service Import 85X records (Serial-52) Yoel Kortick.
Decisions, Decisions: How to Determine the Appropriate Method of Cataloging Special Collections in the 21st Century Presented by Patricia Falk, Music Catalog/Metadata.
Journal separation anxiety
Build Better Data: Best Practices for Catalog Cleanup CT Library Association, April 23, 2018 Diane Napert, Interim Director Monographic Processing Services,
Vendor Records What to do?
CSU Millennium to Alma migration
The Off-site Request Button: How it works and when it appears
The Off-site Request Button: How it works and when it appears
Designing and Using Normalization Rules
Onboarding Webinar 13 April 2019 Presented by and.
‘Splitting’ the MUSIC format
Presentation transcript:

Reconciling OCLC and Orbis Managing a Bibliographic and Holdings Synchronization Between Yale University Library and WorldCat Melissa A. Wisner

Purpose of Presentation Describing the process What is involved? Staffing required Timeframe Programming required Are We Done Yet? No!

Why do you want to come to this talk? For any size collection a reconciliation is a detail oriented project, planning, pre-processing, OCLC processing, dealing with returned data, maintaining the data Why do this? Living with your own standards—good or bad What is your database of record?

YUL Background Voyager ILS since 2002 Approximately 8.5 million bibliographic records Member of (former) RLIN OCLC Participant—add pcc records, create IR records in WorldCat, weekly holdings update, some cataloging directly in Connexion, ILL lender Early 00’s YUL did retrospective conversion with OCLC

Standard Workflow between Voyager and OCLC Weekly export to OCLC (staff flag records to send as needed) Sporadic OCLC Batch Matches over the years Local program to identify “candidate” records by encoding level and “UNCAT” status; send out to OCLC as separate project; filter and reload any 1, 4, or 7 el returned records and overlay the original Run LC Match once a month-similar process against local copy of LCDB

Arcadia Grant Cultural Knowledge grant March 2009-March 2013 $5 million/$1 million per year Cambodian Newspapers, Khmer Rouge Genocide documentation, African language materials and more… Layoffs and re-staffing

What Records to Send or Exclude? Divided up by locations for staff review Uncovered some data problems we knew about and didn’t know about…e.g. locations with no holdings in them; locations that still had holdings we thought had been migrated to new locations Most significant…outdated MARC tags, outdated format codes, practice different from OCLC, dual script records

What Records to Send or Exclude? Sending approximately 6.7 million out of 8.5 million bibliographic records as UTF-8 Excluding: MARCIVE E-resource records Suppressed bibs Unsuppressed bibs with suppressed holdings records In Process/On Order records UNCAT records**

Tracking our Records MySQL database created Bib IDs Exlcude Project ID (local tracking) OCLC Project IDs Reload Dates

Tracking our Records Used this to QA the results of the queries run to identify all potential records Used this to push out files of bib ids by OCLC project ID to be used later to extract correct records to send to OCLC Tracking was/is a big effort of reconciliation!

Tracking our Records As records are prepared for loading back into Voyager this MySQL database will be updated with those date(s) OCLC will produce crossref reports and other processing reports per each file, but these are not concatenated into any form of a relational database

Building an 079 Index in Voyager Ex Libris contracted to generate and update Voyager indexes Created in both Production and Test environments--took less then a day each time; downtime required; $ for service Added 079|a and 079|z left anchored indexes

Building an 079 Index in Voyager Updated SYSN Composite Index to include new 079 indexes: 019|a 035|a |z 079|a |z Indexes were mostly to assist staff in searching, but also for bulk import profiles for ongoing loads Exploring how to use the new indexes in ongoing EOD or e-resource loads from vendors

OCLC Pre-Processing OCLC IBM Mainframe limitations Sending records in 100MB limit/90,000 records per file AND only 15 files per day Separating records with 880s from those without Additionally, OCLC is splitting out PCC records from the YUS files

OCLC Pre-Processing Each set of files sent as a “project” with unique ID Creating label files, tracking via spreadsheet Suspended weekly exports to OCLC (9/5/ /20/2010**)

OCLC Pre-Processing Deleting YUL IR records in WorldCat Why? Easier matching? 5.7 million removed total EBScan software process Match routines set: Example: match on this field and that or ….

Cross Ref Reports and Stats Sample Adding in prefixes of ocm and ocn Other statistical reports

Loading OCLC Numbers back into Orbis Basic Process: Retrieve crossref report to be used as input Script to de-dupe crossref reports by name* Extract MARC record using Voyager API BIB_RETRIEVE and crossref as input MARC4Java Open Source to parse and update the MARC record* Remove any version of *OCoLC* *ocm* *ocn* in 035|a Insert new IR number from crossref report with a prefix of (OCoLC)

Loading OCLC Numbers back into Orbis Basic Process: Comparing 079|a to crossref report—if same, move on, if new, just add and move on, if different, update with new one and report out old one Remove any 079|z and report out Prepare new file of MARC records for bulk import Report out log summary of process, errors encountered, discrepancies in 079|a See handouts!

Loading OCLC Numbers back into Orbis Will also be our new permanent workflow post-reconciliation—maintenance of these control numbers! Cornell, Columbia and Stanford all used similar processes… Original hope was to load 250,000 records per day 4 days a week=estimated 6 weeks to reload everything back into Orbis…

Loading OCLC Numbers back into Orbis All depends on timing…OCLC process 80K records in 1-2 days for 6.7 million bibs it is 1.2 million/month or 2.4/month or 3 to 6 months total to process our data! We can keep pace with loading updated MARC data, but waiting 6 months is a big deal Need to keep 1 day a week for all other load activity in Orbis

Loading OCLC Numbers back into Orbis Run a keyword regen once a week—even though keyword index not being updated Program to extract and update MARC records can process 80K records in 15 minutes Bulk import run no-key takes 2 hours to load 80K records Minimize the loss of any staff changes

Handling Errors Reports from OCLC with no match records (validation errors) Correcting anything in OCLC? Correcting records in Voyager then re- submitting post-reclamation? See handouts!

Processing a “Gap” File Suspended weekly exports to OCLC 9/5/2010 Extracted a version of the bib record between 9/8/10 and 9/10/2010 Identify and extract all changes and new records from 9/8/10, that have an 079|a and the last operator in History is not OCLCRECON Send to OCLC as another one-off project

What Staff Will Do During Reconciliation No processing of holdings in OCLC ILL OK Will not create IR records so as not to affect matching Work in Orbis as normal otherwise

Modifications Needed to Resume Weekly Exports to OCLC Two file streams needed-one for archival materials and one for everything else PCC records will be split off once at OCLC YUM records split off once at OCLC New process/program created

Lessons Learned So Far Consistent application of standards across cataloging units (Suppressed, Suppressed!!!, In Process records, etc.) What is your database of record? How much time to spend on fixing records so they can be sent? Maintenance of the control numbers long term

Questions? Thank you!