Automating Data Normalization and Clean-up.

Slides:



Advertisements
Similar presentations
OCLC Online Computer Library Center OCLC Cataloging Update Connexion client 1.50 & more OCLC CJK Users Group Annual Meeting San Francisco, CA April 8,
Advertisements

Subject Based Information Gateways in The UK Coordinated Activities in The UK Within the UK Higher Education community, the JISC (Joint Information Systems.
MarcEdit: Doing more, but faster
MICHAEL MARINO CSC 101 Whats New in Office Office Live Workspace 3 new things about Office Live Workspace are: Anywhere Access Store Microsoft.
Getting Started with MarcEdit
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
Microsoft Visio is diagramming software for Microsoft Windows. It uses vector graphics to create diagrams. The 2007 Standard and Professional editions.
Catalog: Batch delete old Patron Records How to conduct global/batch updates to records – patron Adding Faculty and Patron/Student Records Manually Standardizing.
Microsoft Dynamics. Introducing Al-Futtaim Technologies  One of the region’s leading System Integrators  Strong partnerships with leading global ICT.
T ERRY R EESE ’ S M ARC E DIT : P RACTICAL U SES Jenn Nolte Middlesex Community College 25 April 2008.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
Arc: Programming Options Dr Andy Evans. Programming ArcGIS ArcGIS: Most popular commercial GIS. Out of the box functionality good, but occasionally: You.
Global Update with Confidence Mary M. Strouse Innovative Users Group May 19, 2009.
MarcEdit Basics and Beyond By Mary Aycock Head, Catalog Department Missouri University of Science and Technology MOBIUS 2012 Conference.
WILIUG 1. June 2, 2005 Using Review Files with Millennium Rapid & Global Update jenny schmidt SWITCH Library Consortium.
Copyright COMPLETExRM, Inc. All rights reserved. Sales Presentation 1 For Real Estate.
©Kwan Sai Kit, All Rights Reserved Windows Small Business Server 2003 Features.
JUNE 13-15, 2011  LANCASTER, PENNSYLVANIA Cataloging with MarcEdit Doreen Herold Lehigh University Symphony Sharon Scott Cumberland County Library System.
Only Connect: Better Use of Library, Publisher and End-User Metadata in a Networked World 31 st International Supply Chain Seminar Tuesday 13 th October,
How to handle the Multitude Successfully handling thousands of E-Book records using MARCEdit and BIBLOAD reports Kelly Swickard Decker Library Maryland.
In addition to Word, Excel, PowerPoint, and Access, Microsoft Office® 2013 includes additional applications, including Outlook, OneNote, and Office Web.
Afsin Ustundag, Ray Group International 1. Scheduling Update Scheduling was chosen as the second package to refactor  Significant number of dependencies.
Requirements Engineering Requirements Elicitation Process Lecture-8.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
Marjorie E. Bloss, Lecturer Dominican University.
Kathryn Lybarger ALA Midwinter Cataloging & Classification Research Interest Group January 22, 2012.
Sitefinity Nordic Event – Sitefinity Success Stories Matti Kiviluoto
OPAC Training aid (Library solutions & Library world)
Distributed Aircraft Maintenance Environment - DAME DAME Workflow Advisor Max Ong University of Sheffield.
Navigating An Introductory Guide for Librarians Brought to you by:
Kathryn Lybarger SLA Kentucky Chapter Program and Business Meeting November 2, 2012.
Demystifying Batchload Analysis Yael Mandelstam Fordham Law Library AALL 2009 Annual Meeting.
Finding the right balance between human effort and automation for metadata creation Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Case study : creating a usable MARC file from a spreadsheet Thomas Meehan Head of Current Cataloguing UCL Library Services CILIP CIG Metadata.
Legal Issues with Cataloging Supplied by MARCIVE Joan Chapa.
The New Interface for the Library Catalogue Proposal 10.
Automating Your Way to Easy Faculty Scholarship Collection Development Margaret Heller Loyola University Chicago
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick Senior Librarian Cataloging introductory flow.
E-books in the Catalog: Managing MARC Records in Batches Bonnie Figgatt Sacred Heart University Library April 15 & 16, 2011.
SIERRA CATALOGING BASICS. CONTACT INFORMATION Lynn Uhlman Systems Training and Support Librarian Ticket:
Terry Reese Build your toolbox: In depth data manipulation with MarcEdit to prepare your data for the ANBD Terry Reese
Introduction to MarcEdit
MARCEdit TNUIG 2016.
7 ways to clean up the catalog
NOAA Central Library National Oceanic and Atmospheric Administration
New features and customization options
Pause for Cleanup: MarcEdit in Real Life
Authorities in Alma and F3
Bulk Editing Catalogue Records
Metadata Editor Introduction
Cleaning up the catalog: getting your data in order
Be Your Own data Mechanic
Cataloging introductory flow
INFS 3500 Martin, Brad, and John
Development of a World Historical Data Resource and Archive
Build Better Data: Best Practices for Catalog Cleanup CT Library Association, April 23, 2018 Diane Napert, Interim Director Monographic Processing Services,
DESIGNING AND USING NORMALIZATION RULES
Qualtrics Survey Kenyon
Practical guide to build Modern Intranet sites with SharePoint Communication Sites Asish Padhy.
E-Resources in Prospector
Software Support Features
Lesson 3: Find and Replace Tools
Designing and Using Normalization Rules
Confluence (wiki) Short practical guide afris
Generate Data with Google Analytics SQL Saturday /04/2019.
Real-Time Acquisition (RTA) integration with OASIS platform
Prediction Patterns and Summary Holdings
ONE-STOP FOR JOURNAL DISCOVERY
Presentation transcript:

Automating Data Normalization and Clean-up

Automated metadata enhancement teams

Building your Metadata Toolkit MARC4J OpenRefine

Dirty Data and the last mile problem Poor metadata doesn’t just affect search It complicates aggregations and collaborations like the DPLA Disincentives work to develop systems that make existing metadata more actionable Is a real impediment to innovative work being done in Libraries, but, more importantly, causes those outside of Libraries to really think hard about working with us.

Working with MarcEdit MarcEdit was created with three specific goals in mind: Place metadata decisions and clean up in the hands of technical services professionals Provide easy to use/easy to learn tools the provide both straightforward and advanced editing techniques Provide multiple methods to automated data cleanup – from point and click, to user generate scripts Get Help:

Global Editing Dedicated global editing functions (MARC Agnostic) Add/Delete Fields with a wide range of conditional options Edit Field data Edit Indicators Edit Subfields Swap Data between fields General Replace function Generate Cutters Validate ISBNS Automatically Generate Call Numbers (LC or Dewy)

Global Editing Specialized Edit Tools (MARC21 specific) RDA Helper (MARC21 Specific) Generate Links (MARC21 Specific) Case Management (to fix vendor titles in all upper case, etc.) Validation (MARC21 Specific)

Automating Data Processing MarcEdit Task Management MarcEdit’s tasks allow users to create “programs” that will perform a sequence of edits. Tasks can be shared, and linked to other tasks.

Building Scripts against the API MarcEdit’s processing engine is exposed via an API. Template scripts can be generated via the Script Wizard (right) API Documentation can be found at: marcedit-com-api- documents marcedit-com-api- documents

Find what works for you

content/uploads/2014/07/interpello.jpg content/uploads/2014/09/Thank-you-cat.jpg

Contact Me: Terry Reese Head of Digital Initiatives University Libraries 175 West 18th Avenue 320F 18 th Avenue Library,, Columbus, OH Office / Mobile / /