BATCH EDITING: software and regular expressions at the University of Kentucky Libraries Julene Jones ALA Catalog Management IG June.

Slides:



Advertisements
Similar presentations
John Espley and Robert Pillow ALA New Orleans 26 June 2011 The RDA Sandbox and RDA Implementation Scenario One.
Advertisements

Creative Create Lists Elizabeth B. Thomsen Member Services Manager North of Boston Library Exchange
OCLC Online Computer Library Center Connexion Client 1.30 for Multiscripts Cataloging CJK User Group Meeting, Chicago April 2, 2005 David Whitehair and.
Voyager Extracts and Batch Editing for Catalogers Ex Libris Northeast User Group 2010 Conference Mark Sandford Special Formats Cataloger William Paterson.
Getting Started with MarcEdit
Searching using regular expressions. A regular expression is also a ‘special text string’ for describing a search pattern. Regular expressions define.
Regular Expressions in Perl By Josue Vazquez. What are Regular Expressions? A template that either matches or doesn’t match a given string. Often called.
Kathryn Lybarger and Julene Jones May 20,  MARC = MAchine Readable Cataloging  File format for exchange of cataloging information  MARC has many.
Guiding Interface Design with Ethnographic Methods ALA Annual 2013 Usability, the User Experience & Interface Design The University of Chicago Library.
Voyager Interest Group Locations and Statuses February 27, 2008.
PERL Part 3 1.Subroutines 2.Pattern matching and regular expressions.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
Sage Library Consortium Cataloging-in-Publication MARC record conversion.
Regex Wildcards on steroids. Regular Expressions You’ve likely used the wildcard in windows search or coding (*), regular expressions take this to the.
Global Update with Confidence Mary M. Strouse Innovative Users Group May 19, 2009.
MarcEdit Basics and Beyond By Mary Aycock Head, Catalog Department Missouri University of Science and Technology MOBIUS 2012 Conference.
Overview of the grep Command Alex Dukhovny CS 265 Spring 2011.
Regular Expression Darby Tien-Hao Chang (a.k.a. dirty) Department of Electrical Engineering, National Cheng Kung University.
Pattern matching with regular expressions A common file processing requirement is to match strings within the file to a standard form, e.g. address.
Programming Perl in UNIX Course Number : CIT 370 Week 4 Prof. Daniel Chen.
JUNE 13-15, 2011  LANCASTER, PENNSYLVANIA Cataloging with MarcEdit Doreen Herold Lehigh University Symphony Sharon Scott Cumberland County Library System.
Writing macros and programs for Voyager cataloging Kathryn Lybarger ELUNA 2013 May 3, #ELUNA2013.
Regular Expressions Regular expressions are a language for string patterns. RegEx is integral to many programming languages:  Perl  Python  Javascript.
Perl and Regular Expressions Regular Expressions are available as part of the programming languages Java, JScript, Visual Basic and VBScript, JavaScript,
Robert Pillow, VTLS Inc. How Will RDA Impact Your System? A Forum of Vendors Discussing Implementation Plans Association for Library Collections & Technical.
Introduction to Regular Expressions Ben Brumfield THATCamp Texas 2011.
Searching Voyager: #2: Finding a Book by Its Title Zale Library at Paul Quinn College David Hamrick, 2012 “Now, voyager, sail thou forth to seek and find…”
Post-Module JavaScript BTM 395: Internet Programming.
Regular Expressions – An Overview Regular expressions are a way to describe a set of strings based on common characteristics shared by each string in.
Regular Expressions in PHP. Supported RE’s The most important set of regex functions start with preg. These functions are a PHP wrapper around the PCRE.
Instructor: Craig Duckett Lecture 08: Thursday, October 22 nd, 2015 Patterns, Order of Evaluation, Concatenation, Substrings, Trim, Position 1 BIT275:
Connexion Comparison Client or Browser? Fran Juergensmeyer Waukegan Public Library 2 nd Annual WILIUG Conference June 16, 2006 Cataloging from A (Authority)
Regular Expressions. Overview Regular expressions allow you to do complex searches within text documents. Examples: Search 8-K filings for restatements.
Module 6 – Generics Module 7 – Regular Expressions.
Batch-Load Points Counter (MARCEdit project) Amelia C. VanGundy The University of Virginia’s College at Wise Virginia SirsiDynix Library Users Group Meeting.
©Brooks/Cole, 2001 Chapter 9 Regular Expressions.
David Whitehair Product Manager OCLC Cataloging and Metadata Services Future Direction of Technical Processing Committee on Technical Processing, CEAL.
R EGULAR E XPRESSION IN P ERL (P ART 1) Thach Nguyen.
Regular Expressions CS 2204 Class meeting 6 Created by Doug Bowman, 2001 Modified by Mir Farooq Ali, 2002.
CIT 383: Administrative ScriptingSlide #1 CIT 383: Administrative Scripting Regular Expressions.
Unit 11 –Reglar Expressions Instructor: Brent Presley.
Prof. Alfred J Bird, Ph.D., NBCT Office – McCormick 3rd floor 607.
What are Regular Expressions?What are Regular Expressions?  Pattern to match text  Consists of two parts, atoms and operators  Atoms specifies what.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. ADVANCED.
Introduction to Programming the WWW I CMSC Winter 2004 Lecture 13.
OOP Tirgul 11. What We’ll Be Seeing Today  Regular Expressions Basics  Doing it in Java  Advanced Regular Expressions  Summary 2.
Catalogs, MARC and other metadata Kathryn Lybarger March 25, 2009.
Editing Tons of Text? RegEx to the Rescue! Eric Cressey Senior UX Content Writer Symantec Corporation.
Regular Expressions In Javascript cosc What Do They Do? Does pattern matching on text We use the term “string” to indicate the text that the regular.
Image from: Catalog Management in the Cloud: Two Years In Catalog Management IG Meeting, ALA Annual June 25, 2016 Stacie Traill and Betsy.
Creative Create Lists Elizabeth B. Thomsen Member Services Manager
Terry Reese Build your toolbox: In depth data manipulation with MarcEdit to prepare your data for the ANBD Terry Reese
RE Tutorial.
Regular Expressions ICCM 2017
MARCEdit TNUIG 2016.
Pause for Cleanup: MarcEdit in Real Life
Regular expressions for tech services
Advanced Find and Replace with Regular Expressions
Doug Williams, Campbell County Public Library, September 22, 2017
Software Support Features
Regular Expressions
Lesson 3: Find and Replace Tools
CPSC 203 Library session Jennifer Lee
CIT 383: Administrative Scripting
Onboarding Webinar 13 April 2019 Presented by and.
PolyAnalyst Web Report Training
A Metadata “Connexion” from SharePoint to WorldCat
ADVANCE FIND & REPLACE WITH REGULAR EXPRESSIONS
Perl Regular Expressions – Part 1
Presentation transcript:

BATCH EDITING: software and regular expressions at the University of Kentucky Libraries Julene Jones ALA Catalog Management IG June 2013

Ensuring quality metadata Catalog –Verify data against item, one-by-one Database management systems (DBMS) – Microsoft Office Access Batch editing –Macro programs: MacroExpress and AutoHotKey –MARC editor: MarcEdit –Voyager client: Global Data Change Support regular expressions!

MacroExpress available from

DBMS: Access - GUI

Marc Editor: MarcEdit

Marc Editor: Global Data Change

Regular Expressions “regex” A more general (and powerful!) search or find-and-replace function Searches for patterns of characters in data

Standard search Lots of standard searches are also regex Expression: Wil Matches: 3 William Faulkner Tennessee Williams twill

Regex: Anchors Expression: ^Wil: find what begins with “Wil” Matches: 1 William Faulkner  only matches this one Tennessee Williams twill

Regex: Anchors Expression: ill$ : find what ends with “ill” Matches: 1 William Faulkner Tennessee Williams twill  only matches this one

Special characters Metacharacters: [ \ ^ $. | ? * + ( ) Search for these by escaping them; use \ \$6 matches $650 2\^ matches 3 + 2^3 So how do you search for \ ?

Search for one of a string (a | b | c | d) : find a or b or c or d Example: (Bob | John | Dave) Smith Matches: Bob Smith Does NOT match: Robert Smith or David Smith

Search for any character To match any of several characters, use [ ] Example: [BR]ob(is case sensitive) Matches: Bob, Rob, Robert Does NOT match: Jacob, Job, Hobbes, lobster, cobbler, strobe, or noble

Search for not these characters use [^ ] : find anything other than bracketed Example: [^aeiou]a Matches: Chicago, library, cards, staff, travel, information, program, workplace Does NOT match: annual, early, colleague, area, specialist, goal

Match any character, repetitions. matches any charactergr.y * matches any number.* finds everything of what it follows ? matches 0 or 1 or what it followscolou?r + matches 1 or more of what it follows

A handy regex Find all subject headings with a second indicator other than 0 or 2 ^=6...[^02] Matches: =650 \4$aElectronic books =650 \6$aLittérature populaire$xHistoire et critique. =655 \7$aTourist maps.$2lcgft

Replacement strings Capture strings using ( ) Rearrange or replace them by using $0, $1, $2, etc. $1 contents of first parentheses $2 contents of second parentheses … Search (.*)(.*)Bob Smith Replace $2, $1Smith, Bob

Replacement strings Prepend a phrase by using $0 Example: add J before a call number Replace with J $0 QB641.R  J QB641.R

For more information: MacroExpress: and MarcEdit: and its listserv, Voyager Global Data Change: Regular Expressions: and the MarcEdit listserv

Thanks! Julene Jones