Kathryn Lybarger and Julene Jones May 20, 2010.  MARC = MAchine Readable Cataloging  File format for exchange of cataloging information  MARC has many.

Slides:



Advertisements
Similar presentations
E-books at CUNY LACUNY Cataloguing Roundtable November 5, 2009.
Advertisements

OCLC Online Computer Library Center OCLC Cataloging Update Connexion client 1.50 & more OCLC CJK Users Group Annual Meeting San Francisco, CA April 8,
4. Internet Programming ENG224 INFORMATION TECHNOLOGY – Part I
Batches, Buckets and Bookbags Elizabeth B. Thomsen NOBLE: North of Boston Library Exchange EGILS2014.
Kathryn Lybarger Fourth Friday February 22, 2013.
Getting Started with MarcEdit
MARCXML and MarcEdit Gordon Dunsire Depute Director Centre for Digital Library Research Presented to the seminar “Introduction to MARCXML”, National Library.
XHTML Basics.
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
Millennium Cataloging in Release 2005 Georgia Fujikawa Manager, Training Programs.
Linux+ Guide to Linux Certification, Second Edition
Guide To UNIX Using Linux Third Edition
ERIN STALBERG NCSU LIBRARIES SEPTEMBER 16, 2009 Cool Tools – More Connexion.
Fixed Fields Information Session 29 February 2012 Andrew Gloe Map Acquisitions & Cataloguing Team Australian Collections Management & Preservation Branch.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
Sage Library Consortium Cataloging-in-Publication MARC record conversion.
Advanced File Processing
Global Update with Confidence Mary M. Strouse Innovative Users Group May 19, 2009.
MarcEdit Basics and Beyond By Mary Aycock Head, Catalog Department Missouri University of Science and Technology MOBIUS 2012 Conference.
Chapter 6 Text and Multimedia Languages and Properties
JUNE 13-15, 2011  LANCASTER, PENNSYLVANIA Cataloging with MarcEdit Doreen Herold Lehigh University Symphony Sharon Scott Cumberland County Library System.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Unix Talk #2 (sed). 2 You have learned…  Regular expressions, grep, & egrep  grep & egrep are tools used to search for text in a file  AWK -- powerful.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Course Content - Chapter 2 Introduction to HTML Introduction to a Text Editor as a web authoring tool Instructional Activity: Creating a webpage using.
Linux+ Guide to Linux Certification, Third Edition
 Regular expressions are : › A language or syntax that lets you specify patterns for matching e.g. filenames or strings › Used to identify the files.
15.1 Fundamentals of HTML.
Kathryn ELUNA 2014 (Montreal) #ELUNA2014 May 1, 2014.
IS1811 Multimedia Development for Internet Applications Lecture 4: Introduction to HTML Rob Gleasure
Kathryn Lybarger ALA Midwinter Cataloging & Classification Research Interest Group January 22, 2012.
Writing macros and programs for Voyager cataloging Kathryn Lybarger ELUNA 2013 May 3, #ELUNA2013.
Week 3 Exploring Linux Filesystems. Objectives  Understand and navigate the Linux directory structure using relative and absolute pathnames  Describe.
Advanced File Processing. 2 Objectives Use the pipe operator to redirect the output of one command to another command Use the grep command to search for.
Chapter Five Advanced File Processing Guide To UNIX Using Linux Fourth Edition Chapter 5 Unix (34 slides)1 CTEC 110.
Kathryn Lybarger SLA Kentucky Chapter Program and Business Meeting November 2, 2012.
RDA Toolkit is an integrated, browser-based, online product that allow user to interact with a collection of cataloging-related documents and resources.
Operating Systems COMP 4850/CISG 5550 File Systems Files Dr. James Money.
Z-Books: Hunting Down Zombie Ebooks Hiding in your Catalog Kathryn OVGTSL 2013#ovgtsl2013 May 17, 2013.
Local Holdings Maintenance: The Basics. Agenda Defining Local Holdings Accessing Connexion Searching in Connexion Understanding an LHR Deriving LHR’s.
A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
Connexion Comparison Client or Browser? Fran Juergensmeyer Waukegan Public Library 2 nd Annual WILIUG Conference June 16, 2006 Cataloging from A (Authority)
Working with Forms and Regular Expressions Validating a Web Form with JavaScript.
Converting Millennium ILS Bibliographic records into Dublin- Core XML format for DSpace Alan Ng Hong Kong University Libraries PNC 2009 Annual Conference.
Kathryn EBUG 2014 (Morehead, Ky.) #EBUG2014 June 20, 2014.
Chapter Five Advanced File Processing. 2 Lesson A Selecting, Manipulating, and Formatting Information.
XP Tutorial 8 Adding Interactivity with ActionScript.
The physical parts of a computer are called hardware.
BATCH EDITING: software and regular expressions at the University of Kentucky Libraries Julene Jones ALA Catalog Management IG June.
Form Processing Week Four. Form Processing Concepts The principal tool used to process Web forms stored on UNIX servers is a CGI (Common Gateway Interface)
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Linux+ Guide to Linux Certification, Second Edition Chapter 4 Exploring Linux Filesystems.
ORAFACT Text Processing. ORAFACT Searching Inside Files grep - searches for patterns within files grep [options] [[-e] pattern] filename [...] -n shows.
Page 1 of 13 Beginner’s Tutorial – The Monalog Sanitizer What data does Monalog collect from you?  Monalog collects what you type on the command line.
The ___ is a global network of computer networks Internet.
NXT File System Just like we’re able to store multiple programs and sound files to the NXT, we can store text files that contain information we specify.
Creative Create Lists Elizabeth B. Thomsen Member Services Manager
Theory, Tools, History: A Brief Introduction August 17, 2016.
Regular Expressions Copyright Doug Maxwell (
Terry Reese Build your toolbox: In depth data manipulation with MarcEdit to prepare your data for the ANBD Terry Reese
Lesson 5-Exploring Utilities
Introduction to MarcEdit
MARCEdit TNUIG 2016.
Professional development training on cataloging at the University Wisconsin-Madison Memorial Library, USA 14th October -24th October, 2016 Aigerim Shurshenova.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
BIBFRAME at the Library of Congress
Cataloging Tips and Tricks
Guide To UNIX Using Linux Third Edition
MARC: Beyond the Basics 11/24/2018 (C) 2006, Tom Kaun.
Onboarding Webinar 13 April 2019 Presented by and.
Presentation transcript:

Kathryn Lybarger and Julene Jones May 20, 2010

 MARC = MAchine Readable Cataloging  File format for exchange of cataloging information  MARC has many advantages

 Created in 1960’s for Library of Congress  ANSI standard (1971)  ISO standard (1973)

 Large collection, only growing  OCLC has 183 million bibliographic records  New record added every ten seconds

 Bibliographic  Holdings  Authority

cam a a s2004 nyua b eng 010__ |a __ |a __ |a __ |a (OCoLC)ocm __ |a DLC |c DLC |d OCLCQ 049__ |a KUJY |a PS3545.E365 |b D |a 813/.52 | _ |a Webster, Jean, |d |a Daddy Long Legs ; |b and, Dear enemy / |c Jean Webster ; edited with an introduction and notes by Elaine Showalter.

_ ␢ (blank) # OPAC: OCLC Connexion Client (and actually in file): LC MARC Bibliographic Documentation: OCLC Bib Formats documentation:

|a ‡a ǂaǂa $a OPAC: OCLC Connexion Client: Voyager: OCLC Connexion Browser:

 Binary:  Decimal: 31  Hex: 1F

 MARC is often displayed as text  MARC is not plain text  You need a MARC editor to easily edit MARC

 Tags are stored in the directory of the file  Indicators and field contents are stored in the body of the record

   Field 001, 8 bytes long, starts at 0

 Fixed fields are always the same size  In a bibliographic record, there are always four bytes for illustration data:  ____ (four blanks): no illustrations  abcd : illustrations, maps, portraits, charts  abcd : illustrations, maps, portraits, charts, plans, plates and music

 Variable fields are not always the same size.  The title field may vary in length: The Iliad Alexander and the Terrible, Horrible, No Good, Very Bad Day

  $aThe nonsense novels ;$band, The dawn of Canadian history /$cby Stephen Leacock.  Field 245, 84 characters long, starts at position 203

 Variable fields make MARC flexible and compact  Some structural metadata is needed to make this happen  This book-keeping is usually (thankfully) invisible

 Binary characters represented as text  Data in a reasonable order  Structural metadata hidden

 Many MARC editors allow editing only one record at a time  You may want to do batch editing  You may want to do something nobody has ever considered before!

 How many records in my file?  Do they all have a field that I require?  Does that field contain what I require?  Are there any fields I don’t want?

 Remove a field in all records  Add a field to all records  Modify a field in all records  Swap data between fields  Change character encoding

 Create (one or more) holdings records based on bibliographic record content  Extract some fields to make an RSS feed of new books in your catalog

 Suite of tools for working with MARC  Developed by Terry Reese at Oregon State  Free download for Windows, Linux, Mac

 MarcBreaker – converts MARC to Mnemonic file format  MarcMaker – converts Mnemonic file format to MARC  MarcEditor – text editor for making common MARC edits  MARC Spy – hex editor, good for finding problems in corrupted MARC records

 “Regex” or “regexp”  A more general (and powerful) search or search-and-replace  A regular expression is a pattern which “matches” parts of your file

 grep  Powerful text editors: MarcEditor, vim, emacs  Programming languages: perl, php

 Expression: Mar  Matches: Mark Twain Steve Martin Telemarketing(case sensitive)

 Expression: ^Mar  Matches: Mark Twain  only this one Steve Martin Telemarketing

 Expression: tin  Matches: Mark Twain Steve Martin Telemarketing

 Expression: tin$  Matches: Mark Twain Steve Martin  only this one Telemarketing

 So how do you search for ^ or $ ?  Escape special characters with \ \$1matches$1.35 2\^matches2 + 2^4 = 18  So how do you search for \ ?

 You can modify searches just slightly for much more specific results  You can use multiple simpler expressions  You can ask other people (MARCEDIT-L)

^ $. + ? * ( ) [ ] { } | \

 You can search for one of several phrases (a|b|c|d)  Example: (Bob|John|Joe) Smith  Matches: Bob Smith  Does NOT match: Robert Smith

 Match not just one letter, but any of several  Surround with [ ]  Example:[BR]ob matches Bob, Rob, Robert Does NOT match: Toby, bobbing, robbery

 Match anything that is NOT in a specified list of characters  Surround with [^ ]  Example: [^aeiou]a matches scuba, Wildcat, dazzle, aardvark Does NOT match: each, toad, visual, antique

 A period matches any character  Example:..an  Matches:woman watchman  Does NOT match:man

 Find all subject headings with second indicator other than 0 or 2 ^=6...[^02]  Matches: =650 \7$aFilms$xMontage.$2 ram

 *any number of what it follows  ?0 or 1 of what it follows  +1 or more of what it follows  Example:.*

 *any number of what it follows  ?0 or 1 of what it follows  +1 or more of what it follows  Example: Joh?nathan

 Replace matched part with a static string OR  Capture parts of what you match with ( )  Use those captured parts in your replacement

 $1 – contents of first parentheses  $2 – contents of second parentheses  …  Search: (.*) (.*)  Replace:$2, $1  Bob Jones  Jones, Bob

 $0 – whole string matched  Example: make all URLs hyperlinks  Search: ]+  Replace: $0

 Make sure all 245 fields have $h [electronic resource] Search: ^=245.*\$h\[electronic resource\] Replace all: $0 This makes no real changes to your file, but lets you know how many matches it found

 Many books and websites  expressions.info/  Unix man pages: man perlretut