Metadata tools - working with MarcEdit and OpenRefine

Slides:



Advertisements
Similar presentations
E-books at CUNY LACUNY Cataloguing Roundtable November 5, 2009.
Advertisements

Creating a reading list in Moodle Learning Technologists, Centre for Learning Technology This work is licensed under a Creative Commons Attribution-ShareAlike.
Essential IT Skills - Video Tutorials Chisholm Institute Library Kim Bryce (Information Services Librarian) Colin Sutherland (Systems/Technical Services.
Easy, like an attachment. But can your doc stand on its own? Yes. Only teachers can upload files to course site. So definitely a push- tool. Maybe.
Getting Started with MarcEdit
SHARED COLLECTIONS, SHARED RECORDS? RESOURCE SHARING AT THE META-LEVEL Charley Pennell, NCSU - Natalie Sommerville, Duke TRLN Annual Meeting, 13 July 2012.
Elibrary.worldbank.org World Bank eLibrary User Guide Take full advantage of your eLibrary subscription!
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
Discovering Society: finding information November 2010 Ben Taylorson.
CS 255: Database System Principles slides: Variable length data and record By:- Arunesh Joshi( 107) Id: Cs257_107_ch13_13.7.
Database Design Concepts Info 1408 Lecture 2 An Introduction to Data Storage.
Introduction to databases Developed by Anna Feldman for the Association for Progressive Communications (APC)
Definitions Collaboration – working together on team projects and sharing information, often through ad-hoc processes, to accomplish project goals. Document.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
A centre of expertise in digital information managementwww.ukoln.ac.uk Podcasting: Transforming Society Or Overblown Hype? Brian Kelly UKOLN University.
Agenda Overview 2.What is SharePoint? 3.NCDOT Websites 4.Roles 5.Search 6.SharePoint Interface.
Global Update with Confidence Mary M. Strouse Innovative Users Group May 19, 2009.
MarcEdit Basics and Beyond By Mary Aycock Head, Catalog Department Missouri University of Science and Technology MOBIUS 2012 Conference.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
JUNE 13-15, 2011  LANCASTER, PENNSYLVANIA Cataloging with MarcEdit Doreen Herold Lehigh University Symphony Sharon Scott Cumberland County Library System.
1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Office Access 2010 by Robert Grauer, Keith Mast, and Mary Anne.
Vended Authority Control --Procedures and issues.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
My Pod or Yours? How Podcasting fits in with your library and your life Medical Library Association - Greater Midwestern Region Technology Forum 2006 Max.
A centre of expertise in digital information managementwww.ukoln.ac.uk Understanding And Exploiting Web 2.0: Podcasting Brian Kelly UKOLN University of.
Chapter 17 Creating a Database.
Moodle (Course Management Systems). Forums, Chats, and Messaging.
1 Chapter 2: Working with Data in a Project 2.1 Introduction to Tabular Data 2.2 Accessing Local Data 2.3 Accessing Remote Data 2.4 Importing Text Files.
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
Introduction to Database Tonga Institute of Higher Education NOS 215.
CCS – Mail Merge Mail Merge This presentation is incomplete without the associated discussion 1 Coloma Community Schools In-service 21 March 2014.
Mike Bolam Metadata Librarian Digital Scholarship Services University Library System //
Case study : creating a usable MARC file from a spreadsheet Thomas Meehan Head of Current Cataloguing UCL Library Services CILIP CIG Metadata.
1 Smart Searching Techniques Fall 2006 the Library.
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
Trial period demonstration Jenny Stephens, National Library of Australia July 2010, revised February 2011 This work is licensed under the Creative Commons.
Presentation Preparation Overview The format will be as follows for all of the sessions will be as follows – First third of the slot will be a presentation.
HTML Comprehensive Concepts and Techniques Second Edition Project 2 Creating a Web Site with Links.
The Cataloging Department  Creates and maintains the libraries’ online catalog of both physical and virtual collections  Describes, classifies, and.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
Using OpenRefine in Digital Collections: the Spencer Sheet Music Project Bruce J. Evans Cataloging & Metadata Unit Leader/Music and Fine Arts Catalog Librarian.
DRAFT Library Resources – Teaching and Learning Adapted from a presentation by Ruth Russell, NOTE: References to UCL have been replaced.
Database (Microsoft Access). Database A database is an organized collection of related data about a specific topic or purpose. Examples of databases include:
Normalizing Data for Migration Kyle Banerjee
E-books in the Catalog: Managing MARC Records in Batches Bonnie Figgatt Sacred Heart University Library April 15 & 16, 2011.
1 Midterm Examination. 2 General Observations Examination was too long! Most people submitted by .
... Using the OPAC of Koha. .. Features Search (simple, combination) Browse Filters Results view (normal, ISBD, MARC) Cross searching View linked content.
Terry Reese Build your toolbox: In depth data manipulation with MarcEdit to prepare your data for the ANBD Terry Reese
3. System Task Botton in Form (Uploader Function)
An Introduction to the Bibliographic Metadata Profile in Alma
Digital Stewardship Curriculum
ePartner Portal for A/C Managers -
Introduction to MarcEdit
Virtual Research Environments
Databases Chapter 16.
Managing Copyrights in Invenio
Create a data-connected Visio Services web part
Vendor Neutral Ebook Records from OCLC Collection Manager
Case Study: Fixing MARC data with MarcEdit and OpenRefine
Oracle Sales Cloud Sales campaign
System Analysis and Design
Vendor Records What to do?
Name authority control in an evolving landscape
OER Basics II Heather Dodge Kelsey Smith Head Librarian
This is the Sign In page for the Dashboard
Spreadsheets, Modelling & Databases
Getting Started Training Session for Students.
The Life-Changing Magic of OpenRefine
Mukurtu: Batch Upload, Roundtrip
Vancouver Public Library
Presentation transcript:

Metadata tools - working with MarcEdit and OpenRefine Owen Stephens CILIP Cataloguing and Indexing Group, 2015

These slides were developed by Owen Stephens (owen@ostephens.com). Using these slides These slides were developed by Owen Stephens (owen@ostephens.com). Unless otherwise stated, all images, audio or video content are separate works with their own licence, and should not be assumed to be CC-BY in their own right This work is licensed under a Creative Commons Attribution 4.0 International License http://creativecommons.org/licenses/by/4.0/. It is suggested when crediting this work, you include the phrase “Developed by Owen Stephens”

Programme 10:00-10:30 Introduction to MarcEdit and OpenRefine 10:30-11:00 Case study 1: MarcEdit and OpenRefine to fix MARC Records 11:00-11:15 Break 11:15-11:45 Case study 2: MarcEdit for eBook records 11:45-12:15 Case study 3: Creating a usable MARC file from a spreadsheet 12:15-12:45 Introduction to regular expressions 12:45-13:45 Lunch 13:45-14:45 Hands-on with MarcEdit 14:45-15:00 Hands-on with Open Refine part 1 15:00-15:15 Break 15:15-16:00 Hands-on with Open Refine part 2

a tool for working with MARC records MarcEdit is… a tool for working with MARC records

MarcEdit can help when… You want to create MARC records from some other format You want to convert MARC records to another format You want to make an edit to a MARC record You want to make a known set of edits to many MARC records

MarcEdit can help you… You want to automate aspects of a cataloguing workflow You want to report on errors or issues with MARC records You want to analyse a set of MARC records and more…

For example… Create MARC records from a csv file or spreadsheet Modify URLs in 856 fields to include proxy server information (e.g. EZProxy) Add/remove local fields from a large number of MARC records in one go Modify externally supplied MARC records to fit local cataloguing practice

Getting help Use the Help function in MarcEdit Email list: http://listserv.gmu.edu/cgi- bin/wa?A0=marcedit-l Ask Terry! @reese_terry Email address for questions from http://marcedit.reeset.net/help

“a tool for working with messy data” OpenRefine is… “a tool for working with messy data” OpenRefine is described as a tool for working with ‘messy’ data - but what does this mean? It is probably easiest to describe the kinds of data OpenRefine is good at working with and the sorts of problems it can help you solve. http://openrefine.org

OpenRefine can help when… you have data in a simple tabular format there are inconsistencies in how the data is formatted there are inconsistencies in where data appears there are inconsistencies in terminology used in the data OpenRefine is most useful where you have data in a simple tabular format but with internal inconsistencies either in data formats, or where data appears, or in terminology used. It can help you:

OpenRefine can help you… Get an overview of a data set Resolve inconsistencies in a data set Help you split data up into more granular parts Match local data up to other data sets Enhance a data set with data from other sources These are some of the things OpenRefine can help you with. Some common scenarios might be: 1. Where you want to know how many times a particular value appears in a column in your data 2. Where you want to know how values are distributed across your whole data set

For example… Data you have Desired data 1st January 2014 2014-01-01 01/01/2014 Jan 1 2014 Where you have a list of dates which are formatted in different ways, and want to change all the dates in the list to a single common date format:

For example… Data you have Desired data London London] London,] london Where you have a list of names or terms that differ from each other but refer to the same people, places or concepts:

For example… Data you have Desired data Institution Library name Address 1 Address 2 Town/City Region Country Postcode University of Wales, Llyfrgell Thomas Parry Library, Llanbadarn Fawr, ABERYSTWYTH, Ceredigion, SY23 3AS, United Kingdom University of Wales Llyfrgell Thomas Parry Library Llanbadarn Fawr Aberystwyth Ceredigion United Kingdom SY23 3AS University of Aberdeen, Queen Mother Library, Meston Walk, ABERDEEN, AB24 3UE, United Kingdom University of Abderdeen Queen Mother Library Meston Walk Aberdeen AB24 3UE University of Birmingham, Barnes Library, Medical School, Edgbaston, BIRMINGHAM, West Midlands, B15 2TT, United Kingdom University of Birmingham Barnes Library Medical School Edgbaston Birmingham West Midlands B15 2TT University of Warwick, Library, Gibbett Hill Road, COVENTRY, CV4 7AL, United Kingdom University of Warwick Library Gibbett Hill Road Coventry CV4 7AL Where you have several bits of data combined together in a single column, and you want to separate them out into individual bits of data with one column for each bit of the data:

For example… Data you have Date of Birth from VIAF (Virtual International Authority File) Date of Death from VIAF (Virtual International Authority File) Braddon, M. E. (Mary Elizabeth) 1835 1915 Rossetti, William Michael 1829 1919 Prest, Thomas Peckett 1810 1879 Where you want to add to your data from an external data source - in this example starting with information about authors, and adding dates of birth/death from the Virtual International Authority File

Getting help The OpenRefine Wiki https://github.com/OpenRefine/OpenRefine/wiki The ‘Free your metadata’ site http://freeyourmetadata.org/ and book http://book.freeyourmetadata.org The OpenRefine mailing list and forum http://groups.google.com/d/forum/openrefine