Using OpenRefine in Digital Collections: the Spencer Sheet Music Project Bruce J. Evans Cataloging & Metadata Unit Leader/Music and Fine Arts Catalog Librarian.

Slides:



Advertisements
Similar presentations
Omeka Adding Items. Select Add a new item to your archive Log in to Omeka at:
Advertisements

Collections Management Software for Museums and Archives r e d i s c o v e r y s o f t w a r e. c o m O V E R V I E W P R E S E N T A T I O N.
Database Relationships in Access As you recall, the data in a database is stored in tables. In a relational database like Access, you can have multiple.
Holdings Management Overview
Full Text Finder Overview Tutorial support.ebsco.com.
EBSCO Discovery Service
Use Watch folders to automatically add PDFs to Mendeley Desktop.
Citavi – Adding References – Articles from EBSCOhost Databases
Prepared by Houeida Kammourié-Charara InfoCommons Librarian © 2012.
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
XP New Perspectives on Microsoft Office Access 2003, Second Edition- Tutorial 3 1 Microsoft Office Access 2003 Tutorial 3 – Querying a Database.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Querying a Database Microsoft Office Access 2003.
Using Social Care Online: an overview Version 1.0 April 2015.
BAYLOR UNIVERSITY STREAMING RECITAL RECORDINGS Ready, Set ….. Wait for it …..
LSTA Digital Imaging Grants Presentation Projects Workshop September 13, 2002 Wendy Sistrunk Music Catalog Librarian University of Missouri—Kansas City.
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
NO CARD CATALOGS HERE ROO FOUR DESIGN. Searches  Computers cannot search without direction  Tries to match exact terms  Failed searches  Combining.
Tame Your Data with OpenRefine GIL User Group Meeting May 14 th, 2015 Tricia Clayton Collection Services Librarian Georgia State University.
Self Guided Tour for Query V8.4 Basic Features. 2 This Self Guided Tour is meant as a review only for Query V8.4 Basic Features and not as a substitute.
Tutorial: Search and Browse Project MUSE. Search for Books and Journals Type search terms, keywords, phrases (“”) and Boolean Operators (AND, OR, NOT)
Registering with My NCBI allows you to customize your NCBI services. Save Searches, get Alerts, and set up Filters to group your search results.
MAIL MERGING IN MICROSOFT PUBLISHER CREATING LABELS AND DOCUMENTS FOR MULTIPLE PEOPLE AT ONCE.
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Introduction to Databases Trisha Cummings. What is a database? A database is a tool for collecting and organizing information. Databases can store information.
Chapter 17 Creating a Database.
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums Jenn Riley Metadata Librarian Indiana University Digital Library.
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Working with Data Lists.
Mike Bolam Metadata Librarian Digital Scholarship Services University Library System //
Case study : creating a usable MARC file from a spreadsheet Thomas Meehan Head of Current Cataloguing UCL Library Services CILIP CIG Metadata.
Tutorial Improving your list of results in EBSCO Discovery Service (EDS) support.ebsco.com.
XP New Perspectives on Microsoft Access 2002 Tutorial 31 Microsoft Access 2002 Tutorial 3 – Querying a Database.
METADATA FOR ACCESS Monica Figueroa & Eve Grünberg.
Adding, editing, and deleting items using CONTENTdm Administration.
Chapter Three Presentation: User interface How to Build a Digital Library Ian H. Witten and David Bainbridge.
CLEANING UP MESSY DATA WITH OPEN REFINE Presented by Anjum Najmi & Spencer Keralis.
Presenting Documents How to Build a Digital Library Ian H. Witten and David Bainbridge.
Automating the Audit: Updates from the Metadata Upgrade Project at the University of Houston Libraries Andrew Weidner, Metadata Librarian Santi Thompson,
GEOVISUALIZATION: VISUALIZE THAT ON A MAP Sarah G. Park April 14, 2016.
Automating Your Way to Easy Faculty Scholarship Collection Development Margaret Heller Loyola University Chicago
Dublin Core Application Profiles and the OCLC Digital Gateway New tools for improving discoverability of digital collections Allyson Mower, University.
Introduction to EBSCOhost
Using Social Care Online: an overview
Using the Advanced Search Guided Style Find Fields on
Reconciling Legacy Archival Metadata
Holdings Management Overview
Metadata Editor Introduction
Fearless Transformation: Applying OpenRefine to Digital Collections
Using the Advanced Search Guided Style Find Fields on
Digging into Linked Data: Perspectives from the Long Tail
Microsoft Office Access 2003
Creating a Basic Search on
SPARC November 17, 2008 Dr. Joan Giesecke and Paul Royster
Creating a Basic Search on
Microsoft Office Access 2003
IL Step 3: Using Bibliographic Databases
USING OPENREFINE FOR DATA-DRIVEN DECISION-MAKING
1. go to: 2. log in
How do I conduct a search using
2015 – working with Angel History – Life Skill classes
Introduction to EBSCOhost
The Life-Changing Magic of OpenRefine
Tutorial: Search and Browse Project MUSE.
WISER: Key tools for finding Oxford resources
RSA 2019, Toronto Preconference day March 16, AM-1PM
Tutorial: Search and Browse Project MUSE.
Creating a Basic Search on
Presentation transcript:

Using OpenRefine in Digital Collections: the Spencer Sheet Music Project Bruce J. Evans Cataloging & Metadata Unit Leader/Music and Fine Arts Catalog Librarian Baylor University Kara Long Metadata & Catalog Librarian Baylor University

Frances G. Spencer Collection of American Sheet Music

Cataloging & Metadata Overview

Card Catalog MARC Record Dublin Core Metadata & digital object

OpenRefine Interactive Data Transformation tool (IDT) Interactive like a spreadsheet – but more powerful Programmable like a database – but more exploratory Open source Runs locally in your browser But what can it do? Import and export data Facet data Transform data Reconcile data to outside data sets

Importing the data and creating a new project… MARC fields re-named and re-ordered Join fields where data is separated Separate fields where data is joined Re-format dates Remove unnecessary punctuation Add fields that required in digital collection

Columns are the primary units of interaction. The drop down menu of functions at the column level allows us to rename, reorder, or transform columns. Column names must exactly match our CDM field names in order for upload the metadata. MARC 100  Composer Renaming Columns

Columns must also exactly match the order that the corresponding fields appear in our CONTENTdm collection. Once all the fields have been re- named, they can be re-ordered under the All columns menu. Re-ordering Columns

Joining the 245$a and 245$b to create a Title field Transform data with Google Refine Expression Language (GREL) Expected value Joining Values

Adding a column based on an existing column. Values from the 260$c populate the new Date Search field, with unnecessary data removed. Adding a New Column

246 must be split into two or three separate fields: -Alternative Title -First line of verse -First line of chorus Splitting Values

The value in the First Line of Verse field always begins with the same phrase, “First line of text.” To create a new column with this portion only, split the value by a semi-colon, filter those values by the leading phrase. The same method will also isolate the First Line of Chorus values. Using “not” as a Boolean operator will isolate the Alternative Title values. Splitting Values

Extract the Operation History to automate your data transformation and clean up. Extract and save! Apply to new data sets that need the same kind of clean up. Isn’t it all a little tedious?

Invaluable resources Verborgh, Ruben, and Max De Wilde. Using OpenRefine. Birmingham: PACKT Publishing, Van Hooland, Seth, and Ruben Verborgh. Linked Data for Libraries, Archives, and Museums: How to Clean, Link, and Publish Your Metadata. Chicago: Neal-Schuman, 2014.