Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and Searching of an Online Photograph Collection Michelle Dalmau,

Slides:



Advertisements
Similar presentations
Sam Hastings University of North Texas School of Library and Information Sciences User Input into Image Retrieval Design.
Advertisements

Online sheet music Jenn Riley Metadata Librarian Indiana University.
Software Development in the Digital Library Program Digital Library Brown Bag Tamara Cameron David Jiao Oct. 22, 2004.
A partnership of Truman Presidential Museum & Library, Truman Institute, and the MU Design Team at CTIE Project Whistlestop.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Sakaibrary in 2.4: User Feedback Guides Development Jon Dunn and Mark Notess Digital Library Program Indiana University.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
1 Pathways to Library Resources David Lindahl Director of Digital Library Initiatives Jeff Suszczynski Lead Developer.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Automated Reference Assistance: Reference for a New Generation Denise Troll Covey Associate University Librarian Carnegie Mellon CNI Meeting – April 2002.
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Pathways to Library Resources David Lindahl Director of Digital Library Initiatives Jeff Suszczynski Lead Developer.
A Registry for controlled vocabularies at the Library of Congress
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Image searching on the Web Qunyan Mao SIMS, UC Berkeley.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
Marty Harris aka TEXT QUERY SYSTEM Marty Harris Mgr TRD.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Improving the Catalogue Interface using Endeca Tito Sierra NCSU Libraries.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Next generation library catalogs and the integration of gazetteer information for geographical research Julie Sweetkind-Singer Assistant Director of Geospatial,
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
OCLC Research OCLC Online Computer Library Center Academic Library Association of Ohio, Technical Services IG 19 May 2006 OHIONET, Columbus, Ohio Web Services.
An introduction to metadata in digital projects Jenn Riley Metadata Librarian L566 Fall 2006.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 8.
NARA’s New Authority Sources: Authority Files and Thesauri in ARC C. Jerry Simmons Authority Team Leader, Lifecycle Coordination Staff National Archives.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Searching Sheet Music: IN Harmony Final Report Stacy Kowalczyk Digital Library Program Brownbag Spring Series February 13, 2008.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Jenn Riley Metadata Librarian Digital Library Program.
EVIA Digital Archive New Tools William G. Cowan Mike Durbin Digital Library Program EVIA Digital Archive DLP Brown Bag 20 September 2006.
Successes and Growing Pains: The Indiana University Digital Library Program Jenn Riley Metadata Librarian Indiana University Digital Library Program January.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
January 31, 2007 DLP Brownbag IN Harmony Brownbag Series January 31, 2007 Stacy Kowalczyk, Jenn Riley, Nikki Roberg.
PACSCL Consortial Survey Initiative Group Training Session February 12, 2008 at The Historical Society of Pennsylvania.
Introduction to metadata
Merging Metadata from Multiple Traditions: IN Harmony Sheet Music from Libraries and Museums Jenn Riley Metadata Librarian Indiana University Digital Library.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
UCSD Libraries Portal Project: Building a Database-Driven Web Content Management System Sharecase, 3/28/2001 Esmé Cowles and Laura Galvan-Estrada.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Challenges in the Nursery: Linking a Finding Aid with Online Content Elizabeth Johnson, Lilly Library Jenn Riley, Digital Library Program DL Brown Bag,
Search Strategies & Catalog Instruction Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science, University of Iowa.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
ALA Annual Meeting Claire Cocco Global Product Manager CONTENTdm Users Group June 30th, 2008.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Implementing (parts of) FRAD in a FRBR-based discovery system Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Image Discovery & Access ACRL Image Resources Interest Group ALA Annual, Saturday, June 26, 2010 Nicole Finzer, Visual Resources Librarian, Digital Collections,
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
EVIA Digital Archive Technical Overview EVIA Digital Archive DLP Brown Bag: 7 December 2005.
7th Annual Hong Kong Innovative Users Group Meeting
Using computers to search electronic databases
Next-Generation Subject Access for Music: Infrastructure Needs
Introduction to Metadata
Imagining the next generation of online Magic Lantern materials
Metadata supported full-text search in a web archive
Presentation transcript:

Cushman Exposed! Exploiting Controlled Vocabularies to Enhance Browsing and Searching of an Online Photograph Collection Michelle Dalmau, Jenn Riley, IU Digital Library Program Brown Bag Series

Indiana University Digital Library Program Brown Bag, May 7, 2004 Overview Introduction Metadata Research Overview Usability Findings Browse and Search Specifications Implementation Lessons Learned

Indiana University Digital Library Program Brown Bag, May 7, 2004 The Cushman Collection Funded with an Institute of Museum & Library Services (IMLS) grant ~14,500 color slides taken between Held at the IU University Archives Site launched October 2003 and March 2004launched

Indiana University Digital Library Program Brown Bag, May 7, 2004 Looking Back U.S. Steel Gary Works Photograph Collection ~2,200 Images Archival descriptions Assigned subject terms from CV Subject field search requires referencing the A-Z list of subjectsA-Z list Usability studies revealed n ot using the CV’s syndetic structure impacts searching

Indiana University Digital Library Program Brown Bag, May 7, 2004 Metadata for Image Collections Advantages to “free-text” descriptions: Preserve photographer’s notations Resembles the user’s language Advantages to CV descriptors: More access points Collocation Disambiguation Interoperability

Indiana University Digital Library Program Brown Bag, May 7, 2004 Metadata for the Cushman Collection Cushman’s description in notebooks and slide mountsnotebooksslide mounts Dates Location Names TGM I – LC Thesaurus for Graphic Materials: Subject Terms TGM II - LC Thesaurus for Graphic Materials: Genre & Physical Characteristics TGN – Getty Thesaurus of Geographic Names

Indiana University Digital Library Program Brown Bag, May 7, 2004 TGN: Getty Thesaurus of Geographic Names Online browser available Online browser Data available for licensing for incorporating into a local system Current and historical place names Hierarchically organized Useful as research tool and as structured CV Cushman cataloging Cushman display

Indiana University Digital Library Program Brown Bag, May 7, 2004 TGM II: Genre and Physical Characteristics Terms Online and free downloadable versions available Online Contains over 600 terms Poly-hierarchically organized We only used 24 TGM II terms Multiple genres assigned when appropriate More appropriate than AAT for our generalist users Cushman cataloging Cushman display

Indiana University Digital Library Program Brown Bag, May 7, 2004 TGM I: Subject Terms Online and free downloadable versions available Online Contains over 6,300 terms Hierarchically organized Includes terms for what picture is OF (eg dogs) plus what picture is ABOUT (eg democracy) Cushman cataloging Cushman display

Indiana University Digital Library Program Brown Bag, May 7, 2004 TGM I: Subject Terms Strengths and Weaknesses Strengths include: Pre-defined relationships between concepts Some lead-in vocabulary Weaknesses include: Complete syndetic relationships lacking, especially for new terms Language not user-friendly Not enough lead-in vocabulary Form and number of top-level categories not useful for a browse structure

Indiana University Digital Library Program Brown Bag, May 7, 2004 Searching Image Collections: Research Shows Complement free-text with controlled vocabulary searching (Fidel, 1991) Image retrieval is heavily based on textual labels (Choi & Rassmussen, 2003) Query expansion methods based on the CV relationship structures can increase access (Greenberg, 2001/2002) Automatic Expansion: Synonyms and Narrower terms are good candidates for automatic retrieval Interactive Expansion: Broader, Narrower and Related terms are good candidates for user-directed, “manual” retrieval Search assistants are helpful (Harping, Getty, 1999) Integration of Getty vocabularies (“a.k.a” and ARThur)

Indiana University Digital Library Program Brown Bag, May 7, 2004 Browsing Image Collections: Research Shows Browsing is exploratory – it fosters new connections, innovative use of resources and the ability to easily pursue new paths (Bawden, 1993) Browsing is a significant part of image discovery (Choi & Rasmussen, 2002) Guided, flexible browsing in context works (Flamenco and SI Art Image Browser projects)Flamenco

Indiana University Digital Library Program Brown Bag, May 7, 2004 Usability Methods Group Walkthrough (prototype excerpt)prototype excerpt Paper-based tasks and prototype evaluation 4 participants (mostly librarians) Individual Walkthrough Interview and prototype evaluation 2 participants (faculty) Task Scenarios (prototype excerpt 1 & 2)12 On-site task-based testing (14 tasks) 12 participants (staff, students and faculty image users)

Indiana University Digital Library Program Brown Bag, May 7, 2004 Usability Findings Show Searching Referencing an A-Z list with no lead-in terms for searching is NOT helpful at all Concerns about word choice (US, USA or America?) Iterative reformulation of queries in context is desired Iterative reformulation Relevant suggestions are helpful Relevant suggestions

Indiana University Digital Library Program Brown Bag, May 7, 2004 Usability Findings Show Browsing Structure is important Contents should be easily exposed Flexible and combinatorial browsing is desired Browsing cultivates searching

Indiana University Digital Library Program Brown Bag, May 7, 2004 Implementation Specifications Search Mapping from lead-in vocabulary Retrieval of all records with narrower terms Integrated search against BOTH “free-text” descriptions and thesaurus Integrated search User-initiated broadening and narrowing Browse Year Genre Subjects (hierarchical) Access via assigned headings with ability to move up and down (pending user studies) Location (hierarchical) Combination of facets

Indiana University Digital Library Program Brown Bag, May 7, 2004 Implementation of the Cushman Web site Java using Java Servlet and Java Server Pages (JSP) HTML / CSS for interface display Oracle 9i, Release 2 databasedatabase Oracle Text Tomcat and Apache HTTP servers JPEG images served from file system (PURLS)

Indiana University Digital Library Program Brown Bag, May 7, 2004 Thesaurus-Enhanced Browsing & Searching: Oracle Text Link to existing thesaurus or define custom thesaurus Preferred terms Broader terms Narrower terms Related terms SQL syntax for using thesaurus to expand database query SQL syntax PL/SQL stored procedures for getting information from thesaurus itself PL/SQL stored procedures

Indiana University Digital Library Program Brown Bag, May 7, 2004 Challenges Using Oracle Text Preferred term matches multiple lead-in terms Crops USE Farming; USE Plants Phrase matching Military finds Military officers, Military uniforms, etc. Qualifiers Cranes vs. Cranes (Birds) Punctuation used in TGM terms

Indiana University Digital Library Program Brown Bag, May 7, 2004 Lessons Learned Approach to metadata needs to be well-planned and flexible Metadata quality control is essential Need more data on how people use images This stuff is HARD!

Indiana University Digital Library Program Brown Bag, May 7, 2004 But It’s Worth the Effort! Enhanced discovery Innovative implementation for a production-level collection People love the Cushman Collection!love

Indiana University Digital Library Program Brown Bag, May 7, 2004 Looking Forward Strive to make our collections truly accessible even if only incrementally Sustainability of the Cushman approach Defining functionality for future image repository for all of our collections

Indiana University Digital Library Program Brown Bag, May 7, 2004 References Bawden, D. (1993). Browsing; theory and practice. Perspective in information management, 3 (1): Choi, Youngok and Rasmussen, Edie M. (2002). Users’ relevance criteria in image retrieval in American history. Information Processing and Management, 38: Choi, Youngok and Rasmussen, Edie M. (2003). Searching for Images: The Analysis of Users’ Queries for Image Retrieval in American History. Journal of the American Society for Information Science and Technology, 54 (6): Fidel, Raya. (1991). Searcher’s selection of search keys: Controlled vocabulary or free-text searching. Journal of the American Society for Information Science, 42 (7):

Indiana University Digital Library Program Brown Bag, May 7, 2004 References (con’t) Greenberg, J. (2001). Optimal QE Processing Methods with Semantically Encoded Structured Thesauri Terminology. Journal of the American Society for Information Science and Technology, 52 (6): Harpring, Patricia. (1999). How forcible are right words!: Overview of applications and interfaces incorporating the Getty vocabularies. Archives & Museum Informatics: Hearst, Marti et al. (2002). Finding the flow in web site search. Communications, 45: University of California, Berkeley: Flamenco Project University of Michigan: SI Art Image Browser

Indiana University Digital Library Program Brown Bag, May 7, 2004 Shout Out! Thanks to the Cushman Team comprised of Archives and DLP members especially... Randall Floyd (Database Guru) David Jiao (Java Genius)