Implementing a Faceted Search Framework Emily Lynema & Andrew K. Pace NC State University Libraries ASIS&T Seminar April 9, 2007.

Slides:



Advertisements
Similar presentations
Metasearching: The Problem, Promise, Principles, Possibilities & Perils Roy Tennant California Digital Library.
Advertisements

Endeca Taking a different path Cindi Holt Information Services Manager September, 2007.
Getting Started with MarcEdit
ICOLC October 4, 2001 OCLC Services. Purpose Libraries’ web-based information portal needs –Maximize consortia’s role in their members’ use of database.
Chapel Hill 03-Mar-2006 Using Endeca for a Catalog Interface “So, yeah, the catalog sucks, but what are you going to do about it?” Andrew K. Pace Head,
Summon: Web-scale discovery. Agenda Web-scale Discovery Defined How Summon Works Summon User Experience (live demonstration) Additional Resources.
Endeca Statistics: An Online Set of Tools for UWF Stephen Fluharty Digital & Learning Technologies.
Catalogs for the Future Andrew K. Pace NCSU Libraries March 24, 2006 Library Automation: Yesterday’s Technology Tomorrow ILS Vendors: Squandering our money.
Opening the Door: using Endeca for a faceted catalog Emily Lynema NCSU Libraries MLC: Discovery & Access March 2, 2007.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
PRIMO AT THE ROYAL LIBRARY OF DENMARK Integrated search – Google of the library? Helsinki, October
BC Integration of Systems and Resources MetaLib at Boston College Theresa Lyman Digital Resources Reference Librarian Boston College Libraries.
Information Retrieval in Practice
River Campus Libraries CUIPID Catalog User-Interface Platform for Iterative Development David Lindahl Director of Digital Library Initiatives River Campus.
Searching TAL Online Developed by Northern Lights Internet Solutions Ltd. Advanced Searching.
River Campus Libraries Metadata That Supports Real User Needs David Lindahl Director of Digital Library Initiatives University of Rochester Libraries.
River Campus Libraries Metadata That Supports Real User Needs Jennifer Bowen Head of Cataloging University of Rochester Libraries David Lindahl Director.
River Campus Libraries Metadata That Supports Real User Needs Jennifer Bowen Head of Cataloging University of Rochester Libraries David Lindahl Director.
Relevance ranking of results from MARC-based catalogues: from guidelines to implementation exploiting structured metadata Tony Boston and Alison Dellit.
Overview of Search Engines
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
Catalog Transformed: From Traditional to Emerging Models of Use Andrew K. Pace NCSU Libraries Feb. 7, 2006.
The Future of the Online Catalog Andrew K. Pace NCSU Libraries July 28, 2006 Library Automation: Yesterday’s Technology, Tomorrow.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
The FCLA Endeca Project By Michele Newberry. M.Newberry2 Why ENDECA?  Already proven by NCSU  Build on NCSU’s work instead of starting from zero  Product.
Online Resources From Oxford University Press This presentation gives a brief description of University Press Scholarship.
Connecting users to Collections Collection Development/Resource Sharing Conference March 26, 2009 Jean Phillips Florida Center for Library Automation
Discovery Tool VuFind: Georgia Tech's Implementation Enhances Findability of Resources Larry Hansard & Lori Critz GIL Users Group Meeting /15/08.
Improving the Catalogue Interface using Endeca Tito Sierra NCSU Libraries.
Next generation library catalogs and the integration of gazetteer information for geographical research Julie Sweetkind-Singer Assistant Director of Geospatial,
Project Overview Bibliographic merging, Endeca, and Web application.
The FCLA Endeca Project By Michele Newberry. M.Newberry2 Current OPAC environment  Aleph 500 v.15.5  Heavily customized to reflect pre- implementation.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
OPAC Search & Navigation. “OPAC Complainers” “There is certainly no dearth of OPAC complainers. You have Andrew Pace (OPACs suck), and Roy Tennant (You.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
Bringing the Library to You: Integrating Library Resources into Online Classrooms ACRL/CNI/EDUCAUSE Joint Virtual Conference - Innovate and Motivate: Next.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Library of Congress Working Group on the Future of Bibliographic Control ~ March 8, 2007 Users and Uses of Bibliographic Data: The Promise and Paradox.
Utilizing OPAC Search Logs and Google Analytics Assessing OPAC Effectiveness and User Search Behavior VALE Users'/NJLA CUS/NJ ACRL Conference January 9,
Endeca: a faceted search solution for the library catalog Kristin Antelman & Emily Lynema UNC University Library Advisory Council June 15, 2006.
MetaLib 4 User Guide. 2 MetaLib 4 Access MetaLib at: – MetaLib may be used at two different levels –
Mississippi State University Libraries’ EBSCO Discovery Service Experience.
Free the Data: creating a web services interface to the online catalog Emily Lynema NC State University Libraries Code4lib 2007 February 28, 2007.
What is an open source discover tool? is a standalone, open source software used as alternative interface to existing integrated library systems that may.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Implementation of a faceted catalog search solution Kristin Antelman & Emily Lynema NCSU Libraries Feb. 7, 2006.
Infopeople Webcast Series: Technology Tuesdays. Leading Edge Technologies An Infopeople Webcast Roy Tennant Tuesday, January 17 12:00.
VuFind Digital Libraries à la Carte International Ticer School 2009 Tilburg University 31 July, 2009 Benoit PAUWELS Université Libre de Bruxelles (ULB)
The Catalog of the Future: Integrating Electronic Resources By Dana M. Caudle Cataloging Librarian Auburn University Libraries
Unbundling the ILS: Deploying an e-commerce catalog search solution Andrew Pace & Emily Lynema NCSU Libraries April 12, 2006.
Discovery Tool Implementation: UGA Bill Clayton Assistant University Librarian for Systems University of Georgia Libraries GUGM, Macon State, May.
A Faceted Interface to the Library Catalog Tito Sierra NCSU Libraries ALA Midwinter Meeting January 20, 2007.
Sitecore. Compelling Web Experiences Page 1www.sitecore.net Patrick Schweizer Director of Sales Enablement 2013.
Delivers local and global resources and OCLC e-Content in a single search Paul Cappuzzello Senior Library Services Consultant
Information Retrieval in Practice
Summon® 2.0 Discovery Reinvented
Emily Lynema NCSU Libraries AALL Annual Meeting July 15, 2007
Search Engine Architecture
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Federated & Meta Search
Library Content Comparison System
EBSCO Discovery Service (EDS)
Implementation and Introduction to Users
Metadata supported full-text search in a web archive
Presentation transcript:

Implementing a Faceted Search Framework Emily Lynema & Andrew K. Pace NC State University Libraries ASIS&T Seminar April 9, 2007

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Agenda The Context: Problem & motivation Local Implementation What and How? Challenges Encountered Outcomes Usage Statistics Future Opportunities

UNC School of Information & Library Science ASIS&T Student Chapter Seminar The Context

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Online Catalogs "Most integrated library systems, as they are currently configured and used, should be removed from public view." - Roy Tennant, CDL

UNC School of Information & Library Science ASIS&T Student Chapter Seminar What was the problem? Existing catalogs are hard to use: known item searching works pretty well, but … users often do keyword searching on topics and get large result sets returned in system sort order catalogs are unforgiving on spelling errors, stemming

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Catalog value is buried Subject headings are not leveraged in searching they should be browsed or linked from, not searched Data from the item record is not leveraged should be able to filter by item type, location, circulation status, popularity

UNC School of Information & Library Science ASIS&T Student Chapter Seminar What was the motivation? Unresponsive vendors ( ) Some reading and writing SUNY Buffalo XML OPAC (2004) “My Kingdom for an OPAC” (Feb 2005) Some casual conversation (Jan 2005) Some formal conversation (Feb-June 2005) Organizational culture (all along) Fast implementation (July 2005-Jan 2006)

UNC School of Information & Library Science ASIS&T Student Chapter Seminar What’s the big picture? Improve the quality of the library catalog user experience Exploit our existing authority infrastructure (aka make MARC data work harder) Build a more flexible catalog tool that can be integrated with discovery tools of the future.

UNC School of Information & Library Science ASIS&T Student Chapter Seminar What is Endeca? Software company based in Cambridge, MA Search and information access technology provider for a number of major e-commerce websites Developers of the Endeca Information Access Platform

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Why Endeca? Customized relevance ranking of results Better subject access by leveraging available metadata (including item level data!) through facets Improved response time Enhanced natural language searching through spell correction, etc. Browse

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Local Implementation

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Demo

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Relevance ranking Based on locally customizable algorithm: Most relevant: query as entered For multi-term searches: phrase match Field match title match more relevant than notes match Other factors: number of fields matched weighted frequency (tf/idf) static ordering (publication date, circulation stats)

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Faceted navigation Combine search and browse in single interface (Guided Navigation™) Filter results across multiple facets Remove facets in any order

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Facet refinements Availability Author Library Format Language New LC Classification Subject: Topic Subject: Genre Subject: Region Subject: Era

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Added search tools Automatic spell correction “Did you mean…” suggestions Automatic stemming

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Implementation team Information Technology Team chair and project manager Technical lead ILS Librarian Technical manager Research and Information Services Reference librarian Metadata and Cataloging Cataloging librarian Digital Library Initiatives Interface development

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Implementation timeline License / negotiation: Spring 2005 Acquire: Summer 2005 Implementation: August 2005 : vendor training September 2005 : finalize requirements October 2005 – January 2006 : design and development January 12, 2006 : go-live date It doesn’t have to be perfect!

UNC School of Information & Library Science ASIS&T Student Chapter Seminar The nitty gritty Endeca co-exists with SirsiDynix Unicorn ILS and Web2 online catalog Endeca handles keyword search Web2 handles authority search and detail page display Endeca indexes MARC records exported nightly from Unicorn Index is refreshed nightly with records added/updated during previous day

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Technical overview Raw MARC data NCSU exports and reformats Flat text files Data Foundry Parse text files Indices MDEX Engine NCSU Web Application HTTP Information Access Platform

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Technical overview Raw MARC data NCSU exports and reformats Flat text files Data Foundry Parse text files Indices MDEX Engine NCSU Web Application HTTP Offline - Nightly

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Technical overview Raw MARC data NCSU exports and reformats Flat text files Data Foundry Parse text files Indices MDEX Engine NCSU Web Application HTTP Always Online

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Challenges – System design Identifying appropriate facets Integrating 2 independent data systems Unique identifiers are important! Designing the user interface Search page Results page

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Too many boxes, lines, and shaded areas. Elements for a single record not visually grouped.

UNC School of Information & Library Science ASIS&T Student Chapter Seminar First version of results page wireframe (~8 total iterations). Ideas drawn from OPAC, RedLightGreen, Amazon, etc.

UNC School of Information & Library Science ASIS&T Student Chapter Seminar 8 th (and Final) Revision: Aggregate holdings information by library. Reduces complexity of continuing and online resources. Brief view vs. Full view gives user choice about displaying holdings.

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Challenges - Data MARC data with MARC-8 encoding => Text data with UTF-8 encoding

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Fun with MARC MARC  flat text file(s) for ingest by Endeca. Transformation accomplished with MARC4J. Opportunity to manipulate data on the back-end.

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Transformed data

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Challenges - Data MARC data with MARC-8 encoding => Text data with UTF-8 encoding Data issues revealed by exposing metadata in facets Relevance ranking for bibliographic data

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Maintenance Little ongoing work required after deployment Quarterly data refresh from ILS Version upgrades 6 member product team meets monthly Lots of development ideas (as time / library priorities afford)! Loosely coupled = making changes twice

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Outcomes

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Relevance Are search results in Endeca more likely to be relevant to a user’s query than search results in old OPAC? 100 topical user searches from 1 month in Fall 2005 How many of top 5 results relevant? 40% relevant in Web2 OPAC; 31 no hits 68% relevant in Endeca catalog; 12 no hits

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Usage statistics

UNC School of Information & Library Science ASIS&T Student Chapter Seminar July 06 – Jan 07

UNC School of Information & Library Science ASIS&T Student Chapter Seminar July 06 – Jan 07

UNC School of Information & Library Science ASIS&T Student Chapter Seminar 19.4% Subj./Class July 06 – Jan 07

UNC School of Information & Library Science ASIS&T Student Chapter Seminar July 06 – Jan 07

UNC School of Information & Library Science ASIS&T Student Chapter Seminar July 06 – Jan 07

UNC School of Information & Library Science ASIS&T Student Chapter Seminar The Future

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Future opportunities Integrate catalog w/other tools through web services Enrich catalog through external web services: book jackets, reviews, etc. – Amazon/OCLC Build cross-application shopping cart functionality

UNC School of Information & Library Science ASIS&T Student Chapter Seminar The catalog & web services Initial impetus – 2 requests Can we have RSS feeds for the catalog? Can we integrate catalog results into library website QuickSearch? Initial plan Build RSS feeds and extend with OpenSearch for integration. Where did we end up?

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Introducing CatalogWS “A Web API for dynamically querying information from the NCSU Libraries Catalog” Generic XML layer provides same functionality as HTML interface REST web API: define HTTP GET requests via URL parameters Enables server-side user-defined XSL transformations

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Why go there? More open access to the data available in our library catalog Core XML schema can be re-used and modified via stylesheets Enable other developers in the library to build applications using catalog data Reduce bottleneck (I don’t have to do everything)

UNC School of Information & Library Science ASIS&T Student Chapter Seminar RSS

UNC School of Information & Library Science ASIS&T Student Chapter Seminar QuickSearch

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Mobile device searching

UNC School of Information & Library Science ASIS&T Student Chapter Seminar Thanks NCSU project site: Andrew K. Pace Head, Information Technology Emily Lynema Systems Librarian for Digital Projects