Metadata Working Group Forum Cornell 2008-05-16 1 Metadata Normalization A Case Study in Primo -- and -- Linked Open Data In Libraries.

Slides:



Advertisements
Similar presentations
Metasearching: The Problem, Promise, Principles, Possibilities & Perils Roy Tennant California Digital Library.
Advertisements

Primo v.3 Highlights June What’s new in v. 3? Renewed user interface Changes to how resources are delivered to the user New searching and sorting.
2009 Annual ASERL Membership Meeting Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library
ICOLC October 4, 2001 OCLC Services. Purpose Libraries’ web-based information portal needs –Maximize consortia’s role in their members’ use of database.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Module 6: Preparing for RDA... Library of Congress RDA Seminar, University of Florence, May 29-June 2, 2011.
Opening the Door: using Endeca for a faceted catalog Emily Lynema NCSU Libraries MLC: Discovery & Access March 2, 2007.
OCLC Online Computer Library Center WorldCat Discovery to Delivery Jennifer Pearson Global Market Solutions OCLC
Next Generation OPAC Technologies and NEOS Looking into the Future Kenton Good, Web Development Librarian, University of Alberta Libraries Dan Mirau, Library.
EXtensible Catalog XC Drupal Toolkit. XC Software Overview User Interface for searching and browsing Library Website (on Drupal) VoyagerUR Research XC.
Rethinking the library catalogue: making search work for the library user Sally Chambers The European Library
Library as Community: reviews, ratings, feeds and the future Dinah Sanders, Product Manager.
Searching Without a Net:
Cambridge University Library New interfaces The future of the OPAC at Cambridge Ed Chamberlain – Systems Librarian, University Library.
Project Update David Lindahl University of Rochester Libraries.
AGent Demonstration Multi-Tier Solution Presented by Auto-Graphics Pomona, CA December 8-9, 2003 Version 2.0.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
The National Library of New Zealand (Te Puna Matauranga o Aotearoa) & OCLC established a Partnering Agreement for the supply of bibliographic services.
Why create a Gandhara What is it expected to do that the library catalog is not doing? What other benefits can it offer to users? Think of Gandhara as.
1  Ex Libris Ltd., Internal and Confidential Ex Libris Primo Sofia July 2013 Roman Piontek Key-Account Manager.
Envisioning an “eXtensible” Future Opportunities presented by the eXtensible Catalog (XC) Project Jennifer Bowen University of Rochester ACRL NY Annual.
UNDERSTANDING THE NEW DISCOVERY LANDSCAPE: Federated Search, Web-scale Discovery, Next- Generation Catalog and the rest Marshall Breeding Director for.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Connecting users to Collections Collection Development/Resource Sharing Conference March 26, 2009 Jean Phillips Florida Center for Library Automation
Improving the Catalogue Interface using Endeca Tito Sierra NCSU Libraries.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
John Helmer Executive Director, Orbis Cascade Alliance Paul Cappuzzello Senior Library Services Consultant Cheryl Snowdon WorldCat Local Product Manager.
LIBRARY RESOURCE DISCOVERY PRODUCTS: COMMERCIAL AND OPEN SOURCE OPTIONS Web Manager’s Academy Marshall Breeding Director for Innovative Technology and.
VuFind at the University of Illinois LITA National Forum October 3, 2009.
Future of Cataloging RDA and other innovations pt.1.
Metadata Crosswalking/ Transforming and Federated Searching in Ex Libris Products Anthony Dellureficio Library Systems Manager The New School University.
LIBRARY RESOURCE DISCOVERY PRODUCTS AND SERVICES: OVERVIEW AND PERSPECTIVES Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
OPAC Review Catalog functions Inventory and control Locating known items Discovery tool.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
OPAC Search & Navigation. “OPAC Complainers” “There is certainly no dearth of OPAC complainers. You have Andrew Pace (OPACs suck), and Roy Tennant (You.
THE STATE OF LIBRARY SEARCH AND DISCOVERY Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder and Publisher,
Module 6: Preparing for RDA... LC RDA for Georgia Cataloging Summit Aug. 9-10, 2011.
1 Preparations for Implementing RDA in Ex Libris’ Products ALA Annual Conference | Anaheim, CA | 24 June 2012 Mike Dicus, Product Manager Ex Libris (USA),
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
DISCOVERY PRODUCTS AND SERVICES: Introduction and current trends Marshall Breeding Director for Innovative Technology and Research Vanderbilt University.
Endeca: a faceted search solution for the library catalog Kristin Antelman & Emily Lynema UNC University Library Advisory Council June 15, 2006.
Module 6: Preparing for RDA... LC RDA for NASIG - June 1, 2011.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
A Management Tool David Goldsmith E-Matrix Oct 27 th, EPA Meeting NCSU Libraries.
MaNGO Mango is FCLA created application that uses the Solr/Lucene search engine and repository Begun in October 2007, live in August 2008, major overhaul.
What is an open source discover tool? is a standalone, open source software used as alternative interface to existing integrated library systems that may.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Kevin Gilbertson - Web Services Librarian Jean-Paul Bessou - Systems Librarian Z. Smith Reynolds Library Wake Forest University Growing Your Own Next-Gen.
THE FUTURE OF THE LIBRARY CATALOG OPACS GIVE WAY TO DISCOVERY Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
Discovering Value : Discovery Services and ERM Systems Together Nancy Fleck Michigan State University Ted Fons Innovative Interfaces.
VuFind: Community & Code. vufind.org Overview Intro to VuFind Features & Technologies Community, Support, Sustainability …
Google Search Appliance (GSA) & HIP Feasibility Review October 29, 2008.
EVERY CONNECTION has a starting point. A compelling end user environment: OCLC’s view Marianne Klomp Product Manager OCLC EUSIDIC 2008 London, UK.
Webdiscovery Tools: the Future of Reference in Academic Libraries.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Matt Goldner Product & Technology Advocate Mela Kircher Product Manager WorldCat Local Metasearch 13 November 2009.
Taking the Library Back from Google Abe Lederman, President and CTO October 18-20, 2007.
Delivers local and global resources and OCLC e-Content in a single search Paul Cappuzzello Senior Library Services Consultant
DISCOVERY SYSTEMS: SOLUTIONS A USER COULD LOVE OVERVIEW OF DISCOVERY SYSTEMS Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
Delivers local and global resources in a single search The first, easy step toward the first cooperative library service on the Web WorldCat Local “quick.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
THE EVOLUTION OF LIBRARY COLLECTION DISCOVERY: Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder.
Remote Data Sources in Primo Ebsco API WorldCat API Local Content.
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick | Senior Librarian Primo Analytics.
Discovery of Library Resources
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Sophia Katsarska Eighth AMICAL Conference Beirut, April 2011
Presentation transcript:

Metadata Working Group Forum Cornell Metadata Normalization A Case Study in Primo -- and -- Linked Open Data In Libraries

Metadata Working Group Forum Cornell Topical Overview Non-OPAC Discovery Systems ILS - Discovery System Interoperability SFX Optimization Metadata Normalization Ex Libris’ Primo - A Case Study –Front end and System Overview –Primo - System Demo –Primo - NYU’s Implementation

Metadata Working Group Forum Cornell Topical Overview Primo (Cont) –Metadata and Data Analysis –Challenges and Possibilities Common Models and Linked Open Data – An Alternative Approach Data Harmonization Benefits –Authority Control –Application Profiles –A possible future for Bibliographic Data

Metadata Working Group Forum Cornell Not an OPAC Replacement Primo, Endeca, Encore, AquaBrowser, Library Find, VUFind, WorldCat Local Not OPAC Replacements –More seamless discovery –“Web 2.0” –Fewer Clicks - ease of use –Cross-depository discovery

Metadata Working Group Forum Cornell Encore by III “Encore goes beyond the online-catalog model to provide a better patron experience that leverages library content and patron- contributed information. Key features include: –Faceted search by multiple parameters –RightResult™ relevance-ranking –Real-time holdings and status information –Suggested links to content related to the user's search”

Metadata Working Group Forum Cornell AquaBrowser “Whatever it is, wherever it is, patrons can quickly and easily find it using a single interface for all types and formats of content. Visually represented and faceted search results allow your patrons to search and discover information faster and more effectively. Relevant search results help them find answers fast. Word clouds encourage exploration and discovery. Facets help to quickly focus the results.”

Metadata Working Group Forum Cornell Endeca “Endeca for Libraries is the most effective way for members of the library community to find the book or resource they need and to discover new information they didn't even know the library owned, which drives increased usage of the library's resources, usage of legacy library collections, and re- circulation.”

Metadata Working Group Forum Cornell Primo “Interfacing seamlessly with library applications from Ex Libris and other vendors … management of all types of library resources, regardless of format and location.” “Find it all. Find it Easily. Get it”

Metadata Working Group Forum Cornell WorldCat Local “A localized version of WorldCat with custom branding and relevancy ranking … interoperates with your existing ILS and fulfillment systems … Single-search, multilingual interface for all physical and electronic content held locally or in remote locations Integrated access to the most appropriate delivery options”

Metadata Working Group Forum Cornell One Problem, Many Solutions Users want a more seamless discovery experience Libraries get locked on the 2.0 buzz –Tagging, Reviews, Recommendations –Improved Relevancy Ranking Other goals may be more important –Fewer clicks to fulfillment –Cross-Depository Discovery

Metadata Working Group Forum Cornell More Intuitive Searching Less complicated initial searches Less pre-search limiting More post-search limits via faceting Appropriate Delivery bubbles up Trade-offs…

Metadata Working Group Forum Cornell Not in your OPAC DLF ILS Discovery Interface Task Force From the “Berkeley Accords”: –“participants agreed to support a set of essential functions through open protocols and technologies by deploying specific recommended standards” –Harvesting, Availability, Linking

Metadata Working Group Forum Cornell Availability and Access Links to full resource at the front Carefully considered SFX options Record FRBRization and Dedup Availibility Statements real time (or as close as possible)

Metadata Working Group Forum Cornell Open URL From Primo Aleph Record Aleph Record(s) and/or other data deduped Other Data Source Query: Other NYU Cat WorldCat Link both to ISBN/ISSN Search Query: Aleph (link to Holdings) Other NYU Cat WorldCat Link Others to ISBN/ISSN Search Query: Aleph (link to Holdings) Other NYU Cat WorldCat Link Others to ISBN/ISSN Search

Metadata Working Group Forum Cornell Primo: A Case Study Normalization Rules Delivery templates Tight SFX and MetaLib Integration “Pipes” for different data sources Hourly Availability Checking –(Real Time in Version 2.0)

Metadata Working Group Forum Cornell Harvesting Different Data Sources Different Normalization Rules All standardized on Primo Normalized XML (PNX) Record –Very Flat, sections corresponding to Primo Functionality

Metadata Working Group Forum Cornell The PNX Record Display Section Links Section Search Section Sort Section Facets Section Dedup Section FRBR Section Delivery Section

Metadata Working Group Forum Cornell Data Sources at NYU BobCat (Aleph) MarcIt! EAD Records (Archivists Toolkit) Preservation Repository Faculty Digital Archive (IR) Art Images (Luna Insight) MetaLib Resources Data in SOLR –Newspaper Index –Data Sets

Metadata Working Group Forum Cornell Issues and Challenges Managing Deduplication –Dedup Data only out of box for MARC –Writing for OAI-PMH sources (EAD) Consortial Environment(s) Appropriate Delivery Options “Interpreting” Metadata

Metadata Working Group Forum Cornell EAD Records Archivists Toolkit –Previously in Access, Notepad, Excel –Authority Control (sort of) OAI-PMH Overlay Multiple layers of Crosswalking Deduping

Metadata Working Group Forum Cornell EAD / Aleph Dedup Aleph Title: –James E. Jackson and Esther Cooper Jackson papers EAD Title: –Guide to the James E. Jackson and Esther Cooper Jackson papers (Bulk ) Tamiment 347

Metadata Working Group Forum Cornell MARC + EAD EAD Record Aleph Record Authority Records MARC Record w/ Auth Data OAI-DC Record w/ FT of EAD EAD PNX Aleph PNX Dedup PNX

Metadata Working Group Forum Cornell Value of Dedup Indexing the Best of Both Worlds EAD Records: –Inventory –Long Biographical / Historical Notes MARC Data: –Cross References for Access Points

Metadata Working Group Forum Cornell Why is it so hard? Continually Repetition of Effort

Metadata Working Group Forum Cornell

Metadata Working Group Forum Cornell A Distinction Metadata Harmonization: –the “ability to use serveral different metadata standards in a single software system.” Metadata Normalization: –mapping serveral different metadata standards to a single schema or structure for use in a single software system.

Metadata Working Group Forum Cornell MARC + EAD EAD Record Aleph Record Authority Records MARC Record w/ Auth Data OAI-DC Record w/ FT of EAD EAD PNX Aleph PNX Dedup PNX

Metadata Working Group Forum Cornell Linked Open Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things.

Metadata Working Group Forum Cornell Primo is NOT Linked Data List of nearly a dozen sources, some “normalized” more than once “Normalized” into another proprietary format, used by one system Additional Resources require additional pipes

Metadata Working Group Forum Cornell Linked Library Data Resources get URI’s early in lifecycle Properties get URI’s Vocabularies get URI’s Everything is dereferenceable as to it’s meaning

Metadata Working Group Forum Cornell Conclusions DCMI/RDA Work NSDL Registry Work LC Registry Work MODs as RDF (Simile & LC) OAI-ORE OAI2LOD

Metadata Working Group Forum Cornell Conclusions This stuff is happening We need to be playing with it We need to be applying lessons from projects like Primo to it Library Data is a key component!

Metadata Working Group Forum Cornell …and Library data is extremely complicated

Metadata Working Group Forum Cornell MARC Record Graph Does not include authority data Coins new URI’s any non-literal value Contains a few minor modeling errors <modsrdf:Place modsrdf:name="New York“ rdf:about=" ccountry/nyu"/>

Metadata Working Group Forum Cornell Thanks! Questions?