Web Scale Discovery What is it? …and how does it work?

Slides:



Advertisements
Similar presentations
New from Ex Libris: Primo Central. 2 Primo in a nutshell An end-user discovery and delivery solution Enables a library to offer its entire collection.
Advertisements

1 Advancing Discovery Services for Libraries Jenny Walker Fiesole Retreat April 2012 Jenny Walker Fiesole Retreat April 2012.
2009 Annual ASERL Membership Meeting Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library
1 The Changing Standards Landscape E-Books: Describe and Identify, Search and Discover, Comply and E- Books: Describe and Identify, Search and Discover,
LIBRARY RESOURCE DISCOVERY: Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
Trends and Advancements for Library Resource Discovery Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
Summon: Web-scale discovery. Agenda Web-scale Discovery Defined How Summon Works Summon User Experience (live demonstration) Additional Resources.
DISCOVERY SERVICES FOR LIBRARIES Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides
1 Introducing the Open Discovery Initiative Discovery and Delivery: Innovations and Challenges Marshall Breeding
1 Update on NISO’s Open Discovery Initiative Nettie Lagace, Associate Director for Programs, NISO David Lindahl, Director of Strategic Initiatives, UMKC.
BC Integration of Systems and Resources MetaLib at Boston College Theresa Lyman Digital Resources Reference Librarian Boston College Libraries.
Discovery Tools in Academic Libraries: why, what and how? Edith Falk Chef Librarian The Hebrew University Library Authority.
The Future of Library Resource Discovery Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
Lund Online 07/10/2009 Ingolf Kaspar, Regional Sales Manager EBSCO Publishing.
Web-scale discovery services: searching the library in the era of Google Katie Dunn Technology & Metadata Librarian Rensselaer Polytechnic Institute Slides:
Making sense of the data jumble Trinity College Library Dublin’s Discovery Solution Experience Arlene Healy & Charles Montague Digital Systems and Services.
Clive Wright Senior Director of Discovery Products for Europe and Latin America.
Conversation Starter at ALA Anaheim June 23, 2012 Discovery Here, Discovery There Ken Varnum University of Michigan Pros.
Current trends in library resource management, discovery, and resource sharing Marshall Breeding Independent Consultant, Founder and Publisher, Library.
OpenURL What all library staff should know Cybertour | March 16, 2005 | Cindi Trainor.
Primo Primer / MetaLib+ for Mere Mortals
Florida Center for Library Automation
ADVANCES IN LIBRARY DISCOVERY SERVICES The State of the Art in 2011 Marshall Breeding Director for Innovative Technology and Research Vanderbilt University.
HARVARD DISCOVERY DAY Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides
NEXT-GENERATION LIBRARY DISCOVERY: Recent trends and developments Marshall Breeding Director for Innovative Technology and Research Vanderbilt University.
UNDERSTANDING THE NEW DISCOVERY LANDSCAPE: Federated Search, Web-scale Discovery, Next- Generation Catalog and the rest Marshall Breeding Director for.
TOWARD A UNIFIED WEB PRESENCE FOR YOUR LIBRARY Web Manager’s Academy Marshall Breeding Vanderbilt University Library Founder and Publisher, Library Technology.
Link Resolvers: An Introduction for Reference Librarians Doris Munson Systems/Reference Librarian Eastern Washington University Innovative.
Link Resolvers: Serials Solutions Aine Finucane Acquisitions Librarian University of Limerick LIR Annual Seminar 2005.
Discovery services at the UAntwerpen VLIR-UOS Workshop December 8 th 2014.
LIBRARY RESOURCE DISCOVERY PRODUCTS: COMMERCIAL AND OPEN SOURCE OPTIONS Web Manager’s Academy Marshall Breeding Director for Innovative Technology and.
The role of knowledge bases in improving discoverability now and in the future- why national and international collaboration is key The role of knowledge.
LIBRARY RESOURCE DISCOVERY PRODUCTS AND SERVICES: OVERVIEW AND PERSPECTIVES Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
Discovery Overview Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
OpenURL Link Resolvers 101
NEXT-GENERATION LIBRARY DISCOVERY: Overview + Recent trends and developments Marshall Breeding Independent Consultand, Author, Founder and Publisher, Library.
1 Introducing the Open Discovery Initiative North American Serials Interest Group June 9, 2012 Marshall Breeding
THE STATE OF LIBRARY SEARCH AND DISCOVERY Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder and Publisher,
Current trends in library resource management, discovery, and resource sharing Marshall Breeding Independent Consult, Author, Founder and Publisher, Library.
DISCOVERY AND LIBRARY MANAGEMENT SYSTEMS Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides
Library Resource Discovery: Current landscape and trends Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
Christine Stohn SFX Product Manager Ex Libris January 8th, 2011 ALA Midwinter, San Diego.
1 Transparency in Discovery: Marshall Breeding April 7, 2014 Issues and progress in the.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Issues, Concerns and Suggestions for Chinese E-resources Susan Xue Chair, Committee on Chinese Materials.
DISCOVERY PRODUCTS AND SERVICES: Introduction and current trends Marshall Breeding Director for Innovative Technology and Research Vanderbilt University.
MetaLib and SFX: The Library Portal and Link Server from Ex Libris Tamar Sadeh Marketing Manager Tallinn, September 2005.
EBSCO Discovery Service. Discovery Background –Quickly –By small development teams –Using rudimentary relevance algorithms built around searching article.
Working out the Future of Library Resource Discovery Marshall Breeding Independent Consultant, Founder and Publisher, Library Technology Guides
MaNGO Mango is FCLA created application that uses the Solr/Lucene search engine and repository Begun in October 2007, live in August 2008, major overhaul.
© Ex Libris Ltd. All Rights Reserved. From Library Systems to Information SystemsMetaLib Jenny Walker ICOLC 2001.
THE FUTURE OF THE LIBRARY CATALOG OPACS GIVE WAY TO DISCOVERY Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
Webdiscovery Tools: the Future of Reference in Academic Libraries.
GET A BETTER CATALOG! NEXT-GENERATION DISCOVERY INTERFACES FOR LIBRARY OPACS Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
1 Introducing the Open Discovery Initiative John LawJamene Brooks-Kieffer Serials SolutionsKansas State University Electronic Resources & Libraries April.
Delivers local and global resources and OCLC e-Content in a single search Paul Cappuzzello Senior Library Services Consultant
DISCOVERY SYSTEMS: SOLUTIONS A USER COULD LOVE OVERVIEW OF DISCOVERY SYSTEMS Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
Choice in Discovery: Bundled or à la carte? History and market context Marshall Breeding Independent Consultant, Author, and Founder and Publisher, Library.
THE EVOLUTION OF LIBRARY COLLECTION DISCOVERY: Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library Founder.
Remote Data Sources in Primo Ebsco API WorldCat API Local Content.
ODI - Open Discovery Initiative Ken Varnum NISO Update at ALA Midwinter 10 January 2016.
Discovery of Library Resources
Director, Library Systems
Open Discovery Initiative (ODI)
Making Sense of the Alphabet Soup of Standards
Laura Morse & Amira Aaron ELUNA Steering Committee
Open Discovery Initiative (ODI)
Mango: Features and Functionality
Advancement of Discovery
Link servers for libraries
Presentation transcript:

Web Scale Discovery What is it? …and how does it work?

 Overview of WSD  Discovery at Harvard  Open Discovery Initiative  StackLife  Questions

discovery layer “next-gen” search interface index pre-harvested, indexed content what is web-scale discovery?

who:

pre-harvested index full-text articles, article citations, book chapters, ebooks, digital objects, open source content optional: local content harvested using FTP and OAI-PMH local content: catalog data, digital/special collections, etc. aggregators: Gale, LexisNexis, etc. publishers: Elsevier, Wiley, Springer, etc. open source: Hathi Trust, DOAJ, Medline, etc. A&I services: MLA, LLBA, PsycINFO, etc.

indexing full-text vs. metadata only

vendor’s discovery layer vendor index content and local content in single result set non-native discovery layer (Blacklight, Solr, Drupal, VuFind, etc.) article results (vendor index) book results (catalog) “blended results:”“bento box:”

Search results Harvard G:\LTS\Library Systems\Discovery Initiatives\Discovery Environment.pptx11/18/13 p. 1 of 1Prepared by Corinna Baksik HOLLIS Classic (Aleph OPAC) VIA HGL OASIS Dataverse Browse Phrase CJK Call # HOLLIS (Aquabrowser) Relevancy Facets Ebookplates Tables of Contents ($) TBD: Web-Scale Discovery Primo Central Index (PCI) E- resources A-Z Keyword Subject E-Research (Metalib)Metalib+ OpenURL resolver “right copy” check for FT / print hol Scan & Deliver, Aeon, ILLiad, ARES E-journals A-Z Citation Linker Find It (SFX) Federated Searching of remote resources Enables cross- search, custom widgets Publishers harvests Aggregators E-Resource Management Acquisitions (interop w/Aleph AP) Licensing Trials Subscriptions Statistics Verde Knowledge Base Largely harvested from SFX Local configuration (non-SFX ) bX recommender service Portal (Drupal) send searches E-journal MARCit service Hosted service Local service Key: Repositories Vendors who do not share data* November 2013 * E.g. EBSCO content can only be represented in Metalib+ via federated search of separately purchased EBSCO API DASH WAX queries for OA/peer indicators Knowledge Base Search targets (DBs) and parsers Local target configuration Knowledge Base Harvard journal subscriptions Journal coverage Package coverage Indicators for peer-reviewed, OA Knowledge Base Management / activation of packages within PCI

2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim= T12%3A44%3A25IST&url_ver=Z &url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/pri mo.exlibrisgroup.com:primo3-Article- wos&rft_val_fmt=info:ofi/fmt:kev:mtx:&rft.genre=article&rft. atitle=Syrian%20refugees%20and%20sexual%20violence&rft.j title=LANCET&rft.btitle=&rft.aulast=Ouyang&rft.auinit=&rft.a uinit1=&rft.auinitm=&rft.ausuffix=&rft.au=Ouyang%2C%20H& rft.aucorp=&rft.date= &rft.volume=381&rft.issue=98 84&rft.part=&rft.quarter=&rft.ssn=&rft.spage=2165&rft.epag e=2166&rft.pages=&rft.artnum=&rft.issn= &rft.eissn=&rft.isbn=&rft.sici=&rft.coden=&rft_id=info:d oi/ /S (13) X&rft.object_id=&svc_val_fmt=info:ofi/fmt:kev:mtx:sch_svc&s vc.fulltext=yes&rft.eisbn=&rft_dat=%3Cwos%3E %3C/wos%3E&rft_id=info:oai/%3E

v2 summon EDS

Web-Scale Discovery Systems now able to index large quantities of records Large result sets with potentially equivalent candidates returned for any given query – Relevancy – Faceting – Pre-search scoping/Advanced search So, is content king? – Features are built on a variety of elements (citation metadata, A&I metadata, full text/transcript)

Web-Scale Discovery In some ways, yes. – Desire for full collection (ILS data, special collections data, and licensed materials at the unit level (article, e-book, song, etc) to be searched in the system. – Easier to determine this for local collection data; harder to evaluate for the “mega-indexes” Competitive Marketplace – coverage often seen as key differentiator How can libraries understand the differences in coverage among competing services?

Web-Scale Discovery Difficult to evaluate coverage based on numbers of items indexed alone Breadth – Titles – Date Span Depth – Is content indexed at the citation or full-text level? – Value Added A&I Data

Web-Scale Discovery Important to know whether the discovery service favors the content of any given publisher, intentionally or not intentionally. For example, does the relevancy reflect bias or publisher preferences? New studies looking at impact on underlying content usage (example 1 and example 2).example 1example 2

Slide Credit – Marshall Breeding Many A&I and specialized resources do not contribute to Discovery services Two major players are both publishers and discovery service providers – EBSCO – ProQuest ProQuest does not provide content to other discovery services EBSCO does not provide content to other discovery services Issue currently being pressed by Orbis Cascade Alliance. Non-Cooperative Scenarios

Open Discovery Initiative NISO Work Group to Develop Standards and Recommended Practices for Library Discovery Services Based on Indexed Search Informal meeting called at ALA Annual 2011 Co-Chaired by Marshall Breeding and Jenny Walker Term: Dec 2011 – Dec / Slide Credit – Marshall Breeding

Balance of Constituents LibrariesPublishersService Providers 19 Marshall Breeding, Vanderbilt University Jamene Brooks-Kieffer, Kansas State University Laura Morse, Harvard University Ken Varnum, University of Michigan Sara Brownmiller, University of Oregon Lucy Harrison, College Center for Library Automation (D2D liaison/observer) Michele Newberry Lettie Conrad, SAGE Publications Roger Schonfeld, ITHAKA/JSTOR/Portico Jeff Lang, Thomson Reuters Linda Beebe, American Psychological Assoc Aaron Wood, Alexander Street Press Jenny Walker, Ex Libris Group John Law, Serials Solutions Michael Gorrell, EBSCO Information Services David Lindahl, University of Rochester (XC) Jeff Penka, OCLC (D2D liaison/observer) Slide Credit – Marshall Breeding

ODI Project Goals: Identify … needs and requirements of the three stakeholder groups in this area of work. Create recommendations and tools to streamline the process by which information providers, discovery service providers, and librarians work together to better serve libraries and their users. Provide effective means for librarians to assess the level of participation by information providers in discovery services, to evaluate the breadth and depth of content indexed and the degree to which this content is made available to the user. Slide Credit – Marshall Breeding

ODI Timeline MilestoneTarget DateStatus Appointment of working groupDec 2011 Approval of charge and initial work planMar 2012 Agreement on process and toolsJun 2012 Completion of information gatheringJan 2013 Completion of initial draftJun 2013 Completion of final draftSep 2013 Public Review Period commencesSep Slide Credit – Marshall Breeding

ODI Best Practice Highlights General – Continue the work with a SC or WG – Voluntary disclosure by CPs and DSPs to assert conformance with best practices Provide minimum set of metadata to all discovery services Provide library with statement of participation – Statement on non-bias – Usage reports – Content listings

ODI Best Practice Highlights Content Providers – Metadata elements to provide to DSPs (both minimum metadata, full text, and enhanced metadata) – Disclosure of this information to libraries – Technical format compliance

ODI Best Practice Highlights Discovery System Providers – Disclosure of information about content included in index to libraries – Guidelines on transparency for linking without bias – Technical data transfer guidelines – Guidelines on transparency for linking without bias – Usage statistics

ODI Best Practice Highlights Areas for further discussion – APIs – “Restricted” content access – Additional metrics/reporting – and much more

Further Reading NISO RP x, Open Discovery Initiative: Promoting Transparency in Discovery Orbis – Cascade Stakeholders Strive to Define Standards for Web-Scale Discovery Systems face-growing-need-for-best-practices / Discovering Reciprocity Discovery or Displacement?: A Large Scale Longitudinal Study of the Effect of Discovery Systems on Online Journal Usage Revisiting Plato's Cave, Bruce Heterick, JSTOR