NISO'S IOTA INITIATIVE: COMPLETENESS INDEX AND IMPROVING ELEMENT WEIGHTS Oliver Pesch EBSCO Information Services

Slides:



Advertisements
Similar presentations
Full Text Finder Overview Tutorial support.ebsco.com.
Advertisements

OvidSP Flexible. Innovative. Precise. Introducing OvidSP Resources.
LOCALIZED REFERENCE LINKING PROJECT Dale Flecker NFAIS/NISO Linking Workshop February 24, 2002 Philadelphia.
Localization and Extended Services NFAIS/NISO Linking Workshop February 24, 2002 Miriam Blake Los Alamos National Laboratory.
Implications of Release 3 of the COUNTER Code of Practice Vendor Usage Reports: Are we all on the same page now? Charleston Conference November 6, 2008.
LITA Electronic Resources Management Interest Group NISOs IOTA Working Group January 7, 2011 Oliver Pesch, EBSCO Information Services.
Holdings Information in Electronic Content Access: NISOs IOTA Working Group January 8, 2011 Oliver Pesch, EBSCO Information Services.
Jason Price, SCELC Ejournal Package Analyst
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
NISOs IOTA Initiative Measuring the Quality of OpenURL Links NASIG Annual Conference St. Louis, MO June 2 – 5, 2011 Rafal Kasprowski, Rice University.
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Library Electronic Resources in the EUI Library Veerle Deckmyn, Library Director Aimee Glassel, Electronic Resources Librarian 07 September
HINARI – Accessing Articles: Problems and Solutions.
Sirsi Resolver at the University of Leicester Jonathan Field Sept 3rd, 2004 EUUG, Amsterdam.
LinkSource – the sophisticated, powerful link resolver
UKOLN is supported by: An overview of the OpenURL UKOLN/JIBS OpenURL Meeting London, September 2003 Andy Powell, UKOLN, University of Bath
Freedom by design OL 2 Stephanie Taylor Project Manager.
EBSCO A-to-Z ® Electronic Resource Management Februar 2007.
CrossRef Linking and Library Users “The vast majority of scholarly journals are now online, and there have been a number of studies of what features scholars.
NISO’s IOTA Working Group Improving OpenURLs Through Analytics UKSG Conference Harrogate, United Kingdom April 4 – 6, 2011 Rafal Kasprowski, Rice University.
Progress towards a trouble-free knowledge base supply chain Charlie Rapple KBART co-chair UKSG, March 2009.
Information Retrieval (IR) on the Internet. Contents  Definition of IR  Performance Indicators of IR systems  Basics of an IR system  Some IR Techniques.
Metadata and presentation issues of Korean E-Resources relating to access and discovery ERMB Workshop presented by Erica Chang March 25, 2014 Philadelphia,
CINAHL Keyword Searching. This presentation will take you through the procedure of finding reliable information which can be used in your academic work.
OvidSP Medline: Search Techniques & Strategies Educational Programming by Sladen Library Developed by Gina Hug, JoAnn Krzeminski and Nandita Mani January.
KBART: improving the supply of data to link resolvers and knowledge bases Charlie Rapple KBART co-chair UKSG Annual Conference 5-7 April 2008.
Extended-Linking Services: towards a Quality Web Eric F. Van de Velde California Institute of Technology
Project COUNTER Trends in Statistical Standards for E- Resource Management March 2005 Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services.
Link Resolvers and Knowledge Bases – Why are they so important? Sarah Pearson University of Birmingham Co-Chair KBART Working Group.
1 Item-level linking & OpenURL: the perspective of a database provider ACRL Information Technology Interest Group, March 11, 2003 Oliver Pesch Chief Architect.
Open Linking and the OpenURL Standard Eric F. Van de Velde, Ph.D. Chair, NISO Committee AX Director of Library Information Technology California Institute.
New technologies in the libraries Stu Baker Library Management Systems Northwestern University Library.
Developments in Linking: OpenURL Eric F. Van de Velde California Institute of Technology
Information Literacy Summon Catalog Summon is the only discovery service designed around a single, unified index of content. Provides a Google-like search.
OpenURL What all library staff should know Cybertour | March 16, 2005 | Cindi Trainor.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
Link Resolvers: An Introduction for Reference Librarians Doris Munson Systems/Reference Librarian Eastern Washington University Innovative.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
1 Use Measures for Electronic Resources: Theory and Practice A Vendor’s Perspective ALCTS June 27, 2005 Oliver Pesch.
1 CrossRef - a DOI Implementation for Journal Publishers January 29, 2003 CENDI Workshop.
The role of knowledge bases in improving discoverability now and in the future- why national and international collaboration is key The role of knowledge.
PubMed Overview From the HINARI Content page, we can access PubMed by clicking on Search inside HINARI full-text using PubMed. Note: If you do not properly.
OpenURL Link Resolvers 101
ZLOT Prototype Assessment John Carlo Bertot Associate Professor School of Information Studies Florida State University.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
Inherent Dependencies in the OpenURL Reference Linking Model Adam Chandler Cornell University Library NISO Discovery to Delivery: Creating a First-Class.
IOTA: Improving OpenURL Through Analytics Nettie Lagace NISO Associate Director for Programs CEAL Workshop on Electronic Resources Standards.
West Virginia State University Jenn Zuccaro Leadership Institute November 2012.
Christine Stohn SFX Product Manager Ex Libris January 8th, 2011 ALA Midwinter, San Diego.
Librarians Creating Solutions for Librarians
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
A Fund Allocation Process: Employing a Use Factor Lisa Barricella and Cindy Shirkey November 7, 2014.
CENDI/FLICC Workshop, June 21, 2000 Slide 1 of 24 The Impact of Reference Linking on the Creation and Use of References/Citations CENDI/FLICC Workshop.
Jenny Walker JOIN-UP 6 th March Enabling the delivery of localized extended services the OpenURL framework Agenda The delivery of localized extended.
Smart Linking With SFX SFX Training, Intranet Internet range of authorities, technologies A&I e-print FTXT OPAC FTXT A&I Electronic Scholarly Information.
Access to Electronic Journals and Articles in ARL Libraries By Dana M. Caudle Cecilia M. Schmitz.
Making Sense of the Alphabet Soup of Standards Practical Support for Managing Electronic Resources DDAKBARTTransfer Betty Landesman ER&L Conference February.
Alma Analytics Usage Yoel Kortick | Senior Librarian.
Taming the E-Chaos Through Standards and Best Practices An Update on Recent Developments Betty Landesman NC Serials Conference March 21, 2016.
Full Text Finder Publication Finder Overview
Making Sense of the Alphabet Soup of Standards
SCOTT BERNIER | Senior Vice President
Summon discovers contents from one search box!
Link Resolver and Knowledge Base in Discovery Services
Full-Text Links: Fish, or Fishing License?
Standards For Collection Management ALCTS Webinar – October 9, 2014
Deliver the Appropriate Links for Your Library
Presentation transcript:

NISO'S IOTA INITIATIVE: COMPLETENESS INDEX AND IMPROVING ELEMENT WEIGHTS Oliver Pesch EBSCO Information Services

Overview Premise for IOTA completeness score and element weights Proving the theory through real-life tests Using statistical approach to determine weights Test results Conclusions Next steps for IOTA

The premise behind IOTA Completeness Score is the measure of the “completeness” of a single OpenURL Completeness Index is attributed to the content provider as an overall measure of the completeness of their OpenURLs

The premise behind IOTA The Completeness Score is calculated by “weighing” the elements provided in the OpenURL based on their importance in target links Some elements are more important than others and will have a higher weight Completeness Score equals the sum of weights of elements found divided by the maximum score possible

The premise behind IOTA Simple example assuming equal element weights ElementDescriptionWeight This OpenURL ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication1 ISSN 1 IssueIssue number1 SPageStart page1 TitleJournal Title1 VolumeVolume number1 TOTAL 8

The premise behind IOTA Simple example assuming equal element weights ElementDescriptionWeight This OpenURL ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication1 ISSN 1 IssueIssue number1 SPageStart page1 TitleJournal Title1 VolumeVolume number1 TOTAL Completeness Score... (Total for This OpenURL) Total Weights 5 / 8 =.625 Completeness Score... (Total for This OpenURL) Total Weights 5 / 8 =.625

Determining the weights Initial approach Frequency of element occurrence in target link templates Combined with reasoning

Initial Weights OpenURL data elementDescriptionWeight ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication5 eISSNOnline ISSN3 ISSNPrint ISSN3 IssueIssue number3 JtitleJournal Title1 PmidPubMed ID8 SPageStart page3 TitleJournal Title1 VolumeVolume number3 DOIDigital Object Identifier8

Initial Weights OpenURL data elementDescriptionWeight ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication5 eISSNOnline ISSN3 ISSNPrint ISSN3 IssueIssue number3 JtitleJournal Title1 PmidPubMed ID8 SPageStart page3 TitleJournal Title1 VolumeVolume number3 DOIDigital Object Identifier8 Initial weights were somewhat subjective.

Initial Weights OpenURL data elementDescriptionWeight ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication5 eISSNOnline ISSN3 ISSNPrint ISSN3 IssueIssue number3 JtitleJournal Title1 PmidPubMed ID8 SPageStart page3 TitleJournal Title1 VolumeVolume number3 DOIDigital Object Identifier8 Most link resolver knowledge bases can handle look-ups by either Print ISSN or Online ISSN (both are not needed)

Initial Weights OpenURL data elementDescriptionWeight ATitleArticle title1 AuLastAuthor’s last name1 DateDate of publication5 eISSNOnline ISSN3 ISSNPrint ISSN3 IssueIssue number3 JtitleJournal Title1 PmidPubMed ID8 SPageStart page3 TitleJournal Title1 VolumeVolume number3 DOIDigital Object Identifier8 Most link resolvers will enhance identifiers like PubMed ID and DOI; therefore, having an identifier is like having all metadata elements.

Validating the Completeness Score Use real OpenURLs and a commercial link resolver. (tested with LinkSource and 360-Link) Remove institutional holdings as a limit to resolution Process each OpenURL through the link resolver to determine “Success” Score one point for finding at least one full text target Calculate the completeness score for each OpenURL Look for a statistical correlation between the completeness score and the success score

Results: Original Weights Correlation Coefficient.43 Tests conducted on sample of 15,000 OpenURLs randomly pulled from IOTA database

A Statistical Approach to Determining Element Weights Select a set of “perfect” OpenURLs include all key data elements and resolve to full text Perform step-wise regression Test failure rates for each element by removing that element Use failure rates as basis for weights Use new weights to test for correlation between weights and success for larger sample

Failure Rates from 1500 OpenURL test sample Element removed from the OpenURL DescriptionFailure Percentage ATitleArticle title.74% AuLastAuthor’s last name.07% DateDate of publication.4% ISSN ISSN (either online or print ISSN) 22.02% IssueIssue number20.27% SPageStart page33.27% Title Journal Title (either Title or Jtitle).61% VolumeVolume number74.14% Volume is most critical Author’s last name is least important Date is surprisingly low

Calculated Element Weights ElementDescriptionWeight* ATitleArticle title1.87 AuLastAuthor’s last name0.83 DateDate of publication 1.61 ISSN ISSN (either online or print ISSN) 3.34 IssueIssue number 3.31 SPageStart page 3.52 Title Journal Title (either Title or Jtitle) 1.78 VolumeVolume number3.87 *Element weight calculation: log10 (failure-rate-per-10,000 OpenURLs)

Results: New Weights Correlation Coefficient.80 Tests conducted on sample of 15,000 OpenURLs randomly pulled from IOTA database

Notes Testing the same OpenURLs on 360-Link results in different numbers but consistent trends. Differences may be attributed to: Variations in metadata enhancement techniques Strictness in target link rules (e.g. required elements before link shows – tied to level of forgiveness of target) Link syntax used for target

Notes 96.3 of OpenURLs in the test were able to populate a full text target of credible ILL form… Perception of high failure rate of OpenURL may be attributed to library holdings and user expectations Suggestion: set link text to control expectations Link to full text (for items in the online collection) Check library collection (for things in print collection) Request from library (for everything else)

Conclusions Step-wise regression approach to element weights works Completeness Index scores can be correlated to actual OpenURL “success” KB and resolver technology influence results and prevent a universal set of element weights The Completeness Index is a mechanism individual link resolver vendors can use to provide metrics to help improve their service quality

Other takeaways Several factors involved in perceived “link failure”: 1. Bad or missing metadata in the OpenURL link 2. Inaccurate holdings data within the resolver’s knowledge base 3. Flexibility of syntax to the target - e.g., target supports at least two: OpenURL syntax, DOI link, proprietary link structure 4. Flexibility of resolution logic at the target - i.e., target finds way to create link using available data when some data missing or wrong 5. User expectations - e. g., link resolver provided link to OPAC or ILL form, but user was expecting full text - IOTA focused on (1) - KBART working on (2) - Education of content providers could address (4) - Displaying OpenURL button only if full text available could address (5)

What’s next for IOTA Continue offering public access to reports on element frequency Publish technical report on work to date Publish recommended practice for calculation and use of completeness scores for link quality assessment by link resolver vendors Continue work as a NISO standing committee for at least one more year