University of North Carolina School of Information & Library Science Reference Linking University of North Carolina School of Information & Library Science INLS 110-98 – Week 15 Herbert Van de Sompel Cornell University – Computer Science herbertv@cs.cornell.edu based on many presentations
enable the delivery of localized extended services Part 1: general enable the delivery of localized extended services problem statement the OpenURL framework herbert van de sompel
the delivery of localized extended services Part 2: specific the delivery of localized extended services the SFX software herbert van de sompel
electronic scholarly information A&I e-print FTXT distributed OPAC herbert van de sompel
electronic scholarly information A&I e-print FTXT range of authorities, technologies OPAC herbert van de sompel
electronic scholarly information A&I e-print FTXT ¡¡ challenges re integrated access !! OPAC FTXT A&I herbert van de sompel
linking A&I e-print FTXT OPAC FTXT A&I related information herbert van de sompel
linking A&I e-print FTXT OPAC FTXT A&I ¿¿ interlink ?? herbert van de sompel
N ERL Z39.50 URL THIN session management authentication menu system authorization interlinking
reference linking full text
reference linking full text
extended services citation databases Ulrich’s full text Medline
citation databases extended services full text Ulrich’s PubMed
extended services Who is Who Books in Print Amazon.com
linking frameworks: problems not context-sensitive static herbert van de sompel
linking frameworks static Biomednet, ISI Links, Ovid links, SilverLinker, IOP via pre-computation requires control of the information environment foolproof links herbert van de sompel
services : static & pre-computed & fool proof static links services : static & pre-computed & fool proof
static linking service extended services 5678 AB12 A server linking service link bundles 1234 5678 9100 M337 PO89 M654 AA66 AB12 AC44 link source PO89
linking frameworks static DOI/CrossRef, PubMed/PubRef identification across resources central database: unique identifiers & metadata look-up facility: input metadata; output identifier insertion of actionable identifiers in resources; identifiers point at a central resolver herbert van de sompel
linking frameworks: problems not context-sensitive herbert van de sompel
localize extended services appropriate citation databases localize extended services full text Inspec
action-radius restricted by business agreements not context-sensitive default links: restricted in nature action-radius restricted by business agreements not context-sensitive resource2 resource3 default links resource1 metadata plane herbert van de sompel
not context-sensitive linking frameworks not context-sensitive static dynamic context-sensitive herbert van de sompel
links from a single resource to a single resource Open Journals Project links from citations in journals articles to ISI citation database dynamic linking: insertion of links when articles are being delivered send PDF article through proxy on-the-fly identification of references on-the-fly matching of references with citation database on-the-fly insertion of link in case of match just-in-case linking: fool-proof computational delay not practical for provision of multiple services links from a single resource to a single resource dynamic not context sensitive herbert van de sompel
Open Citation Project links from references in e-prints in Los Alamos arXiv uses Open Journals technology to extract identification from references (batch) to prepend URL to identification (dynamic) see demo at http://arabica.ecs.soton.ac.uk/ links from a single resource to multiple resources dynamic context sensitiveness can be implemented herbert van de sompel
links from a single resource to multiple resources PubMed LinkOut links from PubMed to other resources static context-sensitive linking: institutions provide links to PubMed batch insertion of links into PubMed institutional users register to see appropriate links scalable?: extension of model would require provision of links to all resources long-term and large-scale management of links is problematic links from a single resource to multiple resources static context sensitive herbert van de sompel
links from multiple resources to multiple resources OpenURL framework links from multiple resources to multiple resources dynamic context sensitive herbert van de sompel
metadata plane resource2 resource3 default links resource1 herbert van de sompel
extended services plane component1 service component2 OpenURL appropriate links resource2 resource3 default links resource1 metadata plane herbert van de sompel
the OpenURL framework A&I FTXT OPAC FTXT by value or by reference delivery of metadata on users’ request OpenURL local service component FTXT OPAC linking server describes context FTXT herbert van de sompel
the Ex Libris SFX server A&I OpenURL one possible service component OPAC FTXT FTXT herbert van de sompel
the OpenURL framework A&I ) :- ) search result fetch metadata OpenURL other dbase fetch metadata local service component herbert van de sompel
OpenURL-awareness 1. implement a technique to make a resource understand the difference between a user that has access to a service component and one that does not herbert van de sompel
1. CookiePusher, user-profiles, DLF certificate, ... OpenURL-awareness 1. CookiePusher, user-profiles, DLF certificate, ... herbert van de sompel
OpenURL-awareness 2. for users with access to a service component, provide an OpenURL for each transported metadata-object herbert van de sompel
target = service component OpenURL-awareness 2. OpenURL: target = service component content = elements of the metadata, in a public syntax http://www.sfxit.com/OpenURL herbert van de sompel
OpenURL examples service component = http://sfx.aaa.edu/menu record = AU Smith, Paul ISSN 1234-5678 VOLUME 12 ISSUE 3 PAGES 1-8 PY 1998 DBASE ERL-BIOSIS DBASE-AN 987641
OpenURL examples OpenURL = http://sfx.aaa.edu/menu?genre=article&issn=1234-5678&volume=12&issue=3&spage=1&epage=8&date=1998&aulast=Smith&aufirst=Paul OpenURL = http://sfx.aaa.edu/menu?sid=ERL:BX&pid=987641
OpenURL examples service component = http://sfx.aaa.edu/menu link = http://dx.doi.org/10.1000/6382-1 OpenURL = http://sfx.aaa.edu/menu?id=doi:10.1000/6382-1
deployment of OpenURL framework draft OpenURL specs early 2000 lots of interest from libraries, information providers adopted by important information providers OpenURL fast track NISO standardization 2001 SFX server marketed by Ex Libris DOI/CrossRef prototype
the OpenURL framework A&I FTXT FTXT OpenURL namespace institutional service component FTXT FTXT A&I OpenURL OpenURL FTXT
IOP proxy handle doi link url herbert van de sompel
IOP OpenURL aware proxy handle doi link metadata doi herbert van de sompel
the delivery of localized extended services Part 2: specific the delivery of localized extended services the SFX software herbert van de sompel
the SFX server A&I OPAC FTXT FTXT OpenURL one of many possible service components OPAC FTXT FTXT herbert van de sompel
the SFX server 1. accept OpenURL as input
2. collect object-metadata: from the OpenURL [OpenURL parser] the SFX server 2. collect object-metadata: from the OpenURL [OpenURL parser] via fetch from a resource [SourceParsers]: per sid & per namespace http, z39.50, OAI interface, ... fetch parsing of reply configuration via table
the SFX server 3. augment object-metadata
the SFX server 4. compute relevant localized extended services [SFX database]
SFX base: relation between sources services : dynamic & on-the-fly & not fool proof
SFX database definition of potential services information about the local collection rules supporting a decision on the relevance of services
THRESHOLDS BIOSIS arXiv APS/PROLA Inspec Compendex SOURCES get_holdings get_abstractget_full_text cited_author SERVICES Medline Inspec Springer Sci Cit Bas OPAC TARGETS OBJECTS 1234-5678 3456-8765 9876-4321 2389-6734
Web of Science provides: conceptual links that will be formed (for real) if certain requirements (Thresholds) are met … COLLI TARGETS SOURCES Springer provides: getFullTxt, getTOC MEDLINE requests: getFullTxt getTOC getAuthor getCitedAuthor getAbstract getWebService getCitedReference getCitedJournal BIOSIS provides: getAuthor, getAbstract Wiley provides: getFullTxt, getTOC WWW Search Engines provides: getWebService Web of Science provides: getCitedReference getCitedAuthor getAuthor
5. compute service-links [TargetParsers] the SFX server 5. compute service-links [TargetParsers] Per Target/Service
tools for SFX-configuration SFXCTRL: a graphical interface for configuring: Sources Targets Services [where Sources and Targets meet] and for making small-scale edits to the SFX-base TEMPLATES: basic, generic, flexible pre-configurations SFXLOADER: a tool for object administration STATISTICAL measures
Sources Where a user is coming from, beginning a search Examples: OPAC A&I database Online journal Sources must be OpenURL-enabled! Setting up a source involves decisions about what sources ARE available to your users and what targets and services you would like to provide to them. [List of possible, typical SFX services] Librarian must decide what Sources are available in the library environment and what Services would be desired by users.
Targets & Target/Services Where a user “lands” Services are the facilities: A getAuthor service would run a search in a target on an author’s name A getFullTxt service would link to the full text of a particular article from a Target publisher Targets must have a link-to syntax that supports the Service TARGETS have to be set up alongside (either before or after is OK) SOURCES. They are configured with TARGET SERVICES. Librarian: determines which targets can be used by library, and uses SFXCTRL to specify which services should be used by SFX For example, Springer can provide full text services for journals to which a library has a subscription, and table of contents services for journals which it doesn’t. So here, we can use SFXCTRL to configure SPRINGER as a target, and “getFullTxt” and “getTOC” as its services. Services do not necessarily have to enable the OpenURL; however, they do have to have a discernable link-to syntax (so that a program, or “parser,” can be written for SFX to use to create a link)
Threshold administration Thresholds aid in the process of evaluating the appropriateness of delivering a real link based on a conceptual link: Objects: ISSN numbers in an A&I database Rules: metadata needs an ISBN number
Templates Material to support typical or generic scenarios Contain: services, parsers, objects and thresholds Constantly updated and distributed by Ex Libris Subscribing library “localizes” to match library’s subscriptions and conditions Localization work might include comparing local subscription information to template information and editing thresholds where different; adding or removing objects, edits to target parsers.