Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004.

Slides:



Advertisements
Similar presentations
Searching Options and Result Sets Sara Randall Endeavor Information Systems October 30, 2003.
Advertisements

Metasearching: The Problem, Promise, Principles, Possibilities & Perils Roy Tennant California Digital Library.
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
Reinventing using REST. Anything addressable by a URI is called a resource GET, PUT, POST, DELETE WebDAV (MOVE, LOCK)
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
C2001 Endeavor Information Systems, Inc. 1 Linking & CrossRef Harry E. Samuels Digital Library Projects Coordinator Endeavor Information Systems Inc.
DT228/3 Web Development WWW and Client server model.
Ray Denenberg Ralph LeVan Workshop 20 March 25, 2006; Washington Metasearch - the NISO Initiative.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Lesson 1 Introduction: Federated Searching Defined.
Open Linking and the OpenURL Standard Eric F. Van de Velde, Ph.D. Chair, NISO Committee AX Director of Library Information Technology California Institute.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
1 Technological standards, communications protocols, security technology Packet Switching, Web Protocols, HTML, HTTP, HTTPS and HTTS Public and Private.
Lesson 4 The Debate Pros and Cons of Federated Searching.
Internet Research Search Engines & Subject Directories.
Integrating CRM On Demand with the E-Business Suite to Supercharge your Sales Team Presented by: Tom Connolly, Jason Lieberman Company: BizTech Session.
Chapter 10 Publishing and Maintaining Your Web Site.
The Metadata System of C hina M ete. D ata S ervice S ystem WANG Guofu National Meteorological Information Centre, CMA Metadata Workshop.
INTRODUCTION TO WEB DATABASE PROGRAMMING
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 24 How Websites Work with Databases How Websites Work with Databases.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Web Site Performance An analytical approach for benchmarking and tuning.
Using the SAS® Information Delivery Portal
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
ELAG 2004  Work Shop on ZING Bill Oldroyd, Animator British Library Janifer Gatenby, Scribe OCLC PICA, Leiden, Netherlands.
HOW WEB SERVER WORKS? By- PUSHPENDU MONDAL RAJAT CHAUHAN RAHUL YADAV RANJIT MEENA RAHUL TYAGI.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
1 Accelerated Web Development Course JavaScript and Client side programming Day 2 Rich Roth On The Net
Citations Helper: What Is It and What Can It Do? Sakai Conference Amsterdam June 12, 2007.
Java CGI Lecture notes by Theodoros Anagnostopoulos.
10/13/2015 ©2006 Scott Miller, University of Victoria 1 Content Serving Static vs. Dynamic Content Web Servers Server Flow Control Rev. 2.0.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
OpenWeb: Expanding access to Digital Collections Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
Chapter 9 Publishing and Maintaining Your Site. 2 Principles of Web Design Chapter 9 Objectives Understand the features of Internet Service Providers.
Federated Searching By Cheri Duncan James Madison University Harrisonburg, VA May 12, 2003.
ICN and DTN NetInf over BP using BPQ Elwyn Davies Folly Consulting Ltd/Trinity College Dublin or
Overview Web Session 3 Matakuliah: Web Database Tahun: 2008.
Once and Future Metasearching Roy Tennant California Digital Library Roy Tennant California Digital Library.
Delivering Fixed Content to Oracle Portal Doug Daniels & Ken Barrette Quest Software.
Module 7: Advanced Application and Web Filtering.
World Wide Web Library 150 Week 8. The Web The World Wide Web is one part of the Internet. No one controls the web Diverse kinds of services accessed.
Searching > Finding Changes to how people are finding books & other ‘stuff’ that libraries provide.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Cloud platforms Lead to Open and Universal access for people with Disabilities and for All WP Federating repositories of Solutions.
1 Introduction to Servlets. Topics Web Applications and the Java Server. HTTP protocol. Servlets 2.
JS (Java Servlets). Internet evolution [1] The internet Internet started of as a static content dispersal and delivery mechanism, where files residing.
SRW/U: Re-Introduction SRW is a Web Services based Information Retrieval Protocol Motivations: Create an easy to implement protocol with the power of Z39.50.
Breaking Out of the Box: Creating Customized Metasearch Services Using an XML API Roy Tennant, California Digital Library.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Fundamentals of Web DevelopmentRandy Connolly and Ricardo HoarFundamentals of Web DevelopmentRandy Connolly and Ricardo Hoar Fundamentals of Web DevelopmentRandy.
1 State and Session Management HTTP is a stateless protocol – it has no memory of prior connections and cannot distinguish one request from another. The.
AJAX and REST. Slide 2 What is AJAX? It’s an acronym for Asynchronous JavaScript and XML Although requests need not be asynchronous It’s not really a.
Z39.50 and the ZING Initiatives: MAVIS Users Conference, 2003 November 6, 2003 Larry E. Dixson Library of Congress.
Networked Information Resources Federated search, link server, e-books.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Managing State Chapter 13.
Section 6.3 Server-side Scripting
Warm Handshake with Websites, Servers and Web Servers:
AJAX and REST.
Strategies for improving Web site performance
Search Engines & Subject Directories
Introduction to Servlets
Search Engines & Subject Directories
Search Engines & Subject Directories
Planning and Storyboarding a Web Site
WEB SERVICES From Chapter 19, Distributed Systems
Presentation transcript:

Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004

Topics  The Challenge of Federated Searching  Z39.50  XML Gateways  HTTP Searching  So, Where Are We Now?  The Future  SRW/SRU  NISO Metasearch Initiative  The Generic XML Gateway API

The Challenge of Federated Searching  To execute federated searching, one needs a protocol or mechanism to search each of the electronic resources one would like to search  But one protocol does not fit all in the federated search environment - different electronic resources require different mechanisms  The challenge is to figure out how an electronic resource can be searched and have the right mechanism in place for each situation

Z39.50  The protocol we love to hate  Z39.50 is the oldest of the commonly used search mechanisms  Almost every integrated library system can be searched using Z39.50  Despite the issues with Z39.50 it provides a fairly dependable mechanism for searching

Z39.50  The main problem with Z39.50 is that very few content providers implemented Z39.50  But it is the content of the commercial providers that we really want to search from our federated search systems

XML Gateways  Enter the XML gateway  But first of all, what does XML gateway mean?  As in Z39.50, there must be an XML gateway client that transmits search queries and accepts results – This is the part of the XML gateway that is in the federated search system  There must also be an XML gateway server that responds to search queries – This is the part of the XML gateway that is at the content provider site

XML Gateways  An XML gateway client sends a search query over http  The query is (1) packeded into the query string of a URL or (2) packaged into an XML document that is posted to the resource  Regardless of how the query is packaged the results are sent back in an XML document over http  The use of XML in at least one of the steps gave rise to the name XML Gateway

XML Gateways  XML gateways provide an alternative mechanism for searching an electronic resource  Every XML gateway is different and every XML gateway requires special programming or special configuration  As electronic resource providers implement search mechanisms they are implementing XML gateways and not Z39.50 servers  XML gateways are the future – the world of electronic resources and federated searching just needs to catch up with the future

HTTP Searching  Z39.50 was implemented by very few content providers and XML gateways are just now catching on – so how do we search everything else  The same way a user does…  The federated search system pretends to be a user sitting at a web browser – it simulates the actions of a human user by generating URL’s that are understood by the electronic resource – and then extracting the information off of the web pages that are returned

HTTP Searching  This is possible because almost all electronic resources are accessed over the web  At Endeavor, we simply call the HTTP Search Engine the HSE  It is capable of searching hundreds of web sites and databases that are inaccessible via Z39.50 or XML gateways  Some federated search engines use HTTP searching as the preferred search mechanism

HTTP Searching  Despite its reach, there are issues with HTTP searching  It usually cannot retrieve a large set of metadata in its results sets  If the user interface of an electronic resource changes then the HSE connector for that resource usually breaks – this means that HTTP searching is fragile and requires constant maintenance

So Where Are We Now?  Adoption of Z39.50 has stalled  XML gateway adoption is in the early stages and many content providers do not yet have them  HTTP searching can search far more resources than Z39.50 or XML gateways, but it is fragile and usually does not retrieve a robust set of metadata

The Future  SRW/SRU  NISO Metasearch Initiative  The Generic XML Gateway API

SRW/SRU  The next generation of Z39.50 over the web  “Search and Retrieve Web Service (SRW) and Search and Retrieve URL Service (SRU) are Web Services-based protocols for querying databases and returning search results.” Eric Lease Morgan   It is a version of an XML gateway that holds the promise of a standard XML Gateway protocol

NISO Metasearch Initiative “NISO's metasearch Initiative will identify, develop, and frame the standards and other common understandings that are needed to enable an efficient and robust information environment. The goal of NISO's Metasearch Initiative is to enable:  metasearch service providers to offer more effective and responsive services  content providers to deliver enhanced content and protect their intellectual property  libraries to deliver services that distinguish their services from Google and other free web services. “ 

The Generic XML Gateway API  We couldn’t wait…  ENCompass already had an XML gateway search infrastructure  From that infrastructure, we created a generic gateway and documented it  It is freely available to Endeavor customers  When content providers ask us “how to build an XML gateway” we share the specification with them

Questions?