Search Web Services Ralph LeVan Senior Research Scientist.

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

Z39.50 as a Web Service Ralph LeVan Research Scientist.
SRW/U for DSpace Ralph LeVan Research Scientist. What is SRW/U A Pair of HTTP-based Text Query Protocols – SRW: Search and Retrieve Web Service – SRU:
Z39.50 as a Web Service Ralph LeVan Research Scientist.
A centre of expertise in digital information management UKOLN is supported by: SRU: An overview of the SRU protocol and how it can be used.
Delivering MARC/XML records from the Library of Congress catalogue using the open protocols SRW/U and Z39.50 Mike Taylor, Index Data
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
Standardizing Usage Statistics Requests with SUSHI Theodore Fons Senior Product Manager Innovative Interfaces.
Web Service Architecture
WEB SERVICES. FIRST AND FOREMOST - LINKS Tomcat AXIS2 -
An Introduction to Web Services Sriram Krishnan, Ph.D.
1 Web Services Based partially on Sun Java Tutorial at Also, XML, Java and the Future of The Web, Jon Bosak. And WSDL.
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
Ray Denenberg Ralph LeVan Interoperability Standards & Searching Multiple Repositories Workshop 20 March 25, 2006; Washington.
1 Understanding Web Services Presented By: Woodas Lai.
RPC Robert Grimm New York University Remote Procedure Calls.
Topics Acronyms in Action SOAP 6 November 2008 CIS 340.
OCLC Research TAI CHI Webinar 5/27/2010 A Gentle Introduction to Linked Data Ralph LeVan Sr. Research Scientist OCLC Research.
Web Services Darshan R. Kapadia Gregor von Laszewski 1http://grid.rit.edu.
Web Services Nasrullah. Motivation about web service There are number of programms over the internet that need to communicate with other programms over.
TCP/IP Protocol Suite 1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 22 World Wide Web and HTTP.
Snejina Lazarova Senior QA Engineer, Team Lead CRMTeam Dimo Mitev Senior QA Engineer, Team Lead SystemIntegrationTeam Telerik QA Academy SOAP-based Web.
Ray Denenberg Ralph LeVan Workshop 20 March 25, 2006; Washington Metasearch - the NISO Initiative.
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
Understand Web Services
Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004.
Processing of structured documents Spring 2003, Part 6 Helena Ahonen-Myka.
Web server and web browser It’s a take and give policy in between client and server through HTTP(Hyper Text Transport Protocol) Server takes a request.
Introduction SOAP History Technical Architecture SOAP in Industry Summary References.
J2EE Web Fundamentals Lesson 1 Introduction and Overview
Web Architecture Dr. Frank McCown Intro to Web Science Harding University This work is licensed under a Creative Commons Attribution-NonCommercial- ShareAlike.
Grid Computing, B. Wilkinson, 20043b.1 Web Services Part II.
Chapter 6 Introduction to Web Services. Objectives By study of the chapter, you will be able to: Describe what is Web services Describe what are differences.
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Open Data Protocol * Han Wang 11/30/2012 *
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
OCLC Online Computer Library Center Interoperability Standards & Searching Multiple Repositories Ralph LeVan/OCLC Ray Denenberg/Library of Congress.
1 Web Services Web and Database Management System.
Ray Denenberg Rob Sanderson “ Key Standards Updates ” SRU Project Briefing April 4, 2006; Washington.
CNI, 4th April 2006 Slide 1 Key Standards Update: SRU (“Technical” Details) Dr. Robert Sanderson Dept. of Computer Science University of Liverpool
Digital libraries and web- based information systems Mohsen Kamyar.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Kemal Baykal Rasim Ismayilov
SOAP-based Web Services Telerik Software Academy Software Quality Assurance.
Advanced Web Technologies Lecture #4 By: Faraz Ahmed.
SRW/U: Re-Introduction SRW is a Web Services based Information Retrieval Protocol Motivations: Create an easy to implement protocol with the power of Z39.50.
1 G52IWS: Web Services Chris Greenhalgh. 2 Contents The World Wide Web Web Services example scenario Motivations Basic Operational Model Supporting standards.
Stages to Services (1) Web processes perform work  Here a CGI script controls programs which analyse bibliographic data in a PDF document.
Web Services Martin Nečaský, Ph.D. Faculty of Mathematics and Physics Charles University in Prague, Czech Republic Summer 2014.
Web Technologies Lecture 10 Web services. From W3C – A software system designed to support interoperable machine-to-machine interaction over a network.
Chair: Kevin Gamiel Abstract: How are Z39.50 URLs (Z39.50s and Z39.50r) being used? What purposes should a Z39.50 url serve? Are these existing definitions.
Z39.50 and the ZING Initiatives: MAVIS Users Conference, 2003 November 6, 2003 Larry E. Dixson Library of Congress.
Digital libraries research IG Cataloging and metadata IG Web services and metadata switch February 2003 Web services and metadata switch February 2003.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
SOAP, Web Service, WSDL Week 14 Web site:
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Windows Communication Foundation and Web Services
Sabri Kızanlık Ural Emekçi
z/Ware 2.0 Technical Overview
WEB SERVICES.
Unit – 5 JAVA Web Services
CHAPTER 3 Architectures for Distributed Systems
Windows Communication Foundation and Web Services
WEB API.
Introduction to Web Services and SOA
QoS Metadata Status 106th OGC Technical Committee Orléans, France
Presentation transcript:

Search Web Services Ralph LeVan Senior Research Scientist

OASIS Search Web Services Technical Committee To define Search and Retrieval Web Services, combining various current and ongoing web service activities.

OASIS Search Web Services TC Ray DenenbergLibrary of CongressCo-Chair Matthew DoveyJISC ExecutiveCo-Chair Larry DixsonLibrary of CongressVoting Member Janifer GatenbyOCLCVoting Member Ralph LeVanOCLCVoting Member Ashley SandersUniv. of ManchesterVoting Member Robert SandersonUniv. of LiverpoolVoting Member Sri GopalanBooz Allen HamiltonMember MacKenzie SmithM.I.T. Member

Who is OASIS OASIS is a non-profit, international consortium that creates interoperable industry specifications based on public standards such as XML and SGML. The ebXML suite of standards is probably their most famous product

Why are we there? We were hoping to reach a broader audience than we normally see in NISO We were hoping that there would be synergies with the other XML-based standards groups. After all, most of them have searching requirements.

Where Weve Come From Pros, Cons and What Weve Learned Z39.50 SRW/U OpenSearch

Z39.50 Pros High Functionality High Interoperability Cons Complicated Binary encoding over raw tcp/ip Lesson Learned Theres a need for a high functionality interface If people are desperate enough, theyll do anything

SRU Pros XML-based web service High Interoperability Cons Still complicated (but much less than Z39.50!) Unheard of outside the library community Lesson Learned Theres still a need for a high functionality interface If people arent desperate, theyll live with what theyve got

OpenSearch Pros Simple Moderate Interoperability Cons Low Functionality Lesson Learned Theres a need for a simple low functionality interface Developers prefer to do as little as possible

What Were Doing CQL 1.2 SRU 2.0 Abstract Protocol Definition Binding to HTTP Get Binding to SRU 1.2 Binding to OpenSearch SWS Description Language

CQL 1.2 This is the path to actually standardize CQL Enhances a couple of features (sort and proximity and the CQL Context Set)

SRU 2.0? I wish I had something to say here, but its mostly on the todo list and the SWS Description Language has more traction in the committee.

Abstract Protocol Description (APD) This document is an abstract protocol definition for the Search Web Services (SWS) searchRetrieve operation. It presents the model for the SearchRetrieve operation and is also intended to serve as a guideline for the development of application protocol bindings (hereafter bindings, see definitional note).definitional note A binding describes the capabilities and general characteristic of a server or search engine, and how it is to be accessed. A binding may describe a class of servers via a human-readable document or a binding may be a machine-readable file describing a single server, provided by that server, according to the description language described at xxx, which is a fundamental component of the SWS standard

APD Data Model A server exposes a datastore for access by a remote client for purposes of search and retrieval. The datastore is a collection of units of data. Such a unit is referred to as an item in this model. For purposes of this model there is a single datastore at any given server. Associated with a datastore are one or more formats that may be used for the transfer of items from the server to the client. Such a format is referred to as an item type in this model. An item type represents a common understanding shared by the client and server of the information contained in the items of the datastore, to allow the transfer of that information. The item type identifies an abstract representation of the information. It does not represent nor does it constrain the internal representation or storage of that information at the server

APD Processing Model A client sends a searchRetrieve request to a server, which responds with a searchRetrieve response. The request includes a search query to be matched against the items at the servers datastore. The server processes the query, creating a result set (see Result Set Model) of items that match the query.Result Set Model The request also indicates the desired number of items to be included in the response and includes information about how the individual items in the response, as well as the response at large, are to be formatted. The response includes items from the result set, diagnostic information, and a result set identifier that the client may use in a subsequent request to retrieve additional items.

APD Result Set Model This is a logical model; support of result sets is not assumed nor required by this standard From the client's point of view, the result set is a set of items each referenced by an ordinal number, beginning with 1. The client may request a given item from a result set according to a specific format. For example the client may request item 1 in Dublin Core, and subsequently request item 1 in MODS. The format in which items are supplied is not a property of the result set, nor is it a property of the requested items as a member of the result set; the result set is simply the ordered list of items.

APD Request Parameters Abstract Parameter Name Description responseType e.g. 'text/html', application/atom+xml, application/x+sru queryThe search query of the request. startPositionThe position within the result set of the first item to be returned. maximumItemsThe number of items requested to be returned. itemTypee.g. string, jpeg, dc, iso2709. From list provided by server. sortOrderThe requested order of the result set.

APD Response Parameters Abstract Element NameDescription numberOfItems The number of items matched by the query. resultSetIdThe identifier for the result set created by the query. itemsa sequence of items. nextPositionThe next position within the result set following the final returned item. DiagnosticsError message and/or diagnostics. echoedSearchRetrieveRequestThe server may echo the request back to the client.

HTTP Get Binding Syntax The client sends a request via the HTTP GET method Specifically it is an HTTP URL of the form: ? Encoding Convert the value to UTF-8. Percent-encode characters as necessary within the value. Construct a URI from the parameter names and encoded values.

SRU 1.2 Binding The APD + the HTTP Get Binding + new request parameters (operation, version, recordPacking resultSetTTL stylesheet extraRequestData) – unused base parameters (responseType, sortOrder) + new response elements (version, resultSetIdleTime, extraResponseData) and a shiny XML encoding.

SWS Description Language What do we think weve learned? Developers are tired of being told how to do their business! Unless they have a business reason to worry about interoperability, they wont. Third party interoperability needs to be something they can add on when they do discover they need it. Better yet, let someone else add it on.

Prescriptive vs Descriptive Standards A prescriptive standard (Z39.50, SRU and the response part of OpenSearch) causes interoperability by telling you how to construct your interface, allowing for simple clients that know how to talk to you. The hard work of interface is done by the server. A descriptive standard (WSDL and the request part of OpenSearch) causes interoperability by allowing you to describe your interface in such a way that clients can be created dynamically to talk to you. The hard work of interface is done by the client.

Who Wants This? Anyone who wants access to content that doesnt adhere to any search standards: Web 2.0 and NISO Metasearch! Anyone with content to provide who doesnt know what clients might want to search that content

Essentially OpenSearch…

… On Steroids! <form action="/wzgw" method="get name="Copac Quick Search">

… On Steriods (cont.) <regexp regexp="<span id="num_hits">([0-9]+)<"/>

P.S., Bibliographic Context Set Anyone? SRU depends on context sets. The SRU Editorial Board recognizes the need for context set for bibliographic searching (equivalent to Bib-1 in the Z39.50 universe). But, they dont feel that they are the appropriate body. Anyone in the NISO community interested?

Questions?