Ralph LeVan Research Scientist

Slides:



Advertisements
Similar presentations
Terminology Services Ralph LeVan Senior Research Scientist OCLC.
Advertisements

WorldCat Registry Don Hamparian Portfolio Manager, OCLC Web Services OCLC.
Chungnam National University DataBase System Lab
Dublin Core for Digital Video: Overview of the ViDe Application Profile.
SSA Query Interface M. Dolensky, ESO Data Access Layer Working Group Interoperability Workshop, Pune, India 27-Sep-2004.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006.
Z39.50 as a Web Service Ralph LeVan Research Scientist.
Theo van Veen, Koninklijke Bibliotheek The European Library: opportunities for new services.
OCLC Research April 2008 Terminology Services Experimental Services for Controlled Vocabularies.
Implementing Effective Metadata Brian Lavoie Office of Research OCLC Online Computer Library Center, Inc. Intranets 99, San Francisco April 27, 1999.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research DLF Fall Forum Charlottesville, Virginia 8 November 2005.
SRW/U for DSpace Ralph LeVan Research Scientist. What is SRW/U A Pair of HTTP-based Text Query Protocols – SRW: Search and Retrieve Web Service – SRU:
Z39.50 as a Web Service Ralph LeVan Research Scientist.
WorldCat Registry Karen A. Coombs Product Manager, Developer Network WorldCat Mashathon UK Thursday, 13 May 2010 Liver & Mash.
Search Web Services Ralph LeVan Senior Research Scientist.
OCLC Online Computer Library Center SRW & DSpace Ralph LeVan OCLC Research.
OCLC Online Computer Library Center SRW & OAI Ralph LeVan OCLC Research.
SRU and CQL Ralph LeVan Senior Research Scientist OCLC.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research code4lib 2006 Oregon State University, Corvallis, Oregon 15 February 2006.
WorldCat Registry Don Hamparian Portfolio Manager, OCLC Web Services OCLC.
OCLC Online Computer Library Center Parallel Text Searching on a Beowulf Cluster using SRW Ralph LeVan OCLC Research.
A centre of expertise in digital information management UKOLN is supported by: SRU: An overview of the SRU protocol and how it can be used.
CQL – a Common Query LanguageMike Taylor Implementing SRW/U and CQL: Tools 1. Implementing a simple SRU client 2. Implementing serious SRW and SRU clients.
Delivering MARC/XML records from the Library of Congress catalogue using the open protocols SRW/U and Z39.50 Mike Taylor, Index Data
ZeeRex – an Explain Mechanism for SRW/UMike Taylor ZeeRex – an Explain Mechanism for SRW/U 1. What ZeeRex is 2. How we got where we.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research Distributed Service Registry Workshop Warwick, UK 14 July 2005.
ArrayExpress Query Interface Gonzalo Garc í a Lara January, / 24.
Ray Denenberg Ralph LeVan Interoperability Standards & Searching Multiple Repositories Workshop 20 March 25, 2006; Washington.
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
CIS 375—Web App Dev II SOAP.
XML & Data Structures for the Internet Yingcai Xiao.
Applications and Tools Jaci Mize NOAA/NESDIS/NODC/NCDDC Rob Dollison USGS.
Searching Digital Content via SRU Ryan Scherle Randall Floyd October 25, 2006.
ZML  ZNG  ZING “Z39.50 over XML”  “Z39.50 Next Generation”  “Z39.50-international: Next Generation”
Implementation of One Stop Search by XSLT By Dave Low University of Hong Kong 9-Dec-2003.
Z39 Gate DigiTool Version 3.0. Z39 Gate 2 z39 gate Introduction DigiTool contains a program called z39 gate which communicates with remote z39 targets/servers.
OCLC Research TAI CHI Webinar 7/1/2010 OCLC Open Source Linked Data Framework Ralph LeVan Sr. Research Scientist OCLC Research.
VxOware Progress Report August How to create a new section? Configure section –Create metadata structure (template) –Create elements map for web.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
WaveMaker Visual AJAX Studio 4.0 Training Studio Overview.
Developing Interfaces and Interactivity for DSpace with Manakin Part 2: Technical and Conceptual Overview of Dspace and Manakin Eric Luhrs Digital Initiatives.
Access 2008 Using WorldCat Grid Services in Library Applications Roy Tennant Senior Program Officer OCLC Research.
Open Archives Iniative – Protocol for Metadata Harvesting Iztok Kavkler, University of Ljubljana Some slides by Stefaan Ternier, KUL Bram Vandenputte,
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
Z39.50 for Finding It All William E. Moen School of Library and Information Sciences Texas Center for Digital Knowledge University of North Texas Denton,
ELAG 2004  Work Shop on ZING Bill Oldroyd, Animator British Library Janifer Gatenby, Scribe OCLC PICA, Leiden, Netherlands.
Project Overview Bibliographic merging, Endeca, and Web application.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research Presented by Stu Weibel CERN OAI4 Geneva, Switzerland 20 October 2005.
A Web Services Approach for Search and Retrieve The Next Generation Z39.50 Access 2004, October 13-16, 2004, Halifax, Nova Scotia William E. Moen School.
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
1 JISC IE Metadata Schema Registry Technical Update 23 November 2004 Dave Beckett.
WDC-MARE – World Data Center for Marine Environmental Sciences Data portal based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler,
OCLC Online Computer Library Center Interoperability Standards & Searching Multiple Repositories Ralph LeVan/OCLC Ray Denenberg/Library of Congress.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
SRW/U: Re-Introduction SRW is a Web Services based Information Retrieval Protocol Motivations: Create an easy to implement protocol with the power of Z39.50.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Martin Kruliš by Martin Kruliš (v1.1)1.
Next Generation Z39.50 A Web Services Approach for Search and Retrieve 6 th Annual State GILS Conference, March 31 – April 3, 2004, Raleigh, NC William.
Z39.50 and the ZING Initiatives: MAVIS Users Conference, 2003 November 6, 2003 Larry E. Dixson Library of Congress.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
JAFER Toolkit Project Oxford University 1 JAFER Java-based high level Z39.50 toolkit Matthew Dovey; Colin Tatham; Antony Corfield; Richard Mawby Oxford.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
Web Services Overview Thomas Hickey. 2 What are Web Services? Machine-to-machine communication Run over standard Web protocols –XML syntax, HTTP packaging.
XML in Web Technologies
How the VIAF Magic Happens
SRU & CQL Ralph LeVan Senior Research Scientist, OCLC
Presentation transcript:

Ralph LeVan Research Scientist levan@oclc.org SRU and Lucene Ralph LeVan Research Scientist levan@oclc.org

SRU Overview A Simple Web Service Supports Search and Retrieve Supports REST-ful and SOAP requests Responses are always XML records Supports Search and Retrieve Uses a Standard Query Grammar Supports Self-Configuring Clients A Gateway to Local Databases

SRU Features Explain Records CQL Query Grammar Persistent Result Sets XML Database Records Returned Index Browses Stylesheets

Explain Records serverInfo databaseInfo metaInfo indexInfo schemaInfo configInfo

serverInfo Generated Automatically host port database

databaseInfo From SRWDatabase.props Provided Automatically databaseInfo.title databaseInfo.description databaseInfo.contact Provided Automatically implementation

metaInfo From SRWDatabase.props metaInfo.dateModified metaInfo.aggregatedFrom metaInfo.dateAggregated

indexInfo Generated Automatically From SRWDatabase.props “local” index set and Lucene index names From SRWDatabase.props qualifier.<indexSet>.<indexName> = <LuceneIndexName> Used only if you want to map other index names to your Lucene indexes (e.g. qualifier.dc.identifier=id)

schemaInfo Generated Automatically From SRWDatabase.props LuceneDocument From SRWDatabase.props xmlSchemas=<list of name> <schemaName>.identifier= <schemaName>.location= <schemaName>.namespace= <schemaName>.title= [<schemaName>.transformer=] [<schemaName>.resolver=]

schemaInfo Example xmlSchemas=LuceneDocument, DC LuceneDocument.identifier=info:srw/schema/1/LuceneDocument LuceneDocument.location=http://www.oclc.org/standards/Lucene/schema/LuceneDocument.xsd LuceneDocument.namespace=http://www.oclc.org/LuceneDocument LuceneDocument.title=Lucene Demo Database records in their internal format

schemaInfo Example (cont.) DC.identifier=info:srw/schema/1/dc-v1.1 DC.location=http://www.loc.gov/zing/srw/dc-schema.xsd DC.title=DC: Dublin Core Elements DC.transformer=LuceneToDC.xsl

configInfo Generated Automatically From SRWDatabase.props maximumRecords (20) numberOfRecords (10) resultSetTTL (300) From SRWDatabase.props configInfo.maximumRecords configInfo.numberOfRecords configInfo.resultSetTTL

CQL Query Grammar Builtin: BasicLuceneQueryTranslator CqlQueryTranslator Query makeQuery(CQLNode cn); Term getTerm(); From SRWDatabase.props SRWLuceneDatabase. CqlToLuceneQueryTranslator= <ClassName>

Persistent Result Sets Builtin: LuceneQueryResult

XML Database Records Builtin: BasicLuceneRecordResolver RecordResolver Void init(Properties props); Record resolve(Document doc, String IdFieldName, ExtraDataType extraDataType) From SRWDatabase.props <schemaName>.resolver=<ClassName> SRWLuceneDatabase.idFieldName= <FieldName>

Index Browses Builtin: SRWLuceneDatabase.getTerms()

Stylesheets From SRWDatabase.props explainStyleSheet= /SRW/explainResponse.xsl scanStyleSheet=/SRW/scanResponse.xsl searchStyleSheet= /SRW/searchRetrieveResponse.xsl

Making the Magic Happen Drop the SRWLucene.war into your <tomcat>/webapps directory Restart Tomcat Edit <tomcat>/webapps/SRWLucene/WEB-INF/classes/SRWServer.props

Sample SRWServer.props db.LuceneDemoDB.class= ORG.oclc.os.SRW.Lucene.SRWLuceneDatabase db.LuceneDemoDB.home= f:/lucene-2.0.0 db.LuceneDemoDB.configuration= SRWDatabase.props

Sample SRWDatabase.props databaseInfo.title=Lucene Demo Database databaseInfo.description=An index of the source code for Lucene databaseInfo.contact=Ralph LeVan (levan@oclc.org) qualifier.cql.serverChoice=contents explainStyleSheet=/SRWLucene/explainResponse.xsl scanStyleSheet=/SRWLucene/scanResponse.xsl searchStyleSheet= /SRWLucene/searchRetrieveResponse.xsl

Resources http://www.oclc.org/research/software/srw http://staff.oclc.org/~levan/SRWLuceneSource.jar http://staff.oclc.org/~levan/SRWLucene.war http://staff.oclc.org/~levan/Implementing%20an%20SRWLuceneDatabase.doc http://staff.oclc.org/~levan/SRU%20and%20Lucene.ppt http://alcme.oclc.org/srw/SRUServerTester.html