Open Search David Wolber. Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P.

Slides:



Advertisements
Similar presentations
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Advertisements

Intesar G Ali IT DepartmentPalestinian Land Authority Web services Prepared by: Intesar Ali IT Department PLA August 2010.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
Web Services Nasrullah. Motivation about web service There are number of programms over the internet that need to communicate with other programms over.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
Overview USING THE INTERNET TO COMMUNICATE METEOROLOGICAL INFORMATION Victor Stefanescu (NMA-NCMF) Goals Description Comparative analysis Comparative analysis.
Virtual Ticketing Agents using Web Services and J2EE Advisor: Dr. Chung-E-Wang Date: 05/06/03 Naveen Repala.
Interactive Systems Technical Design Seminar work: Web Services Janne Ojanaho.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
DOT NET WEB SERVICES Agenda  Web Services Discussion  Quick DOT NET DEMO  Assignment Assignment  Lab: Hello World Service/Client  Lab: Google Client.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
USF Department of Computer Science Peer-to-Peer Knowledge Sharing David Wolber.
What is a web service?  A remote procedure call (RPC) over HTTP.  A web-accessible operation that takes input and returns results in XML.  Like a web.
Web Services By Ethan Justin Yuli. Web Services in Action Information through Integration (Google Example)Google Example What do Web.
Internet Systems Review. Generally Speaking Understand the essence of the papers/systems we’ve studied. Understand taxonomies/criteria for comparison.
USF Department of Computer Science Peer-to-Peer Knowledge Sharing David Wolber.
USF Department of Computer Science Peer-to-Peer Knowledge Sharing.
Microsoft ® Official Course Interacting with the Search Service Microsoft SharePoint 2013 SharePoint Practice.
RSS RSS is a method that uses XML to distribute web content on one web site, to many other web sites. RSS allows fast browsing for news and updates.
OFC 322 Building Office Research Web Services: Exposing Corporate Data Through Office Brian Jones Program Manager Authoring Services Martin Sawicki Lead.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
CIS 451: Web Services Dr. Ralph D. Westfall March, 2009.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
OBAA STANDARD Where are we? Tiago Primo GIA – Grupo de Pesquisa em Inteligência Artificial UFRGS.
Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.
Practical Project of the 2006 Joint International Master’s Degree.
1 HKU CSIS DB Seminar: HKU CSIS DB Seminar: Web Services Oriented Data Processing and Integration Speaker: Eric Lo.
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
1 BINGO! and Daffodil: Personalized Exploration of Digital Libraries and Web Sources Martin Theobald Max-Planck-Institut für Informatik Claus-Peter Klas.
WHAT IS A SEARCH ENGINE. Widescreen Presentation Proteus, Keeper of Knowledge. Proteus is synonymous with change and success.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
Introduce Grid Service Authoring Toolkit Shannon Hastings, Scott Oster, Stephen Langella, David Ervin Ohio State University Software Research Institute.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure SRB + Web Services = Datagrid Management System (DGMS) Arcot.
I.R.I.S. © 2006, All rights reserved 1 GENERALI Belgium, a global Documentum Content Management Solution since 2004.
SSE3 Hypertext concepts 1. Agenda Pioneers and evolution Hypermedia – Modern hypermedia technology – Structure domains Architectural evolution The project.
WEB SERVICE DESCRIPTION LANGUAGE (WSDL). Introduction  WSDL is an XML language that contains information about the interface semantics and ‘administrivia’
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
C# 1 Web services CSC 298. C# 2 Web services  A technology to make libraries available across the internet.  In Visual Studio,  can create a web service.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
May 26-28ICNEE 2003 ARCHON: BUILDING LEARNING ENVIRONMENTS THROUGH EXTENDED DIGITAL LIBRARY SERVICES Hesham Anan, Kurt Maly, Mohammad Zubair,et al. Digital.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
1 G52IWS: Web Services Chris Greenhalgh. 2 Contents The World Wide Web Web Services example scenario Motivations Basic Operational Model Supporting standards.
Web Services An Introduction Copyright © Curt Hill.
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
Digital Library Syllabus Uploader Will Cameron CSC 8530 Fall 2006 Presentation 1.
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
DISCO/UDDI -Ketan Reddy Peddabachi. Agenda Web Service Disco UDDI Disco Vs UDDI.
What problems are we trying to solve? Hannes Tschofenig.
1 Automating Network Diagnostics to Help End-Users Dave Thaler
OE-NIK HP Advanced Programming Web services Standards and protocols Using web services Using web services with asynchronous calls.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
WEB SERVICES.
Unit – 5 JAVA Web Services
Web Services Primer Overview of Web Services
Some Basics of Globus Web Services
An Architecture for Complex Objects and their Relationships
Wsdl.
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
AI Discovery Template IBM Cloud Architecture Center
XML and Web Services (II/2546)
Presentation transcript:

Open Search David Wolber

Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P knowledge Sharing Webtop Metasearch Clients

Contributors Michael Kepe Igor Ranitovic Iman Sadreddin Senior Team ’03 Ken Chong Rudd Stevens Colin Bean Tim Chan Julian Chan Pooja Garg

Information Source Explosion Google, Amazon APIs Internet Archive Technorati– The World Live Web Technorati Domain Specific: – ACM Digital Library for CS – Lexis-Nexis for law – MLA for literature

End-User Created Digital Libraries Personal Web (shared Google desktop) Personal Web Neighborhood Topic-Specific Personal Crawlers Ordinary people creating search engines as easily as web pages 2 nd Degree 1 st Degree Nth Degree Personal Web

Subsets of the Web

Motivation for Small, Independent Subsets of the Web Avoid information being channeled through a single portal: Googleopoly Google does no evil, but… – Censorship in China – Creeping level of commercialization – Unregulated manipulation of secret ranking algorithms (see PageKing case) Other media is lost, this is the last frontier

Little support for using multiple search engines

Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P knowledge Sharing Webtop Metasearch Clients

Metasearch Help users discover and use digital libraries Send queries to multiple, selected search engines filter, process, and unify results A9.com – Amazon’s metasearch A9.com

Web Services Basis server html serversoftware xml server html Web Page Model Web Service Model

How does metasearch evolve? New Digital library

How does metasearch evolve? New Digital library Metasearch clients discover it

How does metasearch evolve? New Digital library Metasearch clients discover it Metasearch Programmers write adaptor/scraper

How does metasearch evolve? New Digital library Metasearch clients discover Metasearch Programmers write adaptor/scraper User can access within metasearch SLOWLY…

Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P knowledge Sharing Webtop Metasearch Clients

Goal: Automate the Process Metasearch engines should provide users with up-to- date lists of existing digital libraries Digital libraries should be able to register and be made immediately available to all Metasearch clients. Metasearch and Library development is independent.

What is Necessary? Standard Search API – So Metasearch clients can use polymorphism to access sources. for each source s in sourceList { searchEngine.endPointUrl = s.endPointUrl; resultList += searchEngine.keywordSearch(keywords) } Search API Registry – Metasearch clients can get dynamic list

Web Service Standards WSDL – Web Service Description Language SOAP – Simple Object Access Protocol UDDI – Universal Description, Discovery, and Integration

Standards on top of Web Services WSDL, SOAP, UDDI basis for standards in many domains. – e.g., MS initiated for securities information providers Businesses agree on a standard, then client applications can use polymorphism and new businesses can register services. In this case, we want cross-domain standard.

Open Search Architecture Open Search Protocol (OSP) – Cross-Domain: Search-related services – Not just keyword search, but citations, authorOf, etc. Open Search Registry – Based on UDDI – Can add customization, e.g., parsing to find out which search operations are implemented. – Web and web service access

Open Search Architecture OSP metasearch clients source list Register service OSP-Conforming Libraries OS Registry

User Can Choose Sources

Open Search Protocol Keyword search Citations (inward links, outward links) AuthorOf and other associative operations… Metadata object results based on Dublin Core Restriction object for “advanced search” stuff

Publishing a Library Access OSP WSDL Specification from webtop.cs.usfca.edu Generate code in language of choice Implement the search operations for the digital library Deploy the service Register with Open Search registry

Deploying an Open Search Lib. programmer 1. OS wsdl wsdl2java 2.wsdl 3. skeleton code Open Search information Registry Library server 4. deployed service 5. registration info

Wrapping a Library Custom search API, e.g., Google API Open Search Wrapper Metasearch Client 1. OSP Query4. OSP Result 2. Custom query 3. Custom Result Located on 3 rd party server

Wrappers Developed at USF Google Amazon (sort of) Internet Archive Technorati Feedster

Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P knowledge Sharing Webtop Metasearch Clients

PublishMe Like Google Desktop, but shared. Periodically updates inverse index and linkbase on PC Deploys Web Service on User’s PC Auto-Registers with Open Search Registry

Metasearch with P2P Knowledge Sharing WEBTOP

Integrating Global and Personal Libraries

Motivation for Sharing Personal Webs People create knowledge everyday when they bookmark, annotate, link, organize, and synthesize. Communication is a separate step which often doesn’t happen

Experts Collaborative Work Motivation for Sharing Personal Webs

Computers are designed using our brains for a model Knowledge creation and dissemination separate Explicit effort required to communicate Just as we model our word processors on paper.

Additions to OSP for P2P GetFile OnLine(ip) – Handles user starting up – Dynamic IPs OffLine

But What About PRIVACY? The Big Question: How much of the information hidden within your personal web is hidden due to privacy concerns?

I Want you to be a Search Engine!

Overview Proliferation of Digital Libraries Metasearch and Fixed Lists of Sources Open Search Architecture PublishMe for P2P knowledge Sharing Metasearch Clients

Goal: Implement Vannevar Bush’s Association Trails View a document/thing in context History of an idea

Thinkmap-like Interface

Association Types Outward links Inward links Similar-Content links People Links – author, people referenced in paper Domain-Specific links – law citations – movie-actor Associations specified by Annotators

Webtop Tree View webtop.cs.usfca.edu

Expanding a Tree Bird’s Eye View Local/Web files integrated Follow different Associative Trails Ins of Outs of Ins, etc. Siblings Weird though, as ins and outs both expand right

Webtop Side Panel View

Project Status Too many bugs, Dad

Future Work Open Search Protocol – In-depth study of existing search APIs – Provide Rest alternative to SOAP Metasearch development – Complete and refine existing clients – Dream up new ones Thinkmap Graph Automated Source Selection and Reputation System Page Ranking Initiate grass-roots involvement

Future Work: Documents and Things resource associations annotations document htmlword pdf person filmbook creative work

Stop talking about Webtop daddy! webtop.cs.usfca.edu