DiGIR1 DiGIR Distributed Generic Information Retrieval Stan Blum, Dave Vieglais, P.J. Schwartz.

Slides:



Advertisements
Similar presentations
Slide 1 Insert your own content. Slide 2 Insert your own content.
Advertisements

1 Example Constraint Between Components Consider an XML instance document that has two parts: 1.The "what" part describes - using a standard taxonomy.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
© Keith G Jeffery, Anne G S Asserson GL 11 Washington Keith G Jeffery Director, IT & International Strategy, STFC
OAF Workshop, May 13-14, 2002, Pisa.CYCLADES IST CYCLADES An Open Collaborative Virtual Archive Environment Umberto Straccia.
HMA CDR, 5-6 September 2006, Frascati Slide 1 HMA Catalogue Update Presented By Paul Kopp On behalf of Jolyon Martin, ESA Summarising many presentations.
Requirements. UC&R: Phase Compliance model –RIF must define a compliance model that will identify required/optional features Default.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
1 Building scientific Virtual Research Environments in D4Science Paul Polydoras University of Athens, Greece.
0 - 0.
The ANSI/SPARC Architecture of a Database Environment
1 9 Moving to Design Lecture Analysis Objectives to Design Objectives Figure 9-2.
1 WSDL: Web Service Description Language Gary Sharp Mike Breakiron.
Web Service Architecture
Chapter 19 – Service-oriented Architecture
© 2011 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. Towards a Model-Based Characterization of Data and Services Integration Paul.
1 University of Namur, Belgium PReCISE Research Center Using context to improve data semantic mediation in web services composition Michaël Mrissa (spokesman)
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
CSE 6331 © Leonidas Fegaras Web Services1 Web Services (adapted from Erdogan Dogdu's presentation) Leonidas Fegaras.
Database System Concepts and Architecture
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
U-P2P: A Peer-to-Peer Framework for Universal Resource Sharing and Discovery Neal Arthorne, Babak Esfandiari, Aloke Mukherjee Carleton University Ottawa,
Proposed update of Technical Guidance for INSPIRE Download services based on SOS Matthes Rieke, Dr. Albert Remke (m.rieke, 52°North.
DiGIR1 Distributed Databases and Applications John Wieczorek Museum of Vertebrate Zoology, UC Berkeley.
1 Web Services Based partially on Sun Java Tutorial at Also, XML, Java and the Future of The Web, Jon Bosak. And WSDL.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 31 Slide 1 Service-centric Software Engineering 1.
Web Service Ahmed Gamal Ahmed Nile University Bioinformatics Group
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
XML Technologies and Applications Rajshekhar Sunderraman Department of Computer Science Georgia State University Atlanta, GA 30302
Web Services Andrea Miller Ryan Armstrong Alex. Web services are an emerging technology that offer a solution for providing a common collaborative architecture.
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
Web services A Web service is an interface that describes a collection of operations that are network-accessible through standardized XML messaging. A.
1 Simple Object Access Protocol (SOAP) by Kazi Huque.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
XML Ahmet Şentürk CmpE-587. Content XML XML Syntax XML Core Technologies WebDAV XML Web Services.
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
EbXML Technical Architecture From: ebXML Technical Architecture Specification v1.04,
Trade Software Developer Technical Seminar Document Imaging System March 7, 2012.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Distributed Communication via ASP.Net Web Services and.Net Remoting By Richard King.
James Holladay, Mario Sweeney, Vu Tran. Web Services Presentation Web Services Theory James Holladay Tools – Visual Studio Vu Tran Tools – Net Beans Mario.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Interfacing Registry Systems December 2000.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
Introduction of PRO WG activities Group Name: TP Source: Shingo Fujimoto, FUJITSU, Meeting Date: Agenda Item:
XML Registries Source: Java TM API for XML Registries Specification.
Web Services Based on SOA: Concepts, Technology, Design by Thomas Erl MIS 181.9: Service Oriented Architecture 2 nd Semester,
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
EbXML Technical Architecture From: ebXML Technical Architecture Specification v1.04,
TAPIR 1.0 Renato De Giovanni, Markus Döring, Javier de la Torre October 2006.
1 Web Services Web and Database Management System.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
EbXML Registry and Repository Dept of Computer Engineering Khon Kaen University.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Dr. Rebhi S. Baraka Advanced Topics in Information Technology (SICT 4310) Department of Computer Science Faculty of Information Technology.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
TapirLink: Enabling the transition to TAPIR Renato De Giovanni TDWG 2007.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
Added Value to XForms by Web Services Supporting XML Protocols Elina Vartiainen Timo-Pekka Viljamaa T Research Seminar on Digital Media Autumn.
Introduction to Web Services
SOA (Service Oriented Architecture)
Session 2: Metadata and Catalogues
Presentation transcript:

DiGIR1 DiGIR Distributed Generic Information Retrieval Stan Blum, Dave Vieglais, P.J. Schwartz

DiGIR 2 Project Goals  To define a protocol for retrieving structured data from multiple, heterogeneous databases  To build a reference implementation of said protocol

DiGIR 3 Design Goals  To use open protocols and standards, such as HTTP, XML, and UDDI to leverage existing and emerging technologies  To de-couple the protocol, software and semantics  To automate the establishment of a new data provider as much as possible

DiGIR 4 High-level Architecture  Protocol  Provider  Portal  Registry

DiGIR 5 Protocol  Defines request and response message formats for communication between Provider and Portal  Assumes Providers conform to a known federation schema  Remains flexible to allow for federation schema pluggability

DiGIR 6 Provider  Makes structured data available to portals  Communicates via protocol compliant messaging only  Complies with a known federation schema  Supplies meta-data to describe data classification and availability

DiGIR 7 Portal  The entry point for a “user”  Can make requests of N number of providers  Communicates via protocol compliant messaging only  Queries registry for available providers  Can determine, based on provider meta-data, whether a provider should be queried

DiGIR 8 Project Information  The DiGIR project is a collaborative effort  DiGIR is currently established as an open source project on SourceForge (  Further documentation is available on the SourceForge site.  Please join us in collaborating!

DiGIR9 Protocol Details

DiGIR 10 Protocol Details  Specified in an XML Schema (.xsd)  Intended to work in conjunction with federation schemas, also expressed as XML Schemas  Actual request and response documents are instance documents conforming to both the protocol schema and a federation schema

DiGIR 11 search myDiggableBipesDB Bipes

DiGIR 12 Request Explanation  Composed of elements from the protocol namespace (default) and the schema namespace  contains information about the payload  contains dbName, filter, and record specification (will also specify result format)  is effectively an XML representation of a SQL where clause  This search request is for the first 50 specimen records that are genus Bipes and were found in the months of November or December.

DiGIR 13 Filter Building LOPs (logical operators)   Can be nested COPs (comparison ops)   (multi value)

DiGIR 14 What “binds” the schemas?  The protocol schema defines various abstract types and elements: <xsd:element name="alphaSearchCondition" abstract="true“ substitutionGroup="searchCondition">  A federation schema must define searchable concepts, or groups of them, as substitutable for these abstract elements or extensions of the abstract types <xsd:element name="Species" type="xsd:string“ substitutionGroup="digir:alphaSearchCondition" />

DiGIR 15 <xsd:complexType name="list

DiGIR 16 Why “bind” like this?  To provide data-typing (string, numeric, etc.) for various concepts within operators at an abstract level (e.g. LIKE only valid for string data; IN allows for multiples, but in a controlled fashion)  To allow for federation schemas to simply classify data as types without having to redefine/extend operators

DiGIR 17 Request Issues  Do we need another abstract element such as dateSearchCondition?  What information will be useful in the header?  How should we specify the format of the results? What standard formats should be offered (I.e. brief, full?).  Will tblName be part of the meta-data required of providers?  What concepts of Darwin Core 2 are searchable?

DiGIR 18 Response Prototype

DiGIR 19 Response Issues  How do we format and validate the response content?  What elements are needed for the, if any?  Do we always have diagnostics, or only if there is an error?  Should a finite set of diagnostics be created and maintained in its own XML Schema? Will there ever be a diagnostic that is specific to a federation schema?

DiGIR20 Provider Details

DiGIR 21 Provider Details  Implemented as a web application that answers questions  Interface is not specific to a particular information domain  No state information is recorded  Each request is treated as unique and uninfluenced by previous requests  Must always generate a valid response  Consists of four key components  Request handler  Filter handler  Result set cache  Response generator

DiGIR 22 Request Handler  Receives XML document  Validates document  Generates internal structures for further processing

DiGIR 23 Filter Handler  Internal structural representation of filter (query) structure  Responsible for generating a native query string for querying the database  Communicates with UDDI to obtain standard database definition  Custom configured to work with specific database implementation

DiGIR 24 Result Set Cache  Contains the results of applying a query  Responsible for generating the response records in the requested format  Somewhat directly integrated with the response generator

DiGIR 25 Response Generator  Generates the response XML document  Serializes the response header information  Serializes diagnostic information  Serializes the requested subset of records

DiGIR 26 Provider Configuration

DiGIR27 Portal Details

DiGIR 28 Portal Details  Divided into two distinct components: a presentation layer and PortalServices  The presentation layer supports the UI and translates requests (HTTP requests from forms or links) into protocol compliant XML requests  The presentation layer also handles all display issues involving the responses, such as format, sorting, collating, etc…  The presentation layer is envisioned to be an application server/web server implementation

DiGIR 29 Portal Details  PortalServices handles all external network activity (UDDI calls, provider calls, etc)  PortalServices limits provider calls to those necessary based on provider meta-data  PortalServices threads provider calls for increased performance (I.e. response time)  PortalServices is envisioned to be a webapp and supporting classes running within an application server, such as TomCat

DiGIR 30 PortalServices  RegistryAccess  ProviderCache  PortalConfig  PortalServlet  PortalRequestHandler  ProviderFilterer  Marshallers

DiGIR 31 Portal Issues  What information will be stored in UDDI about a provider?  What information will be known for communicating with a Provider (I.e. IP address, port, etc…?)  What meta-data will be provided and what are the rules for using such data for provider filtering?  What requirements are there for logging and monitoring?