Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.

Slides:



Advertisements
Similar presentations
TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
Advertisements

Progress Update Semantic Web, Ontology Integration, and Web Query Seminar Department of Computing David George.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Interactive Systems Technical Design Seminar work: Web Services Janne Ojanaho.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Future Software Architectures Combining the Web 2.0 with the Semantic Web to realize future Web Communities Maarten Visser
1 HyCon Framework Overview Frank Allan Hansen and Bent Guldbjerg Christensen ! Run this presentation in presentation mode to watch animations.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Collections Management Museums EMu 3.1 / 3.2 – New Features EMu 3.1 / 3.2 New Features Bernard Marshall Chief Technology Officer KE Software.
Triple Stores.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Ricerca Distribuita Semantica Protocolli opensource per la condivisione di risorse online.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
The Earth System Curator Metadata Representations Prototype Portal in Collaboration with ESMF and ESG Rocky Dunlap Spencer Rugaber Georgia Tech.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
A centre of expertise in digital information management The MEG Metadata Schemas Registry Pete Johnston, Research Officer (Interoperability),
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Information system of European Centre for Disease Prevention and Control Terminology Services László Balkányi, knowledge manager KM team / STAKS section.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Introduction to Web Services Eric Lease Morgan University Libraries of Notre Dame June 24, 2005.
© 2006 IBM Corporation Agile Planning Web UI. © 2006 IBM Corporation Agenda  Overview of APT Web UI  Current Issues  Required Infrastructure  API.
TAPIR 1.0 Renato De Giovanni, Markus Döring, Javier de la Torre October 2006.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Ricardo Pereira Software Engineer TDWG Infrastructure Project (TIP)
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
An introduction to data exchange protocols in TDWG Renato De Giovanni TDWG 2008.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
DSpace - Digital Library Software
P088; Presented in Canberra, 27 th March, 2008 GR000: Presented in Fremantle on 20 th October, 2008 GAIA RESOURCES Experiences in mobilizing biodiversity.
Web Services Martin Nečaský, Ph.D. Faculty of Mathematics and Physics Charles University in Prague, Czech Republic Summer 2014.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
TapirLink: Enabling the transition to TAPIR Renato De Giovanni TDWG 2007.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
MEKON & HOBO Java Frameworks for building Ontology-Driven Applications Current use cases:  Almost (!) products:  Knowledge-driven clinical documentation.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Jessie Kennedy Rob Gales, Robert Kukla
Flanders Marine Institute (VLIZ)
Triple Stores.
SDMX IT Tools SDMX Registry
Presentation transcript:

Steven Perry Dave Vieglais

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for building scientific data networks based on RDF, OWL, and open data access protocols.

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Objective Build a data access network that… – Can handle many types objects – Is resilient to changes in data models – Refers to objects with GUIDs – Allows fast & efficient searches – Allows incremental harvesting – Simplifies creation of client software

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics RDF and OWL RDF described by OWL allows… – Machine readable controlled vocabularies – Distinction between classes and properties – Data objects as resources identified with globally unique LSIDs – Query languages to examine patterns of relationships between objects

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Framework Components Provides access to RDF data sets through multiple protocols

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Framework Components Provides access to RDF data sets through multiple protocols Libraries for building client applications

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Framework Components Provides access to RDF data sets through multiple protocols Libraries for building client applications Web-based client for accessing data on a wasabi network

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics A Simple Network

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Server Server – Stores a cached copy of source data in RDF format called a data set – Each data set is bound to one or more protocols handlers – Standard protocols include OAI, SimpleLSID, and SPARQL

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Server

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Loading Data

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Loading Data Loading RDF Data – RDF data can be loaded from one or more files directly into Wasabi – Wasabi will not assign new LSIDs – Wasabi checks to see if any data objects are new or have changed and can scan for deleted data objects

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Loading Data

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Loading Data Loading Non-RDF Data – Wasabi uses a synchronizer program to generate RDF from SQL output or delimited files – Synch program must know about your source data format – Wasabi can assign LSIDs if needed – Wasabi checks to see if any data objects are new or have changed and can scan for deleted data objects

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics OAI-PMH

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics OAI-PMH Open Archive Initiative Protocol for Metadata Harvesting – Wasabi implementation allows efficient harvesting – Supports incremental harvesting “What objects have changed since Oct ?” – Notifies clients about deletions

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics LSID Resolution

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics LSID Resolution Life Science Identifier Metadata Resolution – Wasabi supports a simple HTTP-GET LSID metadata resolution service – Supports metadata resolution “What is the RDF metadata for urn:lsid:auth.org:ns:23?” – Compliant LSID resolution through plug-in for IBM LSID resolver.

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics LSID Resolution

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics SPARQL

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics SPARQL SPARQL Protocol – SPARQL is the W3C candidate for querying RDF – SPARQL protocol bound to HTTP-GET – ASK and SELECT queries return SPARQL XML results – DESCRIBE and CONSTRUCT queries return RDF/XML results

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics SPARQL SPARQL Query Language Example – “What is urn:lsid:auth.org:person:3424?” DESCRIBE <rdf:RDF xmlns:j.0=“ xmlns:rdf:” Steven Perry

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics SPARQL SPARQL Query Language Example – “What is the genus of the specimen urn:lsid:auth.org:spec:657?” SELECT ?genus WHERE { ?txname ?txname ?genus } ?genus = “Heteractis”

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Server OAI, SPARQL, and LSID are standard protocols, so Wasabi services can be used by non-Wasabi clients.

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Client Library

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Client Library Client Library – Contains implementations of clients for protocols used by Wasabi – Can be included in projects that need to communicate with Wasabi servers – Programmatic access to services (hides XML messaging layer) – Provides status and progress listeners – Can be used to query non-Wasabi implementations of OAI or SPARQL

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Indexer

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Indexer Indexer – Harvests from 1 or more RDF sources – Sources can be Wasabi servers (via OAI) sets of RDF files, etc. – Multiple types of indices can be fed from a single set of descriptions – Indexers can filter by object type, etc. – Indexers should understand incremental updates and deletions

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Indexer

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Indexer

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Indexer

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Portal

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Portal Portal – Customizable human interface that allows access to 1 or more Wasabi servers – Default portal requires a Lucene index of harvested data. Most portal queries are against the index – To retrieve and display data objects, the portal makes repeated LSID resolution calls so servers can log access

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Wasabi Portal Portal – Portal automatically configures search forms and renderers based on downloaded OWL ontologies – Provides simple search, advanced search, ontology browsing, and export of downloaded data to CSV or RDF files

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics A More Realistic Network

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Implementation – – Java 1.5 with Spring, Jena, Lucene, and more – Server requires servlet container (Tomcat, WebLogic, etc.) – Server requires JDBC database (MySQL, PostgreSQL, etc.)

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Current State – Server, Client Library and Indexer components are feature complete – Portal is still under development – Using experimental OWL data models; awaiting TDWG ontology.

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Future Plans – Complete portal – Construct the FishNet2 network (25+ servers) – Construct the PlantCollections network (15+ servers)

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Conclusion WASABI is a framework for building scientific data networks based on RDF, OWL, and open data access protocols.

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Conclusion RDF allows us to share complex data models OWL allows machines to understand the data models and provides opportunities for extending models over time Standard protocols (OAI, LSID, & SPARQL) allow for integration across data networks and with the semantic web

W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Support Development of Wasabi is supported by the National Science Foundation as part of the Integrated Community Infrastructure (ICI) project.