June 2006Image LSID resolvers Image LSID Resolution Prototypes Hui Dong, Bob Morris UMass Boston.

Slides:



Advertisements
Similar presentations
© 2006 IBM Corporation Features of an Enterprise-ready Triple Store Ben Szekely June, 2006.
Advertisements

TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
NGT Information Technology Technical Discussion Bob DeHoff Info Tech, Inc.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Databases Chapter Distinguish between the physical and logical view of data Describe how data is organized: characters, fields, records, tables,
Components and Architecture CS 543 – Data Warehousing.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 7: Planning a DNS Strategy.
Proxy Design Pattern Source: Design Patterns – Elements of Reusable Object- Oriented Software; Gamma, et. al.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
Triple Stores.
Distributed Computing COEN 317 DC2: Naming, part 1.
PHASE 3: SYSTEMS DESIGN Chapter 7 Data Design.
The NERC DataGrid Vocabulary Server: an operational system with distributed ontology potential Roy Lowry British Oceanographic Data Centre GO-ESSP 2008,
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
January, 23, 2006 Ilkay Altintas
2005 Adobe Systems Incorporated. All Rights Reserved. 1 Ontolog Forum Gunar Penikis Sr. Product Manager Adobe Systems.
IDs in and out of the database Entomological Collections Network (ECN) 2012 November 10 – 11, Knoxville, TN Debbie Paul, Greg Riccardi.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Practical RDF Chapter 1. RDF: An Introduction
Ben Szekely, IBM Cambridge Adtech © 2006 IBM Corporation TDWG GUID WorkshopFebruary 1, 2006 LSID as a Technology Overview, Participation and Related Projects.
Architecture for Electronic Field Guides Robert A. Morris Robert D. Stevenson UMASS-Boston.
The GRIMOIRES Service Registry Weijian Fang and Luc Moreau School of Electronics and Computer Science University of Southampton.
October 8, 2015 University of Tulsa - Center for Information Security Microsoft Windows 2000 DNS October 8, 2015.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Joanne Luciano With Peter Fox and Li Ding CSCI Week 10, November.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Distributed Computing COEN 317 DC2: Naming, part 1.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
MET280: Computing for Bioinformatics Introduction to databases What is a database? Not a spreadsheet. Data types and uses DBMS (DataBase Management System)
Connecting Specimens, Images and Vocabulary Specify, Morphbank, Morphster Beach, Noble, Spears – KU Mast, Riccardi – FSU Miranker, Tirmizi UT.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Technical Team WITSML SIG Dubai - November 2008 John Shields / Gary Masters.
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Data Warehouses and OLAP Data Management Dennis Volemi D61/70384/2009 Judy Mwangoe D61/73260/2009 Jeremy Ndirangu D61/75216/2009.
DReSS Engineering a Replay Application Based on RDF and OWL Chris Greenhalgh, Andy French, Jan Humble, Paul Tennent School of Computer Science, University.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
Implementing an RDF Schema for Pathology Images, From the Association for Pathology Informatics Jules J. Berman, Ph.D., M.D. APIII, Pittsburgh, PA Monday,
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
© 2006 University of Kansas An LSID resolver for specimens and a digression into issues raised by the use of GUIDs Steve Perry
Triple Stores. What is a triple store? A specialized database for RDF triples Can ingest RDF in a variety of formats Supports a query language – SPARQL.
0 / Database Management. 1 / Identify file maintenance techniques Discuss the terms character, field, record, and table Describe characteristics.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Globally Unique Identifiers: What, why, when, which and what now? Dave Thau University of Kansas
Decentralized User Authentication in a Global File System CS294-4 Presentation Nikita Borisov October 6, 2003.
Author: Akiyoshi Matonoy, Toshiyuki Amagasay, Masatoshi Yoshikawaz, Shunsuke Uemuray.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Orion Contextbroker PROF. DR. SERGIO TAKEO KOFUJI PROF. MS. FÁBIO H. CABRINI PSI – 5120 – TÓPICOS EM COMPUTAÇÃO EM NUVEM
AA202: Performance Enhancers for Laserfiche Connie Anderson, Technical Writer.
PerfSONAR Schema and Topology Martin Swany. Schema Key Goals: Extensibility, Normalization, Readability Break representation of performance measurements.
Slug: A Semantic Web Crawler Leigh Dodds Engineering Manager, Ingenta Jena User Conference May 2006.
Suresh Krishnan Secure Proxy ND Suresh Krishnan
Apache Ignite Data Grid Research Corey Pentasuglia.
Triple Stores.
LSIDs in Taverna Daniele Turi University of Manchester
Building Search Systems for Digital Library Collections
Triple Stores.
Lecture 1: Multi-tier Architecture Overview
Prof. Bhavani Thuraisingham The University of Texas at Dallas
Triple Stores.
Triple Stores.
Presentation transcript:

June 2006Image LSID resolvers Image LSID Resolution Prototypes Hui Dong, Bob Morris UMass Boston

June 2006Image LSID resolvers Application Toy application at examples/MyJsp.jsp examples/MyJsp.jsp

June 2006Image LSID resolvers Image Lsid Resolution Servers –Projects UMB from local image store of 13,000 field photographs. (Morris, Haber, Dong) Morphbank (U. Florida) from project to document morphological characters (Rohnquist, Riccardi) U.T. Austin X-Ray CT facility to scans of paleo and very small vertebrates. (Humphrey, Mirenkar) –Huge variation in social and technical image and metadata acquisition

June 2006Image LSID resolvers Image Lsid Resolution Servers Known –UMB unedited images from skilled naturalist Metadata: exif, taxonomy, habitat, location, voucher number for type specimen of identified taxon, part imaged. 13,000 Images in folders. Mine file and folder names and correlate to checklist(s) then metadata into MySQL with generated LSID. cf. ENBI report on Imaging Type Specimens Data == ??? –Morphbank (U. Florida) from project to document morphological characters Metadata – Darwin Core plus local attributes. Automated by Contribution process –U.T. Austin X-Ray CT facility to document XRCT Metadata: automated by scan configuration

June 2006Image LSID resolvers Summary Resolution is easy. Acquiring metadata is hard.

June 2006Image LSID resolvers UMB details Services implemented on sourceforge.lsid.net Java suite: –Authority, Data, and Metadata interfaces exposed as separate web services –Omitted security service and assignment service (use adhoc assignment, not exposed. Would consider making assignment as part of the image deposit service).

June 2006Image LSID resolvers Implementation issues Triples on-the-fly were too slow. We cache them in MySQL. Could use native triple store but haven’t yet encountered any use case except that needs it in the face of a shadow SQL metadata store and a warehouse model. Most integrated apps might be easier to do with something that appears to the outside like a triple store though.

June 2006Image LSID resolvers Implementation issues Jena RDF serialization generated huge numbers of triples irrelevant to us (e.g. graph support). Result was intolerable performance so serialized with hibernate.org relational persistence framework. (Message from Mirenkar forcefully and us weakly: there are no standards for serializing naturally occurring RDBs to RDF).

June 2006Image LSID resolvers Warehousing vs distributed metadata stores Current resolution discovery scheme does not support multiple resolution services for a given LSID. Hence metadata cannot presently be distributed. Example: distributed annotation. Bill may not have authority to add annotation to Susan’s metadata store but might still have valuable annotation which should be keyed by the LSID.

June 2006Image LSID resolvers Warehousing vs distributed metadata Given some metadata values, how to find all the LSID’s that have that metadata value. Need entire metadata RDF store someplace (for each resolution service!) in order to make the query SELECT lsid WHERE metadataAttributeA(lsid) = value_b Reasonable image RDF is attributes. Reasonable personal image store is 10 5 images. This is not specific to RDF, but there is no history of supporting this kind of query at large scale.

June 2006Image LSID resolvers Interesting research problem Typical utility in applications will(?) arise from metadata containing other LSIDs. But there are no standards for querying this or for recursive resolution. That is, the embedded LSID is a proxy for more metadata + implied ontological relations. How to make resolvers accept ontological data, reason over it, and decide what recursive resolution should take place.

June 2006Image LSID resolvers Grumble LSID Launchpad doesn’t allow showing namespaces in the attribute-value pairs sourceforge.lsid.net framework does not support DDNS or some other magical multi-resolver discovery Jena rdf serialization doesn’t seem to be scalable.