Bioimage database architecture and infrastructure 2005, Bio-ITR, UCSB.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Interfacing processing and visualization tools: FIPS to Slicer3 and the QueryAtlas.
SWE 316: Software Design and Architecture
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
New Approaches to GIS and Atlas Production Infrastructure for spatial data integration: across scales and projects Ilya Zaslavsky David Valentine San Diego.
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
CASE Tools CIS 376 Bruce R. Maxim UM-Dearborn. Prerequisites to Software Tool Use Collection of useful tools that help in every step of building a product.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Center for Bioimaging Informatics UCSB Bio-imaging Infrastructure December 2006 Center for Bioimaging Informatics Supported by NSF.
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
Overview of Search Engines
Development of mobile applications using PhoneGap and HTML 5
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Professional Informatics & Quality Assurance Software Lifecycle Manager „Tools that are more a help than a hindrance”
31 January 2007Craig E. Ward1 Large-Scale Simulation Experimentation and Analysis Database Programming Using Java.
Slide 1 Copyright © 2003 Encapsule Systems, Inc. Hyperworx Platform Brief Modeling and deploying component software services with the Hyperworx™ platform.
GIS technologies and Web Mapping Services
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
Fundamentals of Database Chapter 7 Database Technologies.
BIRN Update Carl Kesselman Professor of Industrial and Systems Engineering Information Sciences Institute Fellow Viterbi School of Engineering University.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
University of Illinois at Urbana-Champaign A Unified Platform for Archival Description and Access Christopher J. Prom, Christopher A. Rishel, Scott W.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Atlas Interoperablity I & II: progress to date, requirements gathering Session I: 8:30 – 10am Session II: 10:15 – 12pm.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
May 2003National Coastal Data Development Center Brief Introduction Two components Data Exchange Infrastructure (DEI) Spatial Data Model (SDM) Together,
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Worldwide Protein Data Bank wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Biomedical Informatics Research Network The BIRN Architecture: An Overview Jeffrey S. Grethe, BIRN-CC 10/9/02 BIRN All Hands Meeting 2002.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
 INDEX  Overview.  Introduction.  System Requirement.  Features Of SQL.  Development Process.  System Design (SDLC).  Implementation.  Future.
Chapter 13 Web Application Infrastructure
CIS 375 Bruce R. Maxim UM-Dearborn
Database System Concepts and Architecture
CMS High Level Trigger Configuration Management
VI-SEEM Data Discovery Service
The Re3gistry software and the INSPIRE Registry
Data, Databases, and DBMSs
Lecture 1: Multi-tier Architecture Overview
Lab 2: Information Retrieval
Presentation transcript:

Bioimage database architecture and infrastructure 2005, Bio-ITR, UCSB

Overview Current system (UCSB) –Status of collection –Capabilities –Architecture Current system (CMU) –Status of collection –Capabilities –Architecture Joint system under development –Capabilities –Architecture Future –Layered databases –Distributed databases

Current collection Retinal –Confocal microscope –EM (Electron micrograph) TypeCurrentBacklogRate/yExpected 4YrsTotal size Retinal EM ,00020GB Retinal confocal P ,00010GB Retinal confocal Z ,00065GB Microtubule light ,00012GB Microtubule AFM GB Microtubule DIC002.7M10M10TB Microtubule –Light –Atomic Force Microscopy –DIC/Nomarski

Current capabilities Import process Image and meta storage Web access and browsing Limited access by content

Screenshots (browsing)

Screenshots (search)

Screenshot (metadata edit)

Screenshot (retina meta)

Current architecture Metadata Database implementation Front end implementation Image import API Software and hardware infrastructure

Metadata Standard (image types, parameters) –File, size, type, tiff data, channel info, etc. Retinal –Visible cells –Antibody labeling –Experimental conditions –Researcher Microtubule –Track (hand captured) AFM –Machine parameters Metadata sources –Researcher –Annotated excel files –Proprietary image formats

Database implementation MySql First generation schema –image parameters File, size, type, tiff data, etc. –Metadata Experimenter, condition, antibodies, tissue, notes, etc.

Front end Apache, Php, Javascript Import proprietary image types Browse images Search by metadata Search by similarity Multi user and release protection

Image and metadata import Excel parser for metadata Image import library –Image Format API and C/C++ library for database and client applications were developed. –Currently supported proprietary image formats: Metamorph Stack, Fluoview TIFF, BioRad PIC, PSIA TIFF, Nanoscope, + common: JPEG, TIFF, BMP, PNG…

Hardware and software infrastructure Hardware –Dell Server with dual Intel Xeon cpu at 2.4Ghz –140GB scsi hard drive set up as RAID 1 –Gigabit network switch Software –Linux, version Fedora 2 –Apache Web server with PHP, PERL and graphical modules –MySQL Database server

Overview Current system (UCSB) –Status of collection –Capabilities –Architecture Current system (CMU) –Status of collection –Capabilities –Architecture Joint system under development –Capabilities –Architecture Future –Layered databases –Distributed databases

Overview Current system (UCSB) –Status of collection –Capabilities –Architecture Current system (CMU) –Status of collection –Capabilities –Architecture Joint system under development –Capabilities –Architecture Future –Layered databases –Distributed databases

Motivation Common schema between UCSB and CMU Support greater functionality –Analysis and interpretation tools –Ground truth –Semantics –Uncertainty –Complex features and distance metrics MPEG-7 features Other features –Querying and relevance feedback

Capabilities Image and metadata storage Web access and browsing Access and search by content Import/Export –Streamlined XML import/export for external tools Schema extensions –Image5d, semantic, uncertainty, analysis Image processing modules and tools

Infrastructure – Interchange XML Unified interchange XML format is being developed for database feeding and extraction procedures, external client application interaction and database intercommunication. DB XML External clients Image library External DB interchange Import/export remote access Ground truth tools Image processing tools

Ground truth acquisition tools Image processing and infrastructure teams are developing universal “ground truth” collection tools able to retrieve data from data-base and feed user defined information back to the database. The main communication vehicle is XML interchange format. At the current stage stand alone tools are being developed and tested that later on will be grouped in the universal application able to communicate directly to the data-base. +

Image processing API Fast development of image processing tools concentrated on problem solving. API provides simple access to multi-channel image and mask information. Allows progress output, acquisition of user defined parameters and automatically created filter preview. Example of API usage: Noise removal for Fluoview images result noise input

Semantic data modules Integration of current research in automatic image analysis: –Cell identification –Layer detection –Cell counting –Microtubule detection and tracking –Microtubule dynamicity and global characterization

Modeling uncertainty Uncertain identification/analysis –Simple probability (e.g., 0.8) –“Is this a rod bipolar cell?” Imprecise location/extent/count –90% accuracy in cell count –Line segment (single or sequence), polygon Identified by a sequence of points Each point Gaussian Store mean x, mean y, and standard deviation –Circle Center  Gaussian point, as above Radius  mean r and standard deviation

Schema Image5d Analysis and interpretation tools –Quantitative data generation –Semantic Labeling Experimental description Shape and geometry Domain knowledge –Ground truth –Semantic objects Uncertainty Features and distance metrics MPEG-7 features Other features Querying and relevance feedback

Schema (image5d) 5d images Image is a set of bit- planes Group planes by which dimensions vary Permits –Multiple formats –Caching

Schema (semantic objects) Capture semantics Capture uncertainty Type of object : confidence Position of object: Gaussian domain

Schema (analysis and features) Capture provenance Support type checking Support feature substitution

Hardware and software components Hardware requirements –Same as original system Software –Postgresql backend –JSP / JSF front end Migrate php/javascript current code into components

Architecture Web Page UI Generation View MenuTable Semantic Interface DB Storage Image Cell Dynamic JSF Components Programmable Image API Model API Object Relational (Postgresql) HTML XML

Overview Current system (UCSB) –Status of collection –Capabilities –Architecture Current system (CMU) –Status of collection –Capabilities –Architecture Joint system under development –Capabilities –Architecture Future –Layered databases –Integration with other databases BIRN OME metadata and schema exchange

Layered database Overlay model (interpretation) on image (raw) data Multiple interpretations of data URI references between databases Pro: Logical distinction, multiple interpretations, flexible implementation

BIRN (Biomedical Informatics Research Network) Goals: –Link multiple databases with different schemas, maintained at different research institutions 19 universities, 26 research groups Current collection –Three test beds centered around brain imaging of human neurological disorders and associated animal models: Functional BIRN Morphometry BIRN Mouse BIRN

Integration with BIRN Databases at UCSB/CMU Centers can be integrated into the BIRN federation UCSB/CMU infrastructure supports –Extensive metadata for images –Standard XML interchange format for 5d images –Computational tools to refine data Web based visualization and analysis tools We need to: –Translate UCSB/CMU Schema to F-logic (Knowledge-based mediation) –Link UCSB/CMU dataset to UMLS (Unified Medical Language System) ontology –Reference a common spatial framework Standard atlas coordinate system, e.g., SMART Atlas

OME Open Microscopy Environment –a set of software that interacts with a database to manage images, image meta data, image analysis and analysis results Designed to perform as a local system Integration with OME –Adapt OME XML image interchange mechanism –Adapt the database oriented modular analysis approach of OME

Conclusion Built prototype and collected ~4000 images –Being used internally Concurrent work on 2 nd generation system –Image loading –Integration of tools –New front end