Center for Bioimaging Informatics UCSB Bio-imaging Infrastructure December 2006 Center for Bioimaging Informatics www.bioimage.ucsb.edu Supported by NSF.

Slides:



Advertisements
Similar presentations
CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
Advertisements

Martin Wagner and Gudrun Klinker Augmented Reality Group Institut für Informatik Technische Universität München December 19, 2003.
Introduction to Databases
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1 Basic DB Terms Data: Meaningful facts, text, graphics, images, sound, video segments –A collection of individual responses from a marketing research.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
File Systems and Databases
CONTENT: A model for collaborative database building Trevor Bond Alan Cornish Washington State University Libraries.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
BUSINESS DRIVEN TECHNOLOGY
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Overview of Database Languages and Architectures.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Section 13.1 Add a hit counter to a Web page Identify the limitations of hit counters Describe the information gathered by tracking systems Create a guest.
Web 2.0: Concepts and Applications 2 Publishing Online.
A Survey on Context-Aware Computing Center for E-Business Technology Seoul National University Seoul, Korea 이상근, 이동주, 강승석, Babar Tareen Intelligent Database.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Week 1 Lecture MSCD 600 Database Architecture Samuel ConnSamuel Conn, Asst. Professor Suggestions for using the Lecture Slides.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
PUBLISHING ONLINE Chapter 2. Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
Archivists' Toolkit - CDL Presentation, October 17, 2005 The Archivists’ Toolkit Lee Mandell Brad Westbrook.
Storing Organizational Information - Databases
Bioimage database architecture and infrastructure 2005, Bio-ITR, UCSB.
Center for Bioimaging Informatics BISQUIK Internals Kristian Kvilekval.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
A Scalable Distributed Datastore for BioImaging R. Cai, J. Curnutt, E. Gomez, G. Kaymaz, T. Kleffel, K. Schubert, J. Tafas {jcurnutt, egomez, keith,
Event-Based Model for Reconciling Digital Entities Ahmet Fatih Mustacoglu Ahmet E. Topcu Aurel Cami Geoffrey C. Fox Indiana University Computer Science.
Integrating Laserfiche and SharePoint PO108 Alex Wilson and Jessica Huang.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Wednesday NI Vision Sessions
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Architecture Review 10/11/2004
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Database System Concepts and Architecture
Database Management:.
VI-SEEM Data Discovery Service
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
File Systems and Databases
Knowledge Based Workflow Building Architecture
AMGA Web Interface Vincenzo Milazzo
Data Model.
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Presentation transcript:

Center for Bioimaging Informatics UCSB Bio-imaging Infrastructure December 2006 Center for Bioimaging Informatics Supported by NSF ITR # Supported by NSF ITR #

Center for Bioimaging Informatics December Projects Bisque/OME –In use at UCSB Bisquik – Next generation system (in construction)

Center for Bioimaging Informatics December Motivation Analysis creates knowledge –Image analysis –Querying –Mining Available Storage systems are growing Available Processing increasing

Center for Bioimaging Informatics December Challenges Diverse users –UCSB Neurosciences institute Retinal detachment : Fischer et al Microtubule dynamics : Feinstein and Wilson –CMU Murphy Lab –Flybase –Computable Plant Dataset challenges –Multiple types and collection techniques –Complex images (5D) and metadata –Acquisition and management of large sets –Security of private data –Sharing of experimental data

Center for Bioimaging Informatics December Challenges Metadata –Collection –Organization –Sharing/interpretation Dataset management –Personal collections –Multiple organizations –Privacy and security Analysis design Analysis integration

Center for Bioimaging Informatics December Block System diagram Image and metadata capture DB Storage Image Analysis Collection Analysis Semantic Analysis Browse Search Content + metadata Interactive Enhancement Knowledge Discovery Ground truth Collection

Center for Bioimaging Informatics December BISQUE Bio-Image Semantic Query User Environment Current dataset collection BISQUE functionality –Data management –Ground truth acquisition –Analysis Architecture

Center for Bioimaging Informatics December Current collections UCSB Retinal Objects (confocal, EM) UCSB Microtubule Objects(light, AFM) TypeCurrentBacklogRate/yExpected 4Yrs2D Images Retinal EM ,00023K Retinal confocal P ,00010K Retinal confocal Z ,00050K Microtubule light ,000400K Microtubule AFM K Flybase0125K0 Total estimated size in TBs and growing for 1 Lab Complexity and analysis are the main issues

Center for Bioimaging Informatics December Data management capabilities Digital notebook –Direct import process –Reconfigurable (confocal, 4 x microtubule, etc) Web –Web access and browsing –Organize images and metadata –Data sharing environment –Search by metadata or content –Integrated analysis

Center for Bioimaging Informatics December Screenshots (Digital Notebook)

Center for Bioimaging Informatics December Data management capabilities Digital notebook –Capture experimental/image parameters –Direct Import process –Reconfigurable (confocal, 4 x microtubule, etc) Web –Web access and browsing –Organize images and metadata –Data sharing environment –Search by metadata or content –Integrated analysis

Center for Bioimaging Informatics December Screenshots (browsing) Dataset Browsing

Center for Bioimaging Informatics December Screenshots (search)

Center for Bioimaging Informatics December Screenshots (search)

Center for Bioimaging Informatics December Screenshots (search similar)

Center for Bioimaging Informatics December Screenshots (search similar)

Center for Bioimaging Informatics December Screenshots (personal collection) Data sharing

Center for Bioimaging Informatics December Screenshots (5D Viewer)

Center for Bioimaging Informatics December Screenshot (5D with tracks)

Center for Bioimaging Informatics December Image analysis Integrated image analysis –Image enhancement –Cell counter (ImageJ) –Quantify microtubule dynamics –Microtubule tracker (manual, automatic) –Track identification Integration in progress –Segmentation and classification –Modeling microtubule dynamics –Relevance feedback improving search

Center for Bioimaging Informatics December Screenshots (Image Analysis) Image J Cell counter

Center for Bioimaging Informatics December System description overview Current dataset collection Current functionality –Data management –Ground truth acquisition –Analysis Architecture

Center for Bioimaging Informatics December Hardware/software infrastructure Hardware –16 node Cluster dual Intel Xeon 3GHZ Gigabit network switch 2 TB Storage Software –Bisque –OME (Apache, Postgresql) –Linux

Center for Bioimaging Informatics December Bisque Architecture Bio-Image Semantic Query User Environment Image/ metadata server WEB Digital Notebook image search external internal BISQUE XML Research in image processing and indexing Cell counter lug-in for ImageJ Research in biology features analysis Distributed computing cluster Cell counter lug-in for ImageJ Cell counter lug-in for ImageJ Cell counter plug-in for ImageJ metadata XML

Center for Bioimaging Informatics December OME Base Open Microscopy Environment –“OME is an open source software project to develop a database-driven system for the quantitative analysis of biological images. OME is a collaborative effort among academic labs and a number of commercial entities”. Provides base for image/metadata storage and analysis integration Boston: Sorger Lab Baltimore: Goldberg Lab Dundee: Swedlow Lab Madison: LOCI

Center for Bioimaging Informatics December Bisque extensions Ongoing extensions with OME –Content based search –Analysis integration with OME Segmentation Cell Counting MT identification and Analysis Etc. –Front end dataset and analysis support –Schema additions for image types and analysis –(Uncertainty modeling and queries)

Center for Bioimaging Informatics December Bisque Built 1 st generation w/ > D images –Integrated several useful analyses –Being used internally –External Interest Immediate future –Continued development analysis integration Dataset integration –Remote deployments –Integration with other projects Flybase : Integration of schema/datasets in progress Computableplant.org

Center for Bioimaging Informatics December Projects Bisque/OME –In use at UCSB – guest/bioimage Bisquik – Next generation system (in construction)

Center for Bioimaging Informatics December Bisque/OME lessons learned Getting the “correct” metadata is hard –Correct may need to change or be reinterpreted –Sql schema difficult to change Getting the “right” name is difficult –Different terms used in different labs make data integration painful Analysis integration needs to be easy –Researcher balk at learning complex software Users expect a rich experience –Google, flickr, etc have raised the bar

Center for Bioimaging Informatics December New project: Bisquik DoughDB: Flexible metadata Ontology support Rich user experience –Web 2.0 : Ajax, SVG –Web based tools: DN, Graphical Annotator Programming toolkit –Smaller is better Distributed data store Support for large scale computation Expected early 2007

Center for Bioimaging Informatics December Block Components Blob/Image Server Metadata Renderer DoughDB Metadata Annotation Analysis Engine Permission Ontology Remote Access Web UI

Center for Bioimaging Informatics December Tagging Screen Shot

Center for Bioimaging Informatics December Motivation: Current metadata model is inflexible Adding new experimental images requires: –Changes to Digital notebook –Changes to Bisque interface –Changes to OME/postgres Shouldn’t this be easier? –“Find images tagged with rod-opsin” –“Create a region and specify the object” –“Add experimental metadata to this dataset”

Center for Bioimaging Informatics December Bisquik requirements Add new tag/value pairs to any db object –(Foo,2) –(visible-cell, rod) Allow multiple tags with same value –(visible-cell, rod) –(visible-cell, muller) Support fine-grained tag permission/visibility –Tags have creators and access control Support update semantics & preserve history –Timestamp tags –No deletes (except under certain conditions)

Center for Bioimaging Informatics December Bisquik: DoughDB imageOID1 Feature f1[ … ] NameGH1020 Foo2 pixels[OID2 OID3] imageOID1 dataServer://… Pixel-typeraw OID1 OID4 OID2

Center for Bioimaging Informatics December Bisquik queries Find objects (images) with tag “foo” –Select a.foo Find images with name “GV100” –Select a.name = “gv100” Find images with cellcount < 100 –Select a.cellcount < 100 Find images with a region similar to r1 based on feature f1 –Select r.image = i and l2(r.f1, r1[ … ]) < 10

Center for Bioimaging Informatics December DoughDB key features No classes or types only tag/value pairs –Open ended data model Pair values have owner and acl Preserves history of annotations SQL like query language Simple keyword queries

Center for Bioimaging Informatics December Bisquik ontology support “ontology is a data model that represents a domain and is used to reason about the objects in that domain and the relations between them”data model domainreason Unstructured tag/value –Great for taggers –Unhappy searchers Different labs use different terms for the same object.

Center for Bioimaging Informatics December Bisquik ontology support Dictionary of terms and relations Require (or strongly suggest) that tags and value are defined before use Drop into ontology editor when new values and tags are encountered. Integrated into search system –Permit (or offer) ‘alias’ ‘part-of’ ‘related-to’ searches

Center for Bioimaging Informatics December Analysis Engine Analysis Components Analysis Applications Glue language: python Component connection library –Memory for single node –MPI for cluster based execution Execution engine –Automatic component placement –Resources and efficient communication

Center for Bioimaging Informatics December Analysis Engine Interfaced with DoughDB All input/output are Pairs or objects Application executions recorded for data provenance

Center for Bioimaging Informatics December Bisquik interface Some bisque functionality Flickr-like interface for region tagging Pair tagging and keyword tagging Metadata renderers –Textual (tag) –Graphical (regions, geometry, graph data) –Analysis oriented (histogram)

Center for Bioimaging Informatics December Region Tagging screenshot

Center for Bioimaging Informatics December Bisquik Metadata Annotation Unified offline (Digital Notebook) and online manipulation. Easy to build annotation forms Allow “schema” modification “in field” Permit annotation templates to be shared between DN and Bisquik Graphical geometry annotator

Center for Bioimaging Informatics December Blob Image server Extensible server for write-once objects –Pixels, Features Pluggable transforms/operations –Thumbnails, slices –pixel transforms (watermarks) –Graphical metadata renderers

Center for Bioimaging Informatics December Remote Access All basic services are web accessible: –Soap and WSDL –DoughDB, Image server, DoughDB pairs have unique 64 bit IDs –Split between machine ID/Pair ID –All pairs are addressable –Query engine processes foreign pairs

Center for Bioimaging Informatics December Bisquik new hardware 10 TB disk mirrored array (20TB) Image Server Database (backup) server 8-16 Query/Analysis nodes Image Server DB/Backup Server 10TB Query Index 1TB Query Index 1TB Query Index 1TB Query Index 1TB Query Index 1TB Query Index 1TB Query Index 1TB Query Index 1TB

Center for Bioimaging Informatics December Status Web UI –Uploading, Tagging, simple searches DoughDB –Single node storage and queries Analysis Engine –In design

Center for Bioimaging Informatics December Conclusion Bisque Team: August, Jiejun, Melissa,Kris Bisquik Team: –Interface: August Jiejun Jaechok –Analysis Engine: Kris, Mellisa, Dmitry –Ontology: David(summer), Verleen(summer), Kris –DoughDB: Kris,Vebjorn

Center for Bioimaging Informatics December Biowall

Center for Bioimaging Informatics December Biowall 5x4 1600x1200 displays –(8000x4800 pixels) –38 MegaPixels 1 Head node + 5 slaves Distributed multihead X (X proxy) New simplified viewer –OpenEV image server –Local client –Web/Database client 3D visualizations Working with very large screen areas