Computational Bioinformatics & Bioimaging Laboratory caBIG-ICR - VISDA VT –GU Developer Team: Huai Li, Jiajing Wang, Yue Wang, Jianhua Xuan, Robert Clarke.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

TAILS: COBWEB 1 [1] Online Digital Learning Environment for Conceptual Clustering This material is based upon work supported by the National Science Foundation.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
EE 290A: Generalized Principal Component Analysis Lecture 6: Iterative Methods for Mixture-Model Segmentation Sastry & Yang © Spring, 2011EE 290A, University.
Agilent: The Company, The Myth, The Lengend. Agilent: Agilent Technologies Inc. (NYSE: A) is a world-wide, diverse technology company focused on expansion.
Introduction to Bioinformatics - Tutorial no. 12
Algorithm Animation for Bioinformatics Algorithms.
Patrick Kemmeren Using EP:NG.
By: Raul Rodriguez Walter Checefsky (Added later)
Midterm project Course: Statistics in Bioinformatics Date: 指導教授 : 陳光琦 學生 : 吳昱賢.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
How to do Fast Analytics on Massive Datasets Alexander Gray Georgia Institute of Technology Computational Science and Engineering College of Computing.
Synthesis of Incomplete and Qualified Data using the GCE Data Toolbox Wade Sheldon Georgia Coastal Ecosystems LTER University of Georgia.
This presentation will guide you though the initial stages of installation, through to producing your first report Click your mouse to advance the presentation.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
INTRODUCTION GOAL: to provide novel types of interaction between classification systems and MIAME-compliant databases We present a prototype module aimed.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Cluto – Clustering toolkit by G. Karypis, UMN
Web Applications for the School of NAS Group #3 09/18/2001.
BISQUE: Enabling Cloud and Grid Powered Image Analysis Ramona Walls iPlant Collaborative
Support for MAGE-TAB in caArray 2.0 Overview and feedback MAGE-TAB Workshop January 24, 2008.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
PN: TM rev:3 Administration Training. PN: TM rev:3 Course Overview Logging On Menu Screens »Edit Functions Options Configuration Users »View.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
GenePattern Overview for MAGE-TAB Workshop Ted Liefeld January 24, 2007.
A new way of seeing genomes Combining sequence- and signal-based genome analyses Maik Friedel, Thomas Wilhelm, Jürgen Sühnel FLI Introduction: So far,
NGS data analysis CCM Seminar series Michael Liang:
Using geWorkbench: Hierarchical & SOM Clustering Fan Lin, Ph. D Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
Visualization and analysis of microarray and gene ontology data with treemaps Eric H Baehrecke, Niem Dang, Ketan Babaria and Ben Shneiderman Presenter:
Enhancing Interactive Visual Data Analysis by Statistical Functionality Jürgen Platzer VRVis Research Center Vienna, Austria.
RecoDS Tuan Van Jack Herbert Anthony Vigil Beau Crawford.
Predicting MicroRNA Genes and Target Site using Structural and Sequence Features: Machine Learning Approach Malik Yousef Institute of Applied Research,
1 maxdLoad The maxd website: © 2002 Norman Morrison for Manchester Bioinformatics.
1 Advanced Database Dr Theodoros Manavis
Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
Welcome to DNA Subway Classroom-friendly Bioinformatics.
Copyright OpenHelix. No use or reproduction without express written consent1.
CS5604( Midterm Presentation) – October 13, 2010 Virginia Polytechnic Institute and State University Presented by: Team 4 (Sarosh, Sony, Sherif)
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
1 Outline Standardization - necessary components –what information should be exchanged –how the information should be exchanged –common terms (ontologies)
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Radial Basis Function ANN, an alternative to back propagation, uses clustering of examples in the training set.
12 Copyright © 2009, Oracle. All rights reserved. Managing Backups, Development Changes, and Security.
Using geWorkbench: Working with Sets of Data Fan Lin, Ph. D. Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Week 1 Lecture 1 Oracle Architecture Overview. Learning Objectives Learn about Oracle9i architecture and key Oracle9i software components Discover differences.
® IBM Software Group © 2009 IBM Corporation Essentials of Modeling with the IBM Rational Software Architect, V7.5 Module 15: Traceability and Static Analysis.
Patterns in caBIG Baris E. Suzek 12/21/2009. What is a Pattern? Design pattern “A general reusable solution to a commonly occurring problem in software.
Discovery Environment Tool Integration High Level Overview.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
WEKA's Knowledge Flow Interface Data Mining Knowledge Discovery in Databases ELIE TCHEIMEGNI Department of Computer Science Bowie State University, MD.
CaBIG™ Terminology Services Path to Grid Enablement Thomas Johnson 1, Scott Bauer 1, Kevin Peterson 1, Christopher Chute 1, Johnita Beasley 2, Frank Hartel.
1 5/18/2007ã 2007, Spencer Rugaber Architectural Documentation Graphical (views) and textual Kruchten's 4+1 Views –Philippe B. Kruchten. "The 4+1 View.
GenePattern Overview caBIG Silver Compatibility review Ted Liefeld Cancer Informatics Program The Broad Institute of MIT and.
Wednesday NI Vision Sessions
Galaxy for analyzing genome data Hardison October 05, 2010
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Canadian Bioinformatics Workshops
Fig. 1. proFIA approach for peak detection and quantification
Computer Science & Engineering Department University of Connecticut
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Presentation transcript:

Computational Bioinformatics & Bioimaging Laboratory caBIG-ICR - VISDA VT –GU Developer Team: Huai Li, Jiajing Wang, Yue Wang, Jianhua Xuan, Robert Clarke UPenn – The Wistar Institute Adopter Team: Louise Showe, Michael Showe, Malik Yousef, Hsiuan-Lin Wu, and Michael Nebozhyn

Computational Bioinformatics & Bioimaging Laboratory Model clustered data structure via hierarchical mixture of Gaussian kernels (adaptive with cluster validation) Visualize data clusters via discriminative projections incorporating human gift for pattern recognition Discover and display hidden data clusters via top- down “soft” clustering (hierarchical yet exploratory) (VIsual Statistical Data Analyzer)is a caBIG -TM analytical tool for cluster modeling, visualization, and discovery VISDA (VIsual Statistical Data Analyzer) is a caBIG -TM analytical tool for cluster modeling, visualization, and discovery VISDA Overview

Computational Bioinformatics & Bioimaging Laboratory VISDA-caBIG TM Key Features Supported Data File –Support data retrieving from caArray –Support local MAGE-ML file format –Support tab-delimited data file with multiple gene annotations as well as multiple sample annotations Analytical Algorithms –Support sample clustering and gene clustering –Has supervised/unsupervised feature selection –Has PCA and PPM projections –Has hierarchical statistical modeling and parameter estimation by EM algorithm –Has advanced options for DCA projection and MDL cluster validation

Computational Bioinformatics & Bioimaging Laboratory GUI –Main frame has a history tracking panel as well as a working view panel. Analysis node and Dataset node can be created and deleted from the tree. –Input file can also have multiple rows for different sample annotation. User has the option to choose one of the rows as label information for analysis. –Tables of the selected genes for phenotype clustering and their performances can be viewed, and saved. –Has the three types of 2D-projection visualization. –Figures can be viewed, zoomed, and saved as PNG or EPS format. –Support sample/gene annotation view. –Support cluster visualization by hierarchical display –History log can be viewed and saved VISDA-caBIG TM Key Features

Computational Bioinformatics & Bioimaging Laboratory Installation

Computational Bioinformatics & Bioimaging Laboratory VISDA GUI

Computational Bioinformatics & Bioimaging Laboratory VISDA GUI

Computational Bioinformatics & Bioimaging Laboratory VISDA GUI

Computational Bioinformatics & Bioimaging Laboratory VISDA Silver Compatibility VISDA can retrieve data from caArray (can be considered as one caGrid node) by utilizing MAGE-OM APIs. MAGE-OM CDEs (registered in caDSR) used in VISDA are documented. Analysis results got from VISDA are the selected feature set, the "soft-clustering" probabilities of the samples/genes in each cluster and the hierarchical "tree of phenotype" or "tree of gene module" plots which are retained locally. Well documented VISDA APIs. UML documentation of all VISDA components using Enterprise Architect (EA).

Computational Bioinformatics & Bioimaging Laboratory Retrieve Data from caArray

Computational Bioinformatics & Bioimaging Laboratory Retrieve Data from caArray

Computational Bioinformatics & Bioimaging Laboratory VISDA API Document

Computational Bioinformatics & Bioimaging Laboratory VISDA UML Use Case Diagram

Computational Bioinformatics & Bioimaging Laboratory VISDA UML Class Diagram o edu.vt.cbil.visda

Computational Bioinformatics & Bioimaging Laboratory VISDA UML Class Diagram o edu.vt.cbil.visda.data

Computational Bioinformatics & Bioimaging Laboratory VISDA UML Class Diagram o edu.vt.cbil.visda.comp

Computational Bioinformatics & Bioimaging Laboratory VISDA UML Class Diagram o edu.vt.cbil.visda.view