LOGO/ICON Keval Mehta School of Informatics Master of Science in Bioinformatics Andrews Dalkilic Team Dr. Mehmet Dalkilic, Dr. Justen Andrews, Dr. John.

Slides:



Advertisements
Similar presentations
Britain Southwick Nicole Anguiano March 29, 2014
Advertisements

The Last Procedure Before First Functional Prototype Grant Boomer, Brett Papineau, Tanis Lopez, Archana Shrestha CS 383.
Gene Ontology John Pinney
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
BioPathways SIG, July Networks in Biology Molecular interaction and similarity networks are vital for understanding gene function.
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
Big data analytics with R and Hadoop Chapter 5 Learning Data Analytics with R and Hadoop 데이터마이닝연구실 김지연.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Motif Discovery in Protein Sequences using Messy De Bruijn Graph Mehmet Dalkilic and Rupali Patwardhan.
INTRODUCTION GOAL: to provide novel types of interaction between classification systems and MIAME-compliant databases We present a prototype module aimed.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
CSE 6406: Bioinformatics Algorithms. Course Outline
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Rahul Raman, Ram Sasisekharan Bioinformatics Core Massachusetts Institute of Technology Glue Grants Bioinformatics Meeting April 22-23, 2004 San Diego,
Networks and Interactions Boo Virk v1.0.
© ABB Inc. - USETI All Rights Reserved 10/17/2015 Insert image here An Economic Analysis Development Framework for Distributed Resources Aaron F. Snyder.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Visualization and analysis of microarray and gene ontology data with treemaps Eric H Baehrecke, Niem Dang, Ketan Babaria and Ben Shneiderman Presenter:
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Copyright OpenHelix. No use or reproduction without express written consent1.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
XML Standards for Proteomics Data Andrew Jones, Dr Jonathan Wastling and Dr Ela Hunt Department of Computing Science and the Institute of Biomedical and.
S. Shumilov – Zürich Analytical Visualization Framework - a visual data processing and knowledge discovery system Ivan Denisovich, Serge Shumilov Department.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
EB3233 Bioinformatics Introduction to Bioinformatics.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Visualization Four groups Design pattern for information visualization
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Importing KEGG pathway and mapping custom node graphics on Cytoscape Kozo Nishida Keiichiro Ono Cytoscape retreat 2010 University of Michigan Jul 18, 2010.
Welcome to the combined BLAST and Genome Browser Tutorial.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Canadian Bioinformatics Workshops
Spring Staff Lecturer: Prof. Sara Cohen Graders: Igor Lifshits, Arbel Moshe 2.
Collaboration Spotting: Visualisation of LHCb process data
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Hansheng Xue School of Computer Science and Technology
Indigene Project: Systems Biology Project
Session I Database & Data Mining Speaker: Mehmet M. Dalkilic
Anastasia Baryshnikova  Cell Systems 
Analyzing Massive Graphs - ParT I
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

LOGO/ICON Keval Mehta School of Informatics Master of Science in Bioinformatics Andrews Dalkilic Team Dr. Mehmet Dalkilic, Dr. Justen Andrews, Dr. John Colbourne, Dr. Brian Eads, James Costello, Rupali Patwardhan, Sumit Middha, Junguk Hur I ndigene - Data Mining & Visualization Component

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Problem Statement  Motivation  Data Mining & Integration  Visualization Software Features  Eye in the Future  Questions? Overview < Outline

LOGO/ICON Capstone Presentation Keval Mehta April 21, Classmates.com Face book Hi5 myYearbook Yahoo 360° Sally Jim Social Network Overview Problem Statement <  Connections based on:  Common Interests (Parameters)  Supplementary or complimentary work (Functionality)  The strength of the connection decided by how often you interact (weight)  People form groups of similar interests and motivation (clustering)  A second and a third level connection and new friendships can be made from friends of friends (similar questions can be asked in Gene Networks) Analogy

LOGO/ICON Capstone Presentation Keval Mehta April 21,  What does gene expression of say alpha- synuclein of value 0.9 in this condition mean with respect to other genes?  What can I do with large datasets of gene expression data and protein assays?  How can I make sense of so many disparate datasets from the experiments by scientists?  What can I know about a gene and how it acts in the presence of other genes? Problem Statement Overview Problem Statement <

LOGO/ICON Capstone Presentation Keval Mehta April 21, Motivation Overview Problem Statement Motivation < Abundance of high-throughput information from DNA, RNA and protein assays Decades of detailed genetic investigations linking to phenotypes Next big question: Next big question: Insights into functional relationships among genes Brazhnik et al., Trends in Biotechnology 20:

LOGO/ICON Capstone Presentation Keval Mehta April 21, Gene Expression  Time Course  Tissue Specific  Sex Specific  Developmental Protein-Protein Interaction Transcription Factor Binding Site Genetic Interaction Phenotypic Annotation Overview Problem Statement Motivation Data Mining & Integration < Data Mining & Integration

LOGO/ICON Capstone Presentation Keval Mehta April 21,  High-throughput Microarray Data  Arbeitman, Larval and Parisi datasets  Keval Mehta and Rupali Patwardhan  Protein-Protein Interaction  Junguk Hur  Genetic Interaction  James Costello  Allelic Phenotype  James Costello  Transcription Factor Binding Site  Sumit Middha Drosophila melanogaster Data Mining & Integration Overview Problem Statement Motivation Data Mining & Integration <

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Arbeitman (Life cycle of Drosophila)  159 slides (79 slides with one replicate each)  Larval tissue-specific transcripts  15 slides (no replicates across slides)  Parisi  14 slides (no replicates across slides)  Using OLIN (from R’s Bioconductor marray package) we generated the Normalized values  Dealing with replicates within the slide (spot replicates)  We averaged the M and A values and back-calculated the new merged intensity values  Pearson’s correlation is computed for all unique combinations of genes = [n * (n-1)] / 2 Gene Expression Overview Problem Statement Motivation Data Mining & Integration <

LOGO/ICON Capstone Presentation Keval Mehta April 21, … …0.2 G1 Slide 1….Slide n G2 Pearson’s Correlation Formula X = G1 Y = G2 Finding Pearson’s Correlation Overview Problem Statement Motivation Data Mining & Integration <

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Example1: Positive Correlation FBgn and FBgn Correlation Value = 0.94  Example2: Negative Correlation FBgn and FBgn Correlation Value = -0.9 Overview Problem Statement Motivation Data Mining & Integration <

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Normalized the Integration Overview Problem Statement Motivation Data Mining & Integration <

LOGO/ICON Capstone Presentation Keval Mehta April 21, How can I visualize all this data effectively and still at the same time keeping it perceptible? Inspiration: Visual Thesaurus Referred by Dr. Youn Lim Overview Problem Statement Motivation Data Mining & Integration Software Features < A Big Challenge

LOGO/ICON Capstone Presentation Keval Mehta April 21,  A user interface toolkit for interactive information visualization  built in Java using Java2D graphics library  data structures and algorithms  pipeline architecture featuring reusable, composable modules  animation and rendering support  architectural techniques for scalability Overview Problem Statement Motivation Data Mining & Integration Software Features < prefuse API

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User centered Interface < User Friendly Interface

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User centered Interface Our Website <  Querying this Integrated Database  Our website bin/upload.cgi  I/P: Upload a file with list of FBgn IDs of interest  O/P: Output’s an XML file Website

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network < Load Network XML input file

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture < FlyBase: online database Our Indigene DB Website: XML file Visualization Component Data Mining Query Architecture

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors < Highlight Immediate Neighbors

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click < Left Mouse Click

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors < Show Neighbors

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph < Move Graph

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes < Randomly place nodes

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes < Neighbors of more than one node

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN < Zoom OUT & IN

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN Node & Edge Info. < Node & Edge Information

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN Node & Edge Info. Load Details < Load Details

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN Node & Edge Info. Load Details Search < Search

LOGO/ICON Capstone Presentation Keval Mehta April 21, Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN Node & Edge Info. Load Details Search ReLoad Graph < ReLoad GraphDemo

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Cytoscape  Graphviz packages and various other packages  Advantage  Highly customizable to ones need and platform independent  Some exclusive features such as reload the graph and playing with cutoffs on run time  Searching and pruning the graph  Highly modular coding style allows extensibility to analysis of any graph network  Make the software talk with live data by connecting to database and querying it in real time  Apply machine learning algorithms that can throw light on possible paths that are otherwise not easy to perceive Eye in the Future & advantage over other tools Overview Problem Statement Motivation Data Mining & Integration Software Features User Centered Interface Our Website Load Network Architecture Highlight Immediate Neighbors Left Mouse Click Show Neighbors Move the Graph Randomly place nodes Select Nodes Zoom OUT & Zoom IN Node & Edge Info. Load Details Search ReLoad Graph

LOGO/ICON Capstone Presentation Keval Mehta April 21,  Advisors Dr. Mehmet Dalkilic and Dr. Justen Andrews  Research Team John Colbourne, Brian Eads, James Costello, Rupali Patwardhan, Sumit Middha, Junguk Hur  Visualization Expert Advice Ketan Mane, Ph.D student – SLIS  Computing Facilities Center for Genomics, School of Informatics & Department of Computer Science  Special Thanks Dr. Youn-Kyung Lim and Dean Marty Siegel  Thank you My Parents and my friends Acknowledgements

LOGO/ICON Capstone Presentation Keval Mehta April 21, Questions?