Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.

Slides:



Advertisements
Similar presentations
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Advertisements

Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
The STRING database Michael Kuhn EMBL Heidelberg.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Signaling pathways –SPIKE Network visualization.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Gene Set Enrichment Analysis (GSEA)
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Biological Pathways & Networks
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Networks and Interactions Boo Virk v1.0.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
Semantic Web for Life Sciences Workshop Session VII: Semantic Aggregation, Integration, and Inference Moderator: Joanne Luciano October, Cambridge,
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Network visualization –Cytoscape Cool.
GO-based tools for functional modeling TAMU GO Workshop 17 May 2010.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
A Biology Primer Part IV: Gene networks and systems biology Vasileios Hatzivassiloglou University of Texas at Dallas.
“My Experiment” and What I Want to Discover My experiment involved comparing the effect of long term ozone exposure on gene expression in a wild type and.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network problems Tamer Kahveci.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
A database of biological pathways and processes (borrowed from a presentation created by Steve Jupe)
Supplementary Figure 1: Comparison of the results obtained from three widely used databases (namely AmiGO, ArrayExpress and GeneCards) with that from HypoxiaDB.
GO based data analysis Iowa State Workshop 11 June 2009.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Network construction and exploration using CORNET and Cytoscape - Excercises SPICY WORKSHOP Wageningen, March 8 th 2012 Stefanie De Bodt.
Importing KEGG pathway and mapping custom node graphics on Cytoscape Kozo Nishida Keiichiro Ono Cytoscape retreat 2010 University of Michigan Jul 18, 2010.
Preliminary Exploratory Data Analysis for Ruth’s G2P Workflow Bernice Rogowitz 4/30/2010.
High throughput biology data management and data intensive computing drivers George Michaels.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Pathway Informatics 30 th March, 2016 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University.
Lecture 4.31 Protein Pathways and Pathway Databases Shan Sundararaj University of Alberta Edmonton, AB
Clench 2.0 A program for cluster enrichment analysis and integrated visualization of expression, annotation and transcription factor binding site data.
Biological networks CS 5263 Bioinformatics.
Tutorial 12 Biological networks.
Lettuce/Sunflower EST CGPDB project.
MDN MDN. (a) Construction of the MDN. (Upper) A local region of the glycolysis, where the catalytic enzymes are shown with red background and their corresponding.
Schedule for the Afternoon
Network biology An introduction to STRING and Cytoscape
The MultiOmics Explainer
Presentation transcript:

Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010

Ondex at a Glance Open source, Gnu Public Library License on SourceForge Large data sets (100,000’s of data objects) Parsers for multiple inputs Multiple data output formats Generalized data structure to link different biological networks Goal: Integrate, analyze and visualize data from heterogeneous sources Key focus: data that can be represented in graph form

Data Integration Import multiple data formats Tab delimited Fasta GO (OBO 1.2)GO PSI MI (version 2.5)PSI MI SBML Import from a wide array of databases (via parsers) AraCyc AtRegNet BioCyc BioGRID Brenda Cytoscape Multiple network data export formats and multiple image output formats SGD TAIR TIGR Transfac Transpath UniProt WordNet EcoCyc Gene Ontology GOA Gramene Grassius KEGG Medline MetaCyc O-GlycBase OMIM PDB Pfam Plant Ontology

Graph Construction Gene1 Gene 2 value = 1 Gene 1 Gene 3 value = 4 Gene 1 Gene 4 value= 5 Gene 2 Gene 3 value = 1 Gene 2 Gene 4 value = 2 Gene 3 Gene 4 value = 5 G1G2G3G4 G1145 G212 G35 G4 Similarity matrix Graph Table of Tuples Nodes are data entities– genes, proteins, etc Edges show connectivity and degree of connection Metadata can enhance the graph Node size, node color Edge width, edge color Any data that can be expressed in this form can be represented as a graph

Gene Expression Array Data and its associated Graph (dendogram) Various metrics can be used, e.g. different statistical correlation measures, noise- reduced measures The graph is re- computed on each set of values in the similarity table

Many Different Biological Networks Microarray expression level data – relationships between different genes being expressed Protein-protein interactions Signal transduction networks Metabolic pathways Gene regulatory networks

Semantics for Graph Based Representations Protein Interaction database –Proteins as nodes, interactions as edges Metabolic network –Metabolites, enzymes and reactions as different types of nodes, connected by directed edges Semantics– does a node represent a protein or a metabolite? Does the edge mean “binds to” “produced by” or consumed by? ONDEX Approach- define typed nodes as “concepts “ and edges as semantically well-defined relations Generic graph-based representation of different types of biological data, resulting in ontologies. Data and metadata imported in a way that fits into the graph template through various parsers

Generalized data structure in ONDEX

Ondex “Special Sauce” Data schema that combines graph-based methods and generalized data structure(GDS), making use of ontologies and metadata. Core idea: store biological networks as graphs; use GDS to store related information (metadata)

Example: Nitrogen uptake in Arabidopsis Protein-protein interaction (TAIR) combined with co- expression data (ATTEDII) Clusters of nodes show interaction

Example: Drought stress in three cultivars of durum wheat over time Metabolic pathways and functional protein annotations Identification of functional orthologs to genes on Affymetrix chip Enzymes are significantly up or down regulated within Jasmonic acid biosynthesis pathway after 3 days of stress. Key: Squares: target sequence s, size proportion al to expressio n Red- up- regulation Green- down regulation Blue squares: ortholog proteins Blue circles reactions Pink squares- pathways

References Jacob Köhler et al., Graph-based analysis and visualization of experimental results with Ondex, Bioinformatics 22(11), 2006.Graph-based analysis and visualization of experimental results with Ondex Köhler et al.,Linking experimental results, biological networks and sequence analysis methods using Ontologies and Generalised Data Structures Ontoogy Workshop, Gottingen, 2004Linking experimental results, biological networks and sequence analysis methods using Ontologies and Generalised Data Structures