Toward Automatically Drawn Metabolic Pathway Atlas with Peripheral Node Abstraction Algorithm Myungha Jang, Arang Rhie, and Hyun-Seok Park * Bioinformatics.

Slides:



Advertisements
Similar presentations
Predicting essential genes via impact degree on metabolic networks ISSSB’11 Takeyuki Tamura Bioinformatics Center, Institute for Chemical Research Kyoto.
Advertisements

MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Beyond Trilateration: On the Localizability of Wireless Ad Hoc Networks Reported by: 莫斌.
A New Force-Directed Graph Drawing Method Based on Edge- Edge Repulsion Chun-Cheng Lin and Hsu-Chen Yen Department of Electrical Engineering, National.
1 Balloon Views of Source Code and Their Multiscalable Font Modes Chun-Cheng Lin and Hsu-Chun Yen Dept. of Electrical Engineering, National Taiwan University.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Introduction to Bioinformatics - Tutorial no. 13 Probe Design Gene Networks.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
SIMS 247: Information Visualization and Presentation jeffrey heer
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Pathway databases Goto S, Bono H, Ogata H, Fujibuchi W, Nishioka T, Sato K, Kanehisa M. (1997) Organizing and computing metabolic pathway data in terms.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
ANALYSIS OF GENETIC NETWORKS USING ATTRIBUTED GRAPH MATCHING.
Application of Graph Theory to OO Software Engineering Alexander Chatzigeorgiou, Nikolaos Tsantalis, George Stephanides Department of Applied Informatics.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Ch10. Intermolecular Interactions and Biological Pathways
Metagenomic Analysis Using MEGAN4
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Lecture7 Topic1: Graph spectral analysis/Graph spectral clustering and its application to metabolic networks Topic 2: Concept of Line Graphs Topic 3: Introduction.
A hierarchical approach to building contig scaffolds Mihai Pop Dan Kosack Steven L. Salzberg Genome Research 14(1), pp , 2004.
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
Crossing Minimisation (1) Ronald Kieft. Global contents Specific 2-layer Crossing Minimisation techniques After the break, by Johan Crossing Minimisation.
Network Aware Resource Allocation in Distributed Clouds.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Lecture 12: Network Visualization Slides are modified from Lada Adamic, Adam Perer, Ben Shneiderman, and Aleks Aris.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1
Human liver rate-limiting enzymes influence metabolic flux via branch points and inhibitors Min Zhao Center for Bioinformatics Peking University.
Attributed Visualization of Collaborative Workspaces Mao Lin Huang, Quang Vinh Nguyen and Tom Hintz Faculty of Information Technology University of Technology,
Clustering Metabolic Networks Using Minimum Cut Trees Ryan Kellogg 1, Allison Heath 2, Lydia Kavraki 2,3 1 Carnegie Mellon University, Department of Electrical.
Metagenomic Analysis Using MEGAN4 Peter R. Hoyt Director, OSU Bioinformatics Graduate Certificate Program Matthew Vaughn iPlant, University of Texas Super.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Copyright Prof. Dr. Shuichiro Yamamoto Prof. Dr. Shuichiro Yamamoto Nagoya University.
An Efficient Linear Time Triple Patterning Solver Haitong Tian Hongbo Zhang Zigang Xiao Martin D.F. Wong ASP-DAC’15.
Greedy algorithm for obtaining Minimum Feedback vertex set MFVS delete degree 1/0 vertices from V and set remaining vertices to V’ MFVS←  while V’  
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
W e present BiNoM [1,2], a Cytoscape plugin, developed to facilitate the manipulation of biological networks represented in standard systems biology formats.
CS270 Project Overview Maximum Planar Subgraph Danyel Fisher Jason Hong Greg Lawrence Jimmy Lin.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Visualization Four groups Design pattern for information visualization
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Importing KEGG pathway and mapping custom node graphics on Cytoscape Kozo Nishida Keiichiro Ono Cytoscape retreat 2010 University of Michigan Jul 18, 2010.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
7-1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Network Flow Models Chapter 7.
Canadian Bioinformatics Workshops
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Dr. Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
You can check broken videos in this slide here :
Video Summarization by Spatial-Temporal Graph Optimization
Functions and their Graphs
Network Biosignatures of Earth’s Atmosphere and Biosphere
Robustness of wireless ad hoc network topologies
Estimating Networks With Jumps
Robustness of wireless ad hoc network topologies
Anastasia Baryshnikova  Cell Systems 
Automated Layout and Phase Assignment for Dark Field PSM
Systematic analysis of host-pathogen interactions.
Network topology. Network topology. Nodes are linked by edges. Node size represents a quantifiable node property (e.g. fold-change in two different experimental.
Human-object interaction
Fig. 1. Generic metabolic pathway and its corresponding adjacency matrices. The graph of a static network (A) is ... Fig. 1. Generic metabolic pathway.
Presentation transcript:

Toward Automatically Drawn Metabolic Pathway Atlas with Peripheral Node Abstraction Algorithm Myungha Jang, Arang Rhie, and Hyun-Seok Park * Bioinformatics Laboratory, School of Engineering Ewha Womans University Seoul, Korea IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

Table of Contents IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University I.Introduction II. Topological Nature of Metabolic Networks at Peripheral Nodes III. Node Abstraction Featured Scale-free Algorithm IV. Experimental Results V. Discussion and Future Work

Abstract graph structure ⇒ visual representation Graphical diagrams are intuitively helpful to understand biochemical reaction networks - Node : compound, Edge : reactions Optimal solutions : NP-hard problems Automatic graph layout algorithms in systems biology IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University I. INTRODUCTION

A complete metabolic network indicates all the metabolic potential and capacity. The shift of research focus: single pathways to multiple pathways. Visualization serves an important role in understanding large scale metabolic network. KEGG Atlas( 2008http:// Terms : Global (metabolic) pathway, Multiple pathway, Atlas Focusing on Global Metabolic Pathway IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University I. INTRODUCTION

Our Efforts Toward Automatic Global Layout IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University I. INTRODUCTION Not enough to deal with the global pathway! How can we obtain a complete view? No attempts for automatic visualization for Atlas

Related work: KEGG Atlas The map integration process is carried out manually by curators. Based on curator’s experience However, that metabolic networks are dynamic in nature should not be disregarded  Systematic approach is necessary How To Deal With Large-scale Metabolic Pathway? I. INTRODUCTION IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

How To Deal With Large-scale Metabolic Pathway? (con’d) INTRODUCTION Our Strategy We provide a novel algorithmic approach in drawing multiple metabolic pathways by considering two properties: 1. Automatic abstraction criteria: by analyzing a topological nature of metabolic networks based on the graphical property of relation distance, linear reactions were abstracted as a unit reaction. 2. the consistency of highly connected nodes

We obtained 255 map data by parsing KEGG XML (KGML) documents of version 0.6 using our KGML Parser. Two terms were defined: 1. Relation degree the number of edges branching from a node 2. Relation distance a factor to measure the length between any two compounds encompassing nodes which all have relation degrees less than or equal to p (p = 2) KG ML + A dedicated analysis on peripheral nodes with low connectivity was performed. IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University II. T OPOLOGICAL N ATURE OF M ETABOLIC N ETWORKS AT P ERIPHERAL N ODES

Relation Distance Term Clarification II. T OPOLOGICAL N ATURE OF M ETABOLIC N ETWORKS AT P ERIPHERAL N ODES Definition: The length between any two compounds encompassing nodes which all have relation degrees equal to p Here, p = 2 IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

Relation Distance Example in Map II. T OPOLOGICAL N ATURE OF M ETABOLIC N ETWORKS AT P ERIPHERAL N ODES cpd:C01291 cpd:C01290 cpd:C16475 cpd:C16466 cpd:C16470 cpd:C16468 cpd:C16469 cpd:C16471 cpd:C00369 RD(C01290, C00369) = 7 IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

Layout Components according to High Connectivity Basic Motivation Observation: 66.83% of the total compounds within the complete metabolic pathways were of low connectivity, with less than relation degree of 3. The number of compounds with higher relation degree, i.e. more than 6 edges, was much less. Abstracting Compounds With Linear Interaction IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University III. N ODE A BSTRACTION F EATURED S CALE - FREE A LGORITHM

A. Abstracting Compounds With Linear Interaction We abstracted and hid all those compounds that appear within these linear interactions. This approach could be called “chain reduction”(M. Chimani et al) All green compounds in the figure will be hidden in the graph layout according to this approach. IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University III. N ODE A BSTRACTION F EATURED S CALE - FREE A LGORITHM

B. Layout Components according to High Connectivity Input : Metabolic Pathway Graph Output : coordinates of each node void LayoutPathway (Pathway graph) { IF highly connected nodes (N d ) exist in graph LayoutHighlyConnectedNode (graph, N d ); ELSE IF any cycle(N c ) exists in graph AND size of cycle ≥ 6 LayoutCircular (graph, N c ); ELSE LayoutHierarchic (graph); } Highly Connected Nodes: Nodes with relation degree bigger than 6 LayoutHighConnectedNode() Algorithm Steps 1.Find a highly Connected node N d 2.Each component connected to N d is decomposed into sub-graph 3.Each decomposed sub-graph is treated as a super node to apply the spring-embedding algorithm 6 3 IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University III. N ODE A BSTRACTION F EATURED S CALE - FREE A LGORITHM

IV. E XPERIMENTAL RESULTS Experiments : To compare compression rate of compounds, we obtained the number of abstracted compounds and edge crossings by applying two different layout algorithms: Result 1 Node compression rate performance Scope single metabolic pathways 2. 8 major categorized metabolic pathways 3. the global pathway Result 2 The number of edge crossing comparison between by 1. Conventional algorithm 2. Our Node abstraction featured scale-free layout algorithm IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University … single pathways … Categorized pathways Global pathway

III. E XPERIMENTAL RESULTS Pathway Number of Nodes Before Abstraction Number of Nodes After Abstraction Abstraction Rate Carbohydrate Metabolism % Lipid Metabolism % Nucleotide Metabolism % Amino Acid Metabolism % Metabolism of Other Amino Acid % Metabolism of Cofactor and Vitamins % Biosynthesis of Secondary Metabolism % Xenobiotics Biodegradation % Global Pathway (Atlas) % Result 1B The Number of Nodes Before and After Applying Node Abstraction IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University Peripheral path as supplementary nodes

III. E XPERIMENTAL RESULTS Results drawn with Cytoscape, using conventional spring embedding The red-colored edges represent the abstracted edges. (abstraction rate : 70%) Result 1A IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University Original Network Abstracted Network Peripheral path as super edges

III. E XPERIMENTAL RESULTS In single metabolic pathways, the node abstraction featured algorithm reduced edge crossings by 63.31%. In a global metabolic pathway, the number of edge crossings has reached a reduction of 58.08% in total. Our proposed algorithm with node abstraction resulted in 86,067 edge crossings, whereas the one without node abstraction resulted in 205,316 edge crossings. Result 2 : Edge Crossing Reduction IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

IV. D ISCUSSION Two approaches were used: 1. Abstracting compound pairs according to a consistent criteria 2. Layout components according to high connectivity Our experimental results show that node abstraction feature reduced the number of compounds by approximately 23% in global pathway. Further discussion is necessary regarding enzyme reactions IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University

IV. W HY IS OUR WORK IMPORTANT ? The first systematic approach for Atlas visualization focusing on peripheral nodes Fundamental to building a hierarchical structure of Atlas Our approach is flexible upon pathway database change that frequently updates It is a crucial preliminary step toward automatically drawn metabolic pathway Future research on individual biological meaning of each peripheral nodes and abstracted path IEEE BIBM, Dec 2010, Hong Kong Ewha Womans University