Greedy algorithm for obtaining Minimum Feedback vertex set MFVS delete degree 1/0 vertices from V and set remaining vertices to V’ MFVS←  while V’  

Slides:



Advertisements
Similar presentations
Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
Advertisements

an incremental version of A*
Native-Conflict-Aware Wire Perturbation for Double Patterning Technology Szu-Yu Chen, Yao-Wen Chang ICCAD 2010.
Review Binary Search Trees Operations on Binary Search Tree
Greedy Algorithms Greed is good. (Some of the time)
FPGA Technology Mapping Dr. Philip Brisk Department of Computer Science and Engineering University of California, Riverside CS 223.
Optimization of Pearl’s Method of Conditioning and Greedy-Like Approximation Algorithm for the Vertex Feedback Set Problem Authors: Ann Becker and Dan.
Chapter 23 Minimum Spanning Trees
1 Spanning Trees Lecture 20 CS2110 – Spring
Graph Algorithms: Minimum Spanning Tree We are given a weighted, undirected graph G = (V, E), with weight function w:
Branch and Bound Similar to backtracking in generating a search tree and looking for one or more solutions Different in that the “objective” is constrained.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2001 Lectures 3 Tuesday, 9/25/01 Graph Algorithms: Part 1 Shortest.
University of CreteCS4831 The use of Minimum Spanning Trees in microarray expression data Gkirtzou Ekaterini.
Connected Components, Directed Graphs, Topological Sort Lecture 25 COMP171 Fall 2006.
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
2-Layer Crossing Minimisation Johan van Rooij. Overview Problem definitions NP-Hardness proof Heuristics & Performance Practical Computation One layer:
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
CS Data Structures Chapter 6 Graphs.
Two Discrete Optimization Problems Problem #2: The Minimum Cost Spanning Tree Problem.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Backtracking.
Approximate Labelled Subtree Homeomorphism Based on:  “Approximate Labelled Subtree Homeomorphism” R. Y. Pinter, O.Rokhlenko, D. Tsur, M. Ziv-Ukelson.
The Shortest Path Problem
Important Problem Types and Fundamental Data Structures
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
TECH Computer Science Graph Optimization Problems and Greedy Algorithms Greedy Algorithms  // Make the best choice now! Optimization Problems  Minimizing.
V. V. Vazirani. Approximation Algorithms Chapters 3 & 22
CS 146: Data Structures and Algorithms July 21 Class Meeting
Physical Mapping of DNA Shanna Terry March 2, 2004.
BIBM 2008 Qiong Cheng Georgia State University Joint work with Piotr Berman (Pennstate) Robert Harrison (GSU) Alexander Zelikovsky (GSU) Fast Alignments.
Network Aware Resource Allocation in Distributed Clouds.
CS 3343: Analysis of Algorithms Lecture 21: Introduction to Graphs.
Approximating Minimum Bounded Degree Spanning Tree (MBDST) Mohit Singh and Lap Chi Lau “Approximating Minimum Bounded DegreeApproximating Minimum Bounded.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
QNET: A tool for querying protein interaction networks Banu Dost +, Tomer Shlomi*, Nitin Gupta +, Eytan Ruppin*, Vineet Bafna +, Roded Sharan* + University.
COSC 2007 Data Structures II Chapter 14 Graphs III.
UNC Chapel Hill Lin/Foskey/Manocha Minimum Spanning Trees Problem: Connect a set of nodes by a network of minimal total length Some applications: –Communication.
Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
ReferencesReferences AcknowledgementsAcknowledgements TORQUE server DefinitionsDefinitions MethodsMethods IntroductionIntroduction Experiments & Results.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
Qiong Cheng, Robert Harrison, Alexander Zelikovsky Computer Science in Georgia State University Oct IEEE 7 th International Conference on BioInformatics.
Qiong Cheng, Dipendra Kaur, Robert Harrison, Alexander Zelikovsky Computer Science in Georgia State University Dec RECOMB Satellite Conference.
Clustering Metabolic Networks Using Minimum Cut Trees Ryan Kellogg 1, Allison Heath 2, Lydia Kavraki 2,3 1 Carnegie Mellon University, Department of Electrical.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Introduction to Graph Theory
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Chapter 10 Graph Theory Eulerian Cycle and the property of graph theory 10.3 The important property of graph theory and its representation 10.4.
Minimum Spanning Trees CS 146 Prof. Sin-Min Lee Regina Wang.
Graphs A graphs is an abstract representation of a set of objects, called vertices or nodes, where some pairs of the objects are connected by links, called.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Ver Chapter 13: Graphs Data Abstraction & Problem Solving with C++
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
Trees Thm 2.1. (Cayley 1889) There are nn-2 different labeled trees
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Indexing and Mining Free Trees Yun Chi, Yirong Yang, Richard R. Muntz Department of Computer Science University of California, Los Angeles, CA {
::Network Optimization:: Minimum Spanning Trees and Clustering Taufik Djatna, Dr.Eng. 1.
CSCI2950-C Lecture 12 Networks
Depth-First Search.
CS 3343: Analysis of Algorithms
1.3 Modeling with exponentially many constr.
Graphs Graph transversals.
CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014
CSE 373 Data Structures and Algorithms
4-4 Graph Theory Trees.
Comparative RNA Structural Analysis
Spectral methods for Global Network Alignment
SEG5010 Presentation Zhou Lanjun.
Minimum Spanning Trees
Presentation transcript:

Greedy algorithm for obtaining Minimum Feedback vertex set MFVS delete degree 1/0 vertices from V and set remaining vertices to V’ MFVS←  while V’   do pick up the set S of maximal degree vertices MFVS ← MFVS U S Delete degree 1/0 vertices from V’  Random degree-conserved graph generation by : Reshuffle nodes Reshuffle edges  Randomized P-Value computation (P-Value cutoff : 0.01 for 100 randomized graphs) Qiong Robert Harrison, Alexander Zelikovsky Department of Computer Science, Georgia State University, Atlanta, GA * Partially supported by GSU Molecular Basis of Disease (MBD) and Brains & Behavior (B&B) Studies in mining conserved subgraph among metabolic pathways References :  Q. Cheng, D. Kaur, R. Harrison, and A. Zelikovsky,"Mapping and Filling Metabolic Pathways ", RECOMB Satellite Conference on Systems Biology 2007  Q. Cheng, R. Harrison, and A. Zelikovsky,"Homomorphisms of Multisource Trees into Networks with Applications to Metabolic Pathways", Proc. of IEEE 7-th International Symposium on BioInformatics and BioEngineering (BIBE'07)  Ron Y Pinter, Oleg Rokhlenko, Esti Yeger-Lotem, Michal Ziv-Ukelson: Alignment of metabolic pathways. Bioinformatics. LNCS Springer-Verlag.(Aug 2005)21(16):  ….. Abstract Graph is a popular theoretic formalism for modeling multidimensional data such as metabolic pathways. Mining conserved subgraph frequently includes comparison of unknown networks with well-understood networks accounting for the similarity of the vertices as well as their corresponding topological structures. Traditional graph comparisons are classified as two types of problems: subgraph isomophism and subgraph homeomorphism. Both problems are known to be NP-complete. However different application domains on graph comparison require different problem formulations and the corresponding solutions. Based on the specific application in metabolic pathways, we focused on mapping of multi-source tree onto multi-source tree, onto DAG(directed acyclic graph) and further mapping among arbitrary graphs. Arising from innate biochemical reaction observation property that several enzymes can catalyze the same reaction and a single enzyme can catalyze several reactions, we formulated the problem to be homeomorphism embedding and allowed the occurrence of different vertices in query graph to be mapped to a vertex in well-known target graph. Our experimental study finds significant conserved pairs in pairwise mapping of all pathways for four organisms (E.coli, S.cerevisiae, B. subtilis and T. thermophilus species). Δ(v i, u j ) if v i is a leaf in T Δ (v i, u j ) + ∑ l=1 to Adj(vi) Min j’=1 to |V T | C(i l, j’) if v i is a leaf in T DT[i, j] = i<|V P’ | j<|V T | C[i l, j l ] = DT[i l, j l ] + (h(j, j l ) - 1) is penalty for gaps h(j, j l ) = #(hops between u j and u j l in T) Homeomorphism embedding of metabolic pathways Fig 1. A portion of pentose phosphate pathway  Metabolic pathway model – a directed graph in which vertices correspond to enzymes and there is a directed edge between two enzymes if the product of the reaction catalyzed by the first enzyme is a substrate of the reaction catalyzed by the second. P T f Solution find minimum feedback vertex set MFVS of P construct a multi source tree P’ = for every possible fixed mapping f’ v : F(P)  V T do obtain min cost homomorphism of P’ to T under mapping f’ v choose min cost homomorphism for all possible fixed mappings  Optimal homeomorphism embedding algorithm  Best homeomorphism embedding of multisource tree to arbitrary graph  Preprocessing of text graph T by transitive closure  Ordering of constructed multisource tree P’ : 1. DFS traversal of P’ 2. Processing order in opposite way with the DFS traversal Key: Each edge e i in P’ is the unique edge connecting v i with the previous vertices in the order  Dynamic programming  Build DP table - DP = V P’ × V T* Each row and column corresponds to a vertex of V P’ and V T* respectively; The columns u 1, …, u |V T * | are sorted in arbitrary order; The rows v 1, …, v |V P’ | are sorted in the special order Every item is the min cost homomorphism from P’ subgraph induced by previous vertices in the order into T *  Filling DP table by recursive function Experiments results  Computing P-Value of homomorphism mapping a b cd a b cd Reshuffle edge  All-against-all mappings among 4 species :  Identifying conserved pathways 24 pathways that are conserved across all 4 species 18 more pathways that are conserved across at least three of these species  Resolving ambiguity (see figure 2)  Discovering pathways holes (see figure 3) Fig 2. Resolving ambiguity example: Mapping of glutamate degradation VII pathways from B.subtilis to T. thermophilus (p<0.01). The enlighted node reflects enzyme homology Fig 3. Pathway holes’ example: Mapping of formaldehyde oxidation V pathway in B. subtilis to formy1THF biosynthesis pathway in E. coli (p<0.01) (only vertices in the image of the pattern in the text are shown. Georgia Tech Poster/Reception session at SIAM Data Mining (2008)  Mapping metabolic pathways - should capture the similarities of enzymes represented by proteins as well as topological properties. We allow different enzymes to be mapped to the same enzyme.  Graph Homeomorphism embedding Let pattern P = and text T = be directed graphs, a homomorphism mapping f : P  G satisfies  f v : V P  V T : Different pattern vertices can be mapped to a single text vertex;  f e : E P  paths of T : An edge in the pattern can be mapped to a path in the text;  Homeomorphism embedding cost = vertex-to-vertex cost + edge-to- path cost Edge-to-path cost = l (|f e (e)|-1), which proportionally increase with the number of extra hops the images of edge Homomorphism cost =  v in V P Δ(v, f v (v)) +  e in E P (|f e (e)|-1) Problem Formulation Given: an arbitrary graph pattern P = and an arbitrary graph text T = Find: min cost homomorphism f : P  T minimize  v in V P Δ(v, f v (v)) +  e in E P (|f e (e)|-1) s.t.  v in V P,  f v (v) in V T  e in E p,  f e (e) in Path T  Runtime Transitive closure takes O(|V T ||E T |) Pattern graph ordering takes O(|V P | + |E P |) Dynamic programming : - Calculate min contribution of all child pairs of node pair (v i ∈ P,u j ∈ T*) takes t ij = deg P (v i )deg G* (u j ) - Filling DT takes  j=1 to |V G |  i=1 to |V T | t ij =  j=1 to |V G | deg G* (u j )  i=1 to |V T | deg T (v i ) = 2|E G* ||E T | The total runtime is O(|V G ||E G |+|V G* ||V T |).  Runtime for obtaining optimal homeomorphism embedding O(|V T | (1+|MFVS|) (|V G ||E G |+|V G* ||V T |)) Let t(v) denote the number of “reasonable” text images of v, total runtime = O(∏ v  MFVS t(v) (|VG||EG|+|VG*||VT|)) Given: an undirected graph G=(V,E) and a nonnegative weight function w on V Find: a minimum weight subset of V whose removal leaves an acyclic graph.  Minimum Feedback vertex set problem Previous work  Mapping : Linear pattern  Graph (Kelly et al 2004) ( o  V P    V T   ) )  Mapping : Tree  Tree (Pinter [2005] o(|V T | 2 |V P |/log|V T |+|V T ||V P |log|V P |) )  Exhaustively search : Sharan et al 2005 (  V P    V T    )  Yang et al 2007 (   V T   V T    )