How to reconstruct a large genetic network from n gene perturbations in fewer than easy steps Andreas Wagner 2001 Ren-Hua Chung Department of Computer.

Slides:



Advertisements
Similar presentations
Introduction to Algorithms Graph Algorithms
Advertisements

CS 336 March 19, 2012 Tandy Warnow.
Great Theoretical Ideas in Computer Science for Some.
Analysis of Algorithms CS 477/677
DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
1 Constructing Splits Graphs Author: Andreas W.M. Dress Daniel H. Huson Presented by: Bakhtiyar Uddin.
Bayesian Networks, Winter Yoav Haimovitch & Ariel Raviv 1.
Theory of Computing Lecture 18 MAS 714 Hartmut Klauck.
Great Theoretical Ideas in Computer Science for Some.
Topological Sort Topological sort is the list of vertices in the reverse order of their finishing times (post-order) of the depth-first search. Topological.
Lecture 16: DFS, DAG, and Strongly Connected Components Shang-Hua Teng.
More Graphs COL 106 Slides from Naveen. Some Terminology for Graph Search A vertex is white if it is undiscovered A vertex is gray if it has been discovered.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 CHAPTER 4 - PART 2 GRAPHS 1.
Noga Alon Institute for Advanced Study and Tel Aviv University
CS 206 Introduction to Computer Science II 11 / 11 / Veterans Day Instructor: Michael Eckmann.
Directed Graphs. Given vertices u and v of a digraph G , we say that u reaches v (and v is reachable from u ) if G  has a directed path from u to.
Chapter 23 Minimum Spanning Trees
On Computing Compression Trees for Data Collection in Wireless Sensor Networks Jian Li, Amol Deshpande and Samir Khuller Department of Computer Science,
What is the next line of the proof? a). Let G be a graph with k vertices. b). Assume the theorem holds for all graphs with k+1 vertices. c). Let G be a.
Discover a Network by Walking on it! Andrea Asztalos & Zoltán Toroczkai Department of Physics University of Notre Dame Department of Physics University.
1 Data Structures DFS, Topological Sort Dana Shapira.
CSE 780 Algorithms Advanced Algorithms Graph Alg. DFS Topological sort.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2001 Wednesday, 9/26/01 Graph Basics.
Computability and Complexity 20-1 Computability and Complexity Andrei Bulatov Class NL.
Chapter 9 Graph algorithms Lec 21 Dec 1, Sample Graph Problems Path problems. Connectedness problems. Spanning tree problems.
Goal: Reconstruct Cellular Networks Biocarta. Conditions Genes.
Is the following graph Hamiltonian- connected from vertex v? a). Yes b). No c). I have absolutely no idea v.
1 Brief Announcement: Distributed Broadcasting and Mapping Protocols in Directed Anonymous Networks Michael Langberg: Open University of Israel Moshe Schwartz:
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
Lecture 15: Depth First Search Shang-Hua Teng. Graphs G= (V,E) B E C F D A B E C F D A Directed Graph (digraph) –Degree: in/out Undirected Graph –Adjacency.
Bayesian Networks Alan Ritter.
CS2420: Lecture 36 Vladimir Kulyukin Computer Science Department Utah State University.
The Shortest Path Problem
Graph Algorithms Using Depth First Search Prepared by John Reif, Ph.D. Distinguished Professor of Computer Science Duke University Analysis of Algorithms.
TEDI: Efficient Shortest Path Query Answering on Graphs Author: Fang Wei SIGMOD 2010 Presentation: Dr. Greg Speegle.
Lecture 12-2: Introduction to Computer Algorithms beyond Search & Sort.
I/O-Efficient Graph Algorithms Norbert Zeh Duke University EEF Summer School on Massive Data Sets Århus, Denmark June 26 – July 1, 2002.
Spring 2015 Lecture 10: Elementary Graph Algorithms
Trees and Distance. 2.1 Basic properties Acyclic : a graph with no cycle Forest : acyclic graph Tree : connected acyclic graph Leaf : a vertex of degree.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Lecture 11 Algorithm Analysis Arne Kutzner Hanyang University / Seoul Korea.
Presenter : Kuang-Jui Hsu Date : 2011/3/24(Thur.).
The Structure of the Web. Getting to knowing the Web How big is the web and how do you measure it? How many people use the web? How many use search engines?
1 Use graphs and not pure logic Variables represented by nodes and dependencies by edges. Common in our language: “threads of thoughts”, “lines of reasoning”,
A randomized linear time algorithm for graph spanners Surender Baswana Postdoctoral Researcher Max Planck Institute for Computer Science Saarbruecken,
CSIE in National Chi-Nan University 1 How to Reconstruct a Large Genetic Network from n Gene Perturbations in fewer than n 2 Easy Steps Speaker: Chuang.
Great Theoretical Ideas in Computer Science for Some.
Lecture 19 Minimal Spanning Trees CSCI – 1900 Mathematics for Computer Science Fall 2014 Bill Pine.
Graph Concepts and Algorithms Using LEDA By Caroline Moore and Carmen Frerichs (252a-at and 252a-ao) each graph in the presentation was created using gw_basic_graph_algorithms.
1 GRAPHS – Definitions A graph G = (V, E) consists of –a set of vertices, V, and –a set of edges, E, where each edge is a pair (v,w) s.t. v,w  V Vertices.
Introduction to Algorithms
Algorithms and Networks
CS 201: Design and Analysis of Algorithms
Proof technique (pigeonhole principle)
Great Theoretical Ideas In Computer Science
Computing Connected Components on Parallel Computers
Michael Langberg: Open University of Israel
What is the next line of the proof?
Network Flow.
Graph Algorithms Using Depth First Search
Lecture 10 Algorithm Analysis
Reference based assembly
CS223 Advanced Data Structures and Algorithms
Graph Theory By Amy C. and John M..
Data Structures & Algorithms Digraphs and DAGs
Discrete Mathematics for Computer Science
Algorithms Lecture # 27 Dr. Sohail Aslam.
DAGs Longin Jan Latecki
Presentation transcript:

How to reconstruct a large genetic network from n gene perturbations in fewer than easy steps Andreas Wagner 2001 Ren-Hua Chung Department of Computer Science

Direct vs. Indirect Relationships How can we distinguish between direct and indirect relationships in a network based on microarray data? Additional Assumption needed Next: minimize # relationships

Perturbation Static Graph Model Motivation: perturb a gene network one gene at a time and use the effected genes in order to discriminate direct vs. indirect gene-gene relationships Perturbations: gene knockouts, over-expression, etc. Method: 1.For each gene,compare the control experiment to perturbed experiment and identify the differentially expressed genes 2.Use the most parsimonious graph that yields the graph of 1. as its reachable graph

Parsimony Assumptions The direct relationship graph: - is random (ER graphs) - is scale-free (Power law) - has the smallest number of edges This paper is based on the third assumption

An example (a) gene network (b) adjacency list (c) accessibility list Goal: (c) -> (a)

An accessibility list with three possible graphs Third assumption: choose the most parsimonious graph

Algorithm Step1: Graphs without cycles only (acyclic directed graph) Step2: Graphs with cycles

Step 1 Shortcut: e A shortcut-free graph compatible with an accessibility list is a unique graph with the fewest edges among all graphs compatible with the accessibility list, i.e, a shortcut-free graph is the most parsimonious graph.

Step1 A theorem: Let Acc(G) be the accessibility list and Adj(G) be the adjacency list at an acyclic directed graph, its most parsimonious graph, and V( ) the set of all nodes of. Then the following identity holds

An example Adj(1) = Acc(1) - (Acc(2) + Acc(3) + Acc(4) + Acc(5) +Acc(6)) = (2,3,4,5,6) – (3 (5,6) 6) = (2,4)

Step 1 A Corollary: Let i, j, and k be any three pairwise different nodes of an acyclic directed shortcut-free graph G. If j is accessible from i, then no node k accessible from j is adjacent to i. i j k

The algorithm of step 1

Step 2: How about graphs with cycles? Two different cycles have the same accessibility list

Step 2: Graphs with cycles Perturbations of any gene in the cycle influences the activity of all other genes in the same cycle. Can’t decide a unique graph if cycle happens Not an algorithmic but an experimental limitation.

The algorithm of step 2 Basic idea: Shrink each cycles (strongly connected components) into one node and apply the algorithm of step 1.

The algorithm of step 2 A corollary: Let i and j (i j) be two nodes of a directed graph G. i and j are in the same component iff i Acc(j) and j Acc(i). A graph after shrinking all the cycles into nodes is called a condensation graph.

The algorithm of step 2

Missing genes and messy data Some genes are difficult to perturb Problem: some information is missing for certain genes. How well does the algorithm perform in such cases? Simulation: Randomly generate graphs with pre- specified nodes and edges. Then eliminate pre- specified fraction of nodes from the accessibility list. Apply the algorithm to both graphs without elimination and with elimination.

Simulation result

Limitation of the algorithm Unable to resolve cycled graphs Require more data than conventional methods using gene expression correlations. There are many networks consistent with the given accessibility list. The algorithm construct the most parsimonious one.

Better algorithms? The same problem was proposed around 1980 which is called “transitive reduction”. The transitive reduction of a directed graph G is the directed graph G' with the smallest number of edges such for every path between vertices in G, G' has a path between those vertices. An O(V) algorithm for computing transitive reduction of a planar acyclic digraph was proposed by Sukhamay Kundu. (V is the number of nodes in G)