1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network construction from RNAi data Tamer Kahveci.

Slides:



Advertisements
Similar presentations
The Primal-Dual Method: Steiner Forest TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA A A A AA A A.
Advertisements

An iterative algorithm for metabolic network-based drug target identification Padmavati Sridhar, Tamer Kahveci, Sanjay Ranka Department of Computer and.
Novembro 2003 Tabu search heuristic for partition coloring1/29 XXXV SBPO XXXV SBPO Natal, 4-7 de novembro de 2003 A Tabu Search Heuristic for Partition.
GRAIL: Scalable Reachability Index for Large Graphs VLDB2010 Vineet Chaoji Mohammed J. Zaki.
5/5/20151 Mobile Ad hoc Networks COE 549 Transmission Scheduling II Tarek Sheltami KFUPM CCSE COE
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Does Topology Control Reduce Interference? Martin Burkhart Pascal von Rickenbach Roger Wattenhofer Aaron Zollinger.
1 Processing & Analysis of Geometric Shapes Shortest path problems Shortest path problems The discrete way © Alexander & Michael Bronstein, ©
Automated Layout and Phase Assignment for Dark Field PSM Andrew B. Kahng, Huijuan Wang, Alex Zelikovsky UCLA Computer Science Department
Region Segmentation. Find sets of pixels, such that All pixels in region i satisfy some constraint of similarity.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
HCS Clustering Algorithm
1 DAOmap: A Depth-optimal Area Optimization Mapping Algorithm for FPGA Designs Deming Chen, Jacon Cong ICCAD 2004 Presented by: Wei Chen.
Network Optimization Problems: Models and Algorithms This handout: Minimum Spanning Tree Problem.
Close Lower and Upper Bounds for the Minimum Reticulate Network of Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
Technion 1 Generating minimum transitivity constraints in P-time for deciding Equality Logic Ofer Strichman and Mirron Rozanov Technion, Haifa, Israel.
2-Layer Crossing Minimisation Johan van Rooij. Overview Problem definitions NP-Hardness proof Heuristics & Performance Practical Computation One layer:
Jana van Greunen - 228a1 Analysis of Localization Algorithms for Sensor Networks Jana van Greunen.
S. Suri, M, Waldvogel, P. Warkhede CS University of Washington Profile-Based Routing: A New Framework for MPLS Traffic Engineering.
Minimum Spanning Trees. Subgraph A graph G is a subgraph of graph H if –The vertices of G are a subset of the vertices of H, and –The edges of G are a.
Timing-sync Protocol for Sensor Networks (TPSN) Presenter: Ke Gao Instructor: Yingshu Li.
Time-Constrained Flooding A.Mehta and E. Wagner. Time-Constrained Flooding: Problem Definition ●Devise an algorithm that provides a subgraph containing.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Introduction to Graph Theory
Genetic Algorithm for Multicast in WDM Networks Der-Rong Din.
Mehdi Kargar Aijun An York University, Toronto, Canada Discovering Top-k Teams of Experts with/without a Leader in Social Networks.
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
CS774. Markov Random Field : Theory and Application Lecture 13 Kyomin Jung KAIST Oct
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
Example Question on Linear Program, Dual and NP-Complete Proof COT5405 Spring 11.
Graph Theory Emily Belyea Paton Vinal Paul Friedman Emily Belyea Paton Vinal Paul Friedman.
The Colorful Traveling Salesman Problem Yupei Xiong, Goldman, Sachs & Co. Bruce Golden, University of Maryland Edward Wasil, American University Presented.
A New Hybrid Wireless Sensor Network Localization System Ahmed A. Ahmed, Hongchi Shi, and Yi Shang Department of Computer Science University of Missouri-Columbia.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
NP-COMPLETE PROBLEMS. Admin  Two more assignments…  No office hours on tomorrow.
Topology Control of Multihop Wireless Networks Using Transmit Power Adjustment Paper By : Ram Ramanathan, Regina Resales-Hain Slides adapted from R. Jayampathi.
University “Ss. Cyril and Methodus” SKOPJE Cluster-based MDS Algorithm for Nodes Localization in Wireless Sensor Networks Ass. Biljana Stojkoska.
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network problems Tamer Kahveci.
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network models Tamer Kahveci.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
1 EE5900 Advanced Embedded System For Smart Infrastructure Static Scheduling.
Bootstrapped Optimistic Algorithm for Tree Construction
Informatics tools in network science
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
6/11/20161 Graph models and efficient exact algorithms in studying cancer signaling pathways Songjian Lu, Lujia Chen, Chunhui Cai Department of Biomedical.
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Finding Dense and Connected Subgraphs in Dual Networks
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Computer Science cpsc322, Lecture 14
Alexander Zelikovsky Computer Science Department
SiS: Significant Subnetworks in Massive Number of Network Topologies
Robustness of wireless ad hoc network topologies
CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014
Graphs Chapter 11 Objectives Upon completion you will be able to:
CSE 373 Data Structures and Algorithms
Incremental Network Querying in Biological Networks
3.5 Minimum Cuts in Undirected Graphs
Topological Sort CSE 373 Data Structures Lecture 19.
Robustness of wireless ad hoc network topologies
5.4 T-joins and Postman Problems
Algorithms (2IL15) – Lecture 7
CSE 373: Data Structures and Algorithms
EE5900 Advanced Embedded System For Smart Infrastructure
Dynamic Graph Algorithms
CSE 373: Data Structures and Algorithms
Presentation transcript:

1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network construction from RNAi data Tamer Kahveci

Signaling Networks 2 MAPK network

Signal reachability 3 Receptor Reporter Luciferase

Signaling and RNA Interference 4 Receptor Reporter Luciferase X Not critical X Critical

Signaling Network Reconstruction from RNAi data 5 Receptor Reporter Not critical Critical

RNAi data and Reference Network 6 Receptor Reporter Not critical Critical Reference network Insert Delete Not consistent ! Consistent !

Overview 7 G R = (V R, E R ) Reference network Constraints G T = (V T, E T ) Target network 10 SiNeC (Signal Network Constructor) S-SiNeC (Scalable Signal Network Constructor) Given Find Goal: Minimize the number of edit operations to make the reference consistent. NP-Complete !

SiNeC algorithm Three steps 1.Order the critical genes left to right based on the topology of G R. [Sloan, 1986] – v 1, v 2, …, v c 2.Edge deletion phase 3.Edge insertion phase 8

Step 1: Order critical genes 9 ReceptorReporter Prioritize based on distance to the reporter + degree

Step 2: Edge deletion 10 Purpose: Eliminate detours around critical genes ReceptorReporter vivi vkvk vjvj Find all (undesirable) paths between non-consecutive critical genes. i.e., Paths which go through only noncritical genes Edges are weighted with the number of such paths they belong to. Remove greedily starting from the largest weight until al paths are disrupted. Bypassed !!!

Step 3: Edge insertion 11 Purpose: Make sure that critical are connected + noncritical genes are consistent ReceptorReporter v i-1 v i+1 vivi Insert an edge from v i-1 to v i if 1.There is no path from v i-1 to v i. 2.There is a noncritical gene on all paths from v i-1 to v i.

Overview 12 G R = (V R, E R ) Reference network Constraints G T = (V T, E T ) Target network 10 SiNeC (Signal Network Constructor) S-SiNeC (Scalable Signal Network Constructor) Given Find Finding all the paths can be too time consuming for large networks

S-SiNeC algorithm 13 Edge insertion 000None A1A1 100A 2 + A 3 + A 4 101A 2 + A 4 110A 3 + A 4 111A4A4 Critical Left reachable Right reachable Edge deletion Reference network vsvs vtvt vivi

S-SiNeC: Edge insertion (A1) 14 Reference network vsvs vtvt vivi L R Purpose: Make sure that noncritical genes are consistent

S-SiNeC: Edge insertion (A2) 15 Reference network vsvs vtvt vivi L R Purpose: Make sure that critical genes are left reachable

S-SiNeC: Edge insertion (A3) 16 Reference network vsvs vtvt vivi L R Purpose: Make sure that critical genes are right reachable

S-SiNeC: Edge insertion 17 L/Re1e1 e2e2 e3e3 1X 2XX 3XX 4XX 5XX 6XX 7X 8X

S-SiNeC: Edge deletion (A4) 18 Reference network vsvs vtvt vivi L R Purpose: Make sure that no detours exist around critical genes Solve minimum cut between L & R

Dataset 19 Reference networks are obtained by random edge shuffling at 5% to 40% mutation rates. 200 references per target network & per mutation rate.

Average distance to the true network 20

Accuracy based on edge class 21 vsvs vtvt Hot Cold

Running time results 22 SiNeC > 1 hour per reference network.

Success rate on constraints 23

24 Accuracy

25 Functional Enrichment of the Pathway

Last Remarks Constructing very large signaling networks from RNAi data is possible in practical running time. Both SiNeC and S-SiNeC are robust to errors in reference network. We recommend – S-SiNeC for very large OR dense networks. – SiNeC otherwise. 26

Acknowledgements 27 CCF IIS