Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.
Molecular Systems Biology 3; Article number 140; doi: /msb
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Putting genetic interactions in context through a global modular decomposition Jamal.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Chapter 9 Graph algorithms Lec 21 Dec 1, Sample Graph Problems Path problems. Connectedness problems. Spanning tree problems.
Clustering (Part II) 11/26/07. Spectral Clustering.
A scalable multilevel algorithm for community structure detection
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Dependency networks Sushmita Roy BMI/CS 576 Nov 26 th, 2013.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
Module 5 – Networks and Decision Mathematics Chapter 23 – Undirected Graphs.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Identifying Causal Genes and Dysregulated Pathways in Complex Diseases Discussion leader: Nafisah Islam Scribe: Matthew Computational Network Biology BMI.
Dependency networks Sushmita Roy BMI/CS 576 Nov 25 th, 2014.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
The Bi-Module problem: new algorithms and applications Group meeting January 2013 David Amar.
1 Latency-Bounded Minimum Influential Node Selection in Social Networks Incheol Shin
Data Structures & Algorithms Graphs Richard Newman based on book by R. Sedgewick and slides by S. Sahni.
Learning disjunctions in Geronimo’s regression trees Felix Sanchez Garcia supervised by Prof. Dana Pe’er.
Algorithms For Solving History Sensitive Cascade in Diffusion Networks Research Proposal Georgi Smilyanov, Maksim Tsikhanovich Advisor Dr Yu Zhang Trinity.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Random Geometric Graph Model Model for ad hoc/sensor networks n nodes placed in d-dim space Connectivity threshold r Two nodes u,v connected iff ||u-v||
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
A Combinatorial Approach to the Analysis of Differential Gene Expression Data The Use of Graph Algorithms for Disease Prediction and Screening.
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Seojin Bang. The goal of this review paper is.. To address problems and computational solutions that arise in analysis of omics data. To highlight fundamental.
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Graph clustering to detect network modules
Spectral clustering of graphs
Hierarchical Agglomerative Clustering on graphs
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Dynamics and context-specificity in biological networks
Network-based interpretation of perturbations to living systems
Graph Algorithm.
Approximating the Community Structure of the Long Tail
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Spectral methods for Global Network Alignment
Presentation transcript:

Network applications Sushmita Roy BMI/CS Dec 9 th, 2014

RECAP from last time We talked about modularity, degree distribution, and path-based measures to study properties of graphs Today we will touch briefly on the many different applications of networks.

Applications of Networks Differential subgraph identification – Given gene expression studies from disease and normal studies – Identify pathways that are most differentially altered between these conditions Module detection – Dense subgraph identification Interpretation of gene sets Identification of novel pathways – Set cover based methods Network information flow – Sparse subgraph identification Integration of different data sets Interpretation of gene sets Prioritization of genes Adapted from Network biology approaches to Complex Diseases.

Identifying differential networks Given – Gene expression data from two conditions: treatment and control Do – Find the subnetworks are significantly different between conditions

Motivating example Metastases is a major cause of death among breast cancer patients Gene expression levels of marker genes can be used to predict metastasis state, but are 60-70% accurate Because cancer is a complex disease, involving multiple genes, incorporating interactions among them might improve predictive power

Tasks for identifying differential networks Define a measure of activity for a subnetwork – Sum of activity of individual genes Define a measure of differential activity between two conditions – Chuang et al, used an information theoretic measure to information between the class variable and the discretized expression of a subgraph

Identifying differential subgraphs in cancer Chuang et al, 2007 Subnetworks were obtained using a greedy process

Improved predictive power using subnetworks as classification features AUC

Identification of modules dysregulated in diseases Cho D-Y, Kim Y-A, Przytycka TM (2012) Chapter 5: Network Biology Approach to Complex Diseases. PLoS Comput Biol 8(12): e doi: /journal.pcbi

Problem definition of identifying dysregulated modules Given – A network of possibly weighted gene interactions and a set of genetic alterations Do – Find a subgraph that best connects these genetic alterations Best can mean – A dense subgraph with large number of connections among the vertices (genes) – A sparse subgraph with few intermediate nodes

Some approaches for finding dense subgraphs HOTNET: A set cover approach – Best: use fewest possible external nodes NETBAG: Network Based Analysis of Genetic Associations

Set cover approach A gene is said to be covered by a disease if it is dysregulated in the disease sample Main idea: Each disease has a set of dysregulated genes Different cases have different covered gene sets Identify the smallest set that covers as many disease samples as possible

HOTNET optimizes two criteria Genes in a perturbed pathway must be – close to each other – Must have a few alternate paths between them

HOTNET heuristic Mutations in a linear chain are more “interesting” than in a star graph

Steps in HOTNET Obtain an influence score of vertex i on vertex j Weight each edge by the influence score Remove all edges with weight less than a threshold δ Find the subgraph of size k that covers as many samples as possible

Computing the influence between vertex pairs Assume we have two vertices u and v Influence can be defined based on how fast a signal at u reaches v Diffusion of this signal is governed by the connectivity of the graph based on what is called the Graph Laplacian

HOTNET algorithm Exploration Keep adding a neighbor that has the maximal coverage with fewest additional vertices

HOTNET recovers pathways relevant to cancer

Summary Given an existing interaction network, many network applications are based on overlaying information on these networks We talked about two applications – Identifying differential networks – Identifying a subnetwork that best explains a set of mutations in patient samples