Assessing Hierarchical Modularity in Protein Interaction Networks

Slides:



Advertisements
Similar presentations
Course Evaluation Form About The Course -Go more slowly (||) -More lectures (||) -Problem Sets, Class Projects (|||) -Software tools About The Instructor.
Advertisements

Cartography of complex networks: From organizations to the metabolism Cartography of complex networks: From organizations to the metabolism Roger Guimerà.
Network analysis Sushmita Roy BMI/CS 576
Social network partition Presenter: Xiaofei Cao Partick Berg.
Gene duplication models and reconstruction of gene regulatory network evolution from network structure Juris Viksna, David Gilbert Riga, IMCS,
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Emergence of Scaling in Random Networks Albert-Laszlo Barabsi & Reka Albert.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Small World Networks Lecture 5.
UC Davis, May 18 th 2006 Introduction to Biological Networks Eivind Almaas Microbial Systems Division.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
University at BuffaloThe State University of New York Young-Rae Cho Department of Computer Science and Engineering State University of New York at Buffalo.
Mining and Searching Massive Graphs (Networks)
Biological Networks Feng Luo.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Gene and Protein Networks II Monday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.
Global topological properties of biological networks.
1 Topology, Functionality and Evolution of Metabolic Networks Jing Zhao Shanghai Center for Bioinformation and Technology 28, September,
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
1 Hierarchical modularity of nested bow- ties in metabolic networks Jing Zhao Shanghai Jiao Tong University Shanghai Center for Bioinformation.
Chapter 3. Community Detection and Evaluation May 2013 Youn-Hee Han
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Neural Networks - Lecture 81 Unsupervised competitive learning Particularities of unsupervised learning Data clustering Neural networks for clustering.
Network Evolution Statistics of Networks Comparing Networks Networks in Cellular Biology A. Metabolic Pathways B. Regulatory Networks C. Signaling Pathways.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Class 19: Degree Correlations PartII Assortativity and hierarchy
Network resilience.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
Informatics tools in network science
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Gene and Protein Networks I Wednesday, April CSCI 4830: Algorithms for Molecular Biology Debra Goldberg.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Introduction to Machine Learning Nir Ailon Lecture 12: EM, Clustering and More.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Graph clustering to detect network modules
Structures of Networks
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Semi-Supervised Clustering
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Groups of vertices and Core-periphery structure
Biological networks CS 5263 Bioinformatics.
Frontiers of Network Science Class 15: Degree Correlations II
Applications of graph theory in complex systems research
Network biology : protein – protein interactions
Network Analysis of Protein-Protein Interactions
Community detection in graphs
Building and Analyzing Genome-Wide Gene Disruption Networks
Social Network Analysis
CSE572, CBS598: Data Mining by H. Liu
Peer-to-Peer and Social Networks Fall 2017
Department of Computer Science University of York
Artificial Intelligence Lecture No. 28
Microarray Data Set The microarray data set we are dealing with is represented as a 2d numerical array.
CSE572: Data Mining by H. Liu
Presentation transcript:

Assessing Hierarchical Modularity in Protein Interaction Networks Young-Rae Cho, Woochang Hwang, Murali Ramanathan and Aidong Zhang State University of New York at Buffalo

Scale-free & Modular Networks Scale-free Networks Power Law degree distribution4,1: P(k) ~ k –γ with 2 < γ < 3 Disassortativity4,1,5 Frequent connections between a hub and a peripheral node Infrequent connections between two hubs Small-world property11,1: Average geodesic path length l ~ log log N Modular Networks3,8,1 Here we map the samples and the genes into 2-dimensional space. As we can see, the genes has some dense area, if we remove the outliers and zoom in the dense area, we will find detailed dense area and some outliers. So the gene distribution has some hierarchical-dense structure. But the samples are very sparse in high-dimensional space. Even mapped into 2-dimensional space, there are no class structure can be detected. We can partition the sample by many hyperplane, but cannot judge which partition is better. So the the techniques that are effective for gene-based analysis are not adequate for analyzing samples. Effective and efficient sample-based analysis remains a challenging problem. Bridges Bridging Nodes

Bridge Measurement Global Measurement Local Measurement Betweenness Centrality7: Local Measurement Clustering Coefficient11: Neighbor Significance: , Similarity of Nodes: Combined Bridge Measurement Bridging Nodes: Bridges: The existing methods of selecting informative genes to cluster samples fall into two major categories: supervised analysis and unsupervised analysis. The supervised approach assumes that additional information is attached to some (or all) data, for example, that biological samples are labeled as diseased vs. normal. The most famous supervised method is the neighborhood analysis method which is a science paper published in 1999 and it stimulate the research of sample phenotype detection. Other supervised method include: tree harvesting, support vector machines, decision tree method, genetic algorithm, the artificial neural networks, and a variety of ranking based methods. The basic steps of these supervised methods is first select a subset of samples as the training set, using the phenotypes as a reference to select a small percent of informative genes which manifest the phenotype partition within the training samples. Finally, the whole set of samples are grouped according to the selected informative genes.

Hierarchical Modularization Constraint ∆C = C(G) – C(G’) ≥ 0 if v is a bridging node where V’ = V – {v} C(G): Average clustering coefficient of the nodes in graph G. C(G’): Average clustering coefficient of the nodes in the reduced graph G’, in which v with the highest BR(v) is removed. Algorithm Successive Removal of Bridging Nodes if ∆C < 0 Successive Removal of Bridges No if G is split into Gi’ G Gi’ > θsize Gi’ Therefore, it is natural to ask “Can we find both the phenotypes and the informative genes automatically at the same time?” For example, if the samples’ phenotypes are unknown, can we correctly distinguish Three classes of samples, as well as output gene1 ~gene3 as informative genes? Unsupervised sample clustering is much more complex than supervised manner. Yes if G is split into Gi’ Replace each Gi’ to G

Topological Analysis Data: Core protein interaction data of Saccharomyces cerevisiae from DIP10,2 Method: Remove v with the highest BR(v) and compute C(G), iteratively. Results: I & II (~ 30%): Bridging and interconnecting node removal zone. III (~ 30%): Core node removal zone. IV (~ 40%): Peripheral node removal zone.

Biological Analysis Data: Core protein interaction data of Saccharomyces cerevisiae from DIP10,2 Method: Remove a set S of nodes v with the highest BR(v) and compute the proportion of lethal proteins in S, iteratively. Results: Bridging and interconnecting nodes are less lethal than core nodes. Bridging and interconnecting nodes are more lethal than peripheral nodes.

Modularization Results 6,9

References Barabasi, A.-L. and Oltvai, Z. N., Nature Reviews: Genetics (2004) Dean, C. M., Salwinski, L., Xenarios, I. and Eisenberg, D., Molecular and Cellular Proteomics (2002) Hartwell, L. H., Hopfield, J. J., Leibler, S. and Murray, A. W., Nature (1999) Jeong, H., Mason, S. P., Barabasi, A.-L. and Oltvai, Z. N., Nature (2001) Maslov, S. and Sneppen, K., Science (2002) Mewes, H. W., at al., Nucleic Acid Research (2006) Newman, M. E. J., Physical Review E (2001) Ravasz, E., Somera, A. L., Mongru, D. A., Oltvai, Z. N. and Barabasi, A.-L., Science (2002) Ruepp, A., at al., Nucleic Acid Research (2004) Salwinski, L., at al., Nucleic Acid Research (2004) Watts, D. J. and Strogatz, S. H., Nature (1998)