1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.

Slides:



Advertisements
Similar presentations
Introduction to Algorithms Graph Algorithms
Advertisements

Social network partition Presenter: Xiaofei Cao Partick Berg.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Introduction to Graph Cluster Analysis. Outline Introduction to Cluster Analysis Types of Graph Cluster Analysis Algorithms for Graph Clustering  k-Spanning.
PROCESS MODELLING AND MODEL ANALYSIS © CAPE Centre, The University of Queensland Hungarian Academy of Sciences Analysis of Dynamic Process Models C13.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Modularity and community structure in networks
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
The multi-layered organization of information in living systems
Graph Partitioning Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Information Networks Graph Clustering Lecture 14.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Clustering II CMPUT 466/551 Nilanjan Ray. Mean-shift Clustering Will show slides from:
Author: Jie chen and Yousef Saad IEEE transactions of knowledge and data engineering.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Lecture 21: Spectral Clustering
CS 584. Review n Systems of equations and finite element methods are related.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
HCS Clustering Algorithm
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Fast algorithm for detecting community structure in networks.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
A scalable multilevel algorithm for community structure detection
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computer Science 112 Fundamentals of Programming II Graph Algorithms.
Information Networks Introduction to networks Lecture 1.
Community Detection by Modularity Optimization Jooyoung Lee
Computer Science 112 Fundamentals of Programming II Introduction to Graphs.
Finding dense components in weighted graphs Paul Horn
2015/10/111 DBconnect: Mining Research Community on DBLP Data Osmar R. Zaïane, Jiyang Chen, Randy Goebel Web Mining and Social Network Analysis Workshop.
Models and Algorithms for Complex Networks Graph Clustering and Network Communities.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Sharon Bruckner, Bastian Kayser, Tim Conrad Freie Uni. Berlin Finding Modules in Networks with Non-modular Regions.
Lecture 3 1.Different centrality measures of nodes 2.Hierarchical Clustering 3.Line graphs.
University at BuffaloThe State University of New York Detecting Community Structure in Networks.
Data Structures and Algorithms in Parallel Computing Lecture 7.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
- Murtuza Shareef Authoritative Sources in a Hyperlinked Environment More specifically “Link Analysis” using HITS Algorithm.
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
1 Microarray Clustering. 2 Outline Microarrays Hierarchical Clustering K-Means Clustering Corrupted Cliques Problem CAST Clustering Algorithm.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Network Theory and Dynamic Systems Link Prediction.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Modular organization.
Graph clustering to detect network modules
Cohesive Subgraph Computation over Large Graphs
by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72
June 2017 High Density Clusters.
IDENTIFICATION OF DENSE SUBGRAPHS FROM MASSIVE SPARSE GRAPHS
Community detection in graphs
Network Science: A Short Introduction i3 Workshop
Michael L. Nelson CS 495/595 Old Dominion University
Approximating the Community Structure of the Long Tail
Spectral methods for Global Network Alignment
3.3 Network-Centric Community Detection
Clustering.
Presentation transcript:

1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006

2 Introduction

3 Networks A network: presented by a graph G(V,E): V = nodes, E = edges (link node pairs) Examples of real-life networks: –social networks (V = people) –World Wide Web (V= webpages) –protein-protein interaction networks (V = proteins)

4 Protein-protein Interaction Networks Nodes – proteins (6K), edges – interactions (15K). Reflect the cell’s machinery and signaling pathways.

5 Communities (clusters) in a network A community (cluster) is a densely connected group of vertices, with only sparser connections to other groups.

6 Searching for communities in a network There are numerous algorithms with different "target-functions": –"Homogenity" - dense connectivity clusters –"Separation"- graph partitioning, min-cut approach Clustering is important for understanding the structure of the network –Provides an overview of the network

7 Distilling Modules from Networks Motivation: identifying protein complexes responsible for certain functions in the cell

8 Newman's network division algorithm

9 Important features of Newman's clustering algorithm The number and size of the clusters are determined by the algorithm Attempts to find a division that maximizes a modularity score Q –heuristic algorithm Notifies when the network is non-modular

10 Overview of the algorithm

11 Spectral 2-division algorithm Input: adjacency matrix A (n vertices) Output: a (  1)-vector of size n representing the 2-division –"-1" cluster (vertices whose corresponding entry is -1) and "+1" cluster (vertices whose corresponding entry is +1) Build a modularity matrix B from A Compute the leading eigen-pair (u 1,  1 ) of B –u 1 is the eigen-vector (size n),  1 is the eigen-value. leading eigen-pair: Bu 1 =  1 u 1.  1 is the maximal eigen value If (  1 == 0) => the network is indivisible Else (heuristic...) –Transform u 1 into vector (  1)-vector s –Q = s T Bs –if (Q > 0) return s, else return (+1,....,+1)

12 Dividing into more than 2 How to compute into more than 2? Idea: apply the algorithm recursively* on every group.  The algorithm should be generalized for a 2- division of a group in the network

13 Newman's clustering algorithm P* = {{1,....,n}} (*singleton nodes should be removed) For each group g in P –Remove g from P –Perform a spectral 2-division on g if g is divisible - improve the 2-division by additional heuristic. Add* each subgroup in the 2-division to P * if the subgroup has more than one element, and is different from g.