3.3 Network-Centric Community Detection

Slides:



Advertisements
Similar presentations
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Advertisements

Partitional Algorithms to Detect Complex Clusters
Community Detection with Edge Content in Social Media Networks Paper presented by Konstantinos Giannakopoulos.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Modularity and community structure in networks
Community Detection and Evaluation
Information Networks Graph Clustering Lecture 14.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Clustering II CMPUT 466/551 Nilanjan Ray. Mean-shift Clustering Will show slides from:
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Lecture 21: Spectral Clustering
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts Dhillon, Inderjit S., Yuqiang Guan, and Brian Kulis.
CS 584. Review n Systems of equations and finite element methods are related.
Normalized Cuts and Image Segmentation Jianbo Shi and Jitendra Malik, Presented by: Alireza Tavakkoli.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
Fast algorithm for detecting community structure in networks.
Segmentation Graph-Theoretic Clustering.
Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.
EDA (CS286.5b) Day 6 Partitioning: Spectral + MinCut.
A scalable multilevel algorithm for community structure detection
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University COT 5410 – Spring 2004.
Image Segmentation Rob Atlas Nick Bridle Evan Radkoff.
Introduction to Graph Theory
Segmentation Techniques Luis E. Tirado PhD qualifying exam presentation Northeastern University.
Spectral coordinate of node u is its location in the k -dimensional spectral space: Spectral coordinates: The i ’th component of the spectral coordinate.
Models and Algorithms for Complex Networks Graph Clustering and Network Communities.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University.
CS 590 Term Project Epidemic model on Facebook
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Spectral Clustering Shannon Quinn (with thanks to William Cohen of Carnegie Mellon University, and J. Leskovec, A. Rajaraman, and J. Ullman of Stanford.
Mesh Segmentation via Spectral Embedding and Contour Analysis Speaker: Min Meng
A Tutorial on Spectral Clustering Ulrike von Luxburg Max Planck Institute for Biological Cybernetics Statistics and Computing, Dec. 2007, Vol. 17, No.
Normalized Cuts and Image Segmentation Patrick Denis COSC 6121 York University Jianbo Shi and Jitendra Malik.
Motoki Shiga, Ichigaku Takigawa, Hiroshi Mamitsuka
Analysis of Large Graphs Community Detection By: KIM HYEONGCHEOL WALEED ABDULWAHAB YAHYA AL-GOBI MUHAMMAD BURHAN HAFEZ SHANG XINDI HE RUIDAN 1.
Density of States for Graph Analysis
Random Walk for Similarity Testing in Complex Networks
Cohesive Subgraph Computation over Large Graphs
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
Minimum Spanning Tree 8/7/2018 4:26 AM
by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72
Greedy Algorithm for Community Detection
June 2017 High Density Clusters.
Network analysis.
Spectral Clustering.
TELCOM2125: Network Science and Analysis
Pattern Recognition PhD Course.
Degree and Eigenvector Centrality
Segmentation Graph-Theoretic Clustering.
Resolution Limit in Community Detection
Grouping.
Discovering Clusters in Graphs
Approximating the Community Structure of the Long Tail
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
Spectral Clustering Eric Xing Lecture 8, August 13, 2010
On Clusterings: Good, Bad, and Spectral
Spectral clustering methods
第八届全国复杂网络学术会议 Spectra of transition matrix for networks: Computation and applications 章 忠 志 复旦大学计算机科学技术学院 Homepage:
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
“Traditional” image segmentation
Presentation transcript:

3.3 Network-Centric Community Detection Block Model Approximation The adjacency matrix can be approximated by a block structure. Each block represents one community We approximate a given adjacency matrix A as follows: 𝑆 ∈ {0,1} 𝑛×𝑘 is block indicator matrix with 𝑆 𝑖𝑗 =1 if node 𝑖 belongs to 𝑗th block. Σ is 𝑘 ×𝑘 matrix indicating the block (group) interaction density 𝑘 is the number of blocks

3.3 Network-Centric Community Detection Block Model Approximation Natural objective is to minimize the following: The optimal S corresponds to the top k eigenvectors of the adjacency matrix A with maximum eigenvalues. K-means clustering can be applied to S to recover the community partition H .

3.3 Network-Centric Community Detection Spectral Clustering It is derived from the problem of graph partition. Graph partition aims to find out a partition such that the cut is minimized. Cut - the total number of edges between two disjoint sets of nodes The green cut between two sets of nodes {1,2,3,4} and {5,6,7,8,9} is 2 Community detection problem can be reduced to finding the minimum cut in a network. This minimum cut problem can be solved efficiently It often returns imbalanced communities, with one being trivial or a singleton, i.e., a community consisting of only one node. The minimum cut is 1, between {9} and {1, 2, 3, 4, 5, 6, 7, 8} Therefore, the group sizes of communities should be considered.

3.3 Network-Centric Community Detection Spectral Clustering Ratio cut & Normalized cut Graph partition: 𝜋=( 𝐶 1 , 𝐶 2 ,…, 𝐶 𝑘 ) s.t. 𝐶 1 ∩ 𝐶 2 =∅ and 𝑖=1 𝑘 𝐶 𝑖 =𝑉 𝐶 𝑖 : the complement of 𝐶 𝑖 𝑐𝑢𝑡 𝐶 𝑖 , 𝐶 𝑖 : the number of edges between 𝐶 𝑖 and 𝐶 𝑖 𝑣𝑜𝑙 𝐶 𝑖 = 𝑣∈ 𝐶 𝑖 𝑑 𝑣 The smaller ratio/normalized cut is preferable.

3.3 Network-Centric Community Detection Spectral Clustering Ratio cut & Normalized cut - Examples 𝜋 1 =( 𝐶 1 ={9}, 𝐶 2 ={1,2,3,4,5,6,7,8}) 𝑐𝑢𝑡 𝐶 1 , 𝐶 1 =1, 𝑐𝑢𝑡 𝐶 2 , 𝐶 2 =1 𝐶 1 =1, 𝐶 2 =8 𝑣𝑜𝑙 𝐶 1 =1, 𝑣𝑜𝑙 𝐶 2 =27 𝜋 2 =( 𝐶 1 ={1,2,3,4}, 𝐶 2 ={5,6,7,8,9}) 𝑐𝑢𝑡 𝐶 1 , 𝐶 1 =2, 𝑐𝑢𝑡 𝐶 2 , 𝐶 2 =2 𝐶 1 =4, 𝐶 2 =5 𝑣𝑜𝑙 𝐶 1 =12, 𝑣𝑜𝑙 𝐶 2 =16

3.3 Network-Centric Community Detection Spectral Clustering Finding the minimum ratio cut or normalized cut is NP-hard Both ratio cut and normalized cut can be formulated as a min-trace problem like below Graph Laplacian 𝐿 is defined as follows: 𝐷=𝑑𝑖𝑎𝑔( 𝑑 1 , 𝑑 2 , …, 𝑑 𝑛 ) Then, 𝑆 corresponds to the top eigenvectors of 𝐿 with the smallest eigenvalues.

3.3 Network-Centric Community Detection Modularity Maximization Modularity It measure the strength of a community partition for real-world networks by taking into account the degree distribution of nodes Given a network of 𝑛 nodes and 𝑚 edges, the expected number of edges between nodes 𝑣 𝑖 and 𝑣 𝑗 is 𝑑 𝑖 𝑑 𝑗 /2𝑚 9 nodes and 14 edges The expected number of edges between nodes 1 and 2 is 3×2/(2×14) = 3/14. So, 𝐴 𝑖𝑗 − 𝑑 𝑖 𝑑 𝑗 /2𝑚 measures how far the true network interaction between nodes 𝑖 and 𝑗 deviates from the expected random connections

3.3 Network-Centric Community Detection Modularity Maximization Given a group of nodes 𝐶, the strength of community effect is defined as If a network is partitioned into k groups, the overall community effect can be summed up as follows Therefore, Modularity is defined as where 1/2𝑚 is introduced to normalize the value between -1 and 1. Modularity calibrates the quality of community partitions thus can be used as an objective measure to maximize it

3.3 Network-Centric Community Detection Modularity Maximization Define a modularity matrix 𝐵 where 𝑑∈ 𝑅 𝑛×1 is a vector of each node’s degree. Let 𝑆∈ {0, 1} 𝑛×𝑘 be a community indicator matrix with 𝑆 𝑖𝑙 =1 if node 𝑖 belongs to community 𝐶 𝑙 , and 𝑠 𝑙 the 𝑙th column of 𝑆 Modularity can be reformulated as The optimal 𝑆 can be computed as the top k eigenvectors of the modularity matrix 𝐵 with the maximum eigenvalues.