Download presentation
Presentation is loading. Please wait.
1
3.3 Network-Centric Community Detection
Block Model Approximation The adjacency matrix can be approximated by a block structure. Each block represents one community We approximate a given adjacency matrix A as follows: π β {0,1} πΓπ is block indicator matrix with π ππ =1 if node π belongs to πth block. Ξ£ is π Γπ matrix indicating the block (group) interaction density π is the number of blocks
2
3.3 Network-Centric Community Detection
Block Model Approximation Natural objective is to minimize the following: The optimal S corresponds to the top k eigenvectors of the adjacency matrix A with maximum eigenvalues. K-means clustering can be applied to S to recover the community partition H .
3
3.3 Network-Centric Community Detection
Spectral Clustering It is derived from the problem of graph partition. Graph partition aims to find out a partition such that the cut is minimized. Cut - the total number of edges between two disjoint sets of nodes The green cut between two sets of nodes {1,2,3,4} and {5,6,7,8,9} is 2 Community detection problem can be reduced to finding the minimum cut in a network. This minimum cut problem can be solved efficiently It often returns imbalanced communities, with one being trivial or a singleton, i.e., a community consisting of only one node. The minimum cut is 1, between {9} and {1, 2, 3, 4, 5, 6, 7, 8} Therefore, the group sizes of communities should be considered.
4
3.3 Network-Centric Community Detection
Spectral Clustering Ratio cut & Normalized cut Graph partition: π=( πΆ 1 , πΆ 2 ,β¦, πΆ π ) s.t. πΆ 1 β© πΆ 2 =β
and π=1 π πΆ π =π πΆ π : the complement of πΆ π ππ’π‘ πΆ π , πΆ π : the number of edges between πΆ π and πΆ π π£ππ πΆ π = π£β πΆ π π π£ The smaller ratio/normalized cut is preferable.
5
3.3 Network-Centric Community Detection
Spectral Clustering Ratio cut & Normalized cut - Examples π 1 =( πΆ 1 ={9}, πΆ 2 ={1,2,3,4,5,6,7,8}) ππ’π‘ πΆ 1 , πΆ 1 =1, ππ’π‘ πΆ 2 , πΆ 2 =1 πΆ 1 =1, πΆ 2 =8 π£ππ πΆ 1 =1, π£ππ πΆ 2 =27 π 2 =( πΆ 1 ={1,2,3,4}, πΆ 2 ={5,6,7,8,9}) ππ’π‘ πΆ 1 , πΆ 1 =2, ππ’π‘ πΆ 2 , πΆ 2 =2 πΆ 1 =4, πΆ 2 =5 π£ππ πΆ 1 =12, π£ππ πΆ 2 =16
6
3.3 Network-Centric Community Detection
Spectral Clustering Finding the minimum ratio cut or normalized cut is NP-hard Both ratio cut and normalized cut can be formulated as a min-trace problem like below Graph Laplacian πΏ is defined as follows: π·=ππππ( π 1 , π 2 , β¦, π π ) Then, π corresponds to the top eigenvectors of πΏ with the smallest eigenvalues.
7
3.3 Network-Centric Community Detection
Modularity Maximization Modularity It measure the strength of a community partition for real-world networks by taking into account the degree distribution of nodes Given a network of π nodes and π edges, the expected number of edges between nodes π£ π and π£ π is π π π π /2π 9 nodes and 14 edges The expected number of edges between nodes 1 and 2 is 3Γ2/(2Γ14) = 3/14. So, π΄ ππ β π π π π /2π measures how far the true network interaction between nodes π and π deviates from the expected random connections
8
3.3 Network-Centric Community Detection
Modularity Maximization Given a group of nodes πΆ, the strength of community effect is defined as If a network is partitioned into k groups, the overall community effect can be summed up as follows Therefore, Modularity is defined as where 1/2π is introduced to normalize the value between -1 and 1. Modularity calibrates the quality of community partitions thus can be used as an objective measure to maximize it
9
3.3 Network-Centric Community Detection
Modularity Maximization Define a modularity matrix π΅ where πβ π
πΓ1 is a vector of each nodeβs degree. Let πβ {0, 1} πΓπ be a community indicator matrix with π ππ =1 if node π belongs to community πΆ π , and π π the πth column of π Modularity can be reformulated as The optimal π can be computed as the top k eigenvectors of the modularity matrix π΅ with the maximum eigenvalues.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.