Community Detection Definition: Community Detection Girwan Newman Approach Hierarchical Clustering
Community Detection
Community Detection: Girvan Newman Approach
Girwan Newman Approach: Modularity
Variations of Girwan Newman Algorithm
Hierarchical Clustering Algorithms Important: 1.Clustering obtained by cutting the dendo-gram at a desired level. 2.Each connected component forms a cluster. 3.Basically we use some similarity threshold to get the clusters of desired quality.
Hierarchical Clustering Algorithms
Variations of Agglomerative clustering-1
Variations of Agglomerative clustering-2
Variations of Agglomerative clustering-3
Strength & Weaknesses Strengths of Hierarchical Clustering: Do not require information regarding number of clusters. Any desired number of clusters can be obtained by ‘cutting’ the dendogram at the proper level. Weaknesses: Not efficient -- the complexity is O(n^2). Once a decision is made to combine two clusters, it cannot be undone. No objective function is directly minimized.
Pseudo code – Agglomerative Clustering Source: Walter et. al.; Fast Agglomerative Clustering for Rendering
Pseudo code Pseudo code for Agglomerative Clustering, based on KD-Tree
References P.-N. Tan, M. Steinbach, and V. Kumar, editors. "Introduction to Data Mining." Pearson Addison Wesley, B. Walter, K. Bala, M. Kulkarni, and K. Pingali. "Fast Agglomerative Clustering for Rendering." IEEE Symposium on Interactive Ray Tracing, Girvan M. and Newman M. E. J., Community structure in social and biological networks, Proc. Natl. Acad. Sci. USA 99, 7821– 7826 (2002) Andrea Lancichinetti and Santo Fortunato (2011). "Limits of modularity maximization in community detection". Physical Review E 84: arXiv: Survey article Communities in Networks (Notices of the American Mathematical Society.