An Efficient Algorithm Based on Self-adapted Fuzzy C-Means Clustering for Detecting Communities in Complex Networks Jianzhi Jin 1, Yuhua Liu 1, Kaihua.

Slides:



Advertisements
Similar presentations
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Advertisements

Fast algorithm for detecting community structure in networks M. E. J. Newman Department of Physics and Center for the Study of Complex Systems, University.
Partitional Algorithms to Detect Complex Clusters
An Interactive-Voting Based Map Matching Algorithm
Social network partition Presenter: Xiaofei Cao Partick Berg.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Bidding Protocols for Deploying Mobile Sensors Reporter: Po-Chung Shih Computer Science and Information Engineering Department Fu-Jen Catholic University.
Jure Leskovec (Stanford) Kevin Lang (Yahoo! Research) Michael Mahoney (Stanford)
Modularity and community structure in networks
Community Detection Algorithm and Community Quality Metric Mingming Chen & Boleslaw K. Szymanski Department of Computer Science Rensselaer Polytechnic.
Graph Partitioning Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Da Yan, Zhou Zhao and Wilfred Ng The Hong Kong University of Science and Technology.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
V4 Matrix algorithms and graph partitioning
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
MRI Image Segmentation for Brain Injury Quantification Lindsay Kulkin 1 and Bir Bhanu 2 1 Department of Biomedical Engineering, Syracuse University, Syracuse,
Fast algorithm for detecting community structure in networks.
Application of Graph Theory to OO Software Engineering Alexander Chatzigeorgiou, Nikolaos Tsantalis, George Stephanides Department of Applied Informatics.
An Energy-efficient Target Tracking Algorithm in Wireless Sensor Networks Wang Duoqiang, Lv Mingke, Qin Qi School of Computer Science and technology Huazhong.
CHURN PREDICTION MODEL IN RETAIL BANKING USING FUZZY C- MEANS CLUSTERING Džulijana Popović Consumer Finance, Zagrebačka banka d.d. Consumer Finance, Zagrebačka.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Community Structure in Social and Biological Network
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
Sensor Positioning in Wireless Ad-hoc Sensor Networks Using Multidimensional Scaling Xiang Ji and Hongyuan Zha Dept. of Computer Science and Engineering,
Community Detection by Modularity Optimization Jooyoung Lee
WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation Chao Chen ⨳ , Dongsheng Li
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Particle Competition and Cooperation for Uncovering Network Overlap Community Structure Fabricio A. Breve 1, Liang Zhao 1, Marcos G. Quiles 2, Witold Pedrycz.
Vladyslav Kolbasin Stable Clustering. Clustering data Clustering is part of exploratory process Standard definition:  Clustering - grouping a set of.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
The 5th annual UK Workshop on Computational Intelligence London, 5-7 September 2005 The 5th annual UK Workshop on Computational Intelligence London, 5-7.
Discovering Community Structure
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Batch kernel SOM and related Laplacian methods for social network analysis Presenter : Lin, Shu-Han Authors.
Uncovering Overlap Community Structure in Complex Networks using Particle Competition Fabricio A. Liang
1/52 Overlapping Community Search Graph Data Management Lab, School of Computer Science
A new Ad Hoc Positioning System 컴퓨터 공학과 오영준.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
1 Deploying Wireless Sensors to Achieve Both Coverage and Connectivity Xiaole Bai*, Santosh Kumar*, Dong Xuan*, Ziqiu Yun +, Ten H. Lai* * Computer Science.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Prepared by: Mahmoud Rafeek Al-Farra
Central China Normal University A Cluster-based and Range Free Multidimensional Scaling-MAP Localization Scheme in WSN 1 Ke Xu, Yuhua Liu ( ), Cui Xu School.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Manoranjan.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Fast accurate fuzzy clustering through data reduction Advisor.
University at BuffaloThe State University of New York Detecting Community Structure in Networks.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
An Effective Method to Improve the Resistance to Frangibility in Scale-free Networks Kaihua Xu HuaZhong Normal University.
Melbourne, Australia, Oct., 2015 gSparsify: Graph Motif Based Sparsification for Graph Clustering Peixiang Zhao Department of Computer Science Florida.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Cluster Analysis This lecture node is modified based on Lecture Notes for Chapter.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
1 An Efficient Optimal Leaf Ordering for Hierarchical Clustering in Microarray Gene Expression Data Analysis Jianting Zhang Le Gruenwald School of Computer.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Selected Topics in Data Networking Explore Social Networks:
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
Efficient Placement and Dispatch of Sensors in a Wireless Sensor Network You-Chiun Wang, Chun-Chi Hu, and Yu-Chee Tseng IEEE Transactions on Mobile Computing.
Community Detection based on Distance Dynamics Reporter: Yi Liu Student ID: Department of Computer Science and Engineering Shanghai Jiao Tong.
Introduction to Data Mining Clustering & Classification Reference: Tan et al: Introduction to data mining. Some slides are adopted from Tan et al.
Multi-index Evaluation Algorithm Based on Locally Linear Embedding for the Node importance in Complex Networks Fang Hu
Graph clustering to detect network modules
Discrete ABC Based on Similarity for GCP
Greedy Algorithm for Community Detection
June 2017 High Density Clusters.
Community detection in graphs
Department of Computer Science University of York
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
3.3 Network-Centric Community Detection
Presentation transcript:

An Efficient Algorithm Based on Self-adapted Fuzzy C-Means Clustering for Detecting Communities in Complex Networks Jianzhi Jin 1, Yuhua Liu 1, Kaihua Xu 2,Fang Hu 1 1 Department of Computer Science, HuaZhong Normal University Wuhan, , China 2 College of Physical Science and Technology, HuaZhong Normal University Wuhan, , China

Outline Introduction Self-adapted Fuzzy C-Means Clustering in Complex Networks Simulations and Analysis Conclusion and Future Works

Introduction(1/6) Many complex networked systems are found to divide naturally into modules or communities, groups of vertices with relatively dense connections within groups but sparser connections between them. Detecting Communities can provide invaluable help in understanding and visualizing the structure of networks

Introduction(2/6) Detecting Communities  Requirements: High efficiency and high accuracy Be based on sound theoretical principles Not allowed to be any cut-node or cut-link

Introduction(3/6) Detecting Communities  Validation Metrics Modularity Accuracy Density

Introduction(4/6) FCM in Complex Networks  Have been applied to detecting communities in recent years  The mainstream algorithm—AFCM, CFCM and NFCM etc.  All use the different variants of Laplacian matrix of the graph

Introduction(5/6) FCM in Complex Networks  Laplacian matrix (N=D-A) is used in AFCM  N=D -1 A is used in CFCM, and N=D -1/2 (D-A) D -1/2 is used in NFCM. D is the diagonal matrix consisting by the degree of all nodes in the whole network, and A is the adjacency matrix of the network.

Introduction(6/6) FCM in Complex Networks  Better clustering accuracy and running efficiency  The synthetic performance is well  Two deficiencies Cannot find the number of clusters to be explored voluntarily Easy to get stuck in a local extremum

Self-adapted Fuzzy C-Means Clustering in Complex Networks(1/5) SFCM in Complex Networks  A new algorithm based on FCM to detecting communities----Self-adapted FCM. Constructing a new validity function to find an optimal number of clusters voluntarily.

Self-adapted Fuzzy C-Means Clustering in Complex Networks(2/5) A New Validity Function  The inter-cluster distances should be as bigger as possible  The intra-cluster distances should be as smaller as possible.

Self-adapted Fuzzy C-Means Clustering in Complex Networks(3/5) Steps of the Algorithm  Step 1 Initialization : termination condition, cluster number,,.  Step 2 The partition matrix was constructed. If there exist j and r, so that, then and for.

Self-adapted Fuzzy C-Means Clustering in Complex Networks(4/5) Steps of the Algorithm  Step 3 The prototypes was calculated.  Step 4 If Then stop the iteration, else let,and go to Step 2.

Self-adapted Fuzzy C-Means Clustering in Complex Networks(5/5) Steps of the Algorithm  Step 5 was calculated under. If is the highest values, then stop the algorithm, else go to Step 2 with. Deficiency  The computable complexity is O(n 3 ).

Simulations and Analysis(1/7) Zachary’s Karate Club Network of American Football Games Tests on Computer-generated Networks

Simulations and Analysis(2/7) Zachary’s Karate Club Square nodes and circle nodes represent the instructor’s faction and the administrator’s faction, respectively. The squares also split into two communities, which are identified by blue and green, in accordance with the circles which are identified by red and yellow.

Simulations and Analysis(3/7) Zachary’s Karate Club  Modularity of all are not high.  Modularity in AFCM is declined substantially.  Modularity in CFCM is lower than NFCM and SFCM. AlgorithmCommunitiesModularityDensity AFCM CFCM NFCM SFCM

Simulations and Analysis(4/7) Network of American Football Games The algorithm can find ten communities, which contain ten conferences almost exactly voluntarily. A total of 11 nodes are unclassified or misclassified, with a red circle marked, and its Accuracy is 90.43%.

Simulations and Analysis(5/7) Network of American Football Games  The modularity calculated by SFCM is higher than others, so does the density. Likewise, the community number of the first three algorithms is pre-specified. AlgorithmCommunitiesModularityDensity AFCM CFCM NFCM SFCM

Simulations and Analysis(6/7) Tests on Computer-generated Networks  RN(c, m, k, p) Where c is the number of communities in the network, m is the number of nodes in each community, k is the degree of each node, and p is the density we presented.

Simulations and Analysis(7/7) Tests on Computer-generated Networks  p is increasing from 0 to 1, the community structure in the network becomes more cohesive.  All algorithms can correctly cluster all the nodes when p was no less than 0.5.  In the range of, the accuracy of SFCM is better than others.

Conclusion and Future Works A new validity function is defined in this algorithm to find an optimal cluster number voluntarily. The simulation results verify that the algorithm is more complete and accurate The higher computable complexity will influence its performance in the end In a further research, we will focus on improving the computability and complexity with less loss of precision, and getting the global optimal solution.

Please Ask Questions Thank you!