Topologically biased random walks with application for community finding Vinko Zlatić Dep. Of Physics, “Sapienza”, Roma, Italia Theoretical Physics Division,

Slides:



Advertisements
Similar presentations
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Advertisements

An Evaluation of Community Detection Algorithms on Large-Scale Traffic 1 An Evaluation of Community Detection Algorithms on Large-Scale Traffic.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Nonlinear Dimension Reduction Presenter: Xingwei Yang The powerpoint is organized from: 1.Ronald R. Coifman et al. (Yale University) 2. Jieping Ye, (Arizona.
Modularity and community structure in networks
Community Detection Algorithm and Community Quality Metric Mingming Chen & Boleslaw K. Szymanski Department of Computer Science Rensselaer Polytechnic.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Xiaowei Ying Xintao Wu Univ. of North Carolina at Charlotte 2009 SIAM Conference on Data Mining, May 1, Sparks, Nevada Graph Generation with Prescribed.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Mining and Searching Massive Graphs (Networks)
Lecture 21: Spectral Clustering
Lecture 9 Measures and Metrics. Structural Metrics Degree distribution Average path length Centrality Degree, Eigenvector, Katz, Pagerank, Closeness,
Spectral Clustering 指導教授 : 王聖智 S. J. Wang 學生 : 羅介暐 Jie-Wei Luo.
Fast algorithm for detecting community structure in networks.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
A scalable multilevel algorithm for community structure detection
Three Algorithms for Nonlinear Dimensionality Reduction Haixuan Yang Group Meeting Jan. 011, 2005.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
1 Uniform Sampling from the Web via Random Walks Ziv Bar-Yossef Alexander Berg Steve Chien Jittat Fakcharoenphol Dror Weitz University of California at.
Communities and Clustering in some Social Networks Guido Caldarelli SMC CNR-INFM Rome.
The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.
Department of Engineering Science Department of Zoology Soft partitioning in networks via Bayesian nonnegative matrix factorization Ioannis Psorakis, Steve.
Structure Preserving Embedding Blake Shaw, Tony Jebara ICML 2009 (Best Student Paper nominee) Presented by Feng Chen.
Complex network geometry and navigation Dmitri Krioukov CAIDA/UCSD F. Papadopoulos, M. Kitsak, kc claffy, A. Vahdat M. Á. Serrano, M. Boguñá UCSD, December.
Community detection algorithms: a comparative analysis Santo Fortunato.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Discovering Community Structure
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Xiaowei Ying, Xintao Wu Dept. Software and Information Systems Univ. of N.C. – Charlotte 2008 SIAM Conference on Data Mining, April 25 th Atlanta, Georgia.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
Concept Switching Azadeh Shakery. Concept Switching: Problem Definition C1C2Ck …
Community Detection Algorithms: A Comparative Analysis Authors: A. Lancichinetti and S. Fortunato Presented by: Ravi Tiwari.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Mix networks with restricted routes PET 2003 Mix Networks with Restricted Routes George Danezis University of Cambridge Computer Laboratory Privacy Enhancing.
Phase Transitions of Complex Networks and related Xiaosong Chen Institute of Theoretical Physics Chinese Academy of Sciences CPOD-2011, Wuhan.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Network Science K. Borner A.Vespignani S. Wasserman.
Anonymous communication over social networks Shishir Nagaraja and Ross Anderson Security Group Computer Laboratory.
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
A Tutorial on Spectral Clustering Ulrike von Luxburg Max Planck Institute for Biological Cybernetics Statistics and Computing, Dec. 2007, Vol. 17, No.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Tutorial on "GRID Computing“ EMBnet Conference 2008 CNR - ITB GRID distribution supporting chaotic map clustering on large mixed microarray.
Properties and applications of spectra for networks 章 忠 志 复旦大学计算机科学技术学院 Homepage:
Motoki Shiga, Ichigaku Takigawa, Hiroshi Mamitsuka
Graph clustering to detect network modules
Random Walk for Similarity Testing in Complex Networks
Cohesive Subgraph Computation over Large Graphs
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
Intrinsic Data Geometry from a Training Set
Groups of vertices and Core-periphery structure
by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72
Random walks on complex networks
Community detection in graphs
Jianping Fan Dept of CS UNC-Charlotte
Overcoming Resolution Limits in MDL Community Detection
Enumeration problems of networks
3.3 Network-Centric Community Detection
Properties and applications of spectra for networks
第八届全国复杂网络学术会议 Spectra of transition matrix for networks: Computation and applications 章 忠 志 复旦大学计算机科学技术学院 Homepage:
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
Presentation transcript:

Topologically biased random walks with application for community finding Vinko Zlatić Dep. Of Physics, “Sapienza”, Roma, Italia Theoretical Physics Division, Institute “Ruđer Bošković”, Zagreb, Croatia CNR-INFM Centro SMC Dipartimento di Fisica, Universita di Roma “Sapienza”, Italy Zlatić, Gabrielli, Caldarelli, arXiv: Andrea Gabrielli Guido Caldarelli

Community Finding Community: One of many (possibly overlapping) subgraphs 1.Has strong internal node-node connections 2.Weaker external connections Q:How to find communities in large networks? Santo Fortunato “Community detection in graphs”, Eprint arXiv: , Physics Reports 486, (2010)

Motivation Distribution of links in the network is locally heterogeneous 1. Finding sets of nodes with similar function 2. Visualization 3. Classification 4. Hierarchical organization Useful in: Systems Biology, Sociology, Computer sciences, Physics...

Community Finding(2) Algorithms: S. Fortunato, arXiv: , Physics Reports 486, (2010) pp.75 Newman-Girvan, removal of links with high beetweeness Different algorithms to maximise Newman modularity Radicchi et al.,removal of links based on local properties Cfinder, Markov cluster algorithm,Potts model...

Spectral Methods ● Idea: distance in the M dimensional space spanned by eigenvectors associated with random walks ● M corresponds to number of eigenvectors used ● Then we can apply standard clustering techniques Manhattan distance, angle distance, etc ● Graph Laplacian L. Donetti and M. A. Munoz, J. Stat. Mech. P10012 (2004).

Problem Community structure can be very hard to detect Mixing Parameter Different performances of algorithms (detectability, speed, size of networks) A. Lancichinetti, S. Fortunato, Phys. Rev. E, 80, (2009). Donetti and Munoz is one of better algorithms

Biased random walks IDEA ● New idea: Different topological quantities have different frequencies in between different communities Why not use this additional information to improve spectral methods??? Example: Edge multiplicity Other possibilities:Shortest path betweeness, subgraph frequencies, degree, clustering, even eigenvectors of nonbiased random walk.

Biased Random walks Unbiased transition operator (Frobenius-Perron operator) Biased transition operator (Frobenius-Perron operator) Exponential familly of transition probabilities Asymptotic probabilitiesDetailed balance condition

Symmetrization(1) Symmetrization leeds to hermitian operator. Orthogonal eigenvectors

Symmetrization(2) Parametric equations of motion D. A. Mazziotti, et al, Journal of Physical Chemistry 99, (1995).

Spectra Spectra contains important information on community structure N separate graphs considered as one have N-fold degeneracy of the first eigenvector Characteristic time to approach stationary distribution is related to spectral gap N communities should produce N-1 close eigenvalues!

Application to community finding(1) Tuning of biases in such a way to maximize the community gap Indeed!!! Close large eigenvalues form clear “community band” for modest values of mixing parameter There is a clear separation between N-1 eigenvalues associated with community structure and rest of the spectra Separation beetween Nth eigenvalue and N+1th eigenvalue we name “community gap”

Application to community finding(2) Every network has its own optimal parameters Tetrahedric structure – description with angles

Conclusion 1.This topic is still “hot” 2. Possibility to include variables related to dynamics on networks 3. Promissing preliminary results (outperform DM 0.60 vs at mixing parameter =0.5) To do: Test different topological variables as basis for biases Develop better clustering algorithm based on distances between nodes.

Thanks for your attention