by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72

Slides:



Advertisements
Similar presentations
Community Detection with Edge Content in Social Media Networks Paper presented by Konstantinos Giannakopoulos.
Advertisements

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Modularity and community structure in networks
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Community Detection Algorithm and Community Quality Metric Mingming Chen & Boleslaw K. Szymanski Department of Computer Science Rensselaer Polytechnic.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Introduction of Probabilistic Reasoning and Bayesian Networks
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Lecture 17 Introduction to Eigenvalue Problems
Overview of Markov chains David Gleich Purdue University Network & Matrix Computations Computer Science 15 Sept 2011.
More on Rankings. Query-independent LAR Have an a-priori ordering of the web pages Q: Set of pages that contain the keywords in the query q Present the.
Introduction to PageRank Algorithm and Programming Assignment 1 CSC4170 Web Intelligence and Social Computing Tutorial 4 Tutor: Tom Chao Zhou
Discovering Overlapping Groups in Social Media Xufei Wang, Lei Tang, Huiji Gao, and Huan Liu Arizona State University.
Chapter 4. Numerical Interpretation of Eigenvalues In terms of matrix arithmetic eigenvalues turn matrix multiplication into scalar multiplication. Numerically.
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Topologically biased random walks with application for community finding Vinko Zlatić Dep. Of Physics, “Sapienza”, Roma, Italia Theoretical Physics Division,
Link Analysis, PageRank and Search Engines on the Web
1 Markov Chains Algorithms in Computational Biology Spring 2006 Slides were edited by Itai Sharon from Dan Geiger and Ydo Wexler.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Department of Engineering Science Department of Zoology Soft partitioning in networks via Bayesian nonnegative matrix factorization Ioannis Psorakis, Steve.
School of Electronics Engineering and Computer Science Peking University Beijing, P.R. China Ziqi Wang, Yuwei Tan, Ming Zhang.
Lecture7 Topic1: Graph spectral analysis/Graph spectral clustering and its application to metabolic networks Topic 2: Concept of Line Graphs Topic 3: Introduction.
Spectral coordinate of node u is its location in the k -dimensional spectral space: Spectral coordinates: The i ’th component of the spectral coordinate.
Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
Markov Cluster (MCL) algorithm Stijn van Dongen.
Presenter : Lin, Shu-Han Authors : Jeen-Shing Wang, Jen-Chieh Chiang
Numerical Analysis – Eigenvalue and Eigenvector Hanyang University Jong-Il Park.
Community Detection Algorithms: A Comparative Analysis Authors: A. Lancichinetti and S. Fortunato Presented by: Ravi Tiwari.
Lecture 2: Statistical learning primer for biologists
Christopher M. Bishop, Pattern Recognition and Machine Learning 1.
CS 590 Term Project Epidemic model on Facebook
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Modular organization.
Motoki Shiga, Ichigaku Takigawa, Hiroshi Mamitsuka
High Performance Computing Seminar
Graph clustering to detect network modules
Hierarchical Agglomerative Clustering on graphs
Discrete ABC Based on Similarity for GCP
Industrial Engineering Dep
Random Walks on Graphs.
Search Engines and Link Analysis on the Web
Markov Chains Mixing Times Lecture 5
Mutual inconsistency Due to the possible breakdown of the communication links, the system can be divided into several sub-systems (or sub-networks). There.
Geometric Model for Distributive Property
PageRank and Markov Chains
Greedy Algorithm for Community Detection
Network analysis.
Spectral Clustering.
Community detection in graphs
Jianping Fan Dept of CS UNC-Charlotte
Finding modules on graphs
Hidden Markov Models Part 2: Algorithms
Critical Issues with Respect to Clustering
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
Ilan Ben-Bassat Omri Weinstein
Matrix Algebra and Random Vectors
Graph-Based Anomaly Detection
3.3 Network-Centric Community Detection
Scale-Space Representation for Matching of 3D Models
Properties and applications of spectra for networks
Authors: Jinliang Fan and Mostafa H. Ammar
Analysis of Large Graphs: Overlapping Communities
Presentation transcript:

by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72 Dependence clustering, a method revealing community structure with group dependence by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72 2016.08.24 Kiburm Song, Ph.D. student Intelligent Data Systems Lab. Department of Industrial Engineering Hanyang University

Introduction Community structure detection in networks Most similarity based clustering algorithms require to set predefined number of clusters

Modularity-based clustering by Newman (2006)1 Introduction Modularity-based clustering by Newman (2006)1 Modularity measures how precise a division of the network is against a graph with edges placed at random Drawback assuming that each node can be linked to any other nodes of the network whether they are large or small regardless of the geometric structure of the network it cannot adjust the level of resolution or the scale on which the modularity measure relies 1Newman, M. E. (2006). Modularity and community structure in networks. Proceedings of the national academy of sciences, 103(23), 8577-8582.

Dependence clustering - dependence : an undirected graph Assuming the graph as a Markov chain the whole chain is ergodic and all transitions follow the Markovian property

Dependence clustering - dependence Figure 1 from Park & Lee (2014) Park, H. & Lee, K. (2014). Dependence clustering, a method revealing community structure with group dependence. Knowledge-Based Systems 60, 58-72

Dependence clustering - dependence Dependence captures how 𝑥 𝑚 in the initial state is inter-dependent with 𝑥 𝑖 at the 𝑡th step: Dep < 1: negative dependence Dep = 1: independence Dep > 1: positive dependence

Dependence clustering – group dependence captures how much data points in each group are dependent on each other measures overall coherence in terms of dependence from group assignments for the whole data set at the t-step transition 𝑠 𝑖 =1 if data point 𝑖 belongs to group1 𝑠 𝑖 =−1 if data point 𝑖 belongs to group2 1 2 ( 𝑠 𝑖 𝑠 𝑗 +1) is 1 if 𝑖 and 𝑗 are in the same group 𝒔= 𝑠 1 ,…, 𝑠 𝑛 : group assignment vector

Dependence clustering – group dependence To explain the simplified way to compute 𝐷 𝑡 need to describe transition matrix 𝑊 𝑖,𝑗 : adjacency matrix consists of nonnegative elements represent the closeness between point 𝑖 and point 𝑗 𝑃 𝑖,𝑗 : transition matrix ( 𝑃 𝑖,𝑗 =Pr 𝑋 1 =𝑗 𝑋 0 =𝑖), 𝑗 𝑃 𝑖,𝑗 =1 )

Dependence clustering – group dependence To overcome computational hurdle in computing 𝑃 𝑡 , we used one-time spectral decomposition eigenvector eigenvalues

Dependence clustering – group dependence

Dependence clustering – group dependence Division of data points is made based on the signs of the eigenvector corresponding to the largest positive eigenvalue of 𝐺

Dependence clustering – Illustrating example

Dependence clustering – Illustrating example

Dependence clustering – Multiple groups a dependence gain parameter 𝛿 𝑑 ∈ 0, 1 minimal dependence gain ∆ 𝑑 = 𝛿 𝑑 𝑔 𝑖𝑗 >0 𝐺 a set of points belonging to a candidate cluster Ω 𝐶 ⊆Ω The division into the sub-clusters proceeds if 𝐷 𝑡 𝑠𝑖gn( 𝐬 C ′ ) − 𝐷 𝑡 𝒔 𝐶 >𝑁 ∆ 𝑑 𝐬 C ′ : within cluster group after division of Ω 𝐶 𝒔 𝐶 : within cluster group before division of Ω 𝐶 𝑁: size of Ω 𝐶

Soft dependence clustering Dependence clustering provides only a hard clustering and is not designed for handling overlaps To extend DC scheme, SDC introduces a soft probability interval Interpreting eigenvector values as generated from two probability distributions one with a negative mean and the other with a positive mean 𝜹 𝑷 = 𝒑 𝒍 , 𝒑 𝒉 , 𝑝 𝑙 + 𝑝 ℎ =1 lower and upper bounds of the posterior probability that a data point should have in order to be assigned to both clusters.