C OMMUNITIES AND B ALANCE IN S IGNED N ETWORKS : S PECTRAL A PPROACH -Pranay Anchuri*, Malik Magdon Ismail Rensselaer Polytechnic Institute, NY.

Slides:



Advertisements
Similar presentations
Line Balancing Problem A B C 4.1mins D 1.7mins E 2.7 mins F 3.3 mins G 2.6 mins 2.2 mins 3.4 mins.
Advertisements

Community Detection with Edge Content in Social Media Networks Paper presented by Konstantinos Giannakopoulos.
Coverage by Directional Sensors Jing Ai and Alhussein A. Abouzeid Dept. of Electrical, Computer and Systems Engineering Rensselaer Polytechnic Institute.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Modularity and community structure in networks
Community Detection Algorithm and Community Quality Metric Mingming Chen & Boleslaw K. Szymanski Department of Computer Science Rensselaer Polytechnic.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
J. Leskovec, D. Huttenlocher, J. Kleinberg Paper Review by Rachel Katz S IGNED N ETWORKS IN S OCIAL M EDIA.
MATH 685/ CSI 700/ OR 682 Lecture Notes
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
V4 Matrix algorithms and graph partitioning
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
On the Construction of Energy- Efficient Broadcast Tree with Hitch-hiking in Wireless Networks Source: 2004 International Performance Computing and Communications.
Fast algorithm for detecting community structure in networks.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
MAE 552 – Heuristic Optimization Lecture 26 April 1, 2002 Topic:Branch and Bound.
A scalable multilevel algorithm for community structure detection
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 ECSE-4670: Computer Communications Netwoks Exam 2: Solutions Time: 75 min (strictly enforced)
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Application of Graph Theory to OO Software Engineering Alexander Chatzigeorgiou, Nikolaos Tsantalis, George Stephanides Department of Applied Informatics.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Introduction to Optimization (Part 1)
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
LINEAR PROGRAMMING SIMPLEX METHOD.
Numerical Computations in Linear Algebra. Mathematically posed problems that are to be solved, or whose solution is to be confirmed on a digital computer.
Steady and Fair Rate Allocation for Rechargeable Sensors in Perpetual Sensor Networks Zizhan Zheng Authors: Kai-Wei Fan, Zizhan Zheng and Prasun Sinha.
Internet Traffic Engineering by Optimizing OSPF Weights Bernard Fortz (Universit é Libre de Bruxelles) Mikkel Thorup (AT&T Labs-Research) Presented by.
1 11 Subcarrier Allocation and Bit Loading Algorithms for OFDMA-Based Wireless Networks Gautam Kulkarni, Sachin Adlakha, Mani Srivastava UCLA IEEE Transactions.
C LUSTERING NETWORKED DATA BASED ON LINK AND SIMILARITY IN A CTIVE LEARNING Advisor : Sing Ling Lee Student : Yi Ming Chang Speaker : Yi Ming Chang 1.
Efficient Identification of Overlapping Communities Jeffrey Baumes Mark Goldberg Malik Magdon-Ismail Rensselaer Polytechnic Institute, Troy, NY.
F INDING B ALANCE IN S OCIAL N ETWORKS What is BALANCE ? These are BALANCED TRIADS. They are considered less stressful arrangements. The first is balanced.
Anindya Bhattacharya and Rajat K. De Bioinformatics, 2008.
Hotspot Detection in a Service Oriented Architecture Pranay Anchuri,
1 Oblivious Routing in Wireless networks Costas Busch Rensselaer Polytechnic Institute Joint work with: Malik Magdon-Ismail and Jing Xi.
DATA MINING LECTURE 13 Pagerank, Absorbing Random Walks Coverage Problems.
Maximum Network Lifetime in Wireless Sensor Networks with Adjustable Sensing Ranges Cardei, M.; Jie Wu; Mingming Lu; Pervaiz, M.O.; Wireless And Mobile.
Li Wang Haorui Wu University of South Carolina 04/02/2015 A* with Pattern Databases.
Uncovering Overlap Community Structure in Complex Networks using Particle Competition Fabricio A. Liang
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
1 11 Channel Assignment for Maximum Throughput in Multi-Channel Access Point Networks Xiang Luo, Raj Iyengar and Koushik Kar Rensselaer Polytechnic Institute.
Detecting Communities Via Simultaneous Clustering of Graphs and Folksonomies Akshay Java Anupam Joshi Tim Finin University of Maryland, Baltimore County.
1 Near-Optimal Hot-Potato Routing on Trees Costas Busch Rensselaer Polytechnic Inst. Malik Magdon Ismail Rensselaer Polytechnic Inst. Marios Mavronicolas.
Mining Top-K Large Structural Patterns in a Massive Network Feida Zhu 1, Qiang Qu 2, David Lo 1, Xifeng Yan 3, Jiawei Han 4, and Philip S. Yu 5 1 Singapore.
Arithmetic Test Pattern Generation: A Bit Level Formulation of the Optimization Problem S. Manich, L. García and J. Figueras.
Data Structures and Algorithms in Parallel Computing Lecture 7.
CS 590 Term Project Epidemic model on Facebook
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
A Protocol for Tracking Mobile Targets using Sensor Networks H. Yang and B. Sikdar Department of Electrical, Computer and Systems Engineering Rensselaer.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
Semi-Supervised Clustering
The minimum cost flow problem
by Hyunwoo Park and Kichun Lee Knowledge-Based Systems 60 (2014) 58–72
Optimum Dispatch of Capacitors in Power Systems
DTMC Applications Ranking Web Pages & Slotted ALOHA
Greedy Algorithm for Community Detection
Finding Communities by Clustering a Graph into Overlapping Subgraphs
SDE: Graph Drawing Using Spectral Distance Embedding
Jon Purnell Heidi Jo Newberg Malik Magdon-Ismail
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
3.3 Network-Centric Community Detection
A Fundamental Bi-partition Algorithm of Kernighan-Lin
Malik Magdon-Ismail, Konstantin Mertsalov, Mark Goldberg
Approximate Graph Mining with Label Costs
Introduction to High Performance Computing Lecture 8
Presentation transcript:

C OMMUNITIES AND B ALANCE IN S IGNED N ETWORKS : S PECTRAL A PPROACH -Pranay Anchuri*, Malik Magdon Ismail Rensselaer Polytechnic Institute, NY.

O UTLINE Introduction Structural Balance Heuristic Spectral Methods Results Conclusion Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

S IGNED S OCIAL N ETWORKS Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

S TRUCTURAL B ALANCE Stable Unstable Network is strongly balanced if all triads are stable. Notation : Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute Positive Edge Negative Edge

W EAK S TRUCTURAL B ALANCE Stable Unstable Network is weakly balanced if all triads are stable. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

C OMMUNITIES IN B ALANCED N ETWORK Balanced network can be divided so that positive edges lie within communities negative edges between communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Real world networks are rarely structurally balanced. Frustration : Number of edges that disturb the balance. Positive edges between communities + Negative edges within communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Real world networks are rarely structurally balanced. Frustration : Number of edges that disturb the balance. Positive edges between communities + Negative edges within communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute Frustration = 1

Real world networks are rarely structurally balanced. Frustration : Number of edges that disturb the balance. Positive edges between communities + Negative edges within communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute Frustration = 1

Real world networks are rarely structurally balanced. Frustration : Number of edges that disturb the balance. Positive edges between communities + Negative edges within communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Community Detection

H EURISTIC Ignore the negative edges and cluster the remaining nodes. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

H EURISTIC Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

H EURISTIC Isolated nodes are added in such a way that minimizes the frustration. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

H EURISTIC Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Spectral Methods

M INIMIZING F RUSTRATION Community C divided into C1,C2 Positive edges between C1 and C2 increase frustration. Negative edges between C1 and C2 decrease frustration. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

M INIMIZING F RUSTRATION Community C divided into C1,C2 Positive edges between C1 and C2 increase frustration. Negative edges between C1 and C2 decrease frustration. C1 C2 Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute Frustration = 2

M INIMIZING F RUSTRATION Community C divided into C1,C2 Positive edges between C1 and C2 increase frustration. Negative edges between C1 and C2 decrease frustration. C1 Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute C2 Frustration = 1 Frustration = 2

M INIMIZING F RUSTRATION Community C divided into C1,C2 Positive edges between C1 and C2 increase frustration. Negative edges between C1 and C2 decrease frustration. C1 Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute C2

M ODULARITY Unsigned Modularity : Number of edges within communities – expected number if edges were randomly permuted. Measure of the “surprise” factor. Higher modularity is better. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

S IGNED M ODULARITY Signed Modularity Surprise factor due to positive edges within communities and negative edges between communities. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Minimizing Frustration Maximizing Modularity Both objectives reduce to maximizing S T M S Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

C OMPUTING THE M AXIMUM Maximizing f (M,S) = S T M S Optimum S : Eigen vector corresponding to maximum Eigen value of M. Eigen vector can be computed by Power Iteration. Requires sparse matrix vector multiplication which is efficient. S ε R n but we need S ε {-1,+1} n !! Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

B OOLEAN S OLUTION Rounding : Based on sign of s i, s i >= 0  1 and -1 o/w. Rounding w/ Improvement : Start with an initial Boolean solution and move the nodes one at a time. If there is a sequence of flips such that solution is closer optimum then retain the changes. Complexity : O(N^2). Rounding w/ Partial Improvement: Consider nodes whose magnitude is close to zero. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

NodeVal in Eigen Vector

Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute NodeVal in Eigen Vector

Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute NodeVal in Eigen Vector

M ULTIPLE C OMMUNITIES Communities can be further divided Until frustration cannot be reduced. Modularity cannot be increased. Change in the objective can be reduced to S T M S Also requires sparse matrix vector multiplication. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Results

M ODULARITY M AXIMIZATION Algorithm# CommunitiesLargestFrustration (% of –ve edges) Epinions.com Clustering ( 15 means) Clustering (40 means) Modularity Modularity w/ partial improvement Slashdot.com Clustering ( 15 means) Clustering (40 means) Modularity Modularity w/ partial improvement Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute Datasets obtained from

F RUSTRATION M INIMIZATION Algorithm# CommunitiesLargestFrustration ( % of – ve edges) Epinions.com Two Division Two Division w/ Partial Improvement Multiple Division Multiple Division w/ Partial Improvement Slashdot.com Two Division Two Division w/ Partial Improvement Multiple Division Multiple Division w/ Partial Improvement Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

S TRONG VS W EAK B ALANCE Minimum Frustration: = 1 when max # communities =2 = 0 when # communities = 3 ( each node in its own community) Minimum frustration with multiple communities implies weak balance. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

N EGATIVE I NCIDENT R ATIO NIR = 3/2 Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

C ONCLUSION Spectral algorithm to detect communities in signed communities. Objective Functions : Minimizing frustration, Maximizing frustration. Careful assignment of nodes leads to better communities. Structural balance (strong and weak) affects the communities detected. Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute

Thank You Questions ? Pranay Anchuri, Malik Magdon Ismail, Rensselaer Polytechnic Institute