Community Detection in a Large Real-World Social Network Karsten Steinhaeuser Nitesh V. Chawla DIAL Research Group www.nd.edu/~dial University of Notre.

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

ICDE 2014 LinkSCAN*: Overlapping Community Detection Using the Link-Space Transformation Sungsu Lim †, Seungwoo Ryu ‡, Sejeong Kwon§, Kyomin Jung ¶, and.
An Evaluation of Community Detection Algorithms on Large-Scale Traffic 1 An Evaluation of Community Detection Algorithms on Large-Scale Traffic.
Analysis and Modeling of Social Networks Foudalis Ilias.
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Xiaowei Ying, Xintao Wu, Daniel Barbara Spectrum based Fraud Detection in Social Networks 1.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
Are You moved by Your Social Network Application? Abderrahmen Mtibaa, Augustin Chaintreau, Jason LeBrun, Earl Oliver, Anna-Kaisa Pietilainen, Christophe.
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Global topological properties of biological networks.
Improvements in the Spatial and Temporal representation of the Model Owen Woodberry Bachelor of Computer Science, Honours.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
PageRank Identifying key users in social networks Student : Ivan Todorović, 3231/2014 Mentor : Prof. Dr Veljko Milutinović.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.
Models of Influence in Online Social Networks
Exploiting indirect neighbors and topological weight to predict protein function from protein– protein interactions Hon Nian Chua, Wing-Kin Sung and Limsoon.
CHAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Community Detection by Modularity Optimization Jooyoung Lee
Anomalous Node Detection in Time Series of Mobile Communication Graphs Leman Akoglu January 28, 2010.
Efficient Identification of Overlapping Communities Jeffrey Baumes Mark Goldberg Malik Magdon-Ismail Rensselaer Polytechnic Institute, Troy, NY.
By Chris Zachor.  Introduction  Background  Changes  Methodology  Data Collection  Network Topologies  Measures  Tools  Conclusion  Questions.
Automated Social Hierarchy Detection through Network Analysis (SNAKDD07) Ryan Rowe, Germ´an Creamer, Shlomo Hershkop, Salvatore J Stolfo 1 Advisor:
Clustering Spatial Data Using Random Walks Author : David Harel Yehuda Koren Graduate : Chien-Ming Hsiao.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Uncovering Overlap Community Structure in Complex Networks using Particle Competition Fabricio A. Liang
EVENT DETECTION IN TIME SERIES OF MOBILE COMMUNICATION GRAPHS
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
PCI th Panhellenic Conference in Informatics Clustering Documents using the 3-Gram Graph Representation Model 3 / 10 / 2014.
Network Community Behavior to Infer Human Activities.
Community-enhanced De-anonymization of Online Social Networks Shirin Nilizadeh, Apu Kapadia, Yong-Yeol Ahn Indiana University Bloomington CCS 2014.
Data Mining Cluster Analysis: Advanced Concepts and Algorithms
Comparison of Tarry’s Algorithm and Awerbuch’s Algorithm CS 6/73201 Advanced Operating System Presentation by: Sanjitkumar Patel.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Overlapping Community Detection in Networks
G LOBAL S IMILARITY B ETWEEN M ULTIPLE B IONETWORKS Yunkai Liu Computer Science Department University of South Dakota.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Project funded by the Future and Emerging Technologies arm of the IST Programme Search in Unstructured Networks Niloy Ganguly, Andreas Deutsch Center for.
Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
Community Detection based on Distance Dynamics Reporter: Yi Liu Student ID: Department of Computer Science and Engineering Shanghai Jiao Tong.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Graph clustering to detect network modules
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
A Black-Box Approach to Query Cardinality Estimation
Time-Course Network Enrichment
Community detection in graphs
Latent Space Model for Road Networks to Predict Time-Varying Traffic
Topological Ordering Algorithm: Example
Department of Computer Science University of York
Graph-Based Anomaly Detection
3.3 Network-Centric Community Detection
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Topological Ordering Algorithm: Example
Topological Ordering Algorithm: Example
Network topology. Network topology. Nodes are linked by edges. Node size represents a quantifiable node property (e.g. fold-change in two different experimental.
CSE572: Data Mining by H. Liu
Digital humanities Filtering.
Topological Ordering Algorithm: Example
Representing Higher-order Dependencies in Networks: Hands-on Tasks
Journal Club Physical Review E
Presentation transcript:

Community Detection in a Large Real-World Social Network Karsten Steinhaeuser Nitesh V. Chawla DIAL Research Group University of Notre Dame April 1, 2008

Cellular Phone Network Real social network Represents actual interactions between individuals Requires intent to communicate Network dimensions 1.3 million nodes (customers) 1.2 million edges (aggregate of voice and text) Contains a wealth of data Communication Links Customer Demographics Temporal Data Spatial Data

Community Detection with Random Walks No Weighting Topology-Based Attribute-Based Weight Clustering Using Random Walks Walk Agglomeration with EA Combine Input Graph Weighted Graph Co-Association Matrix Community Structure

Algorithm Comparison AlgorithmComplexityComments / Assessment Scalable Random Walks O(n) with EAFinds good divisions with high efficiency, still parameterized FastQO(n log 2 n)Computationally efficient, limited by modularity WalkTrapO(n 2 log n)Finds divisions similar to FastQ but at higher complexity MCLO(n 3 )Better divisions but matrix computations limit scalability

Experimental Results Edge weighting based on topology CCS = clustering coefficient similarity CNS = common neighbor similarity Real edge weights Call frequency Call duration NAS = edge weighting based on node attributes WeightingModularityTime (s) CCS (topological)< CNS (topological)< Frequency Duration NAS

Future Work Incorporate network dynamics Spatial data Temporal data