A Framework for Finding Communities in Dynamic Social Networks David Kempe University of Southern California Chayant Tantipathananandh, Tanya Berger-Wolf.

Slides:



Advertisements
Similar presentations
February 20, Spatio-Temporal Bandwidth Reuse: A Centralized Scheduling Mechanism for Wireless Mesh Networks Mahbub Alam Prof. Choong Seon Hong.
Advertisements

Cartography of complex networks: From organizations to the metabolism Cartography of complex networks: From organizations to the metabolism Roger Guimerà.
Efficient Computation of Close Upper and Lower Bounds on the Minimum Number of Recombinations in Biological Sequence Evolution Yun S. Song, Yufeng Wu,
Social network partition Presenter: Xiaofei Cao Partick Berg.
ICDE 2014 LinkSCAN*: Overlapping Community Detection Using the Link-Space Transformation Sungsu Lim †, Seungwoo Ryu ‡, Sejeong Kwon§, Kyomin Jung ¶, and.
Approximation Algorithms
22C:19 Discrete Math Graphs Fall 2010 Sukumar Ghosh.
B IPARTITE I NDEX C ODING Arash Saber Tehrani Alexandros G. Dimakis Michael J. Neely Department of Electrical Engineering University of Southern California.
1 Routing and Wavelength Assignment in Wavelength Routing Networks.
2 The NodeXL Project Team About Me Introductions Marc A. Smith Chief Social Scientist Connected Action Consulting Group
Visualizing the Evolution of Community Structures in Dynamic Social Networks Khairi Reda Department of Computer Science University of Illinois at Chicago.
Social and ecological factors influencing movement and organizational patterns in sheep Habiba, Caitlin Barale, Ipek Kulahci, Rajmonda Sulo and Khairi.
Finding Social Groups: A Meta-Analysis of the Southern Women Data Linton C. Freeman Photograph by Ben Shahn, Natchez, MS, October, 1935.
Introduction to Approximation Algorithms Lecture 12: Mar 1.
On Computing Compression Trees for Data Collection in Wireless Sensor Networks Jian Li, Amol Deshpande and Samir Khuller Department of Computer Science,
P RELIMINARIES –C OMPUTATIONAL P ROBLEM Given a set of real numbers, output a sequence, ( l 1, …, l i, …, l n ), where l i ≤ l i+1 for i = 1 … n-1. Naive.
The Statistical Significance of Max-gap Clusters Rose Hoberman David Sankoff Dannie Durand.
Balanced Graph Partitioning Konstantin Andreev Harald Räcke.
Zebras Dan Rubenstein, Siva Sandaresan, Ilya Fischhoff (Princeton) Movie credit: “Champions of the Wild”, Omni-Film Productions.
Sensor Placement September 4, 2003 Tanya Berger-Wolf University of New Mexico Discrete Sensor Placement Problem Tanya Y. Berger-Wolf With William E. Hart.
1 Nicholas Mancuso Department of Computer Science Georgia State University Joint work with Bassam Tork, GSU Pavel Skums, CDC Ion M ӑ ndoiu, UConn Alex.
Tantipathananandh Chayant Tantipathananandh with Tanya Berger-Wolf Constant-Factor Approximation Algorithms for Identifying Dynamic Communities.
Artificial Intelligence Genetic Algorithms and Applications of Genetic Algorithms in Compilers Prasad A. Kulkarni.
Evaluation of the Haplotype Motif Model using the Principle of Minimum Description Srinath Sridhar, Kedar Dhamdhere, Guy E. Blelloch, R. Ravi and Russell.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Calculation of Similarity (Chernoff et al. 1999, 2000, 2001) 1. Observed Similarity = mean of 200 random sub- samples of larger population at size of smaller.
A Framework For Community Identification in Dynamic Social Networks Chayant Tantipathananandh Tanya Berger-Wolf David Kempe Presented by Victor Lee.
BIONFORMATIC ALGORITHMS Ryan Tinsley Brandon Lile May 9th, 2014.
Optimizing pooling strategies for the massive next-generation sequencing of viral samples Pavel Skums 1 Joint work with Olga Glebova 2, Alex Zelikovsky.
Gene expression & Clustering (Chapter 10)
Algorithms for Network Optimization Problems This handout: Minimum Spanning Tree Problem Approximation Algorithms Traveling Salesman Problem.
Combinatorial Reconstruction of Sibling Relationships in Absence of Parental Data Tanya Y Berger-Wolf (DIMACS and UIC CS) Bhaskar DasGupta (UIC CS) Wanpracha.
MCS312: NP-completeness and Approximation Algorithms
Fixed Parameter Complexity Algorithms and Networks.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
A framework For Community Identification in Dynamic Social Networks Chayant, Tanya Berger-Wolf, David Kempe [KDD’07] Advisor : Dr. Koh Jia-Ling Advisor.
© 2006 Board of Trustees of the University of Illinois Authored by Tanya Berger-Wolf Analysis of Dynamic Social Networks Tanya Berger-Wolf Department.
An Introduction to Social Network Analysis Yi Li
Chapter 3. Community Detection and Evaluation May 2013 Youn-Hee Han
A Computational Framework for Analysis of Dynamic Social Networks Tanya Berger-Wolf University of Illinois at Chicago Joint work with Jared Saia University.
Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Graphs & Matrices Todd Cromedy & Bruce Nicometo March 30, 2004.
Computational Tools for Population Biology Tanya Berger-Wolf, Computer Science, UIC; Daniel Rubenstein, Ecology and Evolutionary Biology, Princeton; Jared.
Network Community Behavior to Infer Human Activities.
Dynamics of communities in two fission-fusion species, Grevy's zebra and onager Chayant Tantipathananandh 1, Tanya Y. Berger-Wolf 1, Siva R. Sundaresan.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Overlapping Community Detection in Networks
Tantipathananandh Chayant Tantipathananandh with Tanya Berger-Wolf Constant-Factor Approximation Algorithms for Identifying Dynamic Communities.
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Community detection via random walk Draft slides.
A Simulation-Based Study of Overlay Routing Performance CS 268 Course Project Andrey Ermolinskiy, Hovig Bayandorian, Daniel Chen.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Graphs. Graph Definitions A graph G is denoted by G = (V, E) where  V is the set of vertices or nodes of the graph  E is the set of edges or arcs connecting.
On the Ability of Graph Coloring Heuristics to Find Substructures in Social Networks David Chalupa By, Tejaswini Nallagatla.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
A Graph Theoretic Approach to Cache-Conscious Placement of Data for Direct Mapped Caches Mirza Beg and Peter van Beek University of Waterloo June
Graph clustering to detect network modules
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Groups of vertices and Core-periphery structure
ISP and Egress Path Selection for Multihomed Networks
Finding Subgraphs with Maximum Total Density and Limited Overlap
Discovering Larger Network Motifs
Noémi Gaskó, Rodica Ioana Lung, Mihai Alexandru Suciu
SEG5010 Presentation Zhou Lanjun.
Network-Wide Routing Oblivious Heavy Hitters
Affiliation Network Models of Clusters in Networks
Presentation transcript:

A Framework for Finding Communities in Dynamic Social Networks David Kempe University of Southern California Chayant Tantipathananandh, Tanya Berger-Wolf University of Illinois at Chicago

Social Networks

History of Interactions t=1 History of interactions t=1 t=2 t=3 t=4 t= Assume discrete time and interactions in form of complete subgraphs. Aggregated network

Community Identification Centrality and betweenness [Girvan & Newman ‘01] Correlation clustering [Basal et al. ‘02] Overlapping cliques [Palla et al. ’05] What is community? “Cohesive subgroups are subsets of actors among whom there are relatively strong, direct, intense, frequent, or positive ties.” [Wasserman & Faust ‘97] Notions of communities: Static Dynamic Metagroups [Berger-Wolf & Saia ’06]

The Question: What is dynamic community? A dynamic community is a subset of individuals that stick together over time. NOTE: Communities ≠ Groups t=1 t=2 t=3 t=4 t=5

Approach: Graph Model t=1 t=2 t=3 t=4 t=

Approach: Assumptions Individuals and groups represent exactly one community at a time. Concurrent groups represent distinct communities. Desired Required Conservatism: community affiliation changes are rare. Group Loyalty: individuals observed in a group belong to the same community. Parsimony: few affiliations overall for each individual.

Approach: Color = Community Valid coloring: distinct color of groups in each time step

Approach: Assumptions Individuals and groups represent exactly one community at a time. Concurrent groups represent distinct communities. Desired Required Conservatism: community affiliation changes are rare. Group Loyalty: individuals observed in a group belong to the same community. Parsimony: few affiliations overall for each individual.

Costs Conservatism: switching cost (α) Group loyalty:  Being absent (β1)  Being different (β2) Parsimony: number of colors (γ)

Approach: Assumptions Individuals and groups represent exactly one community at a time. Concurrent groups represent distinct communities. Desired Required Conservatism: community affiliation changes are rare. Group Loyalty: individuals observed in a group belong to the same community. Parsimony: few affiliations overall for each individual.

Costs Conservatism: switching cost (α) Group loyalty:  Being absent (β1)  Being different (β2) Parsimony: number of colors (γ)

Approach: Assumptions Individuals and groups represent exactly one community at a time. Concurrent groups represent distinct communities. Desired Required Conservatism: community affiliation changes are rare. Group Loyalty: individuals observed in a group belong o the same community. Parsimony: few affiliations overall for each individual.

Costs Conservatism: switching cost (α) Group loyalty:  Being absent (β1)  Being different (β2) Parsimony: number of colors (γ)

Problem Definition Minimum Community Interpretation For a given cost setting, (α,β1,β2,γ), find vertex coloring that minimizes total cost. Color of group vertices = Community structure Color of individual vertices = Affiliation sequences Problem is NP-Complete and APX-Hard

Model Validation and Algorithms Model validation: exhaustive search for an exact minimum-cost coloring. Heuristic algorithms evaluation: compare heuristic results to OPT. Validation on data sets with known communities from simulation and social research  Southern Women data set (benchmark)

Southern Women Data Set by Davis, Gardner, and Gardner, 1941 Photograph by Ben Shaln, Natchez, MS, October; 1935 Aggregated network Event participation

Ethnography by Davis, Gardner, and Gardner, 1941 Core (1-4) Periphery (5-7) Core (13-15) Periphery (11-12)

An Optimal Coloring: (α,β1,β2,γ)=(1,1,3,1) Core Periphery Core

An Optimal Coloring: (α,β1,β2,γ)=(1,1,1,1) Core Periphery Core

Conclusions An optimization-based framework for finding communities in dynamic social networks. Finding an optimal solution is NP-Complete and APX-Hard. Model evaluation by exhaustive search. Heuristic algorithms for larger data sets. Heuristic results comparable to optimal.

Thank You Poster #6 this evening

Dan Rubenstein Princeton Siva Sundaresan Ilya Fischoff Simon Levin Princeton David Kempe USC Jared Saia UNM Muthu Google Habiba Mayank Lahiri Computational Population Biology Lab UIC compbio.cs.uic.edu Tanya Berger-Wolf Chayant Tantipathananandh Poster#6 this evening