Approximating the Community Structure of the Long Tail

Slides:



Advertisements
Similar presentations
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Advertisements

Partitional Algorithms to Detect Complex Clusters
Community Detection with Edge Content in Social Media Networks Paper presented by Konstantinos Giannakopoulos.
Spectral graph reduction for image and streaming video segmentation Fabio Galasso 1 Margret Keuper 2 Thomas Brox 2 Bernt Schiele 1 1 Max Planck Institute.
Leting Wu Xiaowei Ying, Xintao Wu Aidong Lu and Zhi-Hua Zhou PAKDD 2011 Spectral Analysis of k-balanced Signed Graphs 1.
Modularity and community structure in networks
Information Networks Graph Clustering Lecture 14.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
More on Rankings. Query-independent LAR Have an a-priori ordering of the web pages Q: Set of pages that contain the keywords in the query q Present the.
Lecture 21: Spectral Clustering
CS 584. Review n Systems of equations and finite element methods are related.
Normalized Cuts and Image Segmentation Jianbo Shi and Jitendra Malik, Presented by: Alireza Tavakkoli.
Fast, Multiscale Image Segmentation: From Pixels to Semantics Ronen Basri The Weizmann Institute of Science Joint work with Achi Brandt, Meirav Galun,
Using eigenvectors of a bigram-induced matrix to represent and infer syntactic behavior Mikhail Belkin and John Goldsmith The University of Chicago July.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Dominic Rizzo and Giota Stratou.
Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Fast algorithm for detecting community structure in networks.
Clustering Social Networks Isabelle Stanton, University of Virginia Joint work with Nina Mishra, Robert Schreiber, and Robert E. Tarjan.
Efficient Spatiotemporal Grouping Using the Nyström Method Charless Fowlkes, U.C. Berkeley Serge Belongie, U.C. San Diego Jitendra Malik, U.C. Berkeley.
A scalable multilevel algorithm for community structure detection
Malicious parties may employ (a) structure-based or (b) label-based attacks to re-identify users and thus learn sensitive information about their rating.
Kyle Heath, Natasha Gelfand, Maks Ovsjanikov, Mridul Aanjaneya, Leo Guibas Image Webs Computing and Exploiting Connectivity in Image Collections.
Mining Social Media Communities and Content Akshay Java Ph.D. Dissertation Defense October 16 th 2008.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University COT 5410 – Spring 2004.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Structure Preserving Embedding Blake Shaw, Tony Jebara ICML 2009 (Best Student Paper nominee) Presented by Feng Chen.
Segmentation using eigenvectors Papers: “Normalized Cuts and Image Segmentation”. Jianbo Shi and Jitendra Malik, IEEE, 2000 “Segmentation using eigenvectors:
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
LexPageRank: Prestige in Multi- Document Text Summarization Gunes Erkan and Dragomir R. Radev Department of EECS, School of Information University of Michigan.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
Detecting Communities Via Simultaneous Clustering of Graphs and Folksonomies Akshay Java Anupam Joshi Tim Finin University of Maryland, Baltimore County.
EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs Zhe Jin.
Domain decomposition in parallel computing Ashok Srinivasan Florida State University.
Data Structures and Algorithms in Parallel Computing Lecture 7.
CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Spectral Clustering Shannon Quinn (with thanks to William Cohen of Carnegie Mellon University, and J. Leskovec, A. Rajaraman, and J. Ullman of Stanford.
A Tutorial on Spectral Clustering Ulrike von Luxburg Max Planck Institute for Biological Cybernetics Statistics and Computing, Dec. 2007, Vol. 17, No.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Normalized Cuts and Image Segmentation Patrick Denis COSC 6121 York University Jianbo Shi and Jitendra Malik.
Laplacian Matrices of Graphs: Algorithms and Applications ICML, June 21, 2016 Daniel A. Spielman.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
Analysis of Large Graphs Community Detection By: KIM HYEONGCHEOL WALEED ABDULWAHAB YAHYA AL-GOBI MUHAMMAD BURHAN HAFEZ SHANG XINDI HE RUIDAN 1.
Laplacian Matrices of Graphs: Algorithms and Applications ICML, June 21, 2016 Daniel A. Spielman.
High Performance Computing Seminar
Graph clustering to detect network modules
Random Walk for Similarity Testing in Complex Networks
Cohesive Subgraph Computation over Large Graphs
Modeling Influence Opinions and Structure in Social Media
Document Clustering Based on Non-negative Matrix Factorization
June 2017 High Density Clusters.
Graph and Tensor Mining for fun and profit
Trust on Blogosphere using Link Polarity Anubhav Kale, Akshay Java, Pranam Kolari, Dr Anupam Joshi, Dr Tim Finin Motivation Link Polarity Computation.
Generative Model To Construct Blog and Post Networks In Blogosphere
Segmentation Graph-Theoretic Clustering.
Discovering Clusters in Graphs
The likelihood of linking to a popular website is higher
Statistical properties of network community structure
Learning latent structure in complex networks 1 2
Spectral Clustering Eric Xing Lecture 8, August 13, 2010
3.3 Network-Centric Community Detection
Detecting Important Nodes to Community Structure
Affiliation Network Models of Clusters in Networks
Analysis of Large Graphs: Community Detection
Presentation transcript:

Approximating the Community Structure of the Long Tail Akshay Java, Anupam Joshi, Tim Finin Problem Statement Community Structure of the Long Tail Original Adjacency Matrix Sorted (degree) Adjacency Matrix “Approximate the membership of the blogs in the long tail using only a small portion of the entire graph” Long Tail Head Motivation It is expensive to compute the community structure for large networks. Blog graphs are large, but extremely sparse and exhibit the long tail/ core-periphery structure. CORE Intuition Communities emerge around the core (A). The membership of the blogs in the long tail (B) can be approximated by the structure of the core and how the long tail links to the core. PERIPHERY Detecting Communities Approximation Methods A community is a set of nodes that has more connections within the set than outside it. Normalized Cut The graph can be partitioned using the eigenspectrum of the Laplacian. (Shi and Malik) The second smallest eigenvector of the graph Laplacian is the Fiedler vector. Singular Value Decomposition (reduced rank) Sampling-based Technique By Nyström Approximation (Fowlkes et al.) Thus eigenvectors of L can be easily approximated by sampling. Heuristic Method Use the head of the distribution to find the initial communities. Approximate membership of the tail by using number of links to each community in the head. The graph can be recursively partitioned using the sign of the values in its Fielder vector. Evaluation Conclusions and Ongoing Research Conclusion Community structure of the entire network can be well approximated by sampling and heuristic methods. Advantage Achieve fast, accurate approximation using much smaller sample of the entire graph. Ongoing Research Applying the approximation on temporal graphs Clustering graph, tag information simultaneously Modularity Score A measure of the quality of community detection Q = (fraction of intra-community edges)-(expected fraction, disregarding communities)