Author: Jie chen and Yousef Saad IEEE transactions of knowledge and data engineering.

Slides:



Advertisements
Similar presentations
Section 2.5: Graphs and Trees
Advertisements

Review Binary Search Trees Operations on Binary Search Tree
13 May 2009Instructor: Tasneem Darwish1 University of Palestine Faculty of Applied Engineering and Urban Planning Software Engineering Department Introduction.
3.3 Spanning Trees Tucker, Applied Combinatorics, Section 3.3, by Patti Bodkin and Tamsen Hunter.
Chapter 8, Part I Graph Algorithms.
© 2006 Pearson Addison-Wesley. All rights reserved14 A-1 Chapter 14 excerpts Graphs (breadth-first-search)
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
 Graph Graph  Types of Graphs Types of Graphs  Data Structures to Store Graphs Data Structures to Store Graphs  Graph Definitions Graph Definitions.
1 Representing Graphs. 2 Adjacency Matrix Suppose we have a graph G with n nodes. The adjacency matrix is the n x n matrix A=[a ij ] with: a ij = 1 if.
Introduction to Graphs
Fast algorithm for detecting community structure in networks.
Segmentation Graph-Theoretic Clustering.
Chapter 9 Graph algorithms Lec 21 Dec 1, Sample Graph Problems Path problems. Connectedness problems. Spanning tree problems.
Graphs G = (V,E) V is the vertex set. Vertices are also called nodes and points. E is the edge set. Each edge connects two different vertices. Edges are.
Lecture 11. Matching A set of edges which do not share a vertex is a matching. Application: Wireless Networks may consist of nodes with single radios,
Clustering Unsupervised learning Generating “classes”
Theory of Computing Lecture 10 MAS 714 Hartmut Klauck.
Computer Science 112 Fundamentals of Programming II Introduction to Graphs.
Graph Partitioning and Clustering E={w ij } Set of weighted edges indicating pair-wise similarity between points Similarity Graph.
Perfect Gaussian Elimination and Chordality By Shashank Rao.
© 2006 Pearson Addison-Wesley. All rights reserved14 A-1 Chapter 14 Graphs.
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Greedy Approximation Algorithms for finding Dense Components in a Graph Paper by Moses Charikar Presentation by Paul Horn.
Graph Algorithms. Definitions and Representation An undirected graph G is a pair (V,E), where V is a finite set of points called vertices and E is a finite.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
Spectral Analysis based on the Adjacency Matrix of Network Data Leting Wu Fall 2009.
Fundamental Data Structures and Algorithms (Spring ’05) Recitation Notes: Graphs Slides prepared by Uri Dekel, Based on recitation.
Data Structures & Algorithms Graphs
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
Data Mining Cluster Analysis: Advanced Concepts and Algorithms
Graphs. Graphs Similar to the graphs you’ve known since the 5 th grade: line graphs, bar graphs, etc., but more general. Those mathematical graphs are.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
Data Structures & Algorithms Graphs Richard Newman based on book by R. Sedgewick and slides by S. Sahni.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
SPARSE CERTIFICATES AND SCAN-FIRST SEARCH FOR K-VERTEX CONNECTIVITY
 In the previews parts we have seen some kind of segmentation method.  In this lecture we will see graph cut, which is a another segmentation method.
Graphs. Contents Terminology Graphs as ADTs Applications of Graphs.
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
(CSC 102) Lecture 30 Discrete Structures. Graphs.
1 Lecture 5 (part 2) Graphs II (a) Circuits; (b) Representation Reading: Epp Chp 11.2, 11.3
Lecture 5 Graph Theory prepped by Lecturer ahmed AL tememe 1.
Midwestern State University Minimum Spanning Trees Definition of MST Generic MST algorithm Kruskal's algorithm Prim's algorithm 1.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
1 Data Structures and Algorithms Graphs. 2 Graphs Basic Definitions Paths and Cycles Connectivity Other Properties Representation Examples of Graph Algorithms:
Subject Four Graphs Data Structures. What is a graph? A data structure that consists of a set of nodes (vertices) and a set of edges that relate the nodes.
Spectral clustering of graphs
Matrix Representation of Graphs
Introduction to Graphs
Special Graphs By: Sandeep Tuli Astt. Prof. CSE.
Csc 2720 Instructor: Zhuojun Duan
Introduction to Graphs
Network analysis.
IDENTIFICATION OF DENSE SUBGRAPHS FROM MASSIVE SPARSE GRAPHS
Data Mining Cluster Analysis: Advanced Concepts and Algorithms
CS 3343: Analysis of Algorithms
Network Science: A Short Introduction i3 Workshop
Segmentation Graph-Theoretic Clustering.
Connected Components Minimum Spanning Tree
Graph Operations And Representation
Chapter 14 Graphs © 2006 Pearson Addison-Wesley. All rights reserved.
Graphs G = (V, E) V are the vertices; E are the edges.
Graph Operations And Representation
Applied Discrete Mathematics Week 13: Graphs
Chapter 9 Graph algorithms
Introduction to Graphs
For Friday Read chapter 9, sections 2-3 No homework
Introduction to Graphs
Minimum Spanning Trees
Presentation transcript:

Author: Jie chen and Yousef Saad IEEE transactions of knowledge and data engineering

 Introduction ◦ Assumption of proposed method  The types of graph  The method ◦ Undirected graph ◦ Directed graph ◦ Bipartite graph  Experiment

 A challenging problem in the analysis of graph structures is the dense subgraph problem, where given a sparse graph, the objective is to identify a set of meaningful dense subgraphs.  The dense subgraphs are often interpreted as “communities”, based on the basic assumption that a network system consists of a number of communities, among with the connections are much fewer than those inside the same community.

 The number k of partitions is mandatory input parameter, and the partitioning result is sensitive to the change of k.  Most of partition methods yield a complete clustering of the data.  Many graph partitioning techniques favor balancing, i.e., sizes of different partitions should not vary too much.

 The adjacency matrix A is a sparse matrix.  The entries of A are either 0 or 1, since the weights of the edges are not taken into account for the density of a graph.  The diagonal of A is empty, since it does not allow self-loops.

 Undirected graph-G(V,E) ◦ V is the vertex set and E is the edge set The definition of the undirected graph density is Adjacency matrix-symmetric

 Bipartite graph-G(V,E) ◦ Undirected graph ◦ V is the vertex set and E is the edge set Adjacency matrix The definition of the bipartite graph density is B BTBT

 Directed graph-G(V,E) ◦ V is the vertex set and E is the edge set Adjacency matrix-nonsymmetric The definition of the directed graph density is

1. We construct a adjacency matrix A with G(V,E). 2. We use to build matrix M that stores the cosines between any two columns of the adjacency matrix A. 3. Then we construct a weight graph G’(V,E’) whose weighted adjacency matrix M is defined as M(i,j).

4. A top-down hierarchical clustering of the vertex set V is performed by successively deleting the edges e’ ∈ E’, in ascending order of the edge weights. When G’ first becomes disconnected, V is partitioned in two subsets, each of which corresponds to a connected component of G’. 5. The termination will take place when the density of the partition passes a certain density threshold d min.

d(G s )=1 d(G t )=0.6 d(G t )=0.8 d min =0.75

 The adjacency matrix A of a directed graph is square but not symmetric. When Algorithm is applied to a nonsymmetric adjacency matrix, it will result in two different dendrograms, depending on whether M is computed as the cosines of the columns of A, or the rows of A.  We symmetrize the matrix A (i.e., replacing A by the pattern matrix of A+A T ) and use the resulting symmetric adjacency matrix to compute the similarity matrix M.

 Remove the direction of the edges and combine the duplicated resulting edges, then it yields an undirected graph  Or use the adjacency matrix A  A+A T

 Without any edge removal of the graph G’ (using M as the weighted adjacency matrix), the vertex set is already partitioned into two subsets: V1 and V2.  Any subsequent hierarchical partitioning will only further subdivide these two subsets separately.

 A reasonable strategy for this purpose is to augment the original bipartite graph by adding edges between some of the vertices that are connected by a path of length 2. is obtained by erasing the diagonal of M 1.

d min =0.5

 The graph contain 1490 vertices, among which the first 758 are liberal blogs, and remaining 732 are conservative.  The edge in the graph indicates the existence of citation between the two blogs.

 Comparisons with the Clauset, Newman, and Moore(CNM) approach.  CNM approach: bottom-up hierarchical clustering.  Dataset: foldoc-G(13356, ) ◦ It extracted from the online dictionary of computing