Liang Ge.  Introduction  Important Concepts in MCL Algorithm  MCL Algorithm  The Features of MCL Algorithm  Summary.

Slides:



Advertisements
Similar presentations
Social network partition Presenter: Xiaofei Cao Partick Berg.
Advertisements

Weighted graphs Example Consider the following graph, where nodes represent cities, and edges show if there is a direct flight between each pair of cities.
Introduction to Graph Theory Instructor: Dr. Chaudhary Department of Computer Science Millersville University Reading Assignment Chapter 1.
Graph-02.
13 May 2009Instructor: Tasneem Darwish1 University of Palestine Faculty of Applied Engineering and Urban Planning Software Engineering Department Introduction.
Graphs Graphs are the most general data structures we will study in this course. A graph is a more general version of connected nodes than the tree. Both.
Graphs By JJ Shepherd. Introduction Graphs are simply trees with looser restrictions – You can have cycles Historically hard to deal with in computers.
 Graph Graph  Types of Graphs Types of Graphs  Data Structures to Store Graphs Data Structures to Store Graphs  Graph Definitions Graph Definitions.
Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Entropy Rates of a Stochastic Process
From: Probabilistic Methods for Bioinformatics - With an Introduction to Bayesian Networks By: Rich Neapolitan.
1 Representing Graphs. 2 Adjacency Matrix Suppose we have a graph G with n nodes. The adjacency matrix is the n x n matrix A=[a ij ] with: a ij = 1 if.
Graph & BFS.
Graph COMP171 Fall Graph / Slide 2 Graphs * Extremely useful tool in modeling problems * Consist of: n Vertices n Edges D E A C F B Vertex Edge.
Graph & BFS Lecture 22 COMP171 Fall Graph & BFS / Slide 2 Graphs * Extremely useful tool in modeling problems * Consist of: n Vertices n Edges D.
Segmentation Graph-Theoretic Clustering.
Representing Graphs Wade Trappe. Lecture Overview Introduction Some Terminology –Paths Adjacency Matrix.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 60 Chapter 8 Markov Processes.
Markov Cluster Algorithm
Graphs, relations and matrices
DynaTraffic – Models and mathematical prognosis
Applied Discrete Mathematics Week 10: Equivalence Relations
Bump Hunting The objective PRIM algorithm Beam search References: Feelders, A.J. (2002). Rule induction by bump hunting. In J. Meij (Ed.), Dealing with.
Graph clustering Jin Chen CSE Fall 2012 MSU 1.
Graph mining in bioinformatics Laur Tooming. Graphs in biology Graphs are often used in bioinformatics for describing processes in the cell Vertices are.
GRAPH Learning Outcomes Students should be able to:
Entropy Rate of a Markov Chain
GRAPH THEORY.  A graph is a collection of vertices and edges.  An edge is a connection between two vertices (or nodes).  One can draw a graph by marking.
Random Walks and Semi-Supervised Learning Longin Jan Latecki Based on : Xiaojin Zhu. Semi-Supervised Learning with Graphs. PhD thesis. CMU-LTI ,
GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,
Undirected Models: Markov Networks David Page, Fall 2009 CS 731: Advanced Methods in Artificial Intelligence, with Biomedical Applications.
Lecture 14 Relations CSCI – 1900 Mathematics for Computer Science Fall 2014 Bill Pine.
Data Structures Week 9 Introduction to Graphs Consider the following problem. A river with an island and bridges. The problem is to see if there is a way.
Expanders via Random Spanning Trees R 許榮財 R 黃佳婷 R 黃怡嘉.
Nyhoff, ADTs, Data Structures and Problem Solving with C++, Second Edition, © 2005 Pearson Education, Inc. All rights reserved Graphs.
Week 11 - Monday.  What did we talk about last time?  Binomial theorem and Pascal's triangle  Conditional probability  Bayes’ theorem.
Markov Cluster (MCL) algorithm Stijn van Dongen.
Markov Chains and Random Walks. Def: A stochastic process X={X(t),t ∈ T} is a collection of random variables. If T is a countable set, say T={0,1,2, …
Data Structures & Algorithms Graphs
ITEC 2620A Introduction to Data Structures Instructor: Prof. Z. Yang Course Website: 2620a.htm Office: TEL 3049.
Relation. Combining Relations Because relations from A to B are subsets of A x B, two relations from A to B can be combined in any way two sets can be.
Seminar on random walks on graphs Lecture No. 2 Mille Gandelsman,
Representing Relations Using Matrices A relation between finite sets can be represented using a zero-one matrix Suppose R is a relation from A = {a 1,
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Introduction to NP Instructor: Neelima Gupta 1.
Goldstein/Schnieder/Lay: Finite Math & Its Applications, 9e 1 of 60 Chapter 8 Markov Processes.
Theory of Computational Complexity Probability and Computing Lee Minseon Iwama and Ito lab M1 1.
Section 9.3. Section Summary Representing Relations using Matrices Representing Relations using Digraphs.
1 GRAPH Learning Outcomes Students should be able to: Explain basic terminology of a graph Identify Euler and Hamiltonian cycle Represent graphs using.
Chapter 11. Chapter Summary  Introduction to trees (11.1)  Application of trees (11.2)  Tree traversal (11.3)  Spanning trees (11.4)
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
 Introduction  Important Concepts in MCL Algorithm  MCL Algorithm  The Features of MCL Algorithm  Summary.
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
Subject Four Graphs Data Structures. What is a graph? A data structure that consists of a set of nodes (vertices) and a set of edges that relate the nodes.
Chapter - 12 GRAPH MATRICES AND APPLICATIONS.
Random Walks on Graphs.
Network analysis.
Network Science: A Short Introduction i3 Workshop
Segmentation Graph-Theoretic Clustering.
Hidden Markov Models Part 2: Algorithms
Chapter 9: Graphs Basic Concepts
Graph Clustering based on Random Walk
Connectivity Section 10.4.
Graphs Part 2 Adjacency Matrix
ITEC 2620M Introduction to Data Structures
Graphs G = (V, E) V are the vertices; E are the edges.
Chapter 9: Graphs Basic Concepts
Presentation transcript:

Liang Ge

 Introduction  Important Concepts in MCL Algorithm  MCL Algorithm  The Features of MCL Algorithm  Summary

 Simualtion of Random Flow in graph  Two Operations: Expansion and Inflation  Intrinsic relationship between MCL process result and cluster structure

 Popular Description: partition into graph so that  Intra-partition similarity is the highest  Inter-partition similarity is the lowest

 Observation 1:  The number of Higher-Length paths in G is large for pairs of vertices lying in the same dense cluster  Small for pairs of vertices belonging to different clusters

 Oberservation 2:  A Random Walk in G that visits a dense cluster will likely not leave the cluster until many of its vertices have been visited

 Measure or Sample any of these—high-length paths, random walks and deduce the cluster structure from the behavior of the samples quantities.  Cluster structure will show itself as a peaked distribution of the quantities  A lack of cluster structure will result in a flat distribution

 Markov Chain  Random Walk on Graph  Some Definitions in MCL

 A Random Process with Markov Property  Markov Property: given the present state, future states are independent of the past states  At each step the process may change its state from the current state to another state, or remain in the same state, according to a certain probability distribution.

 A walker takes off on some arbitrary vertex  He successively visits new vertices by selecting arbitrarily one of outgoing edges  There is not much difference between random walk and finite Markov chain.

 Simple Graph  Simple graph is undirected graph in which every nonzero weight equals 1.

 Associated Matrix  The associated matrix of G, denoted M G,is defined by setting the entry (M G ) pq equal to w(v p,v q )

 Markov Matrix  The Markov matrix associated with a graph G is denoted by T G and is formally defined by letting its q th column be the q th column of M normalized

 The associate matrix and markov matrix is actually for matrix M+I  I denotes diagonal matrix with nonzero element equals 1  Adding a loop to every vertex of the graph because for a walker it is possible that he will stay in the same place in his next step

 Find Higher-Length Path  Start Point: In associated matrix that the quantity (M k ) pq has a straightforward interpretation as the number of paths of length k between v p and v q

(M G +I) 2 MGMG

MGMG

 Flow is easier with dense regions than across sparse boundaries,  However, in the long run, this effect disappears.  Power of matrix can be used to find higher- length path but the effect will diminish as the flow goes on.

 Idea: How can we change the distribution of transition probabilities such that prefered neighbours are further favoured and less popular neighbours are demoted.  MCL Solution: raise all the entries in a given column to a certain power greater than 1 (e.g. squaring) and rescaling the column to have the sum 1 again.

 Expansion Operation: power of matrix, expansion of dense region  Inflation Operation: mention aboved, elimination of unfavoured region

 animation.html animation.html

 Find attractor: the node a is an attractor if Maa is nonzero  Find attractor system: If a is an attractor then the set of its neighbours is called an attractor system.  If there is a node who has arc connected to any node of an attractor system, the node will belong to the same cluster as that attractor system.

Attractor Set={1,2,3,4,5,6,7,8,9,10} The Attractor System is {1,2,3},{4,5,6,7},{8,9},{10} The overlaping clusters are {1,2,3,11,12,15},{4,5,6,7,13},{8,9,12,13,14,15},{10,12,13}

 how many steps are requred begore the algorithm converges to a idempoent matrix?  The number is typically somewhere between 10 and 100  The effect of inflation on cluster granularity

R denotes the inflation operation constants. A denotes the loop weight.

 MCL stimulates random walk on graph to find cluster  Expansion promotes dense region;while Inflation demotes the less favoured region  There is intrinsic relationship between MCL result and cluster structure