1 Presented by: Yuchen Bian 4.16.2015 MRWC: Clustering based on Multiple Random Walks Chain.

Slides:



Advertisements
Similar presentations
Stationary Probability Vector of a Higher-order Markov Chain By Zhang Shixiao Supervisors: Prof. Chi-Kwong Li and Dr. Jor-Ting Chan.
Advertisements

Copyright 2011, Data Mining Research Laboratory Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining Xintian Yang, Srinivasan.
Weiren Yu 1, Jiajin Le 2, Xuemin Lin 1, Wenjie Zhang 1 On the Efficiency of Estimating Penetrating Rank on Large Graphs 1 University of New South Wales.
School of Computer Science Carnegie Mellon University Duke University DeltaCon: A Principled Massive- Graph Similarity Function Danai Koutra Joshua T.
CompLACS Composing Learning for Artificial Cognitive Systems Year 2: Specification of scenarios.
Spark: Cluster Computing with Working Sets
1 Efficient Subgraph Search over Large Uncertain Graphs Ye Yuan 1, Guoren Wang 1, Haixun Wang 2, Lei Chen 3 1. Northeastern University, China 2. Microsoft.
Absorbing Random walks Coverage
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
Introduction to PageRank Algorithm and Programming Assignment 1 CSC4170 Web Intelligence and Social Computing Tutorial 4 Tutor: Tom Chao Zhou
Distributed PageRank Computation Based on Iterative Aggregation- Disaggregation Methods Yangbo Zhu, Shaozhi Ye and Xing Li Tsinghua University, Beijing,
Dimensionality Reduction
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 3 April 2, 2006
Presented by Ozgur D. Sahin. Outline Introduction Neighborhood Functions ANF Algorithm Modifications Experimental Results Data Mining using ANF Conclusions.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
A Local Facility Location Algorithm Supervisor: Assaf Schuster Denis Krivitski Technion – Israel Institute of Technology.
Minas Gjoka, UC IrvineWalking in Facebook 1 Walking in Facebook: A Case Study of Unbiased Sampling of OSNs Minas Gjoka, Maciej Kurant ‡, Carter Butts,
1 COMP4332 Web Data Thanks for Raymond Wong’s slides.
Markov Cluster Algorithm
Factor Graphs Young Ki Baik Computer Vision Lab. Seoul National University.
GDG DevFest Central Italy Joint work with J. Feldman, S. Lattanzi, V. Mirrokni (Google Research), S. Leonardi (Sapienza U. Rome), H. Lynch (Google)
Vilalta&Eick: Informed Search Informed Search and Exploration Search Strategies Heuristic Functions Local Search Algorithms Vilalta&Eick: Informed Search.
Limits of Local Algorithms in Random Graphs
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Liang Ge.  Introduction  Important Concepts in MCL Algorithm  MCL Algorithm  The Features of MCL Algorithm  Summary.
Random Walk with Restart (RWR) for Image Segmentation
DATA MINING LECTURE 13 Absorbing Random walks Coverage.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Network Characterization via Random Walks B. Ribeiro, D. Towsley UMass-Amherst.
1 Random Walks on Graphs: An Overview Purnamrita Sarkar, CMU Shortened and modified by Longin Jan Latecki.
Clustering Spatial Data Using Random Walks Author : David Harel Yehuda Koren Graduate : Chien-Ming Hsiao.
Clustering Spatial Data Using Random Walk David Harel and Yehuda Koren KDD 2001.
Mining Multiple Private Databases Topk Queries Across Multiple Private Databases (2005) Li Xiong (Emory University) Subramanyam Chitti (GA Tech) Ling Liu.
The PageRank Citation Ranking: Bringing Order to the Web Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd Presented by Anca Leuca, Antonis Makropoulos.
A General Optimization Framework for Smoothing Language Models on Graph Structures Qiaozhu Mei, Duo Zhang, ChengXiang Zhai University of Illinois at Urbana-Champaign.
Uncovering Overlap Community Structure in Complex Networks using Particle Competition Fabricio A. Liang
KDD 2007, San Jose Fast Direction-Aware Proximity for Graph Mining Speaker: Hanghang Tong Joint work w/ Yehuda Koren, Christos Faloutsos.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 24 Nov 2, 2005 Nanjing University of Science & Technology.
CS654: Digital Image Analysis
Probabilistic Models for Discovering E-Communities Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha The Pennsylvania State University WWW.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
KDD 2007, San Jose Fast Direction-Aware Proximity for Graph Mining Speaker: Hanghang Tong Joint work w/ Yehuda Koren, Christos Faloutsos.
Lecture #9: Introduction to Markov Chain Monte Carlo, part 3
The Markov Chain Monte Carlo Method Isabelle Stanton May 8, 2008 Theory Lunch.
1 Authors: Glen Jeh, Jennifer Widom (Stanford University) KDD, 2002 Presented by: Yuchen Bian SimRank: a measure of structural-context similarity.
Gerhard Haßlinger Search Methods in Dynamic Wireless Networks  Challenges for search in wireless networks  Random walks and flooding for search with.
Information-Theoretic Co- Clustering Inderjit S. Dhillon et al. University of Texas, Austin presented by Xuanhui Wang.
Presented by Dajiang Zhu 11/1/2011.  Introduction of Markov chains Definition One example  Two problems as examples 2-SAT Algorithm (simply introduce.
Kijung Shin Jinhong Jung Lee Sael U Kang
Lecture 14, CS5671 Clustering Algorithms Density based clustering Self organizing feature maps Grid based clustering Markov clustering.
Community detection via random walk Draft slides.
Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions by S. Mahadevan & M. Maggioni Discussion led by Qi An ECE, Duke University.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Anonymous communication over social networks Shishir Nagaraja and Ross Anderson Security Group Computer Laboratory.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
A Tutorial on Spectral Clustering Ulrike von Luxburg Max Planck Institute for Biological Cybernetics Statistics and Computing, Dec. 2007, Vol. 17, No.
Of 17 Limits of Local Algorithms in Random Graphs Madhu Sudan MSR Joint work with David Gamarnik (MIT) 7/11/2013Local Algorithms on Random Graphs1.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Perfect recall: Every decision node observes all earlier decision nodes and their parents (along a “temporal” order) Sum-max-sum rule (dynamical programming):
Random Sampling Algorithms with Applications Kyomin Jung KAIST Aug ERC Workshop.
SimRank: A Measure of Structural-Context Similarity Glen Jeh and Jennifer Widom Stanford University ACM SIGKDD 2002 January 19, 2011 Taikyoung Kim SNU.
Monte Carlo Linear Algebra Techniques and Their Parallelization Ashok Srinivasan Computer Science Florida State University
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Haim Kaplan and Uri Zwick
Carlos Ordonez, Predrag T. Tosic
Graph Clustering based on Random Walk
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
GANG: Detecting Fraudulent Users in OSNs
Lecture 2-6 Complexity for Computing Influence Spread
Locality In Distributed Graph Algorithms
Presentation transcript:

1 Presented by: Yuchen Bian MRWC: Clustering based on Multiple Random Walks Chain

2 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

3 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

4 1. Introduction and Motivation Random Walk Model: a b c 1 1/2 1 t=0 a b c 1 1/2 1 t=1 a b c 1 1/2 1 t=2 a b c 1 1/2 1 t=3

5 5 x t+1 (i) = ∑ j (Probability of being at node j)*Pr(j->i) =∑ j x t (j)*P(j,i) x t+1 = P T x t Long time after… x t+1 ≈ x t x t = P T x t Converge to a stationary distribution π no matter what the initial distribution is. For each π i π i =d(i)/2m 1. Introduction and Motivation Random Walk Model:

6 1. Introduction and Motivation Random Walk Model: π i =d(i)/2m Query node: 8

7 7 x t = P T x t e i is a vector in which only the i-th (query node) element is 1, otherwise Restart c0≤c<1 1. Introduction and Motivation Random Walk with Restart Model: x t = (1-c)P T x t +ce i

8 1. Introduction and Motivation Query node bias: sharp peak Query node: 8 Random Walk with Restart Model:

9 1. Introduction and Motivation For large graph, convergence needs more time. Query node: 8 Local clustering: Find cluster before convergence, even the RW will not reach some nodes. In fact, a RW might be restricted in the cluster with high probability, HOWEVER, it is also hard to travel back if RW pass through boundary Targets: restricted in the cluster which contains the query nodes. What if the query node(s) send out a series of RWs, not a single RW, hopefully, this RWs group is harder than single RW to travel through boundary.

10 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

11 Intuition: 2. Multiple Random Walks Chain (MRWC) From each query node, send a series of RWs to explore the graph, all RWs walk one by one, but the next vertex the current RW will explore is not only follow its own “thought” but also decided by other RWs. Then all RWs constructs a RWs group and this group is harder than a single RW to travel through the boundary.

12 Definitions: 2. Multiple Random Walks Chain (MRWC)

13 Definitions: 2. Multiple Random Walks Chain (MRWC)

14 Definitions: 2. Multiple Random Walks Chain (MRWC)

15 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

16 3. Experiments Computation and Egs: A3, P Naïve Method: Iteratively computation

17 A2, P2

18 3. Experiments Fig 1. Basic RWFig 2. RWR Fig 3. MRWC (k=2) Fig 4. MRWC (k=3)

19 3. Experiments MRWC (k=2) RWs’ position for each iteration W1-B*, W2-Rs

20 3. Experiments Fig 1. RWR Fig 3. MRWC (k=3) Fig 2. MRWC (k=2) RWs’ position for each iteration (k=2)

21 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

22 4. Conclusion Motivation: restrict into the target cluster Advantages: Increase the number of features Sharpen the boundary: harder to pass through than single RW Group activity not single activity (sharp peak) Disadvantages: Convergence issue Naïve method Evaluation to MRWC:

23 1. Introduction and Motivation ----Background 2. Multiple Random Walks Chain (MRWC) ----Intuition ----Definitions 3. Experiments 4. Conclusion 5. Future Work Content

24 5. Future Work Model: formal and general model Mathematical Analysis: Convergence? How to sharpen the boundary? Algorithm: Efficient computation or approximation Compare with other methods

25 Yuchen Bian Thank you! Q & A