Analysis of Network Diffusion and Distributed Network Algorithms Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization.

Slides:



Advertisements
Similar presentations
Matroid Bases and Matrix Concentration
Advertisements

Gibbs sampler - simple properties It’s not hard to show that this MC chain is aperiodic. Often is reversible distribution. If in addition the chain is.
The Randomization Repertoire Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopThe Randomization Repertoire1.
Rumors and Routes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopRumors and Routes1.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
1 The Monte Carlo method. 2 (0,0) (1,1) (-1,-1) (-1,1) (1,-1) 1 Z= 1 If  X 2 +Y 2  1 0 o/w (X,Y) is a point chosen uniformly at random in a 2  2 square.
Approximation Algorithms for Unique Games Luca Trevisan Slides by Avi Eyal.
Absorbing Random walks Coverage
Random Walks Ben Hescott CS591a1 November 18, 2002.
Entropy Rates of a Stochastic Process
Parallel random walks Brian Moffat. Outline What are random walks What are Markov chains What are cover/hitting/mixing times Speed ups for different graphs.
Overview of Markov chains David Gleich Purdue University Network & Matrix Computations Computer Science 15 Sept 2011.
More on Rankings. Query-independent LAR Have an a-priori ordering of the web pages Q: Set of pages that contain the keywords in the query q Present the.
Complexity 16-1 Complexity Andrei Bulatov Non-Approximability.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Graph Clustering. Why graph clustering is useful? Distance matrices are graphs  as useful as any other clustering Identification of communities in social.
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Semidefinite Programming
1 Mazes In The Theory of Computer Science Dana Moshkovitz.
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
1 Optimization problems such as MAXSAT, MIN NODE COVER, MAX INDEPENDENT SET, MAX CLIQUE, MIN SET COVER, TSP, KNAPSACK, BINPACKING do not have a polynomial.
Randomized Algorithms and Randomized Rounding Lecture 21: April 13 G n 2 leaves
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
Geographic Gossip: Efficient Aggregations for Sensor Networks Author: Alex Dimakis, Anand Sarwate, Martin Wainwright University: UC Berkeley Venue: IPSN.
Undirected ST-Connectivity in Log-Space By Omer Reingold (Weizmann Institute) Year 2004 Presented by Maor Mishkin.
Zig-Zag Expanders Seminar in Theory and Algorithmic Research Sashka Davis UCSD, April 2005 “ Entropy Waves, the Zig-Zag Graph Product, and New Constant-
Expanders Eliyahu Kiperwasser. What is it? Expanders are graphs with no small cuts. The later gives several unique traits to such graph, such as: – High.
Complexity 1 Mazes And Random Walks. Complexity 2 Can You Solve This Maze?
Distributed Combinatorial Optimization
Undirected ST-Connectivity In Log Space
Undirected ST-Connectivity In Log Space Omer Reingold Slides by Sharon Bruckner.
cover times, blanket times, and majorizing measures Jian Ding U. C. Berkeley James R. Lee University of Washington Yuval Peres Microsoft Research TexPoint.
1 Entropy Waves, The Zigzag Graph Product, and New Constant-Degree Expanders Omer Reingold Salil Vadhan Avi Wigderson Lecturer: Oded Levy.
Randomized Algorithms Morteza ZadiMoghaddam Amin Sayedi.
Efficient and Robust Query Processing in Dynamic Environments Using Random Walk Techniques Chen Avin Carlos Brito.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
DATA MINING LECTURE 13 Absorbing Random walks Coverage.
Expanders via Random Spanning Trees R 許榮財 R 黃佳婷 R 黃怡嘉.
Distributed Algorithms Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopDistributed Algorithms1.
DATA MINING LECTURE 13 Pagerank, Absorbing Random Walks Coverage Problems.
Fan Chung University of California, San Diego The PageRank of a graph.
Many random walks are faster than one Noga AlonTel Aviv University Chen AvinBen Gurion University Michal KouckyCzech Academy of Sciences Gady KozmaWeizmann.
Spanning and Sparsifying Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopSpanning and Sparsifying1.
15-853:Algorithms in the Real World
Markov Chains and Random Walks. Def: A stochastic process X={X(t),t ∈ T} is a collection of random variables. If T is a countable set, say T={0,1,2, …
Percolation Processes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopPercolation Processes1.
Cover times, blanket times, and the GFF Jian Ding Berkeley-Stanford-Chicago James R. Lee University of Washington Yuval Peres Microsoft Research.
C&O 355 Mathematical Programming Fall 2010 Lecture 16 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
Random walks on undirected graphs and a little bit about Markov Chains Guy.
Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.
Seminar on random walks on graphs Lecture No. 2 Mille Gandelsman,
Artur Czumaj DIMAP DIMAP (Centre for Discrete Maths and it Applications) Computer Science & Department of Computer Science University of Warwick Testing.
Graph Partitioning using Single Commodity Flows
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Date: 2005/4/25 Advisor: Sy-Yen Kuo Speaker: Szu-Chi Wang.
PERTEMUAN 26. Markov Chains and Random Walks Fundamental Theorem of Markov Chains If M g is an irreducible, aperiodic Markov Chain: 1. All states are.
Presented by Alon Levin
S TOCHASTIC M ODELS L ECTURE 4 B ROWNIAN M OTIONS Nan Chen MSc Program in Financial Engineering The Chinese University of Hong Kong (Shenzhen) Nov 11,
Approximation Algorithms based on linear programming.
Theory of Computational Complexity Probability and Computing Lee Minseon Iwama and Ito lab M1 1.
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
1 “Hybrid Search Schemes for Unstructured Peer- to-Peer Networks” “Random Walks in Peer-to-Peer Networks” Christos Gkantsidis, Milena Mihail, Amin Saberi.
Markov Chains and Random Walks
Markov Chains Mixing Times Lecture 5
Path Coupling And Approximate Counting
Spectral Clustering.
Computability and Complexity
Randomized Algorithms Markov Chains and Random Walks
On the effect of randomness on planted 3-coloring models
3.3 Network-Centric Community Detection
Presentation transcript:

Analysis of Network Diffusion and Distributed Network Algorithms Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopAND and DNA1

Overview of the 4 Sessions Random walks Percolation processes – Branching processes, random graphs, and percolation phenomena Rumors & routes – Rumor spreading, small-world model, network navigability Distributed algorithms – Maximal independent set, dominating set, local balancing algorithms Chennai Network Optimization WorkshopAND and DNA2

Random Walks Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopRandom Walks3

Outline Basic definitions and notation Applications Two results: – Mixing time and convergence of random walks – Cover time of random walks Applications to clustering Techniques: – Probability theory – Spectral graph theory Chennai Network Optimization WorkshopRandom Walks4

What is a Random Walk? Let G be an arbitrary undirected graph A walk starts at an arbitrary vertex v 0 At the start of step t, the walker moves from v t-1 to vertex v t chosen uniformly at random from neighbors of v t-1 in G For all t > 0, v t is a random variable Chennai Network Optimization WorkshopRandom Walks5

Notation Let G be an arbitrary undirected graph and A be its adjacency matrix – A ij is 1 whenever there is an edge (i,j) Define the random walk matrix M – M ij is A ij /degree(i) Let x denote the initial probability distribution (row) vector After t steps, the probability distribution vector equals xM t Chennai Network Optimization WorkshopRandom Walks6

Definitions Stationary distribution – Probability vector π such that πM = π Hitting time h ij – Expected time for random walk starting from i to visit j Cover time C – Expected time for random walk starting from an arbitrary vertex to visit all nodes of G Mixing time – Time it takes for the random walk to converge to a stationary distribution Chennai Network Optimization WorkshopRandom Walks7

Questions of Interest Stationary distribution: – Do they always exist? – Is the stationary distribution unique? – Does a random walk always converge to a stationary distribution? If it does, what is the mixing time? Hitting time: – For a given graph G and vertices i,j, what is the hitting time h ij Cover time: – For a given graph G, what is its cover time C? Chennai Network Optimization WorkshopRandom Walks8

Applications Probabilistic process whose variants capture social and physical phenomena – Brownian motion in physics – Spread of epidemics in contact networks – Spread of innovation and influence in social networks – Connections to electrical networks – Markov chains arise in numerous scenarios Pseudo-random number generators – Random walk in an expander graph is an efficient way to generate pseudo-random bits from a small random seed Use in randomized algorithms Google’s PageRank – PageRank is the probability vector of the stationary distribution of an appropriately defined random walk Chennai Network Optimization WorkshopRandom Walks9

Stationary Distribution and Mixing Time Lemma 1: A stationary distribution always exists and is unique For d-regular undirected graphs G, let λ(G) denote the second largest eigenvalue of M Theorem 1: The random walk is within ε of the stationary distribution in – For non-regular graphs, replace M by a normalized version Chennai Network Optimization WorkshopRandom Walks10

Cover Time Matthews Bound: – Let h max be the maximum hitting time – The cover time is at most h max ln(n) Exercise: Prove that time for a random walk to cover every vertex is O(h max log(n)) whp Theorem 2: For any m-edge n-vertex undirected graph G, the cover time is O(mn) [Mitzenmacher-Upfal 04] Chennai Network Optimization WorkshopRandom Walks11

Lovasz-Simonovits Theorem Lazy random walk: – With probability ½, walk stays at current node; – With probability ½, does regular random walk Theorem 3: [LS 93] For any initial probability distribution and every t, we have Chennai Network Optimization WorkshopRandom Walks12

Sparsest Cut Conductance: Φ measures the “expansion” of a graph Finding the cut (S,V-S) that yields the above minimum ratio is the sparsest cut problem LP rounding: Yields O(log(n))-approximation [Leighton-Rao 88, Linial-London-Rabinovich 94] SDP rounding: Yields O(√log(n)) approximation [Arora-Rao-Vazirani 05] Chennai Network Optimization WorkshopRandom Walks13

Local Clustering Suppose you are given a massive graph and want to find a “good” cluster containing a given vertex v – Good means low conductance Approach: Solve the sparsest cut problem and return the cluster containing v – Too expensive Local clustering [Spielman-Teng 08, Andersen-Chung-Lang 08]: – Start a random walk from v, maintaining the probability vector for each vertex – Keep zeroing out vertices that have very low probability – LS Theorem helps in showing that in time nearly proportional to the size of the cluster, can achieve close to desired conductance Chennai Network Optimization WorkshopRandom Walks14

Take Away Messages Random walk and related processes – Arise in several scenarios – Are useful primitives for designing fast algorithms – Yield effective and practical pseudo-random sources Analysis tools for random walks – Basic probability (Markov’s inequality, Chernoff- type bounds, Martingales) – Spectral graph theory Chennai Network Optimization WorkshopRandom Walks15