Sequential Algorithms for Generating Random Graphs

Slides:



Advertisements
Similar presentations
Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.
Advertisements

The Theory of Zeta Graphs with an Application to Random Networks Christopher Ré Stanford.
Gibbs sampler - simple properties It’s not hard to show that this MC chain is aperiodic. Often is reversible distribution. If in addition the chain is.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
1 The Monte Carlo method. 2 (0,0) (1,1) (-1,-1) (-1,1) (1,-1) 1 Z= 1 If  X 2 +Y 2  1 0 o/w (X,Y) is a point chosen uniformly at random in a 2  2 square.
Randomized Algorithms Kyomin Jung KAIST Applied Algorithm Lab Jan 12, WSAC
Markov Chains 1.
1 Finite-Length Scaling and Error Floors Abdelaziz Amraoui Andrea Montanari Ruediger Urbanke Tom Richardson.
Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.
On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.
CS774. Markov Random Field : Theory and Application Lecture 04 Kyomin Jung KAIST Sep
CS774. Markov Random Field : Theory and Application Lecture 16 Kyomin Jung KAIST Nov
1 Maximal Independent Set. 2 Independent Set (IS): In a graph, any set of nodes that are not adjacent.
Computational statistics 2009 Random walk. Computational statistics 2009 Random walk with absorbing barrier.
EXPANDER GRAPHS Properties & Applications. Things to cover ! Definitions Properties Combinatorial, Spectral properties Constructions “Explicit” constructions.
Advanced Topics in Data Mining Special focus: Social Networks.
Mixing Times of Markov Chains for Self-Organizing Lists and Biased Permutations Prateek Bhakta, Sarah Miracle, Dana Randall and Amanda Streib.
Neighbourhood Sampling for Local Properties on a Graph Stream A. Pavan, Iowa State University Kanat Tangwongsan, IBM Research Srikanta Tirthapura, Iowa.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
1 “Erdos and the Internet” Milena Mihail Georgia Tech. The Internet is a remarkable phenomenon that involves graph theory in a natural way and gives rise.
Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
10 December, 2008 CIMCA2008 (Vienna) 1 Statistical Inferences by Gaussian Markov Random Fields on Complex Networks Kazuyuki Tanaka, Takafumi Usui, Muneki.
15-853:Algorithms in the Real World
Markov Chains and Random Walks. Def: A stochastic process X={X(t),t ∈ T} is a collection of random variables. If T is a countable set, say T={0,1,2, …
M ONTE C ARLO SIMULATION Modeling and Simulation CS
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
Seminar on random walks on graphs Lecture No. 2 Mille Gandelsman,
De novo discovery of mutated driver pathways in cancer Discussion leader: Matthew Bernstein Scribe: Kun-Chieh Wang Computational Network Biology BMI 826/Computer.
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Date: 2005/4/25 Advisor: Sy-Yen Kuo Speaker: Szu-Chi Wang.
Spatial decay of correlations and efficient methods for computing partition functions. David Gamarnik Joint work with Antar Bandyopadhyay (U of Chalmers),
geometric representations of graphs
Sampling algorithms and Markov chains László Lovász Microsoft Research One Microsoft Way, Redmond, WA 98052
1 Distributed Vertex Coloring. 2 Vertex Coloring: each vertex is assigned a color.
1 Milena Mihail Georgia Tech. Algorithmic Performance in Complex Networks.
Random Sampling Algorithms with Applications Kyomin Jung KAIST Aug ERC Workshop.
Dynamic Network Analysis Case study of PageRank-based Rewiring Narjès Bellamine-BenSaoud Galen Wilkerson 2 nd Second Annual French Complex Systems Summer.
Krishnendu ChatterjeeFormal Methods Class1 MARKOV CHAINS.
Network (graph) Models
DOULION: Counting Triangles in Massive Graphs with a Coin
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Degree-based graph construction & Sampling
Search Engines and Link Analysis on the Web
Complex Networks: Connectivity and Functionality
Maximal Independent Set
From dense to sparse and back again: On testing graph properties (and some properties of Oded)
Path Coupling And Approximate Counting
Density Independent Algorithms for Sparsifying
Random Graph Models of large networks
Hidden Markov Models Part 2: Algorithms
Maximal Independent Set
Models of Network Formation
Models of Network Formation
Probabilistic existence of regular combinatorial objects
Models of Network Formation
Chapter 9: Graphs Basic Concepts
Models of Network Formation
geometric representations of graphs
Modelling and Searching Networks Lecture 6 – PA models
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Chapter 9: Graphs Basic Concepts
Network Models Michael Goodrich Some slides adapted from:
Independence and Counting
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
Planting trees in random graphs (and finding them back)
Presentation transcript:

Sequential Algorithms for Generating Random Graphs Amin Saberi Stanford University

Outline Generating random graphs with given degrees (joint work with M Bayati and JH Kim) Generating random graphs with large girth (joint work with M Bayati and A Montanari)

Problem Given integers Generate a simple graph with that degree sequence chosen uniformly at random Example After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018

Application Generating large sparse graphs, with sparse (typically power-law) degree sequence Simulating Internet topology (e.g. Inet) Biological Networks (motif detection) motif: sub-networks with higher frequency than random Coding theory: bipartite graphs with no small stopping sets After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 4

Existing methods (Theory) Markov Chain Monte Carlo method The switch chain It is ``Rapidly mixing’’: [Kannan-Tetali-Vampala 99] [Cooper-Dyer-GreenHill 05] [Feder-Guetz-S.-Mihail 07] Jerrum-Sinclair chain (Walk on the self-reducibility tree) Running time at least O(n7) Running time at least O(n4) A month on Pentium 2 for n = 1000 6/15/2018 5

Existing methods (Practice) Lots of heuristics: INET, PEG, … Example: Milo et al., Science 2002; Kashtan et al. 2004 on Network Motifs The heuristic used for generating random graphs has a substantial bias After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 6

New method: Sequential Importance Sampling Very successful in practice: Knuth’76: for counting self-avoiding random walks estimating the running time of heuristics More recently for random graph generation: Chen-Diaconis-Holmes-Liu’05, Blitzstein-Diaconis’05 No analysis! (with the exception of this work and Blanchet 06) Stanford Inspired by Chen et al and BD.. 6/15/2018 7 7

Our Algorithm failure Same calculation in 10 microseconds Repeat Add an edge between (i,j) with probab. Until edges are added or there are no valid choices left remaining degree degree 2 4 1 3 1 1 2 2 3 1 2 2 1 1 failure 1 1 2 6/15/2018 8

Analysis of the Algorithm Theorem 1 (Bayati-Kim-S. 07): The running time of the algorithm is Furthermore, if Or if the degree sequence is regular and Algorithm is successful with probability The probability of generating any graph is where L is the number of graphs A bit on referencing For regular output distribution is uniform in total variation distance. 6/15/2018 9 9

Basic Idea of The tree of execution the prob. of choosing a sub-tree should be proportional to the number of valid leaves The empty graph Choosing the k th edge After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 10

Basic Idea of The tree of execution the prob. of choosing a sub-tree should be proportional to the number of valid leaves The empty graph Choosing the k th edge After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 11

Basic Idea of The tree of execution the prob. of choosing a sub-tree should be proportional to the number of valid leaves Technical ingredient: concentration results on the distribution of leaves in each sub-tree (improving Kim-Vu 06, McKay-Wormald 91) The empty graph Choosing the k th edge After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 12

Analysis of the Algorithm Theorem 1 (Bayati-Kim-S. 07): The running time of the algorithm is Furthermore, if Or if the degree sequence is regular and Algorithm is successful with probability The probability of generating any graph is where L is the number of graphs A bit on referencing For regular output distribution is uniform in total variation distance. 6/15/2018 13 13

Sequential Importance Sampling Consider a run of the algorithm Let Pr be the probability of the edge chosen in step r Define Crucial observation: X is an unbiased estimator for the number of graphs A bit on referencing For regular output distribution is uniform in total variation distance. 6/15/2018 14 14

Using SIS to get an FPRAS By taking several samples of X, we can have a good estimate of L Then using the right rejection sampling: Theorem 2 (Bayati-Kim-S. ’07): Can generate any graph with probability of uniform. In time A bit on referencing For regular output distribution is uniform in total variation distance. An FPRAS for counting and random generation 6/15/2018 15 15

Outline Generating random graphs with given degrees (joint work with M Bayati and JH Kim) Generating random graphs with large girth (joint work with M Bayati and A Montanari)

Graphs with large girth Check bits (mod 2) Mod 2, For example x2 can be covered from second equation… What is degree ? Fast decoding-algorithm These are used on Mars Reconessance space craft. Challenge: Construct graphs with given degrees with no short cycles Amraoui-Montanari-Urbanke’06. 6/15/2018

Example: Triangle free graphs. Consider all graphs with n vertices and m edges. Let G be one such graph chosen uniformly at random. Can think of G as Erdös-Renyi graph where = number of triangle sub-graphs of G. Explain about Poisson approximation Same phase transition holds when we need graphs of girth k. 6/15/2018

Our Algorithm Initialize G by an empty graph with vertices Repeat Choose a pair from the set of suitable pairs and set Until m edges are added or there are no suitable pair available (failure) Explain what “this works” means. 6/15/2018

Theorem (Bayati, Montanari, S. 07) For a suitable Pij and the output distribution of our algorithm is asymptotically uniform. i.e. Furthermore, the algorithm is successful almost surely. Output dist. of algorithm Uniform probability Remark: can be extended to degree sequences applicable to LDPC codes… 6/15/2018

Coordinate-wise multiplication What is Pij Consider the partially constructed graph G with t edges. Let S be the n£n matrix of all suitable pairs. Let A, Ac be adjacency matrix of G and its complement. can be calculated quickly (e.g. with MATLAB) Coordinate-wise multiplication 6/15/2018

Where does h come from? The execution tree: Problem: estimate the number of valid leaves of each subtree In other words estimate the number of extensions of the partial graph that do not have short cycles After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 22

Where does h come from? add the remaining m - k edges uniformly at random, and compute the expected number of small cycles Y. Assuming the distribution of small cycles is Poisson, the probability of having no small cycles is e-Y After all the pictures Simple ? Labeled ? Uniform means all possible scenarios are equally likely 6/15/2018 23

Summary Random simple graphs with given degrees Random bipartite graphs with given degrees and large girth More extensive analysis of SIS?