1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.

Slides:



Advertisements
Similar presentations
Generating Random Spanning Trees Sourav Chatterji Sumit Gulwani EECS Department University of California, Berkeley.
Advertisements

Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Fundamental Relationship between Node Density and Delay in Wireless Ad Hoc Networks with Unreliable Links Shizhen Zhao, Luoyi Fu, Xinbing Wang Department.
Energy-Efficient Distributed Algorithms for Ad hoc Wireless Networks Gopal Pandurangan Department of Computer Science Purdue University.
0 - 0.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
Addition Facts
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Modeling Grid Job Time Properties Lovro.
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Complex Networks Advanced Computer Networks: Part1.
Scale Free Networks.
Routing and Congestion Problems in General Networks Presented by Jun Zou CAS 744.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Scalable and Dynamic Quorum Systems Moni Naor & Udi Wieder The Weizmann Institute of Science.
Addition 1’s to 20.
Test B, 100 Subtraction Facts
11 = This is the fact family. You say: 8+3=11 and 3+8=11
2 4 Theorem:Proof: What shall we do for an undirected graph?
Week 1.
The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
Small-world networks.
Collective Dynamics of ‘Small World’ Networks C+ Elegans: Ilhan Savut, Spencer Telford, Melody Lim 29/10/13.
Online Social Networks and Media Navigation in a small world.
Rumors and Routes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopRumors and Routes1.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Lecture 7 CS 728 Searchable Networks. Errata: Differences between Copying and Preferential Attachment In generative model: let p k be fraction of nodes.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
1 Analyzing Kleinberg’s (and other) Small-world Models Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Small World Networks Somsubhra Sharangi Computing Science, Simon Fraser University.
Advanced Topics in Data Mining Special focus: Social Networks.
The Small World Phenomenon: An Algorithmic Perspective by Anton Karatoun.
1 Analyzing Kleinberg’s (and other) Small-world Models Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
It’s a Small World After All Kim Dressel - The small world phenomenon Please hold applause until the end of the presentation. Angie Heimkes Eric Larson.
The Science of Networks 6.1 Overview Social Goal. Explain why information and disease spread so quickly in social networks. Mathematical Approach. Model.
Proof of Kleinberg’s small-world theorems
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Eric Horvitz, Michael Mahoney,
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Hot Topics in Peer-to-Peer Computing (HOT-P2P 2004) Volendam 08 October 2004 Non-uniform deterministic routing on F-Chord(  ) Gennaro Cordasco, Luisa.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
Online Social Networks and Media
Navigation in small worlds Social Networks: Models and Applications Seminar Toronto, Fall 2007 (based on a presentation by Stratis Ioannidis)
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
1 Analyzing and Characterizing Small-World Graphs Van Nguyen and Chip Martel Computer Science, UC Davis.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Models and Algorithms for Complex Networks
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Navigation in Networks, Revisited Networked Life MKSE 112 Fall 2012 Prof. Michael Kearns.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Small-world phenomenon: An Algorithmic Perspective Jon Kleinberg.
Algorithms For Small World Networks Anurag singh.
Lecture 1: Complex Networks
Identity and Search in Social Networks
Peer-to-Peer and Social Networks
Milgram’s experiment really demonstrated two striking facts about large social networks: first, that short paths are there in abundance;
Department of Computer Science University of York
Proof of Kleinberg’s small-world theorems
Navigation and Propagation in Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis

2 Contents Small-world phenomenon & Models Kleinberg’s Model Greedy routing The diameter of Kleinberg’s grid

3 Small-world phenomenon Nebraska Boston Two strangers meet and discover they are connected by a short chain of acquaintances Milgram’s pioneering work (1967): “six degrees of separation between any two Americans” Source person in Nebraska, target person in Boston People forward to someone they know on a first-name basis Paths were typically quite short

4 Milgram’s result shows that not only do short chains exist but they can be found using only local knowledge of links (people sent letters only knowing their own friends). People also knew about the general geography of of the “network”: e.g. New York is close to Boston. How can we model such networks?

5 Small World Properties Small diameter: short path between all pairs (or almost all) Efficient Greedy routing: short paths can be found with local knowledge Clustering: If there is a link (u,v) and (u,w) then more likely to be a link (v,w)

6 Modeling Small-Worlds Many networks are Small-Worlds (e.g. WWW, Social Networks, Physical systems) Motivated models of small-worlds: (Watts-Strogatz, Kleinberg) New Analysis and Algorithms Applications peer-to-peer systems gossip protocols secure distributed protocols

7 Kleinberg’s Model Based on an n by n, 2-D grid, where each node has 4 local undirected links

8 Kleinberg’s Model Based on an n by n, 2-D grid, where each node has 4 local undirected links q=2 Add q directed random links per node

9 Kleinberg’s Model Based on an n by n, 2-D grid, where each node has 4 local undirected links Add q directed random links per each node where  Define d(u,v): lattice distance between u and v u v d(u,v)=2+5=7  Now, u has a link to v with probability proportional to d -r (u,v). Parameter r determines crucial behaviors of the model.

10 Increasng r favors near nodes r=0, Link to each other node equally likely r=1, inverse of distance If a node is twice as far away, 1/2 as likely r=2, inverse squared If a node is twice as far away, 1/4 as likely d -r (u,v) =1, Uniform Distribution

11 Normalization Constant For a fixed r and u sum the probabilities to each other node to get the normalization constant C. Thus Pr[u->v] = 1/C * d -r (u,v) r = 0, C = n 2 –1 So, Pr[u->v] = 1/(n 2 –1 ) For all v. r = 2, C = O(logn) Pr[u->v] = (1/logn) * d -2 (u,v)

12 Kleinberg’s SW network is Greedy Routable iff r=2 Greedy routing algorithm using local information only, find a short path from s to t  When u is the current node, choose next v: the closest to t (use lattice distance) with (u,v) a local or random edge. s u t v

13 Kleinberg’s SW network is Greedy Routable iff r=2 A greedy routing algorithm using local information only, find a short path from s to t u t v s This greedy routing achieves expected ` delivery time ’ of O(log 2 n), i.e. the s  t paths have expected length O(log 2 n).

14 Kleinberg’s SW network is Greedy Routable iff r=2 A greedy routing algorithm using local information only, find a short path from s to t u t v s This greedy routing achieves expected ` delivery time ’ of O(log 2 n), i.e. the s  t paths have expected length O(log 2 n). This does not work unless r=2 : for r  2,  >0 such that the expected delivery time of any decentralized algorithm is  (n  ).

15 Greedy Routing Analysis We say that the algorithm is in phase i If the current node u has 2 i  d(u,t) < 2 i+1 The initial phase has i  logn since d(s,t) < 2n for any pair. If we are in phase i, how likely to jump to state i-1 ? Pretty easy to show Pr[u->v| d(v,t) 1/ logn

16 Greedy Routing The initial phase has i  logn We jump to the next lower phase with probability about 1/logn So, expected O(logn) hops/ phase After at most logn phases done. total expected hops = logn * O(logn)= O(log 2 n).

17 Our Results An analysis of the expected diameter of Kleinberg's setting. For a 2D-grid, One random link/node (q=1) If 0  r  2 : diameter=  (logn) – PODC’04

18 Our Results An analysis of the expected diameter of Kleinberg's setting. For a 2D-grid, One random link/node (q=1) If 0  r  2: diameter=  (logn) – PODC’04 If 2< r <4: diameter < log c n SODA’05 If 4 n c for 0<c<1

19 Our Results An analysis of the expected diameter of Kleinberg's setting. For a 2D-grid, One random link/node (q=1) If 0  r  2: diameter=  (logn) – PODC’04 If 2 1 If 4 n c for 0<c<1 Can be generalized for k-D grid, say if k 1

20 Our new results: Routing For a k-dimensional lattice model 1.The expected length of Kleinberg’s greedy paths is  (log 2 n). Also, they are this long with constant probability. 2.With more local knowledge we can improve the path length to O(log 1+1/k n)

21 Prior work on similar (diameter) problems Diameter of a cycle plus a random matching: Bollobas & Chung, 88 Can be seen as a special case of Kleinberg’s grid setting where: 1-D lattice, undirected graph, r=0 (random links are uniform) Diameter of long-range percolation graphs Benjamini & Berger, 2001 Coppersmith et al., 2002 Biskup, 2004: similar to our approach

22 O(log n) Expected Diameter Proof for simple setting :  2D grid with wraparound  4 random links per node, with r=2  Extend to:  K-D grids, 1 random link,  No wraparound

23 The diameter bound: Intuition We construct neighbor trees from s and to t: is the nodes within logn of s in the grid is nodes at distance i (random links) from s

24 T-Tree is the nodes within logn of t in the grid is nodes at distance i (random links) to t

25 Small-worlds: Finding a short path from s to t S2S2 SjSj S1S1 TiTi T2T2 T1T1 t S0S0 T0T0 s t S i = nodes at distance i from S 0, an Initial neighbor set of sufficient size T i is nodes at distance i (random links) to Initial set T 0 We want: the {S i } and {T i } to grow exponentially until big enough, so the two subset chains intersect with high probability.

26 After O(logn) Growth steps and are almost surely of size nlogn  Thus the trees almost surely connect  Similar to Bollobas-Chung approach for a ring + random matching.  But new complications since non-uniform distribution and directed edges Subset chains

27 Proving Exponential Growth Growth rate depends on set size and shape We analyze using an artificial experiment

28 Links into or out of a ball Motivation Links to outside For set C, node u  C, a random link from u: How likely is this link to leave C ? Links into  Given: subset C, node u  C.  How likely is a link to u from outside C ? Worst shape for C: A ball (with same size)

29 Exponential Growth u C Neighbor sets should have exponential growth If a node u is surrounded by a moderate size set of vertices C, a random link from u is likely to “escape” from C.

30 Links into or out of a ball: the facts B L (u) ={nodes within distance L from u } For any 0<  <1, any integer 1  L  n , for n large The Pr[ u->v | v outside of B L ( u ) ] > 1-  -o(1) Similar for a random link to u from outside of B L ( u ) Note that B L ( u ) has about L 2 nodes. For a ball with radius n.51 a random link from the center leaves the ball with probability >.48 With 4 links, expect 4*.48 > 1.9 new nodes.

31 S-Tree growth By making the initial set larger than clogn, a growth step is exponential with probability: By choosing c large enough, we can make m large enough so our sets almost surely grow exponentially to size nlogn

32 The t-Tree Ball experiment for t-tree needs some extra care (links are conditioned) Still can show exponential growth Easy to show two  (nlogn) size sets of `fresh’ nodes intersect or a link from s-set hits t-set More care on constants leads to a diameter bound of 3logn + o(logn)

33 Reducing the Random Links To change from 1/node to 4/node: Collect nodes into super-nodes Each 2x2 square contracted to a super- node New graph has 4 random links /node and diameter differs from old by a constant factor

34 Diameter Results Thus, for a K-D grid with added link(s) from u to v proportional to The expected diameter is  (log n) for Now look at r > k.

35 The diameter of Kleinberg’s SW setting For simplicity, use a 1-D setting Define C(r,n) as an n-node cycle. Each node has 2 local links and One directed random-link: i is connected to j  i with Pr[i  j] ~ |i-j| -r For 0  r  1, we showed the diameter is  (logn) Now consider r> n i j

36 Upper bound for the diameter of C(r,n) when 1<r<2 We use a probabilistic recurrence approach Our approach is similar to Karp's (STOC’91) We establish a (probabilistic) relation between the diameter of a segment and that of a smaller one.

37 Upper bound for the diameter of C(r,n) when 1<r<2 We use a (probabilistic) relation between the diameter of a segment and a sub-segment.  We relate D(x), the diameter of a segment of length x, to D(y), where y=x a for some a  (0,1).  Intuitively, w.h.p, D(x) is bounded by a constant multiple of D(y).

38 Upper bound for the diameter of C(r,n) when 1<r<2  Iterating the relation, starting with x=n, standard recurrence techniques bound D(n) - the graph's expected diameter - based on D(x 0 ) for some x 0 small enough (a poly-log function of n ). D(n) D( n a ) D(n a 2 ) D(x 0 ) …

39 Partitioning: A segment of length x is divided into multiple sub-segments of length y=x a for a  (0,1). Partitioning Hierarchy

40 A partition is complete when every pair of sub-segments has two random directed edges connecting one to the other. Partitioning A segment of length x is divided into multiple sub-segments of length y=x a for some a  (0,1). Partitioning Hierarchy A B

41 We iterate this partitioning from x=n to some small x 0 ( for fixed a ). We need to specify y ( or a ) s.t. Small enough  # iterations is order loglog (n) Not too small  Almost surely, each phase’s partition is complete Partitioning Hierarchy D(n) D( n a ) D(n a 2 ) D(x 0 ) …

42 Supporting Facts Fact 1: For a fixed a s.t. r/2< a <1 and for x large enough, almost surely, all partitions of length x segments are complete Note: 0<r<1 and y=x a Implies that all sub-segments are large enough so can get to another by one link.

43 Fact 2: If a partition of a segment of length x is complete, then almost surely D(x) is at most twice the maximum diameter of a subsegment, plus 1. Basically, any shortest path s  t can be upper bounded by two shortest paths within a sub-segment plus 1 length(s  t)  length(s  v)+length(w  t)+1 for (v,w)  2 max D(y) +1 Supporting Facts A u u+x-1 s t v w * * B

44 Fact 2 still true if we redefine D(x) as the maximum value of the diameters of all segments of length x Supporting Facts Fact 2: If a partition of a segment of length x is complete, then almost surely D(x) is at most twice the maximum diameter of a sub-segment, plus 1.

45 Poly-log diameter for 1  r  2 Consider the sequence of maximum diameter values in our partitioning hierarchy D( n), D( n a ), …,D( x 0 ) Where almost surely, D( x)  2 D( x a ) +1 The # of terms is  (loglog n) D( x 0 )  x 0, bounded by a poly-log(n) So, D( n) = O(log c n) for c>0 depending on r only

46 The diameter of C(r,n) For r>2, C(r,n) is a ‘large’ world expected diameter  (n c ), c= r-1 / r  Random links tend to go to close nodes  Few long links

47 Higher dimensions We generalize to k-dimensional grids If 0  r  k: diameter=  (logn) If k 1 If 2k n c for 0<c<1 The case r=2k is still open.

48 Analyzing Greedy Routing For r=k (so r=2 for 2D grid), Kleinberg shows greedy routing is O((log 2 n). We show this bound is tight, and: With probability greater than1/2, Kleinberg’s algorithm uses at least clog 2 n steps.  Fraigniaud et. al also show tight bound, and  Suggested by Barriere et. al 1-D result.

49 Proof of the tight bound ( ideas ) How fast does a step reduce the remaining distance to the destination? We measure the ratio between the distance to t before and after each random trial: We reach t when the product of the ratios =d(s,t)

50 Rate of Progress To avoid a product of ratios, we transform to Z v, log of the ratio: d(v,t)/d(v’,t) where v’ is the next vertex. Done when sum of Z v totals log(d(s,t)) Show E[Z v ] = O(1/logn), so need  (log 2 n) steps to total log(d(s,t))= logn.

51 An important technical issue: Links to a k-D surface What is the probability to get to a given distance from t ?  Let B = {nodes within distance L from t } and S B - its surface  Given node v outside B and a random link from v, what is the chance for this link to get to S B ? v t m L

52 Part III: Future work The diameter for r=2k (poly-log or polynomial)? Improved algorithms for decentralized routing A routing decision would depend on:  the distance from the new node to the destination  neighborhood information. Better models for small-world graphs