Small-World Navigability

Slides:



Advertisements
Similar presentations
Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
Advertisements

Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Complex Networks Advanced Computer Networks: Part1.
By Jon Kleinberg Bo Young Kim Applied Algorithm Lab.
The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Milgram-Routing in Social Networks
Analysis and Modeling of Social Networks Foudalis Ilias.
Online Social Networks and Media Navigation in a small world.
Rumors and Routes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopRumors and Routes1.
Information Networks Small World Networks Lecture 5.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Lecture 7 CS 728 Searchable Networks. Errata: Differences between Copying and Preferential Attachment In generative model: let p k be fraction of nodes.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Small-World Graphs for High Performance Networking Reem Alshahrani Kent State University.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
1 Analyzing Kleinberg’s (and other) Small-world Models Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Advanced Topics in Data Mining Special focus: Social Networks.
The Small World Phenomenon: An Algorithmic Perspective by Anton Karatoun.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006

Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
CCAN: Cache-based CAN Using the Small World Model Shanghai Jiaotong University Internet Computing R&D Center.
Gennaro Cordasco - How Much Independent Should Individual Contacts be to Form a Small-World? - 19/12/2006 How Much Independent Should Individual Contacts.
Online Social Networks and Media
School of Information University of Michigan SI 614 Livejournal Lecture 23.
Navigation in small worlds Social Networks: Models and Applications Seminar Toronto, Fall 2007 (based on a presentation by Stratis Ioannidis)
It's a small world but I wouldn't want to paint it - Stephen Wright.
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Lecture 17: Search in structured networks CS 765 Complex Networks Slides are modified from Lada Adamic.
How Do “Real” Networks Look?
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Models and Algorithms for Complex Networks
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Lecture 23: Structure of Networks
Connectivity and the Small World
Algorithms For Small World Networks Anurag singh.
Lecture 1: Complex Networks
Topics In Social Computing (67810)
Search in structured networks
Identity and Search in Social Networks
Peer-to-Peer and Social Networks
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
“It’s a small world after all!’
Milgram’s experiment really demonstrated two striking facts about large social networks: first, that short paths are there in abundance;
The Watts-Strogatz model
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Lecture 13 Network evolution
Small World Networks Scotty Smith February 7, 2007.
How Do “Real” Networks Look?
Peer-to-Peer Information Systems Week 6: Performance
Lecture 23: Structure of Networks
Lecture 21 Network evolution
Network Science: A Short Introduction i3 Workshop
Navigation and Propagation in Networks
Advanced Topics in Data Mining Special focus: Social Networks
Small-World Phenomena and Decentralized Search
Advanced Topics in Data Mining Special focus: Social Networks
What did we see in the last lecture?
Presentation transcript:

Small-World Navigability Alexandru Moga @ Seminar in Distributed Computing

Talk about a small world… Zurich, CH Hunedoara, RO Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

From cliché to social networks Milgram’s Experiment and The Small World Hypothesis Boston, MA Omaha, NE Wichita, KS Great distance in the US: socially and geographically. Human society is a small-world type network characterized by short length paths Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

From social networks to CS Models and Algorithms Experimental studies Impact in Computer Science? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Small-world phenomenon Six degrees of separation “We are all linked by short chains of acquaintance” Watts-Strogatz model Pervasive in networks arising in nature and technology Fundamental factor in the evolution of WWW Kleinberg: People can find short paths very effectively Can we put an algorithmic price on that? Six handshakes to the President/Six degrees of Kevin Bacon Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Small world characteristics Long-range edges (few random shortcuts) Local edges (many) 1 4 2 3 5 Superposition of structured subgraph (typical network structures: lattices, hierarchies, groups) with a random subgraph ~ geographical interpretation Network is generated using some probability distribution function (related to the long-range edges) Low network diameter (like uniform random networks) Clustering: neighbours of x are also neighbours What is a good network model that exhibits such characteristics? B A High clustering Short paths What is a good network model that exhibits such characteristics? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Navigation Estimated distance to target (global) Acquaintanceship/Friendship Source s z dzt x dyt Target t y dwt Estimated distance has a global character (may not be a real distance) Chain length has a local character (real distance) Pr(x,y)=f(d(x,y)) for long-range edges w Decentralized search (local) Greedy search dyt = min{x’s neighbours} Can we effectively navigate from s to t given a network model? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

The Watts-Strogatz model Re-wired ring lattice Long-range edges (probability ) Local edges (K-nearest neighbors) 2 1 9 Take a regular lattice, add random links, and you get a small-world Example K=4, beta=0.2 19 Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Kleinberg’s model N Local edges(p) AE := d(A,E) ≤ p E v D A B N C Two-dimensional grid network structure (n x n square) Set of nodes {(i,j) : i  {1,…,n}, j  {1,…,n}} N=6, p=1,q=0 w Z Long-range edges(q) Pr(AZ) ~ 1/[d(A,Z)] Inverse th-power distribution u t Lattice distance d(A,Z) = |t-u| + |w-v| Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Which  yields an effectively navigable network? Clustering exponent  Family of network models with parameter   = 0  > 0 Long-range contacts chosen independently of their position (~Watts-Strogatz model) Long-range contacts tend to cluster in the nodes’ vecinity Which  yields an effectively navigable network? Polylogarithmic: T grows slowly while N increases Expected delivery time T Expected number of steps to reach the destination Shortness (small T) of paths is defined as polylogarithmic Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Navigability in Kleinberg’s model T > N  = 0  = 2 Inverse-square distribution (1/d2) is the unique distribution that allows polylogarythmic T < log2N Alfa=2: Target area progressively closes as new nodes are added to the chain Generalization For a k-dimensional lattice, paths are polylogarithmic iff  = k Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Inverse-square distribution Last phase t At most logN steps 2j Phase j 2j+1 s  ~logN phases Initial phase Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Plausible social structures (Watts et al.) Individuals have identities World is partitioned hierarchically (cognitively) Group management is easier (typically 100 individuals) Similarity of individuals Branching factor Depth l.c.a.(i,j) Group size Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Plausible social structures Network structure Pr(acquaintance) decreases with decreasing similarity Choose i and a link distance with Pr(x) = ce-x Choose j that is in distance x from i Continue until individuals have an average of z friends x = 1 x = 2 x = 3  - shows homophily e- << 1: cliques e- = b: uniform random graph Uniform random graph: everybody is equally likely to link to anybody. Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Plausible social structures Social world is multi-dimensional (H) Each dimension corresponds to an independent hierarchical division (e.g. geography, occupation) Node identity: H-dimensional vector Perceived similarity yields “social distance” Minimum similarity across all dimensions xij = 4 yij = 1 yjk = 1 yik = 4 yij +yjk < yik !!! xij = 1 Same nodes, different hierarchies Graphs build according to each hierarchy (superposition of graphs) If i and j are very similar in a hierarchy, thery are also close socially Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Searchability with social distance N increases Searchable networks in the H- space Comparison to original Milgram experiment H=2, =1 Experimental setting similar to Milgram’s experiment q= the probability of a chain reaching the target Average z: avg no of friends in all dimensions (larger H implies fewer ties per dimension) Upper limit of H: random walk Individuals are basically homophilous Similarity is judged along more than 1 dimenations (2-3) L~6.5 (Milgram) vs. L~6.7 Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Experimental studies Real-world social networks Large-scale Geography and occupation are crucial Network structure alone may not be sufficient Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Geography in small-world networks (Nowell et al.) LiveJournal online community ~500.000 bloggers located in US Friendship-based network Global routing with GEOGREEDY What is the importance of geography in navigation? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

What is the relation between geography and friendship? GEOGREEDY simulation 80% of chains completed with avg. length of 16.74 13% of chains completed with avg. length of 4.12 Original experiment Chain fails if u has no other friend v that is close to the target Modified experiment If d(v,t)>d(u,t), u picks a random person from the same city to continue 80% of chains completed with avg. length of 16.74 Geography seems to be enough  surface similarity to Milgram’s experiment (18% of messages completed) 13 % < completion rate < 80% What is the relation between geography and friendship? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Geographic friendship probability PrKleinberg() ~ 1/ 2 +50.000 people Ithaca, NY P(delta) : proportion of pairs separated by delta/ or the probability of u,v within distance delta to be friends Ill-approximation using Kleinberg model with pure geographic distance/network is still searchable PrLiveJournal() ~1/  , ~1 LiveJournal network exhibits large variance in population density What is a good interpretation of geographic friendship? Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Rank-based friendship Rural Iowa Manhattan ranku(v) := |{w:d(u,w) < d{u,v}}| Pr[u  v] ~ 1/ranku(v) Figure: much better approximation of population distribution Theorem: Analytic evidence In a network formed by rank-based friendship, GEOGREEDY can find short paths (polylogarithmic) Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Navigability in global social networks (Dodds et al.) Routing in the LiveJournal community Geography and occupation are the most important factors in establishing short chains ~70% Source Destination geography-based non-geography-based Search often transitions from geography-based (global/coarse-grained) to non-geography-based (local/fine-grained) LiveJournal: only 2 thirds were geography-based friendships. Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

E-mail replication experiment Human participants (not simulated) ~100k individuals, 18 targets in 13 countries Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Geography vs. occupation Geography matters more in the early stages of the chain (3 steps) People were also asked why they considered their chosen friend as a good recipient. Occupation clearly takes over in the later stages Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Results of the study Without enough incentives, the small-world hypothesis may not hold E.g. Target 5 (university prof.) accounted for 44% of the completed chains  good reachability Network structure alone is not enough Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Case study: Freenet P2P system Freenet Collaborating group of Internet nodes Overlay special-purpose network Application-level routing Freenet Distributed anonymous information storage and retrieval Unstructured system Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Case study: Freenet File ids File caching on the return path Typical cache replacement policy: LRU Backtracking Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Case study: Freenet At low load: At high load: Freenet network shown to evolve into a “small-world” (high clustering + logarithmic paths) At high load: Frequent local caching actions Clusters may break  small-world hypothesis might not hold Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Case study: Freenet Enhanced-clustering cache replacement policy Preserve key clustering in the cache Each node chooses a seed s(x) randomly from the key space At node x (datastore full) key u arrives choose v which is farthest from the seed Distance(u, seed) ≤ Distance(v, seed): cache u, evict v, create entry for u Distance(u, seed) > Distance(v, seed): cache u, evict v, create entry for u with probability p (randomness) Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Case study: Freenet Empirical results Analytically f(d(x,y)) ~ 1/d(x,y) = 1/|sx-sy| Expected delivery time: O(log2n) * Analytical evidence Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Other applications Crawling the WWW On-line search in the unknown Supercomputing Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

Conclusion A small-world network is characterized by: High clustering of nodes “Short” paths Small-world phenomenon has two sides Existential and Algorithmic Take a structured network and add random links with probability depending on distance (position in the network) Unsupervised networks are generally small-worlds Alexandru Moga @ Seminar in Distributed Computing 9/17/2018

References Identity and Search in Social Networks Watts, D.J. and Dodds, P.S. and Newman, MEJ In Science 2002. Small-World Phenomena and the Dynamics of Information, Kleinberg, J. In NIPS 2002. Small-world Phenomena: An algorithmic perspective, Kleinberg, 2000 Navigation in a Small World, Kleinberg, Nature 406 (2000) Geographic routing in social networks Liben-Nowell, D. and Novak, J. and Kumar, R. and Raghavan, P. and Tomkins, A. PNAS 2005. An Experimental Study of Search in Global Social Networks Dodds, P.S. and Muhamad, R. and Watts, D.J. In Science 2003. The Small World Web, L. Adamic, 1999 Growing and Navigating the Small World Web by Local Content, F. Menczer, 2002 Using the Small-World Model to Improve Freenet Performance, Zhang et al. Alexandru Moga @ Seminar in Distributed Computing 9/17/2018