Connectivity and the Small World Overview Background: de Pool and Kochen: Random & Biased networks Rapoport’s work on diffusion Travers and Milgram Argument.

Slides:



Advertisements
Similar presentations
Complex Network Theory
Advertisements

Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
‘Small World’ Networks (An Introduction) Presenter : Vishal Asthana
Small-world networks.
Collective Dynamics of ‘Small World’ Networks C+ Elegans: Ilhan Savut, Spencer Telford, Melody Lim 29/10/13.
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Where we are Node level metrics Group level metrics Visualization
School of Information University of Michigan Network resilience Lecture 20.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Advanced Topics in Data Mining Special focus: Social Networks.
Complex Networks Structure and Dynamics Ying-Cheng Lai Department of Mathematics and Statistics Department of Electrical Engineering Arizona State University.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Models of Influence in Online Social Networks
Connectivity and the Small World Overview Background: de Pool and Kochen: Random & Biased networks Rapoport’s work on diffusion Travers and Milgram Argument.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Small World Problem Christopher McCarty. Small World Phenomenon You meet someone, seemingly randomly, who has a connection to someone you know – Person.
Diffusion 1)Structural Bases of Social Network Diffusion 2)Dynamic limitations on diffusion 3)Implications / Applications in the diffusion of Innovations.
Connectivity & Cohesion Overview Background: Small World = Connected What distinguishes simple connection from cohesion? Moody & White Bearman, Faris &
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Centrality in undirected networks These slides are by Prof. James Moody at Ohio State.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Online Social Networks and Media
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
"Social Networks, Cohesion and Epidemic Potential" James Moody Department of Sociology Department of Mathematics Undergraduate Recognition Ceremony May.
Neural Network of C. elegans is a Small-World Network Masroor Hossain Wednesday, February 29 th, 2012 Introduction to Complex Systems.
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Hierarchy Overview Background: Hierarchy surrounds us: what is it? Micro foundations of social stratification Ivan Chase: Structure from process Action.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
What Is A Network? (and why do we care?). An Introduction to Network Theory | Kyle Findlay | SAMRA 2010 | 2 “A collection of objects (nodes) connected.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
How Do “Real” Networks Look?
Structural Holes & Weak Ties
11 Network Level Indicators Bird’s eye view of network Image matrix example of network level Many network level measures Some would argue this is the most.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Informatics tools in network science
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Contagion in Networks Networked Life NETS 112 Fall 2015 Prof. Michael Kearns.
Connectivity and the Small World
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
How Do “Real” Networks Look?
Social Balance & Transitivity
Networked Life NETS 112 Fall 2018 Prof. Michael Kearns
Network Science: A Short Introduction i3 Workshop
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Networked Life NETS 112 Fall 2017 Prof. Michael Kearns
Small World Networks Scotty Smith February 7, 2007.
How Do “Real” Networks Look?
Networked Life NETS 112 Fall 2016 Prof. Michael Kearns
Network Science: A Short Introduction i3 Workshop
Local Clustering Coefficient
Advanced Topics in Data Mining Special focus: Social Networks
Networked Life NETS 112 Fall 2019 Prof. Michael Kearns
Presentation transcript:

Connectivity and the Small World Overview Background: de Pool and Kochen: Random & Biased networks Rapoport’s work on diffusion Travers and Milgram Argument Method Watts Argument Findings Methods: Biased Networks Reachability Curves Calculating L and C

Connectivity and the Small World Started by asking the probability than any two people would know each other. Extended to the probability that people could be connected through paths of 2, 3,…,k steps Linked to diffusion processes: If people can reach others, then their diseases can reach them as well, and we can use the structure of the network to model the disease. The reachability structure was captured by comparing curves with a random network, which we will do later today.

Connectivity and the Small World Travers and Milgram’s work on the small world is responsible for the standard belief that “everyone is connected by a chain of about 6 steps.” Two questions: Given what we know about networks, what is the longest path (defined by handshakes) that separates any two people? Is 6 steps a long distance or a short distance?

Longest Possible Path: Two Hermits on the opposite side of the country About steps. OH Hermit Mt. Hermit Store Owner Store Owner Truck Driver Truck Driver Manager Corporate Manager Corporate Manager Corporate President Corporate President Congress Rep. Congress Rep.

What if everyone maximized structural holes? Associates do not know each other: Results in an exponential growth curve. Reach entire planet quickly.

What if people know each other randomly? Random graph theory shows us that we could reach people quite quickly if ties were random.

Random Reachability: By number of close friends

Milgram’s test: Send a packet from sets of randomly selected people to a stockbroker in Boston. Experimental Setup: Arbitrarily select people from 3 pools: a) People in Boston b) Random in Nebraska c) Stockholders in Nebraska

Milgram’s Findings: Distance to target person, by sending group.

Most chains found their way through a small number of intermediaries. What do these two findings tell us of the global structure of social relations?

Milgram’s Findings: Length of completed chains

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon Asks why we see the small world pattern and what implications it has for the dynamical properties of social systems. His contribution is to show that globally significant changes can result from locally insignificant network change.

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon Watts says there are 4 conditions that make the small world phenomenon interesting: 1) The network is large - O(Billions) 2) The network is sparse - people are connected to a small fraction of the total network 3) The network is decentralized -- no single (or small #) of stars 4) The network is highly clustered -- most friendship circles are overlapping

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon Formally, we can characterize a graph through 2 statistics. 1) The characteristic path length, L The average length of the shortest paths connecting any two actors. (note this only works for connected graphs) 2) The clustering coefficient, C Version 1: the average local density. That is, C v = ego-network density, and C = C v /n Version 2: transitivity ratio. Number of closed triads divided by the number of closed and open triads. A small world graph is any graph with a relatively small L and a relatively large C.

The most clustered graph is Watt’s “Caveman” graph:

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon Degree (k) Clustering Coefficient Characteristic Path Length C and L as functions of k for a Caveman graph of n=1000

Compared to random graphs, C is large and L is long. The intuition, then, is that clustered graphs tend to have (relatively) long characteristic path lengths. But the small world phenomenon rests on just the opposite: high clustering and short path distances. How is this so? Duncan Watts: Networks, Dynamics and the Small-World Phenomenon

A model for pair formation, as a function of mutual contacts formations. Duncan Watts: Networks, Dynamics and the Small-World Phenomenon Using this equation,  produces networks that range from completely ordered to completely random. (M ij is the number of friends in common, p is a baseline probability of a tie, and k is the average degree of the graph)

A model for pair formation, as a function of mutual contacts formations. Duncan Watts: Networks, Dynamics and the Small-World Phenomenon

C=Large, L is Small = SW Graphs

Why does this work? Key is fraction of shortcuts in the network In a highly clustered, ordered network, a single random connection will create a shortcut that lowers L dramatically Watts demonstrates that Small world graphs occur in graphs with a small number of shortcuts

Empirical Examples 1) Movie network: Actors through Movies L o /L r = 1.22C o /C r = ) Western Power Grid: L o /L r = 1.50C o /C r = 16 3) C. elegans L o /L r = 1.17C o /C r = 5.6

What are the substantive implications? Return to the initial interest in connectivity: disease diffusion 1) Diseases move more slowly in highly clustered graphs (fig. 11) - not a new finding. 2) The dynamics are very non-linear -- with no clear pattern based on local connectivity. Implication: small local changes (shortcuts) can have dramatic global outcomes (disease diffusion)

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon How do we know if an observed graph fits the SW model? Random expectations: For basic one-mode networks (such as acquaintance nets), we can get approximate random values for L and C as: L random ~ ln(n) / ln(k) C random ~ k / n As k and n get large. Note that C essentially approaches zero as N increases, and K is assumed fixed. This formula uses the density-based measure of C.

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon How do we know if an observed graph fits the SW model? One problem with using the simple formulas for most extant data on large graphs is that, because the data result from people overlapping in groups/movies/publications, necessary clustering results from the assignment to groups. G1 G2 G3 G4 G5 Amy Billy Charlie Debbie Elaine Frank George LINES CUT..... William Xavier Yolanda Zanfir

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon How do we know if an observed graph fits the SW model? Newman, M. E. J.; Strogatz, S. J., and Watts, D. J. “Random Graphs with arbitrary degree distributions and their applications” Phys. Rev. E This paper extends the formulas for expected clustering and path length using a generating functions approach, making it possible to calculate E(C,L) for graphs with any degree distribution. Importantly, this procedure also makes it possible to account for clustering in a two-mode graph caused by the distribution of assignment to groups.

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon How do we know if an observed graph fits the SW model? Newman, M. E. J.; Strogatz, S. J., and Watts, D. J. “Random Graphs with arbitrary degree distributions and their applications” Phys. Rev. E Where N is the size of the graph, Z 1 is the average number of people 1 step away (degree) and z2 is the average number of people 2 steps away. Theoretically, these formulas can be used to calculate many properties of the network – including largest component size, based on degree distributions. A word of warning: The math in these papers is not simple, sharpen your calculus pencil before reading the paper…

Duncan Watts: Networks, Dynamics and the Small-World Phenomenon How do we know if an observed graph fits the SW model? Since C is just the transitivity ratio, there are a number of good formulas for calculating the expected value. Using the ratio of complete to (incomplete + complete) triads, we can use the expected values from the triad distribution in PAJEK for a simple graph or we can use the expected value conditional on the dyad types (if we have directed data) using the formulas in SPAN and W&F.

The line of work most closely related to the small world is that on biased and random networks. Recall the reachability curves in a random graph: 0 20% 40% 60% 80% 100% Percent Contacted Remove Degree = 4 Degree = 3 Degree = 2 Random Reachability: By number of close friends

For a random network, we can estimate the trace curves with the following equation: Where P i is the proportion of the population newly contacted at step i, X i is the cumulative number contacted by step i, and a is the mean number of contacts people have. This model describes the reach curves for a random network. The model is based on a, which (essentially) tells us how many new people we will reach from the new people we just contacted. This is based on the assumption that people’s friends know each other at a simple random rate.

For a real network, people’s friends are not random, but clustered. We can modify the random equation by adjusting a, such that some portion of the contacts are random, the rest not. This adjustment is a ‘bias’ - I.e. a non-random element in the model -- that gives rise to the notion of ‘biased networks’. People have studied (mathematically) biases associated with: Race (and categorical homophily more generally) Transitivity (Friends of friends are friends) Reciprocity (i--> j, j--> i) There is still a great deal of work to be done in this area empirically, and it promises to be a good way of studying the structure of very large networks.

Figure 1. Connectivity Distribution for a large Jr. High School (Add Health data) Random graph Observed

How useful are C & L for characterizing a network? These two graphs both have high C

To calculate Average Path Length and Clustering in UCINET 1)Load the network 2)To keep w. Watts, make the network symmetric Transform > Symmetrasize > Maximum Note what you saved the graph as 3)Calculate clustering coefficient Network > Network Properties > Clustering Coefficient The local density version is the “overall clustering coefficient” The transitivity version is the “weighted clustering coefficient” Clustering Coefficient

To calculate Average Path Length and Clustering in UCINET 1)Load the network 2)To keep w. Watts, make the network symmetric Transform > Symmetrasize > Maximum Note what you saved the graph as 3)Calculate Distance Network  cohesion  Distance Tools  Statistics  Univariate  Matrix Average Length