Class 5: Random Graphs – Small Worlds Network Science: Random Graphs 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
It’s a Small World by Jamie Luo. Introduction Small World Networks and their place in Network Theory An application of a 1D small world network to model.
Week 4 – Random Graphs Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Random Networks Network Science: Graph Theory 2012.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Advanced Topics in Data Mining Special focus: Social Networks.
SDSC, skitter (July 1998) A random graph model for massive graphs William Aiello Fan Chung Graham Lincoln Lu.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
TELCOM2125: Network Science and Analysis
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
Six Degrees of Seperation. Budapest Frigyes Karinthy (Writer) Collections of poems and prose „Láncszemek“ or „Chains“ -A worker in Ford‘s factory and.
Peer-to-Peer and Social Networks Introduction. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Random Graph Models of Social Networks Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz Presentation presented by Jessie Riposo.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Class 3: Random Graphs Network Science: Random Graphs 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino.
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.
Online Social Networks and Media
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Application of statistical physics to random graph models of networks Sameet Sreenivasan Advisor: H. Eugene Stanley.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
How Do “Real” Networks Look?
Properties of Growing Networks Geoff Rodgers School of Information Systems, Computing and Mathematics.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Random Network IST402 – Network Science Acknowledgement: Laszlo Barabasi.
School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2013 Figures are taken.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Connectivity and the Small World
Discussion of six degrees of separation theory using induction
Social Networks Analysis
Topics In Social Computing (67810)
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Peer-to-Peer and Social Networks Fall 2017
How Do “Real” Networks Look?
Lecture 9: Network models CS 765: Complex Networks
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Class 5: Random Graphs – Small Worlds Network Science: Random Graphs 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr. Mauro Martino

SIX DEGREES small worlds Frigyes Karinthy, 1929 Stanley Milgram, 1967 Peter Jane Sarah Ralph Network Science: Random Graphs 2012

SIX DEGREES 1929: Frigyes Kartinthy Frigyes Karinthy ( ) Hungarian Writer “Look, Selma Lagerlöf just won the Nobel Prize for Literature, thus she is bound to know King Gustav of Sweden, after all he is the one who handed her the Prize, as required by tradition. King Gustav, to be sure, is a passionate tennis player, who always participates in international tournaments. He is known to have played Mr. Kehrling, whom he must therefore know for sure, and as it happens I myself know Mr. Kehrling quite well.” "The worker knows the manager in the shop, who knows Ford; Ford is on friendly terms with the general director of Hearst Publications, who last year became good friends with Arpad Pasztor, someone I not only know, but to the best of my knowledge a good friend of mine. So I could easily ask him to send a telegram via the general director telling Ford that he should talk to the manager and have the worker in the shop quickly hammer together a car for me, as I happen to need one." 1929: Minden másképpen van (Everything is Different) Láncszemek (Chains) Network Science: Random Graphs 2012

SIX DEGREES 1967: Stanley Milgram HOW TO TAKE PART IN THIS STUDY 1.ADD YOUR NAME TO THE ROSTER AT THE BOTTOM OF THIS SHEET, so that the next person who receives this letter will know who it came from. 2.DETACH ONE POSTCARD. FILL IT AND RETURN IT TO HARVARD UNIVERSITY. No stamp is needed. The postcard is very important. It allows us to keep track of the progress of the folder as it moves toward the target person. 3.IF YOU KNOW THE TARGET PERSON ON A PERSONAL BASIS, MAIL THIS FOLDER DIRECTLY TO HIM (HER). Do this only if you have previously met the target person and know each other on a first name basis. 4.IF YOU DO NOT KNOW THE TARGET PERSON ON A PERSONAL BASIS, DO NOT TRY TO CONTACT HIM DIRECTLY. INSTEAD, MAIL THIS FOLDER (POST CARDS AND ALL) TO A PERSONAL ACQUAINTANCE WHO IS MORE LIKELY THAN YOU TO KNOW THE TARGET PERSON. You may send the folder to a friend, relative or acquaintance, but it must be someone you know on a first name basis. Network Science: Random Graphs 2012

SIX DEGREES 1991: John Guare "Everybody on this planet is separated by only six other people. Six degrees of separation. Between us and everybody else on this planet. The president of the United States. A gondolier in Venice…. It's not just the big names. It's anyone. A native in a rain forest. A Tierra del Fuegan. An Eskimo. I am bound to everyone on this planet by a trail of six people. It's a profound thought. How every person is a new door, opening up into other worlds." Network Science: Random Graphs 2012

WWW: 19 DEGREES OF SEPARATION Image by Matthew Hurst Blogosphere Network Science: Random Graphs 2012

DISTANCES IN RANDOM GRAPHS Random graphs tend to have a tree-like topology with almost constant node degrees. nr. of first neighbors: nr. of second neighbors: nr. of neighbours at distance d: estimate maximum distance: Network Science: Random Graphs 2012

Given the huge differences in scope, size, and average degree, the agreement is excellent. DISTANCES IN RANDOM GRAPHS compare with real data Network Science: Random Graphs 2012

EVOLUTION OF A RANDOM GRAPH Network Science: Random Graphs 2012

Until now we focused on the static properties of a random graph with fixes p value. What happens when vary the parameter p? EVOLUTION OF A RANDOM NETWORK GOTO Choose Nodes=100. Note that the p goes up in increments of 0.001, which, for N=100, L=pN(N-1)/2~p*50,000, i.e. each increment is really about 50 new lines. Network Science: Random Graphs 2012

EVOLUTION OF A RANDOM NETWORK disconnected nodes  NETWORK. How does this transition happen? Network Science: Random Graphs 2012

Let us denote with u=1-N g /N, i.e. the fraction of nodes that are NOT part of the giant component (GC) N g. For a node i to be part of the GC, it needs to connect to it via another node j. If i is NOT part of the GC, that could happen for two reasons: Case A: node i does not connect to node j, Probability: 1-p Case B: node i connects to j, but j is not connected to the GC: Probability: pu Total probability that i is not part of the GC via node j is: 1-p+pu The probability that i is not linked to the GC via any other node is (1-p+pu) N-1 Hence: For any p and N this equation provides the size of the giant component as N GC =N(1-u ) THE PHASE TRANSITION TAKES PLACE AT =1 Network Science: Random Graphs 2012

Using p= /(N-1) and taking the log of both sides and using <<N we obtain: Taking an exponential of both sides we obtain Or, if we denote with S the fraction of nodes in the giant component, S=N GC /N, i.e. S=1-u Erdos and Renyi, 1959 EVOLUTION OF A RANDOM GRAPH Network Science: Random Graphs 2012

(a)(b) THE PHASE TRANSTION IN A RN TAKES PLACE AT =1 S: the fraction of nodes in the giant component, S=N g /N after Newman, 2010 Phase transition point: Set S=0, we obtain a phase transition at =1

EVOLUTION OF A RANDOM NETWORK disconnected nodes  NETWORK. How does this transition happen? Network Science: Random Graphs 2012

Analytical result Numerical result EVOLUTION OF A RANDOM GRAPH Network Science: Random Graphs 2012

CLUSTER SIZE DISTRIBUTION Probability that a randomly selected node belongs to a cluster of size s: At the critical point =1 The distribution of cluster sizes at the critical point, displayed in a log-log plot. The data represent an average over 1000 systems of sizes The dashed line has a slope of Derivation in Newman, 2010 Network Science: Random Graphs 2012

I: Subcritical < 1 III: Supercritical > 1 IV: Connected > ln N II: Critical = 1 =0.5 =1 =3 =5 N=100

I: Subcritical < 1 p < p c =1/N No giant component. N-L isolated clusters, cluster size distribution is exponential The largest cluster is a tree, its size ~ ln N

II: Critical = 1 p=p c =1/N Unique giant component: N G ~ N 2/3  contains a vanishing fraction of all nodes, N G /N~N -1/3  Small components are trees, GC has loops. Cluster size distribution: p(s)~s -3/2 A jump in the cluster size: N=1,000  ln N~ 6.9; N 2/3 ~95 N=  ln N~ 22; N 2/3 ~3,659,250

=3 Unique giant component: N G ~ (p-p c )N  GC has loops. Cluster size distribution: exponential III: Supercritical > 1 p > p c =1/N

IV: Connected > ln N p > (ln N)/N =5 Only one cluster: N G =N  GC is dense. Cluster size distribution: None Network Science: Random Graphs 2012

IV: Connected > ln N p > (ln N)/N The probability that a node does not connect to the giant component is (1-p) N G ~(1-p) N The expected number of such nodes is: For a sufficiently large p we are left with only one disconnected node, i.e. C=1. Network Science: Random Graphs 2012

I: Subcritical < 1 III: Supercritical > 1 IV: Connected > ln N II: Critical = 1 =0.5 =1 =3 =5 N=100

Since edges are independent and have the same probability p, The clustering coefficient of random graphs is small. For fixed degree C decreases with the system size N. CLUSTERING COEFFICIENT from Newman 2010 This is valid for random networks only, with arbitrary degree distribution Network Science: Random Graphs 2012

Degree distribution Binomial, Poisson (exponential tails) Clustering coefficient Vanishing for large network sizes Average distance among nodes Logarithmically small Erdös-Rényi MODEL (1960) Network Science: Random Graphs 2012

HISTORICAL NOTE Network Science: Random Graphs 2012

1951, Rapoport and Solomonoff:  first systematic study of a random graph.  demonstrates the phase transition.  natural systems: neural networks; the social networks of physical contacts (epidemics); genetics. Why do we call it the Erdos-Renyi random model? HISTORICAL NOTE Anatol Rapoport Edgar N. Gilbert (b.1923) 1959: G(N,p) Network Science: Random Graphs 2012

Erdös-Rényi MODEL (1960) Network Science: Random Graphs 2012

Are real networks like random graphs? Network Science: Random Graphs 2012

As quantitative data about real networks became available, we can compare their topology with the predictions of random graph theory. Note that once we have N and for a random network, from it we can derive every measurable property. Indeed, we have: Average path length: Clustering Coefficient: Degree Distribution: ARE REAL NETWORKS LIKE RANDOM GRAPHS? Network Science: Random Graphs 2012

Real networks have short distances like random graphs. Prediction:Data: PATH LENGTHS IN REAL NETWORKS Network Science: Random Graphs 2012

Prediction:Data: C rand underestimates with orders of magnitudes the clustering coefficient of real networks. CLUSTERING COEFFICIENT Network Science: Random Graphs 2012

Prediction: Data: (a)Internet; (b) Movie Actors; (c)Coauthorship, high energy physics; (d) Coauthorship, neuroscience THE DEGREE DISTRIBUTION Network Science: Random Graphs 2012

As quantitative data about real networks became available, we can compare their topology with the predictions of random graph theory. Note that once we have N and for a random network, from it we can derive every measurable property. Indeed, we have: Average path length: Clustering Coefficient: Degree Distribution: ARE REAL NETWORKS LIKE RANDOM GRAPHS? Network Science: Random Graphs 2012

(A) Problems with the random network model: -the degree distribution differs from that of real networks -the giant component in most real network does NOT emerge through a phase transition -the clustering coefficient in most systems will now vanish, as predicted by the model. (B) Most important: we need to ask ourselves, are real networks random? The answer is simply: NO Hence it is IRRELEVANT. There is no network in nature that we know of that would be described by the random network model. A  WRONG B  IRRELEVANT IS THE RANDOM GRAPH MODEL RELEVANT TO REAL SYSTEMS? Network Science: Random Graphs 2012

It is the reference model for the rest of the class. It will help us calculate many quantities, that can then be compared to the real data, understanding to what degree is a particular property the result of some random process. Organizing principles: patterns in real networks that are shared by a large number of real networks, yet which deviate from the predictions of the random network model. In order to identify these, we need to understand how would a particular property look like if it is driven entirely by random processes. While WRONG and IRRELEVANT, it will turn out to be extremly USEFUL! IF IT IS WRONG AND IRRELEVANT, WHY DID WE DEVOT TO IT A FULL CLASS? Network Science: Random Graphs 2012

NETWORK DATA: SCIENCE COLLABORATION NETWORKS Erdos: 1,400 papers 507 coauthors Einstein: EN=2 Paul Samuelson EN=5 …. ALB: EN: 3 Network Science: Random Graphs 2012

NETWORK DATA: SCIENCE COLLABORATION NETWORKS Collaboration Network: Nodes: Scientists Links: Joint publications Physical Review: 1893 – N=449,673 L=4,707,958 See also Stanford Large Network database Network Science: Random Graphs 2012