Eurecom, Sophia-Antipolis Thrasyvoulos Spyropoulos / Part II: Complex Networks Empirical Properties and Metrics.

Slides:



Advertisements
Similar presentations
Complex Networks Advanced Computer Networks: Part1.
Advertisements

Scale Free Networks.
Analysis and Modeling of Social Networks Foudalis Ilias.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Network Measurements.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
CS728 Lecture 5 Generative Graph Models and the Web.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Eurecom, Sophia-Antipolis Thrasyvoulos Spyropoulos / Random Graph Models: Create/Explain Complex Network Properties.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
Mining and Searching Massive Graphs (Networks)
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
CS Lecture 6 Generative Graph Models Part II.
Advanced Topics in Data Mining Special focus: Social Networks.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Data Mining for Complex Network Introduction and Background.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.
Online Social Networks and Media
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
How Do “Real” Networks Look?
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Network (graph) Models
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Lecture 1: Complex Networks
Topics In Social Computing (67810)
How Do “Real” Networks Look?
Network Science: A Short Introduction i3 Workshop
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Models of Network Formation
Models of Network Formation
Peer-to-Peer and Social Networks Fall 2017
Models of Network Formation
How Do “Real” Networks Look?
Models of Network Formation
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Eurecom, Sophia-Antipolis Thrasyvoulos Spyropoulos / Part II: Complex Networks Empirical Properties and Metrics

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 2 “Networks, Crowds, and Markets: Reasoning About a Highly Connected World” by D. Easley and T. Kleinberg (“NCM”: publicly available online) · “Networks: An Introduction” by M. Newman – (“Networks”: shared copies in library) Networked Life: 20 Questions and Answers by M.Chiang (some chapters - shared copies in library)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  A set of “nodes”  Humans, routers, web pages, telephone switches, airports, proteins, scientific articles …  Relations between these nodes  humans: friendship/relation or online friendship  routers, switches: connected by a communication link  web pages: hyperlinks from one to other  airports: direct flights between them  articles: one citing the other  proteins: link if chemically interacting  Network often represented as a graph:  vertex = node  link  relation (weight  strength) 3

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 4 The social network of friendships within a 34-person karate club provides clues to the fault lines that eventually split the club apart (Zachary, 1977)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 5 High school dating Peter S. Bearman, James Moody and Katherine Stovel Chains of affection: The structure of adolescent romantic and sexual networks American Journal of Sociology (2004) Image drawn by Mark Newman

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Mostly done by Social Scientists  Interested in Human (Social) Networks  Spread of Diseases, Influence, etc.  Methodology: Questionnaires  cumbersome, (lots of) bias  Network Size: 10s or at most 100s 6

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 7 flows amongst a large project team. Colors denote each participant’s department

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 8

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 9

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The study of large networks coming from all sorts of diverse areas  We will focus on technological (e.g. Internet) and information networks (e.g. Web, Facebook)  Cannot visually observe such networks (as in the case of old social networks of few 10s of nodes)  need ways to measure them, and quantify their properties  The field is often called Social Networks or Network Science or Network Theory  Question 1: What are the statistical properties of real networks?  Connectivity, paths lengths, degree distributions  How do we measure such huge networks  sampling  Question 2: Why do these properties arise?  Models of large networks: random graphs  Deterministic ways too complex/restrictive  Question 3: How can we take advantage of these properties?  Connectivity (epidemiology, resilience)  Spread (information, disease)  Search (Web page, person) 10

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  There are a lot of different properties we might be interested in  also depends on application  But there are some commonly studied properties for 2 reasons: 1. These properties are important for key applications 2. The majority of networks exhibit surprising similarities with respect to these properties. 1. Degree distribution (“scale free structure”) 2. Path length (“small world phenomena”) 3. Clustering (“community structure”) 11

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Problem: find the probability distribution that best fits the observed data degree frequency k fkfk f k = fraction of nodes with degree k = probability of a randomly selected node to have degree k

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Probability of having k neighbors  Identified by a line in the log-linear plot p(k) = λe -λk log p(k) = - λk + log λ degree log frequency λ

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Right-skewed/Heavy-tail distribution  there is a non-negligible fraction of nodes that has very high degree (hubs)  scale-free: f(ax) = bf(x), no characteristic scale, average is not informative p(k) = Ck -α  Power-law distribution gives a line in the log-log plot  α : power-law exponent (typically 2 ≤ α ≤ 3) log p(k) = -α logk + logC

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis loglog This difference is particularly obvious if we plot them on a log vertical scale: for large x there are orders of magnitude differences between the two functions. Network Science: Scale-Free Property February 7, 2011

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 16 Internet backbone and regional connectivity Multi-tier AS topology Gateway Routers inside ASs

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 17  Holds for both AS and Router topologies

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 18

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 19 α : power-law exponent (typically 2 ≤ α ≤ 3)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  d ij = shortest path between i and j  Diameter:  Average path length:  Also of interest: distribution of all shortest paths

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  A total of n nodes arranged in a grid  Only neighbors (up,down,left,right) connected Q: What is the diameter of the network? A: 2 -1 Q: What is the avg. distance?  i.e. picking two nodes randomly A: It is in the order of (i.e. c ) 21

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  n wireless nodes in an area of 1x1  Each transmits at distance R  R must be at least for connectivity Q: Choose two random nodes: What is the expected hop count (distance) between them? A: 22

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Letters were handed out to people in Nebraska to be sent to a target in Boston  People were instructed to pass on the letters to someone they knew on first-name basis  ~60 letters, only about 35% delivered  The letters that reached the destination followed paths of length around 6  Six degrees of separation: (play of John Guare)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  In 2001, Duncan Watts, a professor at Columbia University, recreated Milgram's experiment using an message as the “package" that needed to be delivered.  Surprisingly, after reviewing the data collected by 48,000 senders and 19 targets in 157 different countries, Watts found that again the average number of intermediaries was 6.

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis A Few Good Men Robert Wagner Austin Powers: The spy who shagged me Wild Things Let’s make it legal Barry Norton What Price Glory Monsieur Verdoux

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis (statistics from IMDB)  ~ linkable actors  Average (path length) = 3  99% of actors less than 6 hops  Try your own actor here: 26

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Legendary mathematician Paul Erdos,  around 1500 papers and 509 collaborators  Collaboration Graph: link between two authors who wrote a paper together  Erdos number of X: hop count between Erdos and author X in collaboration graph  ~260,000 in connected component 27 Kostas Psounis T. Spyropoulos

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 28 Number of AS traversed by an message ~35000 nodes Avg. path ~ 5! Number of routers traversed by an message > Avg. path ~ 15 plots taken from R. V. Hofstad

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 29

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Milgram’s experiment => Small World Phenomenon  Short paths exist between most nodes: Path length l << total nodes N (e.g line network: path length l = O(N)) 30 “Small world” = avg. path length l is at most logN

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Measures the density of triangles (local clusters) in the graph  Two different ways to measure it:  The ratio of the means

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Clustering coefficient for node i  The mean of the ratios

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The two clustering coefficients give different measures  C (2) increases with nodes with low degree

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 36

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Most real networks have… 1. Short paths between nodes (“small world”) 2. Transitivity/Clustering coefficient that is finite > 0 3. Degree distribution that follows a power law 37 Q1. Can we design graph models that exhibit similar characteristics? Q2. Can we explain how/why these phenomena occur in the first place? Q3. Can we take advantage of these properties (e.g. searching, advertising, viral infection/immunization, etc.)?

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Graph G=(V,E)  V = set of vertices  E = set of edges undirected graph E={(1,2),(1,3),(2,3),(3,4),(4,5)}

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Graph G=(V,E)  V = set of vertices  E = set of edges directed graph E={‹1,2›, ‹2,1› ‹1,3›, ‹3,2›, ‹3,4›, ‹4,5›}

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Edges have / do not have a weight associated with them weightedunweighted

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  degree d(i) of node i  number of edges incident on node i  degree distribution  1 node with degree 1  3 nodes with degree 2  1 node with degree 3  P(1) = 1/5, P(2) = 3/5, P(3) = 1/ degree 12 3

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis k P(k) Network Science: Graph Theory January 24, 2011

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  in-degree d in (i) of node i  number of edges pointing to node i  out-degree d out (i) of node i  number of edges leaving node i  in-degree sequence  [1,2,1,1,1]  out-degree sequence  [2,1,2,1,0]

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Path from node i to node j: a sequence of edges (directed or undirected from node i to node j)  path length: number of edges on the path  nodes i and j are connected  cycle: a path that starts and ends at the same node

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Shortest Path from node i to node j  also known as BFS path, or geodesic path

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The longest shortest path in the graph

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Connected graph: a graph where every pair of nodes is connected  Disconnected graph: a graph that is not connected  Connected Components: subsets of vertices that are connected

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Clique K n  A graph that has all possible n(n-1)/2 edges

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Strongly connected graph: there exists a path from every i to every j  Weakly connected graph: If edges are made to be undirected the graph is connected

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Adjacency Matrix  symmetric matrix for undirected graphs

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Adjacency Matrix  non-symmetric matrix for undirected graphs

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis G1G1 G2G2 G3G symmetric undirected: n 2 /2 directed: n 2

Eurecom, Sophia-Antipolis Thrasyvoulos Spyropoulos / Random Graph Models: Create/Explain Complex Network Properties

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The networks discussed are quite large!  Impossible to describe or visualize explicitly.  Consider this example:  You have a new Internet routing algorithm  You want to evaluate it, but do not have a trace of the Internet topology  You decide to create an “Internet-like” graph on which you will run your algorithm  How do you describe/create this graph??  Random graphs: local and probabilistic rules by which vertices are connected  Goal: from simple probabilistic rules to observed complexity Q: Which rules gives us (most of) the observed properties? 54

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis 55

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  This is “Conway’s game of life” (many other automata)  ed (demo) ed  (try your own) 56 Local Rules  Each cell either white or blue  Each cell interacts with its 8 neighbors  Time is discrete (rounds) 1. Any blue cell with fewer than two live neighbors  becomes white 2. Any blue cell with two or three blue neighbors lives on to the round 3. Any blue cell with more than three blue neighbors  becomes white 4. Any white cell with exactly three blue neighbors  become blue

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  A very (very!) simple local rule: (any) two vertices are connected with probability p  Only inputs: number of vertices n and probability p  Denote this class of graphs as G(n,p) 57 Erdös-Rényi model (1960) Connect with probability p p=1/6 N=10  k  ~ 1.5

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis N and p do not uniquely define the network– we can have many different realizations of it. How many? G(10,1/6) N=10 p=1/6 G(N,L): a graph with N nodes and L links The probability to form a particular graph G(N,L) is That is, each graph G(N,L) appears with probability P(G(N,L)).

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis P(L): the probability to have exactly L links in a network of N nodes and probability p: The maximum number of links in a network of N nodes. Number of different ways we can choose L links among all potential links. Binomial distribution...

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis P(L): the probability to have a network of exactly L links  The average number of links in a random graph  The standard deviation  Average node degree

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis As the network size increases, the distribution becomes increasingly narrow—which means that we are increasingly confident that the number of links the graph has is in the vicinity of.

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The degree distribution follows a binomial  average degree is = p(N-1)  variance σ 2 = p(1-p)(N-1)  Assuming z=Np is fixed, as N → ∞, B(N,k,p) is approximated by a Poisson distribution  As N → ∞  Highly concentrated around the mean  Probability of very high node degrees is exponentially small  Very different from power law! 62

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis The secret behind the small world effect – Looking at the network volume

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis The secret behind the small world effect – Looking at the network volume Polynomial growth

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis The secret behind the small world effect – Looking at the network volume Polynomial growth

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis The secret behind the small world effect – Looking at the network volume Polynomial growthExponential growth

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Given the huge differences in scope, size, and average degree, the agreement is excellent!

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Consider a random graph G(n,p) Q: What is the probability that two of your neighbors are also neighbors? A: It is equal to p, independent of local structure  clustering coefficient C = p  when z is fixed (sparse networks): C = z/n =O(1/n) 68

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Given the huge differences in scope, size, and average degree, there is a clear disagreement.

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Erdos-Renyi Graphs are “small world”  path lengths are O(logn)  Erdos-Renyi Graphs are not “scale-free”  Degree distribution binomial and highly-concentrated (no power- law)  Exponentially small probability to have “hubs” (no heavy-tail)  Erdos-Renyi Graphs are not “clustered”  C  0, as N becomes larger Conclusion: ER random graphs are not a good model of real networks  BUT: still provide a great deal of insight! 70 √ X X

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Some of your neighbors neighbors are also your own Exponential growth: Clustering inhibits the small-worldness

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  Short paths must be combined with  High clustering coefficient Watts and Strogatz model [WS98]  Start with a ring, where every node is connected to the next k nodes  With probability p, rewire every edge (or, add a shortcut) to a random node 72 order randomness p = 0 p = 1 0 < p < 1

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis The Watts Strogatz Model : It takes a lot of randomness to ruin the clustering, but a very small amount to overcome locality 73 log-scale in p When p = 0, C = 3(k-2)/4(k-1) ~ ¾ L = n/k For small p, C ~ ¾ L ~ logn Clustering Coefficient – Characteristic Path Length

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Nodes: online user Links: contact, tweet, or friendship Alan Mislove, Measurement and Analysis of Online Social Networks All distributions show a fat-tail behavior: there are orders of magnitude spread in the degrees

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The configuration model  input: the degree sequence [d 1,d 2,…,d n ]  process: -Create d i copies of node i; link them randomly -Take a random matching (pairing) of the copies self-loops and multiple edges are allowed But: Too artificial!

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Networks continuously expand by the addition of new nodes Barabási & Albert, Science 286, 509 (1999) ER, WS models: the number of nodes, N, is fixed (static models)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis (1) Networks continuously expand by the addition of new nodes Add a new node with m links Barabási & Albert, Science 286, 509 (1999)

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis Barabási & Albert, Science 286, 509 (1999) PREFERENTIAL ATTACHMENT: the probability that a node connects to a node with k links is proportional to k. A: New nodes prefer to link to highly connected nodes. Q: Where will the new node link to? ER, WS models: choose randomly.

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis “The rich get richer”  First considered by [Price 65] as a model for citation networks  each new paper is generated with m citations (on average)  new papers cite previous papers with probability proportional to their indegree (citations)  what about papers without any citations? -each paper is considered to have a “default” citation -probability of citing a paper with degree k, proportional to k+1  Power law with exponent α = 2+1/m 80

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  The BA model (undirected graph)  input: some initial subgraph G 0, and m the number of edges per new node  the process: -nodes arrive one at the time -each node connects to m other nodes selecting them with probability proportional to their degree -if [d 1,…,d t ] is the degree sequence at time t, the node t+1 links to node i with probability  Results in power-law with exponent α = 3  Various Problems: cannot account for every power law observed (Web), correlates age with degree, etc. 81

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  As p increases, so does the density of the graph  For small p (<0.2) notice that not all nodes are connected  For p = 0.2 only one isolated node 82 p = 0p = 0.1p = 0.2

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  We saw that increasing p  denser networks  In the large N case we increase z = Np the average degree  But what really happens as p (or z) increases? 83 A random network on 50 nodes: p = 0.01  disconnected, largest component = 3

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  p = 0.03  large component appears  But almost 40% of nodes still disconnected 84

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  p = 0.05  “giant” component emerges  Only 3 nodes disconnected  Giant component  the graph “percolates” 85

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis  p = 0.10  all nodes connected 86

Thrasyvoulos Spyropoulos / Eurecom, Sophia-Antipolis S: the fraction of nodes in the giant component, S=N GC /N there is a phase transition at =1:  for < 1 there is no giant component  for > 1 there is a giant component  for large the giant component contains all nodes (S=1) 10/10/erdos-renyi.png S