Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Slides:

Advertisements

Similar presentations

Routing Complexity of Faulty Networks Omer Angel Itai Benjamini Eran Ofek Udi Wieder The Weizmann Institute of Science.

Advertisements

Peer-to-Peer and Social Networks Power law graphs Small world graphs.

Algorithmic and Economic Aspects of Networks Nicole Immorlica.

CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.

Analysis and Modeling of Social Networks Foudalis Ilias.

Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Week 4 – Random Graphs Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Random Geometric Graph Diameter in the Unit Disk Robert B. Ellis, Texas A&M University coauthors Jeremy L. Martin, University of Minnesota Catherine Yan,

Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.

SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.

1 Dimension matching in Facebook and LinkedIn networks Anthony Bonato Ryerson University Seminar on Social Networks, Big Data, Influence, and Decision-Making.

Universal Random Semi-Directed Graphs

1 Seepage as a model of counter-terrorism Anthony Bonato Ryerson University CMS Winter Meeting December 2011.

The influence of search engines on preferential attachment Dan Li CS3150 Spring 2006.

Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec

1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.

Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.

1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.

On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.

Power law random graphs. Loose definition: distribution is power-law if Over some range of values for some exponent Examples  Degree distributions of.

CS Lecture 6 Generative Graph Models Part II.

1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Mobile Ad Hoc Networks Theory of Data Flow and Random Placement.

Advanced Topics in Data Mining Special focus: Social Networks.

On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.

Computer Science 1 Web as a graph Anna Karpovsky.

Week 10 – Meyniel’s conjecture Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,

Networks - Bonato1 Modelling, Mining, and Searching Networks Anthony Bonato Ryerson University Master’s Seminar November 2012.

The Erdös-Rényi models

Log Dimension Hypothesis1 The Logarithmic Dimension Hypothesis Anthony Bonato Ryerson University MITACS International Problem Solving Workshop July 2012.

Complex networks - Bonato1 Complex networks and their models Anthony Bonato Ryerson University Graduate Seminar October 2011.

1 Vertex-pursuit in heirarchical social networks Anthony Bonato Ryerson University TAMC’12 Complex Networks.

Week 3 - Complex Networks and their Properties

Models and Algorithms for Complex Networks Power laws and generative processes.

Exponential Functions Section 1. Exponential Function f(x) = a x, a > 0, a ≠ 1 The base is a constant and the exponent is a variable, unlike a power function.

Percolation in self-similar networks Dmitri Krioukov CAIDA/UCSD M. Á. Serrano, M. Boguñá UNT, March 2011.

1 Burning a graph as a model of social contagion Anthony Bonato Ryerson University Institute of Software Chinese Academy of Sciences.

Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.

On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-Line Social Networks Anthony Bonato Ryerson University WAW’2009 February 13, 2009 nt.

Class 9: Barabasi-Albert Model-Part I

Week 1 – Introduction to Graph Theory I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.

1 Modelling, Mining, and Searching Networks Anthony Bonato Ryerson University Graduate Seminar October 2015.

Miniconference on the Mathematics of Computation

1 How to burn a graph Anthony Bonato Ryerson University GRASCan 2015.

On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-line Social Networks Anthony Bonato Ryerson University ICMCM’09 December, 2009.

A short course on complex networks

1 Dimension matching in Facebook and LinkedIn networks Anthony Bonato Ryerson University Toronto, Canada ICMCE 2015.

Random Walk for Similarity Testing in Complex Networks

Topics In Social Computing (67810)

Miniconference on the Mathematics of Computation

Peer-to-Peer and Social Networks

Generative Model To Construct Blog and Post Networks In Blogosphere

The Watts-Strogatz model

The likelihood of linking to a popular website is higher

Discrete Mathematics and its Applications Lecture 1 – Graph Theory

Random Geometric Graph Diameter in the Unit Disk

Modelling and Searching Networks Lecture 3 – ILT model

Discrete Mathematics and its Applications Lecture 3 – ILT model

Miniconference on the Mathematics of Computation

Modelling and Searching Networks Lecture 9 – Meyniel’s Conjecture

Modelling and Searching Networks Lecture 4 – ILAT model

Modelling and Searching Networks Lecture 9 – Meyniel’s Conjecture

Modelling and Searching Networks Lecture 2 – Complex Networks

Modelling and Searching Networks Lecture 5 – Random graphs

Modelling and Searching Networks Lecture 6 – PA models

18th Ontario Combinatorics Workshop On-line Social Networks

Discrete Mathematics and its Applications Lecture 5 – Random graphs

Network Models Michael Goodrich Some slides adapted from:

Discrete Mathematics and its Applications Lecture 6 – PA models

Cops and Robbers from a Distance

Presentation transcript:

Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014

Key properties of complex networks 1.Large scale. 2.Evolving over time. 3.Power law degree distributions. 4.Small world properties. Other properties are also important: densification power law, shrinking distances,… 2

3 Geometry of the web? idea: web pages exist in a topic-space –a page is more likely to link to pages close to it in topic-space

4 Random geometric graphs nodes are randomly placed in space nodes are joined if their distance is less than a threshold value d; nodes each have a region of influence which is a ball of radius d (Penrose, 03)

Simulation with 5000 nodes 5

6 Geometric Preferential Attachment (GPA) model (Flaxman, Frieze, Vera, 04/07) nodes chosen on-line u.a.r. from sphere with surface area 1 each node has a region of influence with constant radius new nodes have m out-neighbours, chosen a)by preferential attachment; and b)only in the region of influence a.a.s. model generates power law, low diameter graphs with small separators/sparse cuts

7 Spatially Preferred Attachment graphs regions of influence shrink over time (motivation: topic space growing with time), and are functions of in-degree non-constant out-degree

8 Spatially Preferred Attachment (SPA) model (Aiello,Bonato,Cooper,Janssen,Prałat, 08) parameter: p a real number in (0,1] nodes on a 3-dimensional sphere with surface area 1 at time 0, add a single node chosen u.a.r. at time t, each node v has a region of influence B v with radius at time t+1, node z is chosen u.a.r. on sphere if z is in B v, then add vz independently with probability p

9 Simulation: p=1, t=5,000

10 as nodes are born, they are more likely to enter some B v with larger radius (degree) over time, a power law degree distribution results

11 power law exponent 1+1/p Theorem 6.1 (ACBJP, 08) Define Then a.a.s. for t ≤ n and i ≤ i f,

12 Rough sketch of proof derive an asymptotic expression for E(N i,t )

13 solve the recurrence asymptotically:

14 prove that N i,t is concentrated on E(N i,t ) via martingales standard approach is to use c-Lipshitz condition: change in N i,t is bounded above by constant c c-Lipschitz property may fail: new nodes may appear in an unbounded number of overlapping regions of influence prove this happens with exponentially small probabilities using the differential equation method

15 Models of OSNs few models for on-line social networks goal: find a model which simulates many of the observed properties of OSNs, –densification and shrinking distance –must evolve in a natural way…

Geometry of OSNs? OSNs live in social space: proximity of nodes depends on common attributes (such as geography, gender, age, etc.) IDEA: embed OSN in 2-, 3- or higher dimensional space 16

Dimension of an OSN dimension of OSN: minimum number of attributes needed to classify nodes like game of “20 Questions”: each question narrows range of possibilities what is a credible mathematical formula for the dimension of an OSN? 17

Geometric model for OSNs we consider a geometric model of OSNs, where –nodes are in m- dimensional Euclidean space –threshold value variable: a function of ranking of nodes 18

Geometric Protean (GEO-P) Model (Bonato, Janssen, Prałat, 10) parameters: α, β in (0,1), α+β < 1; positive integer m nodes live in m-dimensional hypercube each node is ranked 1,2, …, n by some function r –1 is best, n is worst –we use random initial ranking at each time-step, one new node v is born, one randomly node chosen dies (and ranking is updated) each existing node u has a region of influence with volume add edge uv if v is in the region of influence of u 19

Notes on GEO-P model models uses both geometry and ranking number of nodes is static: fixed at n –order of OSNs at most number of people (roughly…) top ranked nodes have larger regions of influence 20

Simulation with 5000 nodes 21

Simulation with 5000 nodes 22 random geometric GEO-P

Properties of the GEO-P model Theorem 6.2 (Bonato, Janssen, Prałat, 2010) A.a.s. the GEO-P model generates graphs with the following properties: –power law degree distribution with exponent b = 1+1/α –average degree d = (1+o(1))n (1-α-β) /2 1-α densification –diameter D = O(n β/(1-α)m log 2α/(1-α)m n) small world: constant order if m = Clog n. 23

Density average number of edges added at each time-step parameter β controls density if β < 1 – α, then density grows with n (as in real OSNs) 24

Dimension of OSNs given the order of the network n, power law exponent b, average degree d, and diameter D, we can calculate m gives formula for dimension of OSN: 25

Uncovering the hidden reality reverse engineering approach –given network data (n, b, d, D), dimension of an OSN gives smallest number of attributes needed to identify users that is, given the graph structure, we can (theoretically) recover the social space 26

6 Dimensions of Separation OSNDimension YouTube6 Twitter4 Flickr4 Cyworld7 27

Discussion 1.Speculate as to what the feature space would be for protein interaction networks. 2.Verify that for fixed constants α, β in (0,1): 28

Transitivity 29

30 Iterated Local Transitivity (ILT) model (Bonato, Hadi, Horn, Prałat, Wang, 08) key paradigm is transitivity: friends of friends are more likely friends nodes often only have local influence evolves over time, but retains memory of initial graph

31 ILT model start with a graph of order n to form the graph G t+1 for each node x from time t, add a node x’, the clone of x, so that xx’ is an edge, and x’ is joined to each node joined to x order of G t is n2 t

32 G 0 = C 4

Degrees Lemma 6.3 In the ILT model, let deg t (z) be the degree of z at time t. If x is in V(G t ), then we have the following: a) deg t +1 (x) = 2deg t (x)+1. b) deg t +1 (x’) = deg t (x)

34 Properties of ILT model average degree increasing to with time average distance bounded by constant and converging, and in many cases decreasing with time; diameter does not change clustering coefficient higher than in a random generated graph with same average degree bad expansion: small gaps between 1 st and 2 nd eigenvalues in adjacency and normalized Laplacian matrices of G t

35 Densification n t = order of G t, e t = size of G t Lemma 6.4: For t > 0, n t = 2 t n 0, e t = 3 t (e 0 +n 0 ) - n t. → densification power law: e t ≈ n t a, where a = log(3)/log(2).

36 Average distance Theorem 6.5: If t > 0, then average distance bounded by a constant, and converges; for many initial graphs (large cycles) it decreases diameter does not change from time 0

37 Clustering Coefficient Theorem 6.6: If t > 0, then c(G t ) = n t log(7/8)+o(1). higher clustering than in a random graph G(n t,p) with same order and average degree as G t, which satisfies c(G(n t,p)) = n t log(3/4)+o(1)

38 …Degree distribution –generate power law graphs from ILT? ILT model gives a binomial-type distribution