Miniconference on the Mathematics of Computation

Slides:



Advertisements
Similar presentations
Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Advertisements

1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Week 4 – Random Graphs Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
1 Dimension matching in Facebook and LinkedIn networks Anthony Bonato Ryerson University Seminar on Social Networks, Big Data, Influence, and Decision-Making.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Universal Random Semi-Directed Graphs
Advanced Topics in Data Mining Special focus: Social Networks.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Networks - Bonato1 Modelling, Mining, and Searching Networks Anthony Bonato Ryerson University Master’s Seminar November 2012.
The Erdös-Rényi models
Lecture 6 - Models of Complex Networks II Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Log Dimension Hypothesis1 The Logarithmic Dimension Hypothesis Anthony Bonato Ryerson University MITACS International Problem Solving Workshop July 2012.
Complex networks - Bonato1 Complex networks and their models Anthony Bonato Ryerson University Graduate Seminar October 2011.
1 Vertex-pursuit in heirarchical social networks Anthony Bonato Ryerson University TAMC’12 Complex Networks.
Popularity versus Similarity in Growing Networks Fragiskos Papadopoulos Cyprus University of Technology M. Kitsak, M. Á. Serrano, M. Boguñá, and Dmitri.
Week 3 - Complex Networks and their Properties
1 Burning a graph as a model of social contagion Anthony Bonato Ryerson University Institute of Software Chinese Academy of Sciences.
On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-Line Social Networks Anthony Bonato Ryerson University WAW’2009 February 13, 2009 nt.
Week 1 – Introduction to Graph Theory I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
1 Modelling, Mining, and Searching Networks Anthony Bonato Ryerson University Graduate Seminar October 2015.
How Do “Real” Networks Look?
Miniconference on the Mathematics of Computation
1 How to burn a graph Anthony Bonato Ryerson University GRASCan 2015.
On-line Social Networks - Anthony Bonato 1 Dynamic Models of On-line Social Networks Anthony Bonato Ryerson University ICMCM’09 December, 2009.
A short course on complex networks
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
1 Dimension matching in Facebook and LinkedIn networks Anthony Bonato Ryerson University Toronto, Canada ICMCE 2015.
NN k Networks for browsing and clustering image collections Daniel Heesch Communications and Signal Processing Group Electrical and Electronic Engineering.
Modelling, Mining, and Searching Networks Anthony Bonato Ryerson University Graduate Seminar September 2016.
Mining and Modeling Character Networks Anthony Bonato Ryerson University Complex Systems Seminar Queen Mary University of London.
Network (graph) Models
Lecture 23: Structure of Networks
Cohesive Subgraph Computation over Large Graphs
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Groups of vertices and Core-periphery structure
Modelling, Mining, and Searching Networks
Lecture 1: Complex Networks
Topics In Social Computing (67810)
Peer-to-Peer and Social Networks
How Do “Real” Networks Look?
Dieudo Mulamba November 2017
Lecture 23: Structure of Networks
Miniconference on the Mathematics of Computation
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Lecture 13 Network evolution
Peer-to-Peer and Social Networks Fall 2017
How Do “Real” Networks Look?
Discrete Mathematics and its Applications Lecture 1 – Graph Theory
Modelling and Searching Networks Lecture 3 – ILT model
Lecture 23: Structure of Networks
Miniconference on the Mathematics of Computation
Discrete Mathematics and its Applications Lecture 3 – ILT model
Miniconference on the Mathematics of Computation
Graph and Link Mining.
Lecture 21 Network evolution
Modelling and Searching Networks Lecture 2 – Complex Networks
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
18th Ontario Combinatorics Workshop On-line Social Networks
Affiliation Network Models of Clusters in Networks
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
What did we see in the last lecture?
Presentation transcript:

Miniconference on the Mathematics of Computation 1st Symposium on Spatial Networks Oxford University The Geometry of Social Networks Anthony Bonato Ryerson University

Geometry of Social Networks Friendship networks network of on- and off-line friends form a large web of interconnected links Geometry of Social Networks

Geometry of Social Networks 6 degrees of separation (Stanley Milgram,67): famous chain letter experiment Geometry of Social Networks

Geometry of Social Networks 4 Degrees in Facebook 1.71 billion users (Backstrom,Boldi,Rosa, Ugander,Vigna,2012) 4 degrees of separation in Facebook when considering another person in the world, a friend of your friend knows a friend of their friend, on average similar results for Twitter and other OSNs Geometry of Social Networks

Are we really that similar? Geometry of Social Networks

Geometry of Social Networks Social distance 4 or 6 degrees of separation does not reflect our true social distance D. Liben-Nowell, J. Kleinberg, Tracing information flow on a global scale using Internet chain-letter data PNAS 105 (2008) 4633-4638. Geometry of Social Networks

Geometry of Social Networks Hidden geometry vs Geometry of Social Networks

Complex networks in the era of Big Data web graph, social networks, biological networks, internet networks, … Geometry of Social Networks

What is a complex network? no precise definition however, there is general consensus on the following observed properties large scale evolving over time power law degree distributions small world properties Geometry of Social Networks

Examples of complex networks technological/informational: web graph, router graph, AS graph, call graph, e-mail graph social: on-line social networks (Facebook, Twitter, LinkedIn,…), collaboration graphs, co-actor graph biological networks: protein interaction networks, gene regulatory networks, food networks, connectomes Geometry of Social Networks

Geometry of Social Networks Other properties densification power law (Leskovec, Kleinberg, Faloutsos,05): |(E(Gt)| ≈ |V(Gt)|a where 1 < a ≤ 2: densification exponent community structure spectral expansion Geometry of Social Networks

Geometry of Social Networks Blau space OSNs live in social space or Blau space: each user identified with a point in a multi-dimensional space coordinates correspond to socio-demographic variables/attributes homophily principle: the flow of information between users is a declining function of distance in Blau space Geometry of Social Networks

Geometry of Social Networks Dimensionality Question: What is the dimension of the Blau space of OSNs? what is a credible mathematical formula for the dimension of an OSN? Geometry of Social Networks

Geometry of Social Networks

Random geometric graphs n nodes are randomly placed in the unit square each node has a constant sphere of influence, radius r nodes are joined if their Euclidean distance is at most r G(n,r), r = r(n) Geometry of Social Networks

Some properties of G(n,r) Theorem (Penrose,97) Let μ = nexp(-πr2n). If μ = o(1), then asymptotically almost surely (a.a.s.) G is connected. If μ = Θ(1), then a.a.s. G has a component of order Θ(n). If μ →∞, then a.a.s. G is disconnected. many other properties studied of G(n,r): chromatic number, clique number, Hamiltonicity, random walks, … Geometry of Social Networks

Geometry of Social Networks Spatially Preferred Attachment (SPA) model (Aiello, Bonato, Cooper, Janssen, Prałat,08), (Cooper, Frieze, Prałat,12) volume of sphere of influence proportional to in-degree nodes are added and spheres of influence shrink over time a.a.s. leads to power laws graphs, low directed diameter, and small separators Geometry of Social Networks

Geometry of Social Networks Ranking models (Fortunato,Flammini,Menczer,06), (Łuczak,Prałat,06), (Janssen,Prałat,09) parameter: α in (0,1) each node is ranked 1,2, …, n by some function r 1 is best, n is worst at each time-step, one new node is born, one randomly node chosen dies (and ranking is updated) link probability r-α many ranking schemes a.a.s. lead to power law graphs: random initial ranking, degree, age, etc. Geometry of Social Networks

Geometric model for OSNs we consider a geometric model of OSNs, where nodes are in m-dimensional Euclidean space volume of spheres of influence variable: a function of ranking of nodes Geometry of Social Networks

Geometric Protean (GEO-P) Model (Bonato,Janssen,Prałat,12) parameters: α, β in (0,1), α+β < 1; positive integer m nodes live in an m-dimensional hypercube each node is ranked 1,2, …, n by some function r 1 is best, n is worst we use random initial ranking at each time-step, one new node v is born, one randomly node chosen dies (and ranking is updated) each existing node u has a region of influence with volume add edge uv if v is in the region of influence of u Geometry of Social Networks

Geometry of Social Networks Notes on GEO-P model models uses both geometry and ranking number of nodes is static: fixed at n order of OSNs at most number of people (roughly…) top ranked nodes have larger regions of influence Geometry of Social Networks

Geometry of Social Networks Simulation with 5000 nodes Geometry of Social Networks

Geometry of Social Networks Simulation with 5000 nodes random geometric GEO-P Geometry of Social Networks

Properties of the GEO-P model (BJP,2012) a.a.s. the GEO-P model generates graphs with the following properties: power law degree distribution with exponent b = 1+1/α average degree d = (1+o(1))n(1-α-β)/21-α densification diameter D = nΘ(1/m) small world: constant order if m = Clog n bad spectral expansion and high clustering coefficient Geometry of Social Networks

Geometry of Social Networks Dimension of OSNs given the order of the network n and diameter D, we can calculate m gives formula for dimension of OSN: Geometry of Social Networks

Logarithmic Dimension Hypothesis In an OSN of order n and diameter D, the dimension of its Blau space is posed independently by (Leskovec,Kim,11), (Frieze, Tsourakakis,11) Geometry of Social Networks

Few dimensions implies greater difference low dimensional separation high dimensional separation Geometry of Social Networks

Uncovering the hidden reality reverse engineering approach given network data (n, D), dimension of an OSN gives smallest number of attributes needed to identify users that is, given the graph structure, we can (theoretically) recover the Blau space Geometry of Social Networks

6 Dimensions of Separation OSN Dimension Facebook 7 YouTube 6 Twitter 4 Flickr Cyworld Geometry of Social Networks

Geometry of Social Networks

MGEO-P (Bonato,Gleich,Mitsche,Prałat,Tian,Young,14) time-steps in GEO-P form a computational bottleneck consider a GEO-P where we forget the history of ranks memoryless GEO-P (MGEO-P) place n points u.a.r. in the hypercube assign ranks from via a random permutation σ for each pair i > j, ij is an edge if j is in the ball of volume σ(i)–αn-β Geometry of Social Networks

Contrasting the models by considering the evolution of ranks in GEO-P, the probability that an edge is present in GEO-P and not in MGEO-P is: intuitively, the models generate similar graphs many a.a.s properties hold in MGEO-P with similar parameters Geometry of Social Networks

Properties of the MGEO-P model (BGMPTY,14) a.a.s. the MGEO-P model generates graphs with the following properties: power law degree distribution with exponent b = 1+1/α average degree d = (1+o(1))n(1-α-β)/21-α densification diameter D = nΘ(1/m) Geometry of Social Networks

Proof sketch: diameter eminent node: highly ranked: ranking greater than some fixed R partition hypercube into small hypercubes choose size of hypercubes and R so that each hypercube contains at least log2n eminent nodes sphere of influence of each eminent node covers each hypercube and all neighbouring hypercubes choose eminent node in each hypercube: backbone show all nodes in hypercube distance at most 2 from backbone Geometry of Social Networks

Geometry of Social Networks Back to question… How would we measure the dimensionality of Blau space? Geometry of Social Networks

Aside: machine learning machine learning is a branch of AI that infers structure from data examples: spam filters Netflix recommender systems text and image categorization especially useful when the data or number of decisions are too large for humans to process Geometry of Social Networks

Model selection in complex networks (Middendorf,Ziv,Wiggins,05) used ADTs and motifs for model selection in protein networks predicted duplication/mutation model (Memišević,Milenković,Pržulj,10) model selection predicting random geometric graphs as best fit for protein networks (Janssen,Hurshman,Kalyaniwalla,12) ADT with motif classifiers predict PA and SPA models best fit Facebook 100 graphs Geometry of Social Networks

Support Vector Machine (SVM) Sec. 15.1 Support Vector Machine (SVM) support vectors maximizes margin SVM maximizes the margin around the separating hyperplane solving SVMs is a quadratic programming problem successful text and image classification method Geometry of Social Networks

Geometry of Social Networks Facebook100 Geometry of Social Networks

Geometry of Social Networks Validating the LDH we tested the dimensionality of large-scale samples from real OSN data FB100 and LinkedIn (sampled over time) Idea: use machine learning (SVM) to predict dimensions features: small subgraph counts (3- and 4-vertex subgraphs) compared sampled data vs simulations of MGEO-P with dimensions 1 through 12 Geometry of Social Networks

Miniconference on the Mathematics of Computation Motifs/Graphlets Geometry of Social Networks

Miniconference on the Mathematics of Computation Experimental design Geometry of Social Networks

Miniconference on the Mathematics of Computation Sample: Michigan Geometry of Social Networks

Miniconference on the Mathematics of Computation Stanford3: n: 11621 edges: 568330 avgdeg: 97.81086 plexp: 3.730000 GeoP parameters alphabeta: 0.510389 alpha: 0.366300 beta: 0.144089 python geop_dim_experiment.py --logcount -s 50 -t 0 --mmax 12 --prob 0.001 Stanford3 11621 568330 0.366300 0.144089 M-GeoP dimensions: LADTree: 2 J48: 3 Logistic: 5 SVM: 5 Geometry of Social Networks

Miniconference on the Mathematics of Computation FB and LinkedIn - SVM Geometry of Social Networks

FB and LinkedIn - Eigenvalues Miniconference on the Mathematics of Computation FB and LinkedIn - Eigenvalues Geometry of Social Networks

Miniconference on the Mathematics of Computation Figure 6. For three of the Facebook networks, we show the eigenvalue histogram in red, the eigenvalue histogram from the best fit MGEO-P network in blue, and the eigenvalue histograms for samples from the other dimensions in grey. Bonato A, Gleich DF, Kim M, Mitsche D, et al. (2014) Dimensionality of Social Networks Using Motifs and Eigenvalues. PLoS ONE 9(9): e106052. doi:10.1371/journal.pone.0106052 http://www.plosone.org/article/info:doi/10.1371/journal.pone.0106052 Geometry of Social Networks

Miniconference on the Mathematics of Computation Underlying geometry Feature space thesis (B,16+) every complex network has an underlying metric (or feature) space, where nodes are identified with points in the feature space, and edges are influenced by node similarity and proximity in the space For e.g.: web graph: topic space OSNs: Blau space PPIs: biochemical space Geometry of Social Networks

Miniconference on the Mathematics of Computation Implications of FST new way of viewing complex networks not just graph structure, but underlying, hidden geometry that matters graph structure can help uncover this hidden geometry Geometry of Social Networks

Miniconference on the Mathematics of Computation Future directions other data sets fractal or other dimension underlying metric? what are the attributes? what implications does LDH have for OSNs or social networks in general? Geometry of Social Networks

Miniconference on the Mathematics of Computation Character networks cultural work: fictional works such novels or short stories, movies, biographies, historical works, religious texts character networks: nodes: characters or persona in a cultural work edges: co-occurrence edges may be weighted Geometry of Social Networks

Miniconference on the Mathematics of Computation E.g.: Marvel universe 10K nodes diameter 9 10 communities average degree 41 Character networks

Miniconference on the Mathematics of Computation moviegalaxies.com moviegalaxies.com,catalogues the social networks in 800+ movies Geometry of Social Networks

Dimensionality of character networks? Miniconference on the Mathematics of Computation Dimensionality of character networks? Geometry of Social Networks