Lecture I Introduction to complex networks Santo Fortunato.

Slides:



Advertisements
Similar presentations
Complex Networks Luis Miguel Varela COST meeting, Lisbon March 27 th 2013.
Advertisements

Introduction to Network Theory: Modern Concepts, Algorithms
Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.
Analysis and Modeling of Social Networks Foudalis Ilias.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Advanced Topics in Data Mining Special focus: Social Networks.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Mining and Searching Massive Graphs (Networks)
Fast algorithm for detecting community structure in networks.
Complex networks and random matrices. Geoff Rodgers School of Information Systems, Computing and Mathematics.
Global topological properties of biological networks.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
CS8803-NS Network Science Fall 2013
Social Media Mining Graph Essentials.
GoMore Network Analysis Kate Lyndegaard GEOG 596A Mentor: Frank Hardisty.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Complex networks A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, Orsay, France) M. Barthélemy (CEA, France) L. Dall’Asta (LPT, Orsay,
Traceroute-like exploration of unknown networks: a statistical analysis A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, France) L.
Class 2: Graph theory and basic terminology Learning the language Network Science: Graph Theory 2012 Prof. Albert-László Barabási Dr. Baruch Barzel, Dr.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Network theory David Lusseau BIOL4062/5062
Mathematics of Networks (Cont)
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Lecture 3 1.Different centrality measures of nodes 2.Hierarchical Clustering 3.Line graphs.
Graphs A graphs is an abstract representation of a set of objects, called vertices or nodes, where some pairs of the objects are connected by links, called.
Class 19: Degree Correlations PartII Assortativity and hierarchy
Introduction to complex networks Part I: Structure
Properties of Growing Networks Geoff Rodgers School of Information Systems, Computing and Mathematics.
March 3, 2009 Network Analysis Valerie Cardenas Nicolson Assistant Adjunct Professor Department of Radiology and Biomedical Imaging.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Class 2: Graph Theory IST402.
Lecture III Introduction to complex networks Santo Fortunato.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Lecture II Introduction to complex networks Santo Fortunato.
Weighted Networks IST402 – Network Science Acknowledgement: Roberta Sinatra Laszlo Barabasi.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Structures of Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Groups of vertices and Core-periphery structure
Frontiers of Network Science Class 15: Degree Correlations II
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Network analysis.
How Do “Real” Networks Look?
Community detection in graphs
Network Science: A Short Introduction i3 Workshop
Section 8.6 of Newman’s book: Clustering Coefficients
The Watts-Strogatz model
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Peer-to-Peer and Social Networks Fall 2017
How Do “Real” Networks Look?
Department of Computer Science University of York
Graph Operations And Representation
Clustering Coefficients
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Lecture I Introduction to complex networks Santo Fortunato

References  Evolution of networks S.N. Dorogovtsev, J.F.F. Mendes, Adv. Phys. 51, 1079 (2002), cond-mat/  Statistical mechanics of complex networks R. Albert, A-L Barabasi Reviews of Modern Physics 74, 47 (2002), cond-mat/  The structure and function of complex networks M. E. J. Newman, SIAM Review 45, (2003), cond-mat/  Complex networks: structure and dynamics S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, D.-U. Hwang Physics Reports 424, (2006)  Community detection in graphs S. Fortunato, Physics Reports 486, (2010) arXiv:

Plan of the course I. Networks: definitions, characteristics, basic concepts in graph theory II. Real world networks: basic properties III. Models IV. Community structure I V. Community structure II VI. Dynamic processes in networks

What is a network? Network or graph=set of vertices joined by edges very abstract representation very general convenient to describe many different systems

Some examples VerticesEdges Social networksIndividualsSocial relations InternetRouters AS Cables Commercial agreements WWWWebpagesHyperlinks Protein interaction networks ProteinsChemical reactions and many more ( , P2P, foodwebs, transport….)

Interdisciplinary science Science of complex networks: -graph theory -sociology -communication science -biology -physics -computer science

Interdisciplinary science Science of complex networks:  Empirics  Characterization  Modeling  Dynamical processes

Graph Theory Origin: Leonhard Euler (1736) L. Euler, Solutio problematis ad geometriam situs pertinentis, Comment. Academiae Sci. J. Petropolitanae 8, (1736)

Graph theory: basics Graph G=(V,E)  V=set of nodes/vertices i=1,…,n  E=set of links/edges (i,j), m Undirected edge: Directed edge: i Bidirectional communication/ interaction j ij

Graph theory: basics Maximum number of edges  Undirected: n(n-1)/2  Directed: n(n-1) Complete graph: (all to all interaction/communication)

Adjacency matrix n vertices i=1,…,n a ij = 1 if (i,j) E 0 if (i,j) E

Adjacency matrix Symmetric for undirected networks n vertices i=1,…,n a ij = 1 if (i,j) E 0 if (i,j) E

Adjacency matrix Non symmetric for directed networks n vertices i=1,…,n a ij = 1 if (i,j) E 0 if (i,j) E

Sparse graphs Density of a graph D=|E|/(n(n-1)/2) Number of edges Maximal number of edges D = Representation: lists of neighbours of each vertex Sparse graph: D <<1Sparse adjacency matrix l(i, V(i)) V(i)=neighbourhood of i

Paths G=(V,E) Path of length l = ordered collection of  l+1 vertices i 0,i 1,…,i l ε V  l edges (i 0,i 1 ), (i 1,i 2 )…,(i l-1,i l ) ε E i2i2 i0i0 i1i1 i5i5 i4i4 i3i3 Cycle/loop = closed path (i 0 =i l ) with all other vertices and edges distinct

Paths and connectedness G=(V,E) is connected if and only if there exists a path connecting any two vertices in G is connected is not connected is formed by two components

Trees  n vertices, n-1 edges  Maximal loopless graph  Minimal connected graph A tree is a connected graph without loops/cycles

Paths and connectedness G=(V,E)=> distribution of components’ sizes Giant component= component whose size scales with the number of vertices n Existence of a giant component Macroscopic fraction of the graph is connected

Paths and connectedness: directed graphs TubeTendril Tendrils Giant SCC: Strongly Connected Component Giant OUT Component Giant IN Component Disconnected components Paths are directed

Shortest paths i j Shortest path between i and j: minimum number of traversed edges distance l(i,j)=minimum number of edges traversed on a path between i and j Diameter of the graph= max[l(i,j)] Average shortest path=  ij l(i,j)/(n(n-1)/2) Complete graph: l(i,j)=1 for all i,j “Small-world”  “small” diameter

Graph spectra Spectrum of a graph: set of eigenvalues of adjacency matrix A If A is symmetric (undirected graph), n real eigenvalues with real orthogonal eigenvectors If A is asymmetric, some eigenvalues may be complex Perron-Frobenius theorem: any graph has (at least) one positive eigenvalue μ n with one non-negative eigenvector, such that |μ|≤ μ n for any eigenvalue μ. If the graph is connected, the multiplicity of μ n is one. Consequence: on an undirected connected graph there is only one eigenvector with positive components, the others have mixed-signed components

Graph spectra Spectral density Continuous function in the limit k-th moment of spectral density

Wigner’s semicircle law For real symmetric uncorrelated random matrices whose elements have finite moments in the limit

Centrality measures How to quantify the importance of a vertex?  Degree=number of neighbours=  j a ij i k i =5 Closeness centrality g i = 1 /  j l(i,j) For directed graphs: k in, k out

Betweenness centrality for each pair of vertices (l,m) in the graph, there are σ lm shortest paths between l and m σ i lm shortest paths going through i b i is the sum of σ i lm / σ lm over all pairs (l,m) NB: similar quantity= load l i =   i lm NB: generalization to edge betweenness centrality Path-based quantity j b i is large b j is small i

Eigenvector centrality i x3x3 x4x4 x1x1 xixi x5x5 x2x2 Basic principle = the importance of a vertex is proportional to the sum of the importances of its neighbors Solution: eigenvectors of adjacency matrix!

Eigenvector centrality Not all eigenvectors are good solutions! Requirement: the values of the centrality measure have to be positive Because of Perron-Frobenius theorem only the eigenvector with largest eigenvalue (principal eigenvector) is a good solution! The principal eigenvector can be quickly computed with the power method!

Structure of neighborhoods Clustering: My friends will know each other with high probability! (typical example: social networks) k C(i) = # of links between 1,2,…n neighbors k(k-1)/ Clustering coefficient of a vertex i

Structure of neighborhoods Average clustering coefficient of a graph C=  i C(i)/n

Statistical characterization Degree distribution List of degrees k 1,k 2,…,k n Not very useful! Histogram: n k = number of vertices with degree k Distribution: P(k)=n k /n=probability that a randomly chosen vertex has degree k Cumulative distribution : P > (k)=probability that a randomly chosen vertex has degree at least k

Statistical characterization Cumulative degree distribution Conclusion: power laws and exponentials can be easily recognized

Statistical characterization Degree distribution P(k)=n k /n=probability that a randomly chosen vertex has degree k Average = =  i k i /n =  k k P(k)=2|E|/n Fluctuations : - 2 =  i k 2 i /n =  k k 2 P(k) =  k k n P(k) Sparse graphs: << n

Statistical characterization Multipoint degree correlations P(k): not enough to characterize a network Large degree vertices tend to connect to large degree vertices Ex: social networks Large degree vertices tend to connect to small degree vertices Ex: technological networks

Statistical characterization Multipoint degree correlations Measure of correlations: P(k’,k’’,…k (n) |k): conditional probability that a vertex of degree k is connected to vertices of degree k’, k’’,… Simplest case: P(k’|k): conditional probability that a vertex of degree k is connected to a vertex of degree k’ often inconvenient (statistical fluctuations)

Statistical characterization Multipoint degree correlations Practical measure of correlations : Average degree of nearest neighbors i k=3 k=7 k=4 k i =4 k nn,i =( )/4=4.5

Statistical characterization Average degree of nearest neighbors Correlation spectrum: putting together vertices having the same degree class of degree k

Statistical characterization Case of random uncorrelated networks P(k’|k) independent of k prob. that an edge points to a vertex of degree k’ proportional to k’ itself P unc (k’|k)=k’P(k’)/ number of edges from vertices of degree k’ number of edges from vertices of any degree

Typical correlations  Assortative behaviour: growing k nn (k) Example: social networks Large sites are connected with large sites  Disassortative behaviour: decreasing k nn (k) Example: internet Large sites connected with small sites, hierarchical structure

Correlations: Clustering spectrum P(k’,k’’|k): cumbersome, difficult to estimate from data Average clustering coefficient C=average over vertices with very different characteristics Clustering spectrum: putting together vertices which have the same degree class of degree k (link with hierarchical structures)

Motifs Motifs: subgraphs occurring more often than on random versions of the graph Significance of motifs: Z-score!

Weighted networks Real world networks: edges  carry traffic (transport networks, Internet…)  have different intensities (social networks…) General description: weights i jw ij a ij : 0 or 1 w ij : continuous variable

● Scientific collaborations: number of common papers ● Internet, s: traffic, number of exchanged s ● Airports: number of passengers ● Metabolic networks: fluxes ● … Weights: examples usually w ii =0 symmetric: w ij =w ji

Weighted networks Weights: on the edges Strength of a vertex: s i =  j ε V(i) w ij =>Naturally generalizes the degree to weighted networks =>Quantifies for example the total traffic at a vertex

Weighted clustering coefficient I s i =16 c i w =0.625 > c i k i =4 c i =0.5 s i =8 c i w =0.25 < c i w ij =1 w ij =5 i i A. Barrat, M. Barthélemy, R. Pastor-Satorras, A. Vespignani, PNAS 101, 3747 (2004)

Weighted clustering coefficient II J. Saramäki, M. Kivela, J.-P. Onnela, K. Kaski, J. Kertész, Phys. Rev. E 75, (2007) Definition based on subgraph intensity

Weighted clustering coefficient Random(ized) weights: C = C w C < C w : more weights on cliques C > C w : less weights on cliques w ik i j k (w jk ) w ij Average clustering coefficient C=  i C(i)/n C w =  i C w (i)/n Clustering spectra

Weighted assortativity k i =5; k nn,i = i

Weighted assortativity k i =5; k nn,i = i

Weighted assortativity k i =5; s i =21; k nn,i =1.8 ; k nn,i w =1.2: k nn,i > k nn,i w i

k i =5; s i =9; k nn,i =1.8 ; k nn,i w =3.2: k nn,i < k nn,i w i Weighted assortativity

Participation ratio 1/k i if all weights equal close to 1 if few weights dominate

Plan of the course I. Networks: definitions, characteristics, basic concepts in graph theory II. Real world networks: basic properties III. Models IV. Community structure I V. Community structure II VI. Dynamic processes in networks