Introduction to complex networks Part I: Structure

Slides:



Advertisements
Similar presentations
Complex Networks: Complex Networks: Structures and Dynamics Changsong Zhou AGNLD, Institute für Physik Universität Potsdam.
Advertisements

The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Analysis and Modeling of Social Networks Foudalis Ilias.
School of Information University of Michigan Network resilience Lecture 20.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Network Measurements.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Introduction to complex networks Part II: Models Ginestra Bianconi Physics Department,Northeastern University, Boston,USA NetSci 2010 Boston, May
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
Biological Networks Feng Luo.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
Network Statistics Gesine Reinert. Yeast protein interactions.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Global topological properties of biological networks.
Advanced Topics in Data Mining Special focus: Social Networks.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
Computer Science 1 Web as a graph Anna Karpovsky.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
The Erdös-Rényi models
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Complex networks A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, Orsay, France) M. Barthélemy (CEA, France) L. Dall’Asta (LPT, Orsay,
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
School of Information University of Michigan SI 614 Network subgraphs (motifs) Biological networks Lecture 11 Instructor: Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Self-Similarity of Complex Networks Maksim Kitsak Advisor: H. Eugene Stanley Collaborators: Shlomo Havlin Gerald Paul Zhenhua Wu Yiping Chen Guanliang.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Communities. Questions 1.What is a community (intuitively)? Examples and fundamental hypothesis 2.What do we really mean by communities? Basic definitions.
Complex Network Theory – An Introduction Niloy Ganguly.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Bioinformatics Center Institute for Chemical Research Kyoto University
Class 19: Degree Correlations PartII Assortativity and hierarchy
Transport in weighted networks: optimal path and superhighways Collaborators: Z. Wu, Y. Chen, E. Lopez, S. Carmi, L.A. Braunstein, S. Buldyrev, H. E. Stanley.
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Class 2: Graph Theory IST402.
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Abstract Networks. WWW (2000) Scientific Collaboration Girvan & Newman (2002)
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Weighted Networks IST402 – Network Science Acknowledgement: Roberta Sinatra Laszlo Barabasi.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Graph clustering to detect network modules
Structures of Networks
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Peer-to-Peer and Social Networks Fall 2017
Advanced Topics in Data Mining Special focus: Social Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Introduction to complex networks Part I: Structure NetSci 2010 Boston, May 10 2010 Introduction to complex networks Part I: Structure Ginestra Bianconi Physics Department,Northeastern University, Boston,USA

Biological, Social and Technological systems. Complex networks describe the underlying structure of interacting complex Biological, Social and Technological systems.

Identify your network !

Why working on networks? Because NETWORKS encode for the ORGANIZATION, FUNCTION, ROBUSTENES AND DYNAMICAL BEHAVIOR of the entire complex system

Types of networks Simple Each link is either existent or non existent, the links do not have directionality (protein interaction map, Internet,…) Directed The links have directionality, i.e., arrows (World-Wide-Web, social networks…) Signed The links have a sign (transcription factor networks, epistatic networks…)

Types of networks Weighted The links are associated to a real number indicating their weight (airport networks, phone-call networks…) With features of the nodes The nodes might have weight or color (social networks, diseasome, ect..)

Bipartite networks: ex. Metabolic network Reaction pathway Bipartite Graph A+B C+D A+D E B+C E A B C D E 1 2 3 1 2 3 B Metabolites projection 1 Reactions (enzymes) projection A C E 2 3 D

Adjacency matrix Network: Adjacency matrix: 1 Network: A set of labeled nodes and links between them 3 2 5 4 Adjacency matrix: The matrix of entries a(i,j)=1 if there is a ink between node i and j a(i,j)=0 otherwise

Total number of nodes N and links L The total number of nodes N and the total number of links L are the most basic characteristics of a network

Degree distribution Degree if node i: Degree distribution: Network: Number of links of node i Degree distribution: P(k): how many nodes have degree k Network: 1 2 3 5 4

Each pair of nodes linked Random graphs G(N,L) ensemble Graphs with exactly N nodes and L links G(N,p) ensemble Graphs with N nodes Each pair of nodes linked with probability p Binomial Poisson distribution distribution

Universalities: Scale-free degree distribution Actor networks WWW Internet k k Faloutsos et al. 1999 Barabasi-Albert 1999

Topology of the yeast protein network H. Jeong et al. Nature (2001)

Transportation networks: Scale-free networks Technological networks: Internet, World-Wide Web Biological networks : Metabolic networks, protein-interaction networks, transcription networks Transportation networks: Airport networks Social networks: Collaboration networks citation networks Economical networks: Networks of shareholders in the financial market World Trade Web

Why this universality? Growing networks: Preferential attachment Barabasi & Albert 1999, Dorogovtsev Mendes 2000,Bianconi & Barabasi 2001, etc. Static networks: Hidden variables mechanism Chung & Lu 2002, Caldarelli et al. 2002, Park & Newman 2003

Scale-free networks with with with

Hyppocampus functional neural network Bonifazi et al . Science 2009 Dense scale-free networks

Social network of phone calls J. P Onnela PNAS 2007 Finite scale network

Clustering coefficient Definition of local clustering coefficient Clustering coefficient of nodes 2,3 Network 1 2 3 5 4

Shortest distance The shortest distance between two nodes is the minimal number of links than a path must hop to go from the source to the destination 1 The shortest distance between node 4 and node 1 is 3 between node 3 and node 1 is 2 2 3 5 4

Diameter and average distance The diameter D of a network is the maximal length of the shortest distance between any pairs of nodes in the network The average distance is the average length of the shortest distance between any pair of nodes in the network

Universality: Small world # of links between 1,2,…ki neighbors Ci = ki(ki-1)/2 Ki Networks are clustered (large average Ci, i.e.C) but have a small characteristic path length (small L). Watts and Strogatz (1999)

Small world in social systems 1967 Milligran experiment People from Nebraska and Kansas where ask to contact a person in Boston though their network of acquaintances The strength of weak ties (Granovetter 1973) Weak ties help navigate the social networks Small-world model (Watts & Strogatz 1998) The model for coexistence of high clustering and small distances in complex networks

Diameter and average length Diameter Average distance Poisson networks Scale-free Chung & Lu 2004 Cohen &Havlin 2003 Small world property Ultra small world property

Giant component A connected component of a network is a subgraph such that for each pair of nodes in the subgraph there is at least one path linking them The giant component is the connected component of the network which contains a number of nodes of the same order of magnitude of the total number of links

Molloy-Reed principle A network has a giant component if Molloy-Reed 1995 Second moment <k2> Molloy-Reed condition Poisson networks Scale-free

Fraction of removed nodes, f Robustness Complex systems maintain their basic functions even under errors and failures fc 1 Fraction of removed nodes, f S Node failure

Robustness of scale-free networks Failures Topological error tolerance 1 fc Attacks R. Albert et al. 2000 Cohen et al. 2000 Cohen et al. 2001 S f 1

Robustness Failure=Attack Robust against Failure Weak against Attack R. Albert et al. 2000

Betweenness Centrality The betweenness centrality of a node v is given by where st is the total number of shortest paths between s and t st(v) are the number of such paths passing through v The nodes A and B are also called bridges and have high betweeness

Link probability in uncorrelated networks In uncorrelated networks the probability that a node i is linked to a node j is given by i j

Degree correlations The map of Link probability between nodes of degree classes k and k’ The map of reveals correlations in the protein interaction map S. Maslov and K. Sneppen Science 2002

Specific characteristic of networks: Degree correlations Assortative networks >0 Uncorrelated networks =0 Disassortative networks <0 log(knn(k)) log (k) Average degree of a neighbor of a node of degree k

Degree-degree correlations in the Internet at the AS level the network is disassortative Vazquez et al. PRL 2001 Average degree of a neighbor of a node of degree k

clustering coefficient Modular networks >0 Correlations and clustering coefficient Uncorrelated networks =0 Modular networks >0 log(C(k)) log(k) Average clustering coefficient C(k) of nodes of degree k

Clustering coefficient of metabolic networks There are important correlations in the network! The network is not ‘random’ Highly connected nodes link more “distant” nodes of the network Ravasz,et al. Science (2002).

Loops or cycles of size L A loop or a cycle of size L is a path of the network that start at one point and ends after hopping L links on the same point without crossing the intermediate nodes more than one time 1 This network has 2 loops of size 3 and 1 loop of size 4 2 3 5 4

Small loops in the Internet at the AS level The number of loops of size L= 3,4,5 grows with the network size N as In Poisson networks instead they are a fixed number Independent on N G. Bianconi et al.PRE 2005

Cliques of size c Clique of size c is a fully connected subgraph of the network of c nodes and c(c-1)/2 links 1 This network contains 4 cliques of size 3 (triangles) and 1 clique of size 4 2 3 5 4

Small subgraphs appear abruptly when we increase the average number of links in the random graphs

Finite average connectivity Subgraph thresholds -1 -1/2 -1/3 -1/4 0 1/3 1/2  Poisson distribution Diverging average connectivity Finite average connectivity

Small subgraphs in SF networks  3 2 1 Small loops Of length L Cmax=3 Cliques of Size C G. Bianconi and M. Marsili (2004), (2005)

K-core of a network A K-core of a network is the subgraph of a network obtained by removing the nodes with connectivity ki<K iteratively until the network has only nodes with K-core of the Internet DIMES Internet data Visualization: Alvarez-Hamelin et al. 2005

K-cores on trees with given degree distributions Scale-free networks Maximal K of the K-cores Size of the minimal K-core S. Dorogovtsev, et al. PRL (2006) Carmi et al. PNAS (2007)

How to build a null model form a given network: swap of connections Choose two random links linking four distinct nodes If possible (not already existing links) swap the ends of the links Maslov & Sneppen 2002

Motifs The motifs are subgraphs which appear with higher frequency in real networks than in randomized networks In biological networks the motifs are told to be selected by evolution and are relevant to understand the function of the network. Motifs in the transcriptome network of e.coli. S.S. Shen-Orr, et al., 2002 Milo et al. 2002

Motifs of size 3 in directed networks The number of distinguishable possible motifs increases exponentially with the motif size limiting the extension of this method to large subgraphs

Specific characteristics of a network: communities Dolphins social network High-school dating networks A community of a network define a set of nodes with similar connectivity pattern. S. Fortunato Phys. Rep. 2010

Girvan and Newman algorithm The algorithm The betweenness of all existing edges in the network is calculated first. The edge with the highest betweenness is removed. The betweenness of all edges affected by the removal is recalculated. Steps 2 and 3 are repeated until no edges remain. Girvan and Newman PNAS 2002

Modularity Assign a community si=1,2,…K to each node, The modularity Q is given by Tight communities can be found by maximizing the modularity function Newman PNAS 2006

Cliques for community detection Overlapping set of cliques can be helpful to identify community structures in different networks For this method to work systematically networks must have many cliques (ex. Scale-free networks) Palla et al. Nature (2005)

Weight distribution and weight-degree correlations The weights of a network might correlate with the degree of the nodes which are linked by them. Strength Sk Is a measure of the inhomogeneities of the distribution of the weights among the nodes of the network. Disparity Y2k Is a measure of the inhomogeneities of the weights ending at a node of degree k.

Strength of the node and weight-degree correlations The strength of the nodes is the average weight of links connecting that node All weights distribute equally to the strength of the nodes Weight-degree correlations: Highly weighted links are connected to highly connected nodes

Coauthorship networks The coauthorship network is a scale-free network Power-law strength distribution and Linear dependence of strength versus degree A. Barrat et al. PNAS 2004

Airport network The airport network is a scale-free network With power-law strength distribution and The strength versus k show weight-degree correlations. A. Barrat et al. PNAS 2004

The Disparity All weights contribute equally There is one dominating link In many networks we observe indicating a hierarchy of weights

Metabolic Network The degree distribution of the metabolite projection of the metabolic network is scale-free. The distribution of the fluxes in metabolic networks (e.coli in succinate substrate) is power-law. The Y2(k) predicted by flux-balance analysis techniques is non trivial Almaas et al. Nature 2004

Extraction of sector information in financial markets Minimal-Spanning-Trees Planar maximally filtered graphs NYSE daily returns USA equity market 1995-98 Bonanno et al. (2003) Tumminello et al. (2007)

Why working on networks? Because NETWORKS encode for the ORGANIZATION, FUNCTION, ROBUSTENES AND DYNAMICAL BEHAVIOR of the entire complex system