Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.

Slides:



Advertisements
Similar presentations
Network analysis Sushmita Roy BMI/CS 576
Advertisements

Scale Free Networks.
The Architecture of Complexity: Structure and Modularity in Cellular Networks Albert-László Barabási University of Notre Dame title.
Analysis and Modeling of Social Networks Foudalis Ilias.
Online Social Networks and Media Navigation in a small world.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Network Measurements.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Biological Networks Feng Luo.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Global topological properties of biological networks.
Network Motifs Zach Saul CS 289 Network Motifs: Simple Building Blocks of Complex Networks R. Milo et al.
Advanced Topics in Data Mining Special focus: Social Networks.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Data Mining for Complex Network Introduction and Background.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
Statistical Properties of Massive Graphs (Networks) Networks and Measurements.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
Information Networks Introduction to networks Lecture 1.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Complex Networks First Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Online Social Networks and Media
Analysis of biological networks Part III Shalev Itzkovitz Shalev Itzkovitz Uri Alon’s group Uri Alon’s group July 2005 July 2005.
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
Network Computing Laboratory The Structure and Function of Complex Networks (Episode I) M.E.J. Newman, Dept. of Physics, U. Michigan, presented.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Online Social Networks and Media Network Measurements.
Bioinformatics Center Institute for Chemical Research Kyoto University
How Do “Real” Networks Look?
Class 19: Degree Correlations PartII Assortativity and hierarchy
Introduction to complex networks Part I: Structure
Models and Algorithms for Complex Networks
Information Retrieval Search Engine Technology (10) Prof. Dragomir R. Radev.
Abstract Networks. WWW (2000) Scientific Collaboration Girvan & Newman (2002)
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Lecture II Introduction to complex networks Santo Fortunato.
Scale-free and Hierarchical Structures in Complex Networks L. Barabasi, Z. Dezso, E. Ravasz, S.H. Yook and Z. Oltvai Presented by Arzucan Özgür.
Lecture 23: Structure of Networks
Structures of Networks
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
Biological Networks Analysis Degree Distribution and Network Motifs
Section 8.6 of Newman’s book: Clustering Coefficients
How Do “Real” Networks Look?
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Network Science: A Short Introduction i3 Workshop
Clustering Coefficients
Lecture 23: Structure of Networks
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Models and Algorithms for Complex Networks Networks and Measurements Lecture 3

Types of networks  Social networks  Knowledge (Information) networks  Technology networks  Biological networks

Social Networks  Links denote a social interaction  Networks of acquaintances  collaboration networks actor networks co-authorship networks director networks  phone-call networks  networks  IM networks  Bluetooth networks  sexual networks  home page/blog networks

Knowledge (Information) Networks  Nodes store information, links associate information  Citation network (directed acyclic)  The Web (directed)  Peer-to-Peer networks  Word networks  Networks of Trust  Software graphs

Technological networks  Networks built for distribution of commodity  The Internet router level, AS level  Power Grids  Airline networks  Telephone networks  Transportation Networks roads, railways, pedestrian traffic

Biological networks  Biological systems represented as networks  Protein-Protein Interaction Networks  Gene regulation networks  Gene co-expression networks  Metabolic pathways  The Food Web  Neural Networks

Measuring Networks  Degree distributions  Small world phenomena  Clustering Coefficient  Mixing patterns  Degree correlations  Communities and clusters

The basic random graph model  The measurements on real networks are usually compared against those on “random networks”  The basic G n,p (Erdös-Renyi) random graph model:  n : the number of vertices  0 ≤ p ≤ 1  for each pair (i,j), generate the edge (i,j) independently with probability p

Degree distributions  Problem: find the probability distribution that best fits the observed data degree frequency k fkfk f k = fraction of nodes with degree k = probability of a randomly selected node to have degree k

Power-law distributions  The degree distributions of most real-life networks follow a power law  Right-skewed/Heavy-tail distribution  there is a non-negligible fraction of nodes that has very high degree (hubs)  scale-free: no characteristic scale, average is not informative  In stark contrast with the random graph model!  Poisson degree distribution, z=np  highly concentrated around the mean  the probability of very high degree nodes is exponentially small p(k) = Ck -α

Power-law signature  Power-law distribution gives a line in the log-log plot  α : power-law exponent (typically 2 ≤ α ≤ 3) degree frequency log degree log frequency α log p(k) = -α logk + logC

Examples Taken from [Newman 2003]

A random graph example

Exponential distribution  Observed in some technological or collaboration networks  Identified by a line in the log-linear plot p(k) = λe -λk log p(k) = - λk + log λ degree log frequency λ

Average/Expected degree  For random graphs z = np  For power-law distributed degree  if α ≥ 2, it is a constant  if α < 2, it diverges

Maximum degree  For random graphs, the maximum degree is highly concentrated around the average degree z  For power law graphs  Rough argument: solve nP[X≥k]=1

Collective Statistics (M. Newman 2003)

Clustering (Transitivity) coefficient  Measures the density of triangles (local clusters) in the graph  Two different ways to measure it:  The ratio of the means

Example

Clustering (Transitivity) coefficient  Clustering coefficient for node i  The mean of the ratios

Example  The two clustering coefficients give different measures  C (2) increases with nodes with low degree

Collective Statistics (M. Newman 2003)

Clustering coefficient for random graphs  The probability of two of your neighbors also being neighbors is p, independent of local structure  clustering coefficient C = p  when z is fixed C = z/n =O(1/n)

The C(k) distribution  The C(k) distribution is supposed to capture the hierarchical nature of the network  when constant: no hierarchy  when power-law: hierarchy degreek C(k) C(k) = average clustering coefficient of nodes with degree k

Millgram’s small world experiment  Letters were handed out to people in Nebraska to be sent to a target in Boston  People were instructed to pass on the letters to someone they knew on first-name basis  The letters that reached the destination followed paths of length around 6  Six degrees of separation: (play of John Guare)  Also:  The Kevin Bacon game  The Erdös number  Small world project:

Measuring the small world phenomenon  d ij = shortest path between i and j  Diameter:  Characteristic path length:  Harmonic mean  Also, distribution of all shortest paths

Collective Statistics (M. Newman 2003)

Is the path length enough?  Random graphs have diameter  d=logn/loglogn when z=ω(logn)  Short paths should be combined with other properties  ease of navigation  high clustering coefficient

Degree correlations  Do high degree nodes tend to link to high degree nodes?  Pastor Satoras et al.  plot the mean degree of the neighbors as a function of the degree

Degree correlations  Newman  compute the correlation coefficient of the degrees of the two endpoints of an edge  assortative/disassortative

Collective Statistics (M. Newman 2003)

Connected components  For undirected graphs, the size and distribution of the connected components  is there a giant component?  For directed graphs, the size and distribution of strongly and weakly connected components

Network Resilience  Study how the graph properties change when performing random or targeted node deletions

Graph eigenvalues  For random graphs  semi-circle law  For the Internet (Faloutsos 3 )

Motifs  Most networks have the same characteristics with respect to global measurements  can we say something about the local structure of the networks?  Motifs: Find small subgraphs that over- represented in the network

Example  Motifs of size 3 in a directed graph

Finding interesting motifs  Sample a part of the graph of size S  Count the frequency of the motifs of interest  Compare against the frequency of the motif in a random graph with the same number of nodes and the same degree distribution

Generating a random graph  Find edges (i,j) and (x,y) such that edges (i,y) and (x,j) do not exist, and swap them  repeat for a large enough number of times i j x y G i j x y G-swapped degrees of i,j,x,y are preserved

The feed-forward loop  Over-represented in gene-regulation networks  a signal delay mechanism X YZ Milo et al. 2002

Families of networks  Compute the relative frequency of different motifs, and group the networks if they exhibit similar frequencies

Experiments Milo et al. 2004

References  M. E. J. Newman, The structure and function of complex networks, SIAM Reviews, 45(2): , 2003  R. Albert and A.-L. Barabási, Statistical mechanics of complex networks, Reviews of Modern Physics 74, (2002).  S. N. Dorogovstev and J. F. F. Mendez, Evolution of Networks: From Biological Nets to the Internet and WWW.  Michalis Faloutsos, Petros Faloutsos and Christos Faloutsos. On Power-Law Relationships of the Internet Topology. ACM SIGCOMM  E. Ravasz, A. L. Somera, D. A. Mongru, Z. N. Oltvai, and A.-L. Barabási, Hierarchical organization of modularity in metabolic networks, Science 297, (2002).  R Milo, S Shen-Orr, S Itzkovitz, N Kashtan, D Chklovskii & U Alon, Network Motifs: Simple Building Blocks of Complex Networks. Science, 298: (2002).  R Milo, S Itzkovitz, N Kashtan, R Levitt, S Shen-Orr, I Ayzenshtat, M Sheffer & U Alon, Superfamilies of designed and evolved networks. Science, 303: (2004).