Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.

Slides:



Advertisements
Similar presentations
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Advertisements

Power Laws By Cameron Megaw 3/11/2013. What is a Power Law?
Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
School of Information University of Michigan Network resilience Lecture 20.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
What did we see in the last lecture?. What are we going to talk about today? Generative models for graphs with power-law degree distribution Generative.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Network Measurements.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
The influence of search engines on preferential attachment Dan Li CS3150 Spring 2006.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
School of Information University of Michigan SI 614 Random graphs & power law networks preferential attachment Lecture 7 Instructor: Lada Adamic.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.
CS728 Lecture 5 Generative Graph Models and the Web.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
Mining and Searching Massive Graphs (Networks)
Network Statistics Gesine Reinert. Yeast protein interactions.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
CS Lecture 6 Generative Graph Models Part II.
Advanced Topics in Data Mining Special focus: Social Networks.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
1 A Random-Surfer Web-Graph Model Avrim Blum, Hubert Chan, Mugizi Rwebangira Carnegie Mellon University.
Summary from Previous Lecture Real networks: –AS-level N= 12709, M=27384 (Jan 02 data) route-views.oregon-ix.net, hhtp://abroude.ripe.net/ris/rawdata –
Computer Science 1 Web as a graph Anna Karpovsky.
1 Dynamic Models for File Sizes and Double Pareto Distributions Michael Mitzenmacher Harvard University.
Measurement and Evolution of Online Social Networks Review of paper by Ophir Gaathon Analysis of Social Information Networks COMS , Spring 2011,
Online Social Networks and Media Network models. What is a network model? Informally, a network model is a process (radomized or deterministic) for generating.
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
1 Worm Modeling and Defense Cliff C. Zou, Don Towsley, Weibo Gong Univ. Massachusetts, Amherst.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Models and Algorithms for Complex Networks Networks and Measurements Lecture 3.
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Models and Algorithms for Complex Networks Power laws and generative processes.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
“Adversarial Deletion in Scale Free Random Graph Process” by A.D. Flaxman et al. Hammad Iqbal CS April 2006.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Percolation Processes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopPercolation Processes1.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Models of Web-Like Graphs: Integrated Approach
On the behaviour of an edge number in a power-law random graph near a critical points E. V. Feklistova, Yu.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Network (graph) Models
Random Walk for Similarity Testing in Complex Networks
Topics In Social Computing (67810)
Random Graph Models of large networks
Peer-to-Peer and Social Networks
Modelling and Searching Networks Lecture 6 – PA models
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
What did we see in the last lecture?
Presentation transcript:

Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4

Announcement  The first assignment will be handed out after the spring break  The second assignment around middle of April  You should send me an about the type of project that you intend to do within the next week.

Power Laws - Recap  A (continuous) random variable X follows a power-law distribution if it has density function  A (continuous) random variable X follows a Pareto distribution if it has cumulative function  A (discrete) random variable X follows Zipf’s law if the frequency of the r-th largest value satisfies power-law with α=1+β power-law with α=1+1/γ

Power Laws – Generative processes  We have seen that power-laws appear in various natural, or man-made systems  What are the processes that generate power-laws?  Is there a “universal” mechanism?

Combination of exponentials  If variable Y is exponentially distributed  If variable X is exponentially related to Y  Then X follows a power law  Model for population of organisms

Monkeys typing randomly  Consider the following generative model for a language [Miller 57]  The space appears with probability q s  The remaining m letters appear with equal probability (1-q s )/m  Frequency of words follows a power law!  Real language is not random. Not all letter combinations are equally probable, and there are not many long words

Least effort principle  Let C j be the cost of transmitting the j-th most frequent word  The average cost is  The average information content is  Minimizing cost per information unit C/H yields

Critical phenomena  When the characteristic scale of a system diverges, we have a phase transition.  Critical phenomena happen at the vicinity of the phase transition. Power-law distributions appear

Percolation on a square lattice  Each cell is occupied with probability p  What is the mean cluster size?

Critical phenomena and power laws  For p < p c mean size is independent of the lattice size  For p > p c mean size diverges (proportional to the lattice size - percolation)  For p = p c we obtain a power law distribution on the cluster sizes p c = …

Self Organized Criticality  Consider a dynamical system where trees appear in each cell with probability p, and fires strike cells randomly  The system eventually stabilizes at the critical point, resulting in power-law distribution of cluster (and fire) sizes

Preferential attachment  The main idea is that “the rich get richer”  first studied by Yule for the size of biological genera  revisited by Simon  reinvented multiple times  Also known as  Gibrat principle  cumulative advantage  Mathew effect

The Yule process  A genus obtains species with probability proportional to its current size  Every m new species, the m+1 species creates a new genus  The sizes of genera follows a power law with

Preferential Attachment in Networks  First considered by [Price 65] as a model for citation networks  each new paper is generated with m citations (mean)  new papers cite previous papers with probability proportional to their indegree (citations)  what about papers without any citations? each paper is considered to have a “default” citation probability of citing a paper with degree k, proportional to k+1  Power law with exponent α = 2+1/m

Barabasi-Albert model  Undirected(?) model: each node connects to other nodes with probability proportional to their degree  the process starts with some initial subgraph  each node comes with m edges  Results in power-law with exponent α = 3

The LCD model [Bollobas-Riordan]  Self loops and multiple edges are allowed  A new vertex v, connects to a vertex u with probability proportional to the degree of u, counting the new edge.  The m edges are inserted sequentially, thus the problem reduces to studying the single edge problem

Linearized Chord Diagram  Consider 2n nodes labeled {1,2,…,2n} placed on a line in order.

Linearized Chord Diagram  Generate a random matching of the nodes.

Linearized Chord Diagram  Starting from left to right identify all endpoints until the first right endpoint. This is node 1. Then identify all endpoints until the second right endpoint to obtain node 2, and so on.

Linearized Chord Diagram  Uniform distribution over matchings gives uniform distribution over all graphs in the preferential attachment model

Linearized Chord Diagram  Uniform distribution over matchings gives uniform distribution over all graphs in the preferential attachment model  each supernode has degree proportional to the nodes it contains  consider adding a new chord, with the right endpoint being in the rightmost position and the left being placed uniformly

Preferential attachment graphs  Expected diameter  if m = 1, the diameter is Θ(log n)  if m > 1, the diameter is Θ(log n/loglog n)  Expected clustering coefficient

Weaknesses of the BA model  It is not directed (not good as a model for the Web) and when directed it gives acyclic graphs  It focuses mainly on the (in-) degree and does not take into account other parameters (out-degree distribution, components, clustering coefficient)  It correlates age with degree which is not always the case  Yet, it was a significant contribution in the network research  Many variations have been considered some in order to address the above problems  edge rewiring, appearance and disappearance  fitness parameters  variable mean degree  non-linear preferential attachment

Copying model  Each node has constant out-degree d  A new node selects uniformly one of the existing nodes as a prototype  For the i-th outgoing link  with probability α it copies the i-th link of the prototype node  with probability 1- α it selects the target of the link uniformly at random

An example

Copying model properties  Power law degree distribution with exponent β = (2-α)/(1- α)  Number of bipartite cliques of size i x d is ne -i  The model was meant to capture the topical nature of the Web  It has also found applications in biological networks

Other graph models  Cooper Frieze model  multiple parameters that allow for adding vertices, edges, preferential attachment, uniform linking  Directed graphs [Bollobas et al]  allow for preferential selection of both the source and the destination  allow for edges from both new and old vertices

Other interesting distributions  The log-normal distribution  The variable Y = log X follows a normal distribution

Lognormal distribution  Generative model: Multiplicative process  Central Limit Theorem: If X 1,X 2,…,X n are i.i.d. variables with mean m and finite variance s, then if S n = X 1 +X 2 +…+X n  When the multiplicative process has a reflective boundary, it gives a power law distribution

Double Pareto distribution  Run the multiplicative process for T steps, where T is an exponentially distributed random variable  Double Pareto: Combination of two Pareto distributions

References  M. E. J. Newman, The structure and function of complex networks, SIAM Reviews, 45(2): , 2003  M. E. J. Newman, Power laws, Pareto distributions and Zipf's law, Contemporary Physics  M. Mitzenmacher, A Brief History of Generative Models for Power Law and Lognormal Distributions, Internet Mathematics  R. Albert and L.A. Barabasi, Statistical Mechanics of Complex Networks, Rev. Mod. Phys. 74, (2002).  B. Bollobas, Mathematical Results in Scale-Free random Graphs