Generative Models for the Web Graph José Rolim. Aim Reproduce emergent properties: –Distribution site size –Connectivity of the Web –Power law distriubutions.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
RESILIENCE NOTIONS FOR SCALE-FREE NETWORKS GUNES ERCAL JOHN MATTA 1.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Analysis of Social Media MLD , LTI William Cohen
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
Information Retrieval Lecture 8 Introduction to Information Retrieval (Manning et al. 2007) Chapter 19 For the MSc Computer Science Programme Dell Zhang.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
1 A Random-Surfer Web-Graph Model (Joint work with Avrim Blum & Hubert Chan) Mugizi Rwebangira.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
CS728 Lecture 5 Generative Graph Models and the Web.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
WEB GRAPHS (Chap 3 of Baldi) Wen-Hsiang Lu ( 盧文祥 ) Department of Computer Science and Information Engineering, National Cheng Kung University 2005/10/6.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Common Properties of Real Networks. Erdős-Rényi Random Graphs.
CS Lecture 6 Generative Graph Models Part II.
1 On Compressing Web Graphs Michael Mitzenmacher, Harvard Micah Adler, Univ. of Massachusetts.
Advanced Topics in Data Mining Special focus: Social Networks.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Computer Science 1 Web as a graph Anna Karpovsky.
Modeling the Internet and the Web School of Information and Computer Science University of California, Irvine WEB GRAPHS.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Analysis of Social Media MLD , LTI William Cohen
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
3. SMALL WORLDS The Watts-Strogatz model. Watts-Strogatz, Nature 1998 Small world: the average shortest path length in a real network is small Six degrees.
Percolation Processes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopPercolation Processes1.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Comparison of Tarry’s Algorithm and Awerbuch’s Algorithm CS 6/73201 Advanced Operating System Presentation by: Sanjitkumar Patel.
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network models Tamer Kahveci.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Models of Web-Like Graphs: Integrated Approach
Network (graph) Models
Random Walk for Similarity Testing in Complex Networks
Lecture 1: Complex Networks
Topics In Social Computing (67810)
Minimum Spanning Tree 8/7/2018 4:26 AM
Generative Model To Construct Blog and Post Networks In Blogosphere
Models of Network Formation
Lecture 13 Network evolution
The likelihood of linking to a popular website is higher
Models of Network Formation
Peer-to-Peer and Social Networks Fall 2017
Models of Network Formation
Models of Network Formation
Peer-to-Peer and Social Networks
Lecture 21 Network evolution
Network Science: A Short Introduction i3 Workshop
Network Models Michael Goodrich Some slides adapted from:
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Generative Models for the Web Graph José Rolim

Aim Reproduce emergent properties: –Distribution site size –Connectivity of the Web –Power law distriubutions –Small World Properties

Classical Model Random Graphs Erdos-Renyi Graph G(n,p) – n number of nodes – p probability of connextion pc threshold probability – p < pc -many disconnected components – p=pc - a large connected component – p=1 – a complete graph

Limitations To model the web graph: –Constant number of nodes –Same probability among sommets – etc, etc

Web Page Growth Model Sites with short term (daily) size fluctuations proportional to their size Assume an overall growth rate a such that: – S(t+1)=a(1+vb)S(t) – S(t)= # pages of site s at time t – v=+-1 – Bernouilli variable avec prob. 0.5 – b= absolute rate of daily fluctuations

Web Page Growth Donc: S(T)= a T S(0) π T 0 (1+n i b) ou: logS(T)=Tloga+logS(0)+Σ T 0 log(1+n i b)= l log(1+b)+(T-l)log(1-b) –l= # positive fluctuations Therefore: S(T) has a lognormal distribution or follows a power law:

Web page growth Probability P(s) of a site to have s pages: –P(S)=Σ i P(s/b i )P(b i )= Σ i c i /S gi = c/S g –Power Law g has been experimentaly evaluated for the web as between 1.6 and 2.0

Small world models Properties: –Sparse –Cliquishness –Small Diameter Two models –Edge-reassigning small world network –Edge addition small world network

Edge reassigning model Evolution starts with a ring of n nodes and each node connected to d nearest neighbors Then each edge is randomly reassigned to distant nodes with probability p in a round robin fashion See example page 10 with n=10 and d=4

Edge addition model At the original ring additional edges are added randomly giving an expected number –p.d.n/2 new edges –p probability of addition of an edge –See example page 13 Criticism to small world: –No newpages neither deletion of pages –No deletion of links

Rich get richer Preferential attachement model Start with a null graph with no nodes At each time step add a new node and connect it to m nodes selected randomly with probability proportional to their degree See ex. page 16

Important measures Average diameter Cliquishness ( measure the average density of local connections): –Take a node v sith degree d –Its d neighbors have max=d.(d-1)/2 links –Let c v =real number of links / max –C= Σ v c v /V.

Remarks on rich get richer Reproduces the power law of number of links. Eg: the probability of a page i to have degree di is A/di c –A is proportional to the square of the network – c is a constant – c was found empirically to be 2.9 and theoretically 3

Criticism on Rich Get Richer Does not allow reconnection of existing edges Addition of new edges take place only when new nodes are added

Copy models At each time step a node is added –With prob. p a new edge is created between this node and a randomly chosen node –With prob. 1-p: we choose randomly a node and uniformly one of the out edges and we link the new node to the node that this chosen edge enters.

Remarks Why is called copy? There are more elaborated models which allow addition of more than a edge each time It is also a sort of « rich get richer »

Applications Distributed search algorithms Subgraph patterns and communities Robusteness and vulnerability Page rank algorithms