CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.

Slides:



Advertisements
Similar presentations
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Advertisements

Analysis and Modeling of Social Networks Foudalis Ilias.
Week 5 - Models of Complex Networks I Dr. Anthony Bonato Ryerson University AM8002 Fall 2014.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
Generative Models for the Web Graph José Rolim. Aim Reproduce emergent properties: –Distribution site size –Connectivity of the Web –Power law distriubutions.
SILVIO LATTANZI, D. SIVAKUMAR Affiliation Networks Presented By: Aditi Bhatnagar Under the guidance of: Augustin Chaintreau.
Universal Random Semi-Directed Graphs
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
The influence of search engines on preferential attachment Dan Li CS3150 Spring 2006.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
A Brief History of Lognormal and Power Law Distributions and an Application to File Size Distributions Michael Mitzenmacher Harvard University.
On the Spread of Viruses on the Internet Noam Berger Joint work with C. Borgs, J.T. Chayes and A. Saberi.
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
| May 2003 | Almaden Research Center, San Jose, CA © 2003 IBM Corporation IMA Tutorial (part III): Generative and probabilistic models of data May 5, 2003.
Network Design IS250 Spring 2010 John Chuang. 2 Questions  What does the Internet look like? -Why do we care?  Are there any structural invariants?
Network Statistics Gesine Reinert. Yeast protein interactions.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
AITIA International Inc. and Lorand Eotvos University, Budapest ROBUST NETWORKS FROM LOCAL OPTIMIZATION A Bottom-Up Model to Generate Networks with Skewed.
CS Lecture 6 Generative Graph Models Part II.
Advanced Topics in Data Mining Special focus: Social Networks.
SDSC, skitter (July 1998) A random graph model for massive graphs William Aiello Fan Chung Graham Lincoln Lu.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Computer Science 1 Web as a graph Anna Karpovsky.
1 Dynamic Models for File Sizes and Double Pareto Distributions Michael Mitzenmacher Harvard University.
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
The Erdös-Rényi models
Information Networks Power Laws and Network Models Lecture 3.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
1 Algorithmic Performance in Complex Networks Milena Mihail Georgia Tech.
Models and Algorithms for Complex Networks Power laws and generative processes.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
1 “Erdos and the Internet” Milena Mihail Georgia Tech. The Internet is a remarkable phenomenon that involves graph theory in a natural way and gives rise.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
1 “Expansion” in Power Law and Scale Free Graphs Milena Mihail Georgia Tech with Christos Gkantsidis, Christos Papadimitriou and Amin Saberi.
Percolation Processes Rajmohan Rajaraman Northeastern University, Boston May 2012 Chennai Network Optimization WorkshopPercolation Processes1.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
How Do “Real” Networks Look?
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
1 Milena Mihail Georgia Tech. Algorithmic Performance in Complex Networks.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Network (graph) Models
Random Walk for Similarity Testing in Complex Networks
Topics In Social Computing (67810)
How Do “Real” Networks Look?
Random Graph Models of large networks
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Models of Network Formation
Lecture 13 Network evolution
Models of Network Formation
Models of Network Formation
How Do “Real” Networks Look?
Models of Network Formation
Peer-to-Peer and Social Networks
Lecture 21 Network evolution
Modelling and Searching Networks Lecture 5 – Random graphs
Modelling and Searching Networks Lecture 6 – PA models
Network Science: A Short Introduction i3 Workshop
Shan Lu, Jieqi Kang, Weibo Gong, Don Towsley UMASS Amherst
Discrete Mathematics and its Applications Lecture 5 – Random graphs
Network Models Michael Goodrich Some slides adapted from:
Discrete Mathematics and its Applications Lecture 6 – PA models
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian

This lecture Probabilistic generative models for social networks (in particular web graph)

Why look for generative models? Designing and testing algorithms for the web  E.g.: Compressing the web graph Designing crawling strategies Search algorithms on P2P networks … Explaining why web has certain properties  For example, the central limit theorem tells us why we often see the Gaussian distribution in practice.  Is there a similar explanation for the power law distribution? Predicting what “might” happen in the future  E.g.: An AIDS epidemic? An Internet black out? A residential segregation?

Characteristics of a good model Simple Plausible Exihibits the observed properties  Power law  Small world  Locally dense, globally sparse

Power law distribution From last lecture: power laws everywhere!  Income distribution (Pareto 1896)  Word frequencies (Estoup 1916, Zipf 1932)  City population (Auerbach 1913, Zipf 1949)  Scientific productivity (Lotka 1926)  Internet graph degree dist (FFF 1999)  Web graph degree dist (BKMRRSTW 2000)  Dist. of file sizes  … Why?

Models and explanations for power law Optimization (“power law is the best design”)  Mandelbrot 1953: Zipf’s law is the most efficient design.  Carlson & Doyle 1999, Fabrikant et al (HOT) Monkeys typing randomly  Miller 1957: even a monkey typing randomly can generate a power law. Multiplicative processes & Log-normal dist.  Gibrat 1930, Champernowne 1955, Gabaix 1999 Preferential growth (“the rich get richer”)  Simon 1955, Yule 1925

Log-normal distribution Central limit Thm: Product of many indep. distributions is approximately log-normal.

Multiplicative process and power law Multiplicative processes can sometimes generate power law instead of log-normal:  Multiplicative process with a minimum Chambernowne 1953, Gabaix 1999  Random stopping time Montroll and Schlesinger 1982,1983

Preferential growth The system “grows”. The probability of a new member joining a group is proportional to its current size. Simon 1955, Yule 1925 (for biological systems) Barabasi and Albert 1999: preferential attachment for web graph

Random graph models Erdos-Renyi random graphs G(n,p)  n vertices, there is an edge between each pair independently with probability p. G(n,p) at a glance:  Average degree np. Binomial degree dist.  p < 1/n: union of small simple connected comp.  p > 1/n: a “giant” complex component emerges (still many small connected components)  p > ln(n)/n: connected.

The ACL model Proposed by Aiello, Chung, and Lu, Fix a degree sequence d (e.g., power law). Put d i copies of the i’th vertex. Pick a random matching. Contract the d i copies of the i’th vertex  Essentially a variant of G(n,p), with the degree distribution explicitly enforced.

Preferential attachment Start with a graph with one node. Vertices arrive one by one. When a vertex arrives, it connects itself to one (m, in general) of the previous vertices, with probability proportional to their degrees.

Preferential attachment Heuristic analysis (Barabasi-Albert): degree distribution follows a power law with exponent -3. Theorem (Bollobas, Riordan, Spencer, Tusnady). For d < n 1/16, the fraction of vertices that have degree d is almost surely around