DK-series: Systematic Topology Analysis and Generation Using Degree Correlations Dmitri Krioukov P. Mahadevan, K. Fall, and A. Vahdat LIAFA,

Slides:



Advertisements
Similar presentations
Clustering.
Advertisements

Hidden Metric Spaces and Navigability of Complex Networks
Sampling Research Questions
Evaluating Provider Reliability in Risk-aware Grid Brokering Iain Gourlay.
Correctness of Gossip-Based Membership under Message Loss Maxim GurevichIdit Keidar Technion.
Chapter 7 Sampling and Sampling Distributions
University of Nevada, Reno Router-level Internet Topology Mapping CS790 Presentation Modified from Dr. Gunes slides by Talha OZ.
Discrete time Markov Chain
David Luebke 1 6/7/2014 ITCS 6114 Skip Lists Hashing.
The Frequency Table or Frequency Distribution Table
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
1 Capacity analysis of mesh networks with omni or directional antennas Jun Zhang and Xiaohua Jia City University of Hong Kong.
Communications Research Centre (CRC) Defence R&D Canada – Ottawa 1 Properties of Mobile Tactical Radio Networks on VHF Bands Li Li & Phil Vigneron Communications.
1 Analysis of Random Mobility Models with PDE's Michele Garetto Emilio Leonardi Politecnico di Torino Italy MobiHoc Firenze.
Scale Free Networks.
Routing and Congestion Problems in General Networks Presented by Jun Zou CAS 744.
Face Alignment by Explicit Shape Regression
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Introduction to Ad-hoc & Sensor Networks Security In The Name of God ISC Student Branch in KNTU 4 th Workshop Ad-hoc & Sensor Networks.
 Over-all: Very good idea to use more than one source. Good motivation (use of graphics). Good use of simplified, loosely defined -- but intuitive --
Based on slides by Y. Peng University of Maryland
Fast Algorithms For Hierarchical Range Histogram Constructions
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.
It’s a Small World by Jamie Luo. Introduction Small World Networks and their place in Network Theory An application of a 1D small world network to model.
The Connectivity and Fault-Tolerance of the Internet Topology
1 2.5K-Graphs: from Sampling to Generation Minas Gjoka, Maciej Kurant ‡, Athina Markopoulou UC Irvine, ETZH ‡
Construction of Simple Graphs with a Target Joint Degree Matrix and Beyond Minas Gjoka, Balint Tillman, Athina Markopoulou University of California, Irvine.
STAT 497 APPLIED TIME SERIES ANALYSIS
Scale Free Networks Robin Coope April Abert-László Barabási, Linked (Perseus, Cambridge, 2002). Réka Albert and AL Barabási,Statistical Mechanics.
1 Epidemic Spreading in Real Networks: an Eigenvalue Viewpoint Yang Wang Deepayan Chakrabarti Chenxi Wang Christos Faloutsos.
1 Learning Entity Specific Models Stefan Niculescu Carnegie Mellon University November, 2003.
Before start… Earlier work single-path routing in sensor networks
Probabilistic Data Aggregation Ling Huang, Ben Zhao, Anthony Joseph Sahara Retreat January, 2004.
Presented by Ozgur D. Sahin. Outline Introduction Neighborhood Functions ANF Algorithm Modifications Experimental Results Data Mining using ANF Conclusions.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Navigability of Networks Dmitri Krioukov CAIDA/UCSD M. Boguñá, M. Á. Serrano, F. Papadopoulos, M. Kitsak, A. Vahdat, kc claffy May, 2010.
How could this have been avoided?. Today General sampling issues Quantitative sampling Random Non-random Qualitative sampling.
Probability and Statistics in Engineering Philip Bedient, Ph.D.
Introduction to compact routing Dmitri Krioukov UCSD/CAIDA IDRWS 2004.
COVERTNESS CENTRALITY IN NETWORKS Michael Ovelgönne UMIACS University of Maryland 1 Chanhyun Kang, Anshul Sawant Computer Science Dept.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
The Erdös-Rényi models
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Traceroute-like exploration of unknown networks: a statistical analysis A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, France) L.
Efficient Gathering of Correlated Data in Sensor Networks
Popularity versus Similarity in Growing Networks Fragiskos Papadopoulos Cyprus University of Technology M. Kitsak, M. Á. Serrano, M. Boguñá, and Dmitri.
Complex network geometry and navigation Dmitri Krioukov CAIDA/UCSD F. Papadopoulos, M. Kitsak, kc claffy, A. Vahdat M. Á. Serrano, M. Boguñá UCSD, December.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Percolation in self-similar networks Dmitri Krioukov CAIDA/UCSD M. Á. Serrano, M. Boguñá UNT, March 2011.
Network Characterization via Random Walks B. Ribeiro, D. Towsley UMass-Amherst.
CS 4700 / CS 5700 Network Fundamentals Lecture 17: Network Modeling (Not Everyone has a Datacenter)
Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.
Understanding Crowds’ Migration on the Web Yong Wang Komal Pal Aleksandar Kuzmanovic Northwestern University
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
Click to Edit Talk Title USMA Network Science Center Specific Communication Network Measure Distribution Estimation Daniel.
Introduction to compact routing Dmitri Krioukov UCSD/CAIDA 3 rd CAIDA-WIDE Workshop.
Cooperative Location- Sensing for Wireless Networks Authors : Haris Fretzagias Maria Papadopouli Presented by cychen IEEE International Conference on Pervasive.
Percolation in self-similar networks PRL 106:048701, 2011
Brief Announcement : Measuring Robustness of Superpeer Topologies Niloy Ganguly Department of Computer Science & Engineering Indian Institute of Technology,
Scaling Properties of the Internet Graph Aditya Akella, CMU With Shuchi Chawla, Arvind Kannan and Srinivasan Seshan PODC 2003.
Distributed cooperation and coordination using the Max-Sum algorithm
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Degree-based graph construction & Sampling
Lecture 1: Complex Networks
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Intra-Domain Routing Jacob Strauss September 14, 2006.
Presentation transcript:

dK-series: Systematic Topology Analysis and Generation Using Degree Correlations Dmitri Krioukov P. Mahadevan, K. Fall, and A. Vahdat LIAFA, June 23 d, 2006

Motivation: topology analysis and generation New routing and other protocol design, development, testing, etc. Analysis: performance of a routing algorithm strongly depends on topology, the recent progress in routing theory has become topology analysis Generation: empirical estimation of scalability: new routing might offer X- time smaller routing tables for today but scale Y-time worse, with Y >> X Network robustness, resilience under attack, worm spreading, etc. Traffic engineering, capacity planning, network management, etc. In general: “what if”, predictive power, evolution

Network topology research

Important topology metrics Spectrum Distance distribution Betweenness distribution Degree distribution Assortativity Clustering

Problems No way to reproduce most of the important metrics No guarantee there will not be any other/new metric found important

Our approach Look at inter-dependencies among topology characteristics See if by reproducing most basic, simple, but not necessarily practically relevant characteristics, we can also reproduce (capture) all other characteristics, including practically important Try to find the one(s) defining all others

Outline Introduction dK-*: dK-distributions dK-series dK-graphs dK-randomness dK-explorations Construction Evaluation Conclusion

The main observation Graphs are structures of connections between nodes

dK-distributions as a series of graphs’ connectivity characteristics

0K Average degree

1K Degree distribution P(k)

2K Joint degree distribution P(k 1,k 2 )

3K “Joint edge degree” distribution P(k 1,k 2,k 3 )

3K, more exactly

4K

Definition of dK-distributions dK-distributions are degree correlations within simple connected graphs of size d

Definition of dK-series P d Given some graph G, graph G' is said to have property P d if G'’s dK-distribution is the same as G’s

Definition of dK-graphs dK-graphs are graphs having property P d

Nice properties of properties P d Constructability: we can construct graphs having properties P d (dK-graphs) Inclusion: if a graph has property P d, then it also has all properties P i, with i < d (dK- graphs are also iK-graphs) Convergence: the set of graphs having property P n consists only of one element, G itself (dK-graphs converge to G)

Convergence… …guarantees that all (even not yet defined!) graph metrics can be captured by sufficiently high d

Inclusion and dK-randomness 2K X 0K 0K-random 1K Given G 1K-random nK 2K-random

dK-explorations To identify the minimum d, s. t. dK(-random) graphs provide a sufficiently accurate approximation of G: Find simple (scalar) metrics that are defined by P d+1 but not by P d and construct dK-nonrandom-graphs with extreme (max or min) values of these metrics There are two extreme metrics of this type correlations of degrees of nodes at distance d concentration of d-simplices If differences between these dK-exotic graphs are small, then d is high enough

2K-exploration example Clustering S2S2 Low High Random clustering, random S 2 (2K-random) Random clustering, low S 2 High clustering, random S 2 Random clustering, high S 2 Low clustering, random S 2 2K graphs

dK-summary

Constructability Introduction dK-* Construction Stochastic Pseudograph Matching Rewiring dK-randomizing dK-targeting Evaluation Conclusion

Stochastic approach Classical (Erdos-Renyi) random graphs are 0K-random graph in the stochastic approach Easily generalizable for any d: Reproduce the expected value of the dK- distributions by connecting random d-plets of nodes with (conditional) probabilities extracted from G Best for theory Worst in practice

Pseudograph approach Reproduces dK-distributions exactly Constructs not necessarily connected pseudographs Extended for d = 2 Failed to generalize for d > 2: d-sized subgraphs start overlap over edges at d = 3

Pseudograph details 1K 1. dissolve graph into a random soup of nodes 2. crystallize it back 2K 1. dissolve graph into a random soup of edges 2. crystallize it back k1k1 k2k2 k1k1 k2k2 k3k3 k4k4 k1k1 k1k1 k 1 k 1 -ends

3K-pseudograph failure 1. dissolve graph into a random soup of d-plets 2. cannot crystallize it back

Matching approach Pseudograph + badness (loop) avoidance Extended for d = 2, but loop avoidance is difficult Failed to generalize for d > 2

Rewiring Generalizable for any d Works in practice

dK-randomizing rewiring dK-preserving random rewiring

dK-targeting rewiring d'K-preserving rewiring (d' < d) moving graph closer to dK dK-distance D d can be any non-negative scalar metric measuring the difference between the current and target values of the dK-distribution (e.g., the sum of squares of differences in numbers of d-sized subgraphs) Normally, accept a rewiring only if ∆D d ≤ 0 To check ergodicity: accept a rewiring even if ∆D d > 0 with probability exp(−∆D d /T) T → ∞: d'K-randomizing rewiring T → 0: dK-targeting rewiring start with a high temperature and gradually cool down the system

Outline Introduction dK-* Construction Evaluation Algorithms Topologies skitter HOT Conclusion

Algorithms All algorithms deliver consistent results for d = 0 All algorithms, except stochastic(!), deliver consistent results for d = 1 and d = 2 Both rewiring algorithms deliver consistent results for d = 3

skitter scalar metrics Metric0K1K2K3Kskitter r d dd n

skitter distance distribution

skitter clustering

HOT scalar metrics Metric0K1K2K3KHOT r d dd n

HOT distance distribution

HOT betweenness distribution

HOT 0K-porn

HOT 1K-porn

HOT 2K-porn

HOT 3K-porn

HOT HOT porn

HOT dK-porn 0K1K2K 3K HOT

Outline Introduction dK-* Construction Evaluation Conclusion

Conclusions Analysis: inter-dependencies among topology metrics and connections between local and global structure continuous and discrete worlds equilibrium and non-equilibrium models (if a topology is dK-random, its evolution models need to explain just the dK-distribution) Generation: topology generator with arbitrary level of accuracy