Download presentation
Presentation is loading. Please wait.
Published byMarcus Pearson Modified over 9 years ago
1
1 Spectral Analysis of Power-Law Graphs and its Application to Internet Topologies Milena Mihail Georgia Tech
2
2 The Internet Phenomenon Routers WWW P2P Open Decentralized Dynamic Market Competition Security, Privacy Paradigm Shift : Networks as Artifacts that we construct. Networks as Phenomena that we study !
3
3 Internet Performance Congestion (TCP/IP, ) Stability (Game Theory, ) Scalability (TCP ? Moore’s Law ?) WWW, P2P : Index, Search Van Jacobson 88 Kelly 99 (Kleinberg 97, Google 98)
4
4 Required Data & Models Routers WWW P2P Connectivity Capacity Traffic / Demand Internet Models, such as GT-ITM, Brite, Inet, for Analytic & Simulation based studies : How do elements organize ?
5
5 The Internet Phenomenon Routers WWW P2P Open Decentralized Dynamic Market Competition Security, Privacy Paradigm Shift : Networks as Artifacts that we construct. Networks as Phenomena that we study !
6
6 Level of Autonomous Systems SprintAT&T Georgia Tech CNN Topology data from BGP routing tables, collected by NLANR, looking glass -U. Oregon Decentralized Routing !
7
7 The AS Graph ~14K nodes in 2002 ( ~2K nodes in 1997) ~30K links in 2002 Georgia Tech CNN AT&T Sprint
8
8 The Directed AS Graph Georgia Tech CNN AT&T Sprint Peering Relationships : Customer – Provider Peers Gao 00, Subramanian et al 01 Five Tier Hierarchy Subramanian et al 01
9
9 The Real AS Graph CAIDA http://www.caida.org
10
10 Degree-Frequency Power Law Faloutsos et al degree 134510 frequency 2100
11
11 Rank-Degree Power Law rank degree 1234510 Faloutsos et al 99 UUNET Sprint C&WUSA AT&T BBN
12
12 Eigenvalue Power Law rank eigenvalue 1234510 Faloutsos et al 99
13
13 Eigenvalue Power Law rank eigenvalue 1234510 Faloutsos et al 99 UUNET Sprint C&WUSA AT&T BBN
14
14 Heavy Tailed Degree Distribution Departure from standard Internet Models such as Waxman, Transit-Stub Zegura et al 95 Models and techniques must be revisited Degrees not concentrated around mean Highly irregular graphs Departure from Erdos-Renyi Sharp concentration around mean, Exponential Tails
15
15 Power Law Graphs Which primitives drive their evolution ? Preferential attachment, Barabasi 99, Bollobas et al 00 Multiobjective Optimization, -------------------------------------- -----------------------------------------------Carlson & Doyle 00, Papadimitriou 02 What are their structural properties ? Hierarchy, Subramanian et al 01, Govindran et al 02 Clustering
16
16 Spectral Analysis of Matrices Examines eigenvalues and eigenvectors. Useful analogy to signal processing. All eigenvectors form a basis (complete representation). Focus on large eigenvalues and the corresponding eigenvectors. Pervasive in Algebra : Representation Theory Algorithms : Markov chain sampling Complexity : Expanders, Pseudorandomness Datamining, Information Retrieval Highly technical application specific adaptations
17
17 0. Spectral primitives : eigenvalues and eigenvectors. 1. Eigenvectors ~ Significance, hence HIERARCHY ( capacity / load ) 2. Eigenvectors ~ Clustering CLUSTERING impacts CONGESTION (1.) and (2.) use normalization preprocessing of the data 3. On Eigenvectors of Eigenvalue Power Law 4. Further Directions Outline of Results in this Talk
18
18 Eigenvectors & Eigenvalues 0 1 0 1 1 0 1 0 1 0 0 1 0 1 0 1 0 1 1 0 0 1 0 1 0 1 1 0 1 0 A = Axx=
19
19 Matrix as a Linear Transformation 0 1 0 1 1 0 1 0 1 0 0 1 0 1 0 1 0 1 1 0 0 1 0 1 0 1 1 0 1 0 A = 52 1 0 1 1 46 3 46 7 Matrix as a Linear Transformation
20
20 52 1 0 1 1 46 3 46 7 1912 17 15 13 4051 40 4447 51 Step 0 Step 1 Step 2 Step 3
21
21 1 1 11 1 1 = 3 3 3 33 3 39 9 99 9 9
22
22 Stochastic Normalization 0 1/3 0 1/3 1/3 0 1/3 0 1/3 0 0 1/3 0 1/3 0 1/3 0 1/3 1/3 0 0 1/3 0 1/3 0 1/3 1/3 0 1/3 0 0 1/3 1/3 0 0 1/3 A = Axx=
23
23 The Random Walk Eigenvalues between 1 and –1 ( 1 and 0 also easy). 1 0 0 00 0 1/3 0 0 0 2/91/9 2/9
24
24 In undirected graphs the weights of the principal eigenvector are proportional to degrees. 1. Principal Eigenvector ~ Significance Principal Eigenvector is Stationary Distribution, corresponding to to = 1 1/6 3/16 2/16 3/16 2/16 3/16
25
25 1. Principal Eigenvector ~ Significance In directed graphs the weights of the principal eigenvector can vary way beyond degrees. 10^-42*10^-45*10^-4 0.240.39
26
26 1. Hierarchy from Principal Eigenvector of Directed AS Graph Significance by High Degree Significance by Significant Peers and Customers Add 5% prob. Uniform jump to avoid sinks In WWW : Google’s pagerank
27
27 1. Principal Eigenvector Ranking
28
28 Eigenvector vs Five Tiers
29
29 1. Principal Eigenvector Ranking
30
30 0. Spectral primitives : eigenvalues and eigenvectors. 1. Eigenvectors ~ Significance, hence HIERARCHY ( capacity / load ) 2. Eigenvectors ~ Clustering CLUSTERING impacts CONGESTION (1.) and (2.) use normalization preprocessing of the data 3. On Eigenvectors of Eigenvalue Power Law 4. Further Directions Outline of Results in this Talk
31
31 2. Eigenvectors ~ Clustering 1/6 = 1
32
32 2. Eigenvectors ~ Clustering 1/6
33
33 2. Eigenvectors ~ Clustering 1/6 - -- - - -
34
34 2. Eigenvectors ~ Clustering 1/6
35
35 2. Eigenvectors ~ Clustering 1/6
36
36 2. Eigenvectors ~ Clustering 1/6 - -- - - - ~ ~ ~ ~ ~ ~ ~~ ~~ ~~ Matrix Perturbation Theory
37
37 Spectral Filtering 1 2 3 n K+1 k =1 > > >>> > Find clusters in most positive and most negative ends of eigenvectors associated with large eigenvalues. Heuristic : ( Kleinberg 97, Fiat et al 01 )
38
38 2. Eigenvectors ~ Clustering Weight of eigenvector k rank 1 2 3 n K+1 k =1 > > >>> >
39
An Example of a Cluster
44
44 Additional Matrices Similarity Matrix A*A^T, where A is directed AS graph. Complete and Pruned AS topology. In all cases prune leaves of very big degree nodes. Necessary frequency normalization. Clusters consistent and evolving over time. Synthetic Internet topologies have much weaker clustering properties.
45
45 Clustering and Congestion Assume 1 unit of traffic between each pair of ASes in each direction. Route traffic in the graph (like BGP). Compute # of connections using each link. This is a measure of congestion.
46
46 Effect of intra-cluster and inter-cluster traffic to most congested link InternetInet Internet Inet 0%100% 0%100% 20%91.5%97.7%20%126%109% 40%83.1%95.3%40%153%117% 60%74.2%92.9%60%172%128% 80%65.7%90.8%80%191%136% 100%57.3%88.5%100%207%142%
47
47 Outline of Results in this Talk 1. Eigenvectors ~ Significance, hence HIERARCHY ( capacity / load ) 2. Eigenvectors ~ Clustering CLUSTERING impacts CONGESTION (1) and (2) used normalization preprocessing of the data Normalization preprocessing of data is necessary. 3. Eigenvectors of Eigenvalue Power Law LOCALIZED 4. Further Directions
48
48 Which Eigenvectors correspond to Eigenvalue Power Law ? rank eigenvalue 1234510 Faloutsos et al 99 UUNET Sprint C&WUSA AT&T BBN
49
49 Large Degrees, or“Stars” of AS Graph Dominate Spectrum of Adjacency Matrix, Prior to Normalization
50
50 Principal Eigenvector of a Star 1 1 1 1 1 1 1 1 d
51
51 Disjoint Stars
52
52 “Mostly” Disjoint Stars Proof By Matrix Perturbation Theory, Spectral Graph Theory
53
53 3. Explanation of Eigenvalue Power Law Theorem : Random graphs whose largest degrees are, in expectation, d_1 > d_2 > … > d_k, d_j ~ j ^ -a have largest eigenvalues sharply concentrated around _j ~ j ^ -b for j = 1,…,k, and corresponding eigenvectors localized on corresponding largest degrees, with very high probability.
54
54 Summary 0. First Spectral Analysis on Internet Topologies. 1. PRINCIPAL EIGENVECTOR implies HIERARCHY 2. EIGENVECTORS of LARGE EIGENVALUES imply CLUSTERING 3. CLUSTERING impacts CONGESTION 4. Defined Intra-cluster and Inter-cluster Traffic. 5. Introduced Heady Tailed Specific Normalization Preprocessing. 6. Explained Eigenvalue Power Law. (1)Through (5) with C. Gkantsidis and E. Zegura (6) with C. Papadimitriou
55
55 Further Directions How does congestion scale in power law graphs ? Other properties, such as resilience. What are the growth primitives of power law graphs ? Optimization tradeoff primitives translate to cost – service Towards efficient and accurate synthetic data. Level of Autonomous Systems : Routing protocol (BGP) stability by game theory.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.