Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros, and Mark Crovella

Measurement Philosophy zCurrent Dogma: When conducting a wide-area measurement study: “more is better.” yMore = measurements yMore = measurement sites zTrue, but taking more measurements and deploying more infrastructure is expensive! zOur focus: How much better is more? yEven harder: When can we stop measuring? zNot much work on this topic in our community.

Problem Instance: Discovering Internet Topology zTypical goal: discover the router-level Internet graph zTypical approach: merge lists of known nodes and edges zTraceroute reports the IP path from A to B yi.e., how IP paths are overlaid on the router graph

Traceroute studies zYield overlays of projections from S’s to D’s ySources: active, expensive yDestinations: passive, cheap S S D D D D D

Motivating Questions zHow should we use traceroute and what can it discover? yPhysical topology (nodes, links)? yIP routing topology? zWhat’s a good way to organize a collection-of- traceroutes study? yMany sources? yMany destinations? yHow much is enough?

Theoretical Inroads zTake a graph G = (V, E) and a routing algorithm R. zChoose j sources and k destinations at random. zConsider the subgraph G’ = (V’, E’) induced by routes from R between all (S, D) pairs. zHow do expected values of |V’| and |E’| scale as a function of j and k ? zChuang-Sirbu scaling law is special case for j = 1. zMarginal utility of adding k+1 ’st source or destination is expected contribution to |V’| or |E’|.

What might we expect? zTwo extremal cases: yClique: each new (S, D) discovers a new path yStar: each new S or D discovers only a small neighborhood D D D D D D D D D D S S S S Clique Star

Skitter to the Rescue zTwo datasets from CAIDA zSmall dataset: May 2000 y8 sources, 1277 destinations, 20K paths ySources in: New Zealand, Japan, Singapore, San Jose (2), Ottawa, London, Washington yAll sources traced to all destinations zLarge dataset: October 2000, 30 times bigger y12 sources, 313709 destinations, 600K paths yNo destination common to all sources, or vice versa

Interface Disambiguation zTraceroutes report only on interfaces used yRouters often have multiple interfaces yBut merging traceroutes requires matching routers zSolution: probe each interface from some site X yRouters are supposed to respond on the interface used for routing to X zResults in set of (probe interface, response interface) pairs yEach connected component is taken to be a router

Classifying Nodes zCore, border, stub, leaf zSolely from traceroute information LeafBorderCoreStub

Classification depends on msmts Core Stub Border

Limitations and Caveats zInterface disambiguation y13% of interfaces never responded zNode classification yIdentifying a border node requires two paths to it zRepresentativeness yDatasets are small, may not be representative ySkitter sources not selected at random zUnknown coverage of true network yDiminishing returns may not signify good coverage

Diminishing Returns (Small Dataset)

Diminishing Returns (Large Dataset)

Diminishing returns by Classification (Small Dataset) Core Stub Border

What Does This Suggest? D D D D D D S S

Adding Destinations: Nodes Slope is about 3

Adding Destinations: Links Slope is about 4

Add Sources or Destinations? Isolines represent constant node discovery, varying S’s or D’s

Node Degree Distribution 8 Sources 1 Source

Node Degree Distribution: Tail 1 Source 8 Sources

Degree distribution convergence: RMSE

Information Theory Plug Link Discovery zCan compare marginal utility of different processes. Node Discovery

Related Work zPansiot & Grad ’98 yFirst multi-traceroute study ySimilar methodology, incl. interface disambiguation zChuang & Sirbu ’98 Phillips, Shenker & Tangmunarunkit ’99 ysingle-source case, found sublinear growth of multicast tree with added destinations zGovindan & Tangmunarunkit ’00 yExtensive node discovery, overcoming limitations of traceroute zBroido & Claffy ’01 yLarger datasets; more detailed look at graph structure

Conclusions zRigorous quantification of marginal utility of additional measurements. zTo discover all physical nodes, traceroute is inefficient yDiminishing returns: many S’s and D’s needed zTrading off S’s and D’s yAdding destinations seems more cost-effective zTo discover how “typical” routes pass through network, traceroute is informative yRouting core and feeders yMuch of routing core is visible from few S’s (given enough D’s)

Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Similar presentations

Presentation on theme: "Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,

Similar presentations

Presentation on theme: "Boston University Computer Science On The Marginal Utility of Network Topology Measurements John Byers with Paul Barford (now at Wisconsin), Azer Bestavros,"— Presentation transcript:

Similar presentations

About project

Feedback