On The Marginal Utility of Network Topology Measurements - PowerPoint PPT Presentation

About This Presentation

Title:

On The Marginal Utility of Network Topology Measurements

Description:

True, but taking more measurements and deploying more infrastructure is ... Limitations and Caveats. Interface disambiguation. 13% of interfaces never responded ... – PowerPoint PPT presentation

Number of Views:48

Avg rating:3.0/5.0

Slides: 26

Provided by: MarkCr8

Learn more at: https://www.cs.bu.edu

Category:

more less

Transcript and Presenter's Notes

Title: On The Marginal Utility of Network Topology Measurements

1
On The Marginal Utility of Network Topology
Measurements

John Byers
with
Paul Barford (now at Wisconsin),
Azer Bestavros, and Mark Crovella

2
Measurement Philosophy

Current Dogma When conducting a wide-area
measurement study more is better.
More measurements
More measurement sites
True, but taking more measurements and deploying
more infrastructure is expensive!
Our focus How much better is more?
Even harder When can we stop measuring?
Not much work on this topic in our community.

3
Problem InstanceDiscovering Internet Topology

Typical goal discover the router-level Internet
graph
Typical approach merge lists of known nodes and
edges
Traceroute reports the IP path from A to B
i.e., how IP paths are overlaid on the router
graph

4
Traceroute studies

Yield overlays of projections from Ss to Ds
Sources active, expensive
Destinations passive, cheap

D
D
D
D
D
S
S
5
Motivating Questions

How should we use traceroute and what can it
discover?
Physical topology (nodes, links)?
IP routing topology?
Whats a good way to organize a
collection-of-traceroutes study?
Many sources?
Many destinations?
How much is enough?

6
Theoretical Inroads

Take a graph G (V, E) and a routing algorithm
R.
Choose j sources and k destinations at random.
Consider the subgraph G (V, E) induced by
routes from R between all (S, D) pairs.
How do expected values of V and E scale as
a function of j and k ?
Chuang-Sirbu scaling law is special case for j
1.
Marginal utility of adding k1 st source or
destination is expected contribution to V or
E.

7
What might we expect?

Two extremal cases
Clique each new (S, D) discovers a new path
Star each new S or D discovers only a small
neighborhood

D
D
D
D
D
D
D
D
D
D
Clique
Star
8
Skitter to the Rescue

Two datasets from CAIDA
Small dataset May 2000
8 sources, 1277 destinations, 20K paths
Sources in New Zealand, Japan, Singapore, San
Jose (2), Ottawa, London, Washington
All sources traced to all destinations
Large dataset October 2000, 30 times bigger
12 sources, 313709 destinations, 600K paths
No destination common to all sources, or vice
versa

9
Interface Disambiguation

Traceroutes report only on interfaces used
Routers often have multiple interfaces
But merging traceroutes requires matching routers
Solution probe each interface from some site X
Routers are supposed to respond on the interface
used for routing to X
Results in set of (probe interface, response
interface) pairs
Each connected component is taken to be a router

10
Classifying Nodes

Core, border, stub, leaf
Solely from traceroute information

Leaf
Border
Core
Stub
11
Classification depends on msmts
Core
Stub
Border
12
Limitations and Caveats

Interface disambiguation
13 of interfaces never responded
Node classification
Identifying a border node requires two paths to
it
Representativeness
Datasets are small, may not be representative
Skitter sources not selected at random
Unknown coverage of true network
Diminishing returns may not signify good coverage

13
Diminishing Returns (Small Dataset)
14
Diminishing Returns (Large Dataset)
15
Diminishing returns by Classification (Small
Dataset)
Core
Stub
Border
16
What Does This Suggest?
D
D
S
D
D
S
D
D
17
Adding Destinations Nodes
Slope is about 3
18
Adding Destinations Links
Slope is about 4
19
Add Sources or Destinations?
Isolines represent constant node discovery,
varying Ss or Ds
20
Node Degree Distribution
1 Source
8 Sources
21
Node Degree Distribution Tail
8 Sources
1 Source
22
Degree distribution convergence RMSE
23
Information Theory Plug

Can compare marginal utility of different
processes.

Link Discovery
Node Discovery
24
Related Work

Pansiot Grad 98
First multi-traceroute study
Similar methodology, incl. interface
disambiguation
Chuang Sirbu 98Phillips, Shenker
Tangmunarunkit 99
single-source case, found sublinear growth of
multicast tree with added destinations
Govindan Tangmunarunkit 00
Extensive node discovery, overcoming limitations
of traceroute
Broido Claffy 01
Larger datasets more detailed look at graph
structure

25
Conclusions