SNMP - Simple Network Measurements Please! - PowerPoint PPT Presentation

About This Presentation

Title:

SNMP - Simple Network Measurements Please!

Description:

An Information-Theoretic Approach to Traffic Matrix Estimation ... Seismology. Astronomy. Statistical intuition = Regularization. Penalty function J(x) ... – PowerPoint PPT presentation

Number of Views:46

Avg rating:3.0/5.0

Slides: 38

Provided by: Matthew493

Category:

more less

Transcript and Presenter's Notes

Title: SNMP - Simple Network Measurements Please!

1
Internet Measurement Conference 2003 27-29 Of
October, 2003 Miami, Florida, USA http//www.icir.
org/vern/imc-2003/ Date for student travel grant
applications Sept 5th
2
An Information-Theoretic Approach to Traffic
Matrix EstimationYin Zhang, Matthew Roughan,
Carsten Lund ATT ResearchDavid Donoho
Stanford
3
Problem
Have link traffic measurements
Want to know demands from source to destination
B
C
A
4
Example App reliability analysis
Under a link failure, routes change want to find
an traffic invariant
B
C
A
5
Approach

Principle
Dont try to estimate something
if you dont have any information about it
Maximum Entropy
Entropy is a measure of uncertainty
More information less entropy
To include measurements, maximize entropy subject
to the constraints imposed by the data
Impose the fewest assumptions on the results
Instantiation Maximize relative entropy
Minimum Mutual Information

6
Mathematical Formalism
Only measure traffic at links
Traffic y1
1
link 1
2
router
link 2
link 3
3
7
Mathematical Formalism
Traffic y1
1
Traffic matrix element x1
route 1
2
router
route 3
route 2
3
Problem Estimate traffic matrix (xs) from the
link measurements (ys)
8
Mathematical Formalism
1
route 1
2
router
route 3
route 2
3
Problem Estimate traffic matrix (xs) from the
link measurements (ys)
9
Mathematical Formalism
1
route 1
2
router
route 3
route 2
3
Problem Estimate traffic matrix (xs) from the
link measurements (ys)
10
Mathematical Formalism
1
route 1
2
router
route 3
Routing matrix
route 2
3
y Ax
For non-trivial network UNDERCONSTRAINED
11
Regularization

Want a solution that satisfies constraints y
Ax
Many more unknowns than measurement O(N2) vs
O(N)
Underconstrained system
Many solutions satisfy the equations
Must somehow choose the best solution
Such (ill-posed linear inverse) problems occur in
Medical imaging e.g CAT scans
Seismology
Astronomy
Statistical intuition gt Regularization
Penalty function J(x)
solution

12
How does this relate to other methods?

Previous methods are just particular cases of
J(x)
Tomogravity (Zhang, Roughan, Greenberg and
Duffield)
J(x) is a weighted quadratic distance from a
gravity model
A very natural alternative
Start from a penalty function that satisfies the
maximum entropy principle
Minimum Mutual Information

13
Minimum Mutual Information (MMI)

Mutual Information I(S,D)
Information gained about Source from Destination
I(S,D) -relative entropy with respect to
independent S and D
I(S,D) 0
S and D are independent
p(DS) p(D)
gravity model
Natural application of principle
Assume independence in the absence of other
information
Aggregates have similar behavior to network
overall
When we get additional information (e.g. y Ax)
Maximize entropy ? Minimize I(S,D) (subject to
constraints)
J(x) I(S,D)

equivalent
14
MMI in practice

In general there arent enough constraints
Constraints give a subspace of possible solutions

y Ax
15
MMI in practice

Independence gives us a starting point

independent solution
y Ax
16
MMI in practice

Find a solution which
Satisfies the constraint
Is closest to the independent solution

solution
Distance measure is the Kullback-Lieber divergence
17
Is that it?

Not quite that simple
Need to do some networking specific things
e.g. conditional independence to model hot-potato
routing
Can be solved using standard optimization
toolkits
Taking advantage of sparseness of routing matrix
A
Back to tomogravity
Conditional independence generalized gravity
model
Quadratic distance function is a first order
approximation to the Kullback-Leibler divergence
Tomogravity is a first-order approximation to MMI

18
Results Single example

20 bounds for larger flows
Average error 11
Fast (lt 5 seconds)
Scales
O(100) nodes

19
More results
Large errors are in small flows
gt80 of demands have lt20 error
tomogravity method
simple approximation
20
Other experiments

Sensitivity
Very insensitive to lambda
Simple approximations work well
Robustness
Missing data
Erroneous link data
Erroneous routing data
Dependence on network topology
Via Rocketfuel network topologies
Additional information
Netflow
Local traffic matrices

21
Dependence on Topology
star (20 nodes)
clique
22
Additional information Netflow
23
Local traffic matrix (George Varghese)
for reference previous case
0 1 5 10
24
Conclusion

We have a good estimation method
Robust, fast, and scales to required size
Accuracy depends on ratio of unknowns to
measurements
Derived from principle
Approach gives some insight into other methods
Why they work regularization
Should provide better idea of the way forward
Additional insights about the network and traffic
Traffic and network are connected
Implemented
Used in ATTs NA backbone
Accurate enough in practice

Additional Slides
26
Results

Methodology
Use netflow based partial (80) traffic matrix
Simulate SNMP measurements using routing sim, and
y Ax
Compare estimates, and true traffic matrix
Advantage
Realistic network, routing, and traffic
Comparison is direct, we know errors are due to
algorithm not errors in the data
Can do controlled experiments (e.g. introduce
known errors)
Data
One hour traffic matrices (dont need fine
grained data)
506 data sets, comprising the majority of June
2002
Includes all times of day, and days of week

27
Robustness (input errors)
28
Robustness (missing data)
29
Point-to-multipoint
We dont see whole Internet What if an edge
link fails?
Point-to-point traffic matrix isnt invariant
30
Point-to-multipoint

Included in this approach
Implicit in results above
Explicit results worse
Ambiguity in demands in increased
More demands use exactly the same sets of routes
use in applications is better

Link failure analysis
Point-to-point
Point-to-multipoint
31
Independent model
32
Conditional independence

Internet routing is asymmetric
A provider can control exit points for traffic
going to peer networks

33
Conditional independence

Internet routing is asymmetric
A provider can control exit points for traffic
going to peer networks
Have much less control of where traffic enters

34
Conditional independence
35
Minimum Mutual Information (MMI)

Mutual Information I(S,D)0
Information gained about S from D
I(S,D) relative entropy with respect to
independence
Can also be given by Kullback-Leibler information
divergence
Why this model
In the absence of information, lets assume no
information
Minimal assumption about the traffic
Large aggregates tend to behave like overall
network?

36
Dependence on Topology
Unknowns per Relative Errors ()
Network PoPs Links measurement Geographic Random
Exodus 17 58 4.69 12.6 20.0
Sprint 19 100 3.42 8.0 18.9
Abovenet 11 48 2.29 3.8 11.7
Star N 2(N-1) N/210 24.0 24.0
Clique N N(N-1) 1 0.2 0.2
ATT - - 3.54-3.97 10.6
These are not the actual networks, but only
estimates made by Rocketfuel
37