Trajectory Sampling for Direct Traffic Observation - PowerPoint PPT Presentation

About This Presentation
Title:

Trajectory Sampling for Direct Traffic Observation

Description:

Set of packets with 'same' src and dest IP addresses. Packets that are 'close' together in time (a few seconds) ... Rule for acyclic subgraphs unicast packets: ... – PowerPoint PPT presentation

Number of Views:29
Avg rating:3.0/5.0
Slides: 30
Provided by: MG59
Category:

less

Transcript and Presenter's Notes

Title: Trajectory Sampling for Direct Traffic Observation


1
Trajectory Sampling forDirect Traffic Observation
  • Matthias Grossglauser
  • joint work with Nick Duffield
  • ATT Labs Research

2
Traffic Engineering
Two large flows
overload!
3
Traffic Engineering
overload!
New egress pointfor first flow
Multi-homed customer
4
Traffic Engineering
OSPF shortest path splitting
overload!
5
Traffic Engineering
  • Goal domain-wide control management to
  • Satisfy performance goals
  • Use resources efficiently
  • Knobs
  • Configuration topology provisioning, capacity
    planning
  • Routing OSPF weights, MPLS tunnels, BGP
    policies,
  • Traffic classification (diffserv), admission
    control,
  • Measurements are key closed control loop
  • Characterize demand whats coming in?
  • Observe network state how is the network
    reacting? (low-level adaptivity!)
  • Check performance whats the customers QoS?

6
Traffic Matrix vs. Path Matrix
  • Traffic matrix
  • bytes from ingress i to egress j
  • Path matrix
  • Spatial flow of traffic through domain
  • bytes for every path from i to j

7
Flow Measurement
flow 4
flow 1
flow 2
flow 3
  • IP flow abstraction
  • Set of packets with same src and dest IP
    addresses
  • Packets that are close together in time (a few
    seconds)
  • Cisco NetFlow
  • Router maintains a cache of statistics about
    active flows
  • Router exports a measurement record for each flow

8
Inferring the Path Matrix from the Traffic Matrix
9
Network State Uncertainty
  • Hard to get an up-to-date snapshot of
  • routing
  • Large state space
  • Vendor-specific implementation
  • Deliberate randomness
  • Multicast
  • element states
  • Links, cards, protocols,
  • element performance
  • Packet loss, delay at links

10
missing alarms
missing down alarms
spurious down
noise
11
Direct Traffic Observation
  • Goal direct observation
  • No network model state estimation
  • Basic idea
  • Sample packets at each link
  • Sampling decision based on hash over packet
    content
  • Consistent sampling ? trajectories
  • Labels based on second hash function
  • Exploit entropy in packet content to obtain
    statistically representative set of trajectories

12
Sampling and Labeling
  • Fields of interest collected only once
  • Multicast trajectory is a tree

13
Fields Included in Hashes
14
Collisions Identical Packets
15
Sampling and Labeling Hashes
  • x subset of packet bits, represented as binary
    number
  • Sampling hash
  • h(x) x mod A
  • Sample if h(x) lt r
  • r/A thinning factor
  • Labeling hash
  • g(x) x mod M
  • Make appropriate choice of A, M
  • predictable patterns should mix well

16
Pseudo-Random Sampling
  • Goal infer metrics of interest from trajectory
    samples
  • E.g., what fraction of traffic of customer x on a
    link y?
  • Question is sample set statistically
    representative?
  • Obvious for really random sampling
  • Distribution of a field in the sampled subset
    real distribution?
  • In other words does the complement of the field
    provide enough entropy?

17
Quality of Deterministic Sampling
  • Experiment statistical test to check if sampled
    and full distributions are close
  • Chi-square statistic to verify independence
    hypothesis
  • Hypothesis sampled distribution consistent with
    full distribution
  • Confidence level C(T) for hypothesis, where C is
    cdf of with I-1 degrees of freedom

18
Chi-square Test on Source Address
If , then accept hypothesis
19
Bitwise Independence
  • 2x2 contingency table formed by
  • sampling decision
  • l-th bit of packet

20
Optimal Sampling
  • Fix amount of measurement traffic c per time
    period
  • Problem
  • n number of samples in sampling period
  • M alphabet size, mlog2(M) bits/label
  • nm total amount of measurement traffic bits
  • Goal maximize unique labels, subject to nmltc
  • Result
  • optimal alphabet size Mc log(2)
  • optimal number of samples nM/log(M)
  • example c1Mb/period ?

21
Label Collisions and Trajectory Ambiguity
22
Ambiguity cont.
  • Rule for acyclic subgraphs unicast packets
  • unambiguous if each connected component of the
    subgraph is
  • (a) a source tree
  • (b) a sink tree without loss

23
InferenceExperiment
  • Experiment infer from trajectory samples
  • Estimate fraction of traffic from customer
  • Source address ? customer
  • Source address ? sampling label
  • Fraction of customer traffic on backbone link

24
Estimated Fraction (c1000bit)
25
Estimated Fraction (c10kbit)
26
Sampling Device
MPLS simple additional logic to look behind
label stack
27
Sampling Device Implementation
  • Interface vs. processing speed
  • OC-192 10 Gbps
  • State of the art DSP
  • Proc 600M MACs x 32 bit 20 Gbps
  • I/O 300MHz x 256 bit 70 Gbps
  • Moores law vs. interface speed growth
  • Vendor interest cisco, juniper, avici

28
(No Transcript)
29
Summary
  • Advantages
  • Trajectory sampling estimates path matrixand
    other metrics loss, link delay
  • Direct observation no routing model network
    state estimation
  • No router state
  • Multicast (source tree), DDoS (sink tree)
  • Control over measurement overhead
  • Small measurement delay
  • Disadvantages
  • Requires support on linecards
  • Open questions research problems
  • Collection, storage, querying (in progress)
  • Management interface
Write a Comment
User Comments (0)
About PowerShow.com