Measuring and monitoring Microsofts enterprise network

About This Presentation

Title:

Measuring and monitoring Microsofts enterprise network

Description:

1. Measuring and monitoring Microsoft's enterprise ... Packet encapsulated in shim header plus trailer. IP protocol moved into trailer and header rewritten ... – PowerPoint PPT presentation

Number of Views:42

Avg rating:3.0/5.0

Slides: 19

Provided by: mort98

Category:

more less

Transcript and Presenter's Notes

Title: Measuring and monitoring Microsofts enterprise network

1
Measuring and monitoring Microsofts enterprise
network

Richard Mortier (mort), Rebecca Isaacs, Laurent
Massoulié, Peter Key

2
We monitored our network

and this is how
and this is what we saw
How did we monitor it?
What did we see?

3
Microsoft CorpNet _at_ MSR Cambridge
CORPNET
EMEA
MSRC
area3
area2
LatinAmerica
area1
Area 0
eBGP
NorthAmerica
AsiaPacific
4
Capture setup

MSRC site organized using IP subnets
Roughly one per wing plus one for datacenter
Datacenter is by far the most active
Captured using VLAN spanning
11 mapping between (Ethernet) VLAN and IP subnet
Mapped all VLANs to one port (NS trace)
except datacenter, mapped to second port (DC
trace)
Also took a capture at one VLANs Ethernet switch
Allowed us to estimate amount of traffic not
captured
gt99 traffic is routed (i.e. goes off-VLAN)
Missed printer, some subnet broadcast, some SMB

5
Capture setup VLAN spanning
offsite
north side
NS trace
filter on VLAN
port
MSR Cambridge site router
subnet
DC trace
data center
south side
6
Packet processing

Assigned packets to application
Used port numbers, RPC GUID, signature byte
strings, server name
Assigned applications to category
40 applications ? 10 categories
Generated packet and flow records
Reduce disk IO, increase performance
Still took 10 days per complete run
Python scripts processed records

7
Problems with this setup

Duplication
No DC switch some hosts directly connected to
router
See their packets twice (on the way in and out)
Deduplicate both traces careful selection from
NS trace
IPSec transport mode deployment
Packet encapsulated in shim header plus trailer
IP protocol moved into trailer and header
rewritten
Wrote custom capture tools to unpick
encapsulation
Flow detection
Network flow ? transport flow ? application flow
Used IP 5-tuple and timeout 90 seconds

8
Flow idle timeout
9
Trace characteristics
10
Traffic classification
11
Protocol distribution
12
Communication patterns
flows src ports suggesting client behaviour
flows use few src ports suggests server behaviour
13
flows src ports suggesting client behaviour
flows use few src ports suggests server behaviour
neither client nor server suggests peer-to-peer
neither client nor server suggests peer-to-peer
14
Communication patterns
15
Traffic dynamics

Headlines seasonal, highly volatile
Examine through
Autocorrelations
Variation per-application per-hour
Variation per-application per-host
Variation in heavy-hitter set

16
Correlograms onsite traffic
17
Correlograms offsite traffic
18
Variation per-application per-hour

Onsite (left)
Offsite (down)

Exponential decay
Light-tailed

19
Variation per-application per-host

Onsite (left)
Offsite (down)

Linear decay
Heavy-tailed
Heavy hitters

20
Implications for modelling

Timeseries modelling is hard
Tried ARMA, ARIMA models but per-application only
Exponentiation leads to large errors in
forecasting
Client/server distinction unclear
Tried PCA, projection pursuit method
Neither found anything
PCA discovered singleton clusters in rank order...

21
Implications for endsystem measurement

Heavy hitter tracking a useful approach for
network monitoring
Must be dynamic since heavy hitter set varies
between applications and
over time per-application
but is it possible to define a baseline against
which to detect (volume) anomalies?

22
Questions?

Write a Comment

User Comments (0)