Traffic Management

About This Presentation

Title:

Traffic Management

Description:

Traffic Management & Traffic Engineering An example Executives participating in a worldwide videoconference Proceedings are videotaped and stored in an archive Edited ... – PowerPoint PPT presentation

Number of Views:80

Avg rating:3.0/5.0

Slides: 207

Provided by: s236

Category:

more less

Transcript and Presenter's Notes

Title: Traffic Management

1
Traffic ManagementTraffic Engineering
2
An example

Executives participating in a worldwide
videoconference
Proceedings are videotaped and stored in an
archive
Edited and placed on a Web site
Accessed later by others
During conference
Sends email to an assistant
Breaks off to answer a voice call

3
What this requires

For video
sustained bandwidth of at least 64 kbps
low loss rate
For voice
sustained bandwidth of at least 8 kbps
low loss rate
For interactive communication
low delay (lt 100 ms one-way)
For playback
low delay jitter
For email and archiving
reliable bulk transport

4
What if

A million executives were simultaneously
accessing the network?
What capacity should each trunk have?
How should packets be routed? (Can we spread load
over alternate paths?)
How can different traffic types get different
services from the network?
How should each endpoint regulate its load?
How should we price the network?
These types of questions lie at the heart of
network design and operation, and form the basis
for traffic management.

5
Traffic management

Set of policies and mechanisms that allow a
network to efficiently satisfy a diverse range of
service requests
The mechanisms and policies have to be deployed
at both node level as well as network level
Tension is between diversity and efficiency
Traffic management is necessary for providing
Quality of Service (QoS)
Subsumes congestion control (congestion loss
of efficiency)

6
Traffic Engineering

Engineering of a given network so that the
underlying network can support the services with
requested quality
Encompasses
Network Design
Capacity Design (How many nodes, where)
Link Dimensioning (How many links, what capacity)
Path Provisioning (How much bandwidth end-to-end)
Multi-homing (Reliability for customer)
Protection for Reliability (Reliability in
Network)
Resource Allocation
Congestion Control
routing around failures
adding more capacity

7
Why is it important?

One of the most challenging open problems in
networking
Commercially important
AOL burnout
Perceived reliability (necessary for
infrastructure)
Capacity sizing directly affects the bottom line
At the heart of the next generation of data
networks
Traffic management Connectivity Quality of
Service

8
Outline

Economic principles
Traffic classes
Time scales
Mechanisms
Queueing
Scheduling
Congestion Control
Admission Control
Some open problems

9
Lets order Pizza for home delivery

Customer
calls a closest pizza outlet (what is selection
based on??)
orders a pizza
Requirement specification
type, toppings (measurable quantities)
order arrives at home
Service Quality
How fast it arrived
Is the right pizza? Anything missing (quality
measurements)
Customer Satisfaction (based on feeling!!, all
parameters not measurable)
How was the service?
Is Pizza cold or hot? Is it fresh?
Pizza company
How many customers and how fast to serve
Customer Satisfaction Only through complaints
(cannot really measure)
What they know only what customer ordered
(Requirement!!)

10
Economics Basics utility function

Users are assumed to have a utility function that
maps from a given quality of service to a level
of satisfaction, or utility
Utility functions are private information
Cannot compare utility functions between users
Rational users take actions that maximize their
utility
Can determine utility function by observing
preferences
Generally networks do not support signaling of
utility
They only support signaling of requirements
(bandwidth, delay)
Networks use resource allocation to make sure
requirements are satisfied
Measurements and Service Level Agreements (SLAs)
determine customer satisfaction!!

11
Example File Transfer

Let u(t) S - ? t
u(t) utility from file transfer
S satisfaction when transfer infinitely fast
t transfer time
? rate at which satisfaction decreases with
time
As transfer time increases, utility decreases
If t gt S/ ?, user is worse off! (reflects time
wasted)
Assumes linear decrease in utility
S and ? can be experimentally determined

12
Example Video Conference

Every packet must receive before a deadline
Otherwise, the packet is too late and cannot be
used
Model
u(t) if (t lt D) then S
else (-?)
t is the end to end delay experienced by a
packet
D is the delay deadline
S is the satisfaction
-? is the cost (penalty) for missing deadline
causes performance degradation
Sophisticated Utility measures not only delay but
packet loss too
u(?) S(1- ?) where ? is the packet loss
probability

13
Social welfare

Suppose network manager knew the utility function
of every user
Social Welfare is maximized when some combination
of the utility functions (such as sum) is
maximized while minimizing the infrastructure
cost
An economy (network) is efficient when increasing
the utility of one user must necessarily decrease
the utility of another
An economy (network) is envy-free if no user
would trade places with another (better
performance also costs more)
Goal maximize social welfare
subject to efficiency, envy-freeness, and making
a profit

14
Example

Assume
Single switch, each user imposes load (?0.4)
As utility 4 - d
Bs utility 8 - 2d
Same delay (d) to both users
Conservation law ?(?idi) Constant
0.4d 0.4d C gt d 1.25 C gt Sum of utilities
12-3.75 C
If B wants lower delay say to 0.5C, then As
delay 2C
Sum of utilities 12 - 3C (Larger than before)
By giving high priority to users that want lower
delay, network can increase its utility
Increase in social welfare need not benefit
everyone
A loses utility, but may pay less for service

15
Some economic principles

A single network that provides heterogeneous QoS
is better than separate networks for each QoS
unused capacity is available to others
Lowering delay of delay-sensitive traffic
increases welfare
can increase welfare by matching service menu to
user requirements
BUT need to know what users want (signaling)
For typical utility functions, welfare increases
more than linearly with increase in capacity
individual users see smaller overall fluctuations
can increase welfare by increasing capacity

16
Principles applied

A single wire that carries both voice and data is
more efficient than separate wires for voice and
data
ADSL
IP Phone
Moving from a 20 loaded 10 Mbps Ethernet to a
20 loaded 100 Mbps Ethernet will still improve
social welfare
increase capacity whenever possible
Better to give 5 of the traffic lower delay than
all traffic low delay
should somehow mark and isolate low-delay traffic

17
The two camps

Can increase welfare either by
matching services to user requirements or
increasing capacity blindly
Which is cheaper?
no one is really sure!
small and smart vs. big and dumb
It seems that smarter ought to be better
otherwise, to get low delays for some traffic, we
need to give all traffic low delay, even if it
doesnt need it
But, perhaps, we can use the money spent on
traffic management to increase capacity
We will study traffic management, assuming that
it matters!

18
How useful are utility functions and economic
framework?

Do users really have such functions that can be
expressed mathematically?
Practically no or less clear
Even if users cannot come up with a mathematical
formula, they can express preference of one set
of resources over other
These preferences can be codified as utility
function
Best way to think about utility functions is that
they may allow us to come up with a mathematical
formulation of the traffic management problem
that gives some insight
Practical economic algorithms may never be
feasible
But policies and mechanisms based on these are
still relevant

19
Network Types

Single-Service Networks
Provide services for single type of traffic
e.g., Telephone Networks (Voice), Cable Networks
(Video), Internet (Best effort Data)
Multi-Service Networks
Provide services for multiple traffic types on
the same network
e.g., Asynchronous Transfer Mode (CBR, VBR, ABR,
UBR), Frame Relay, Differentiated Services
(Diff-Serv), Integrated Services (Int-Serv), MPLS
with Traffic Engineering
Application types need to match the service
provided
Traffic models are used for the applications in
order to match services, design, deploy the
equipment and links.

20
Application Types

Elastic applications (Adjust bandwidth and take
what they get)
Wide range of acceptable rates, although faster
is better
E.g., data transfers such as FTP
Continuous media applications.
Lower and upper limit on acceptable performance
Sometimes called tolerant real-time since they
can adapt to the performance of the network
E.g., changing frame rate of video stream
Network-aware applications
Hard real-time applications.
Require hard limits on performance intolerant
real-time
E.g., control applications

21
Traffic models

To align services, need to have some idea of how
applications, users or aggregates of users behave
traffic model
e.g. how long a user uses a modem
e.g. average size of a file transfer
Models change with network usage
We can only guess about the future
Two types of models
measurements
educated guesses

22
Telephone traffic models

How are calls placed?
call arrival model
studies show that time between calls is drawn
from an exponential distribution
call arrival process is therefore Poisson
memoryless the fact that a certain amount of
time has passed since the last call gives no
information of time to next call
How long are calls held?
usually modeled as exponential
however, measurement studies show it to be heavy
tailed
means that a significant number of calls last a
very long time
specially after usage of modems!!

23
Traffic Engineering for Voice Networks

For a switch with N trunks, and with large
population of users (M??), the probability of
blocking (i.e., a call is lost) is given by
Erlang-B formula
? is the call arrival rate (calls /sec)
1/? is the call holding time (3 minutes)
Example (For A 12 Erlangs)
PB 1 for N 20 A/N 0.6
PB 8 for N 18 A/N 0.8
PB 30 for N 7 A/N 1.7

24
Distributions

Long/heavy-tailed distributions
power law
PX gt x ? cx?a x??, a,c gt 0
Pareto
PX gt x c a x? a, x gt b
Exponential Distribution
PX gt x e-ax

25
Pareto distribution

1lt?lt2 gt infinite variance

Power law decays more slowly than exponential ?
heavy tail
26
Internet traffic modeling

A few apps account for most of the traffic
WWW
FTP
telnet
A common approach is to model apps (this ignores
distribution of destination!)
time between app invocations
connection duration
bytes transferred
packet inter-arrival distribution
Little consensus on models
But two important features

27
Internet traffic models features

LAN connections differ from WAN connections
Higher bandwidth (more bytes/call)
longer holding times
Many parameters are heavy-tailed
examples
bytes in call
call duration
means that a few calls are responsible for most
of the traffic
these calls must be well-managed
also means that even aggregates with many calls
not be smooth
can have long bursts
New models appear all the time, to account for
rapidly changing traffic mix

28
Outline

Economic principles
Traffic classes
Time scales
Mechanisms
Some open problems

29
Traffic classes

Networks should match offered service to source
requirements (corresponds to utility functions)
Example telnet requires low bandwidth and low
delay
utility increases with decrease in delay
network should provide a low-delay service
or, telnet belongs to the low-delay traffic class
Traffic classes encompass both user requirements
and network service offerings
Applications match the traffic to the service
offering
Request resources from the network accordingly

30
Traffic classes - details

A basic division guaranteed service and best
effort
like flying with reservation or standby
Guaranteed-service
utility is zero unless app gets a minimum level
of service quality
bandwidth, delay, loss
open-loop flow control with admission control
e.g. telephony, remote sensing, interactive
multiplayer games
Best-effort
send and pray
closed-loop flow control
e.g. email, net news

31
GS vs. BE (cont.)

Degree of synchrony
time scale at which peer endpoints interact
GS are typically synchronous or interactive
interact on the timescale of a round trip time
e.g. telephone conversation or telnet
BE are typically asynchronous or non-interactive
interact on longer time scales
e.g. Email
Sensitivity to time and delay
GS apps are real-time
performance depends on wall clock
BE apps are typically indifferent to real time
automatically scale back during overload

32
Best Effort (Flow Control)

Explicit
Network Tells at what rate the source should send
the packets
Network elements may compute connection fair
share based on Max-Min allocation (e.g, ABR in
ATM Networks)
Or it can be based on 1 bit congestion indicator
(e.g., EFCI in ABR of ATM Networks)
Implicit
Packet drop is detected by the source and adjusts
the window transmission (e.g., TCP)
No flow control
Packets are dropped by the network nodes
Sources may not react (e.g, UDP, UBR)
Problems are caused if these two types are mixed!!

33
Traffic subclasses (roadmap)

ATM Forum
based on sensitivity to bandwidth
GS
CBR, VBR
BE
ABR, UBR

IETF
based on ToS
IETF based on RSVP
based on sensitivity to delay
GS
intolerant
tolerant
BE
interactive burst
interactive bulk
asynchronous bulk
IETF based in DiffServ
PHB
EF, 4 AFs and BE

34
ATM Basics

See the ATM Forum Presentation

35
ATM Basics

Logical or Virtual Connection
Connection is first established using signaling
protocol
Route from the source to the destination is
chosen
The same route is used for all cells (fixed size
packets) of the connection
No routing decision for every cell (they are
switched in the same path)

36
Virtual Circuits in ATM

Virtual Circuit Identifier is represented jointly
by
Virtual Channel Identifier (VCI)
Virtual Path Identifier (VPI)
Virtual Channel (VC)
Path for cell associated with a connection
Supports transportation of a data stream
Each VC is assigned a unique VCI on a link

37
Virtual Channels in ATM

Virtual Path (VP)
Grouping of virtual channels on a physical link
Switching can be performed on the path basis
reduced overheads
Each virtual path is assigned Virtual Path
Identifier (VPI)

38
VCs In ATM
39
Virtual Path Switch (VP - Switch)
40
VP / VC Switch
41
ATM Network Example
D2
Switch 2
Access
Switch 1
Switch 3
Mux
S2
D1
S1
Core
CAC
CAC

Each connection has its own traffic descriptors
such as PCR, SCR, MBS, CDVT, CLR, MCR
A Connection Admission Control algorithm (CAC)
will check for the resources at queuing points to
make a decision on admissibility
Network efficiency depends upon the CAC

42
ATM Forum GS subclasses

Constant Bit Rate (CBR)
constant, cell-smooth traffic
mean and peak rate are the same
e.g. telephone call evenly sampled and
uncompressed
constant bandwidth, variable quality
Variable Bit Rate (VBR)
long term average with occasional bursts
try to minimize delay
can tolerate loss and higher delays than CBR
e.g. compressed video or audio with constant
quality, variable bandwidth

43
ATM Forum BE subclasses

Available Bit Rate (ABR)
users get whatever is available
zero loss if network signals (in RM cells) are
obeyed
no guarantee on delay or bandwidth
Unspecified Bit Rate (UBR)
like ABR, but no feedback
no guarantee on loss
presumably cheaper
Guaranteed Frame Rate (GFR)
like UBR/ABR, expressed in terms of frame rate

44
ATM Attributes

How do we describe a flow (connection) of ATM
Service?
Service Category
Traffic Parameters or descriptors
QoS parameters
Congestion (for ABR)
Other (for UBR)
Cell Loss Priority (CLP0 or CLP01)
Connections are signaled with various parameters
A Connection Admission Control (CAC) procedure
checks for resources in the network
If connection is accepted, a traffic contract
is awarded to the user (Service Level Agreement)

45
Traffic Descriptors or Parameters

Connection Traffic Descriptor
Source Traffic Descriptor PCR, SCR, MBS, MCR,
MFS
Cell Delay Variation Tolerance (?) upper bound
on amount of cell delay that is introduced by the
network interface and the UNI (due to
interleaving, physical layer overhead,
multiplexing, etc.)
Conformance Definition unambiguous specification
of conforming cells of a connection at the UNI (
a policing function is used to check for
conformance such as Generic Cell Rate Algorithm
(GCRA))

46
Traffic Parameters (Source Traffic Descriptor)

Peak Cell Rate (PCR) upper bound on traffic
submitted by source (PCR 1/T, where T minimum
cell spacing
Sustainable Cell Rate (SCR) upper bound on
average rate of traffic submitted by source
(over a larger T)
Maximum Burst Size (MBS) maximum number of cells
sent continuously at PCR
Minimum Cell Rate (MCR) used with ABR and GFR,
minimum cell rate requested, access to unused
capacity up to PCR (elastic capacity PCR-MCR)
Maximum Frame Size (MFS) maximum size of a frame
in cells available for GFR service

47
Cell Rates

Peak Cell Rate (PCR), Line Cell Rate (LCR)

T1/PCR
t1/LCR

Sustained Cell Rate (SCR) PCR(Ton/TonToff)

Ton
Toff
48
Quality of Service

Cell Transfer Delay (CTD)
Cell Delay Variation (CDV)

time
Cell arrival pattern
Queuing point (e.g. mux, switch)
Cell departure pattern without CDV
Cell departure pattern with CDV
Switch transit delay
Positive CDV
Negative CDV
49
Cell Transfer Delay Probability Density
Variable component of delay, due to buffering
and cell scheduling.
50
QoS Parameters

Peak-to-peak cell delay variation (CDV)
acceptable delay variation at destination. The
peak-to-peak CDV is the (1 - ?) quantile of the
CTD minus the fixed CTD that could be experienced
by any delivered cell on a connection during the
entire connection holding time.
Maximum Cell Transfer Delay (maxCTD) maximum
time between transmission of first bit of a cell
at the source UNI to receipt of its last bit at
the destination UNI
Cell Loss Ratio ratio of lost cells to total
transmitted cells on a connection Lost
Cells/Total Transmitted Cells

51
Other Attributes

Congestion Control
defined only for ABR service category
uses network feedback controls
ABR flow control mechanism (more later)
Other Attributes (introduced July 2000)
Behavior class selector (BCS)
for IP differentiated services (DiffServ)
provides for different levels of service among
UBR connections
implementation dependent, no guidance in specs
Minimum desired cell rate (MDCR)
UBR application minimum capacity objective

52
Attributes of Each Service Category
53
Service Paradigm

Quantitative Commitments
Sets explicit values
Ensures service quality through resource
allocation and traffic policing
Qualitative Commitments
Relative measure and no explicit guarantees
Some unspecified level of quality through
network engineering

54
Quantitative Commitments

Generally connection oriented transport
Network nodes maintain per-flow state info
QoS (or GOS) requirements of each connection is
explicitly specified and signaled
Network enforces traffic regulation (policing,
shaping) if necessary and allocates resources for
each connection
Examples Voice networks (POTS), ATM, FR
Expensive and under-utilized

55
Qualitative Commitments

Generally connection less transport
no per-flow state info is maintained due to flow
aggregation
QoS requirements are not explicitly specified
Network may not enforce traffic regulation
May allocate resources for logical groups (such
as VPN)
Examples IP, LANs
Cheap and over-utilized

56
QoS Building Blocks

Backbone supporting QoS speed and scale
Packet / Service classification (sorting)
Bandwidth management and admission control
Queue management
Congestion management
Granular measurements

57
Functions Needed

Admission control - some way to limit usage
relative to resources.
Packet scheduling - some way to treat different
packets differently.
Classifier mechanism - some way to sort packets
into different treatment groups.
Policies and rules for allocating resources.

58
IETF

Internet currently provides only single class of
best-effort service.
No admission control and no assurances about
delivery
Existing applications are elastic.
Tolerate delays and losses
Can adapt to congestion
Future real-time applications may be inelastic.
Should we modify these applications to be more
adaptive or should we modify the Internet to
support inelastic behavior?

59
IETF ToS (1-byte Type-of-Service)

Bits 0-2 Precedence.
Bit 3 0 Normal Delay, 1 Low Delay.
Bits 4 0 Normal Throughput, 1 High
Throughput.
Bits 5 0 Normal Relibility, 1 High
Relibility.
Bit 6-7 Reserved for Future Use

60
IETF int-serv (Integrated Services)

Focus on per-flow QoS.
Support specific applications such as video
streaming.
Based on mathematical guarantees.
Many concerns
Complexity
Scalability
Business model
Charging
Uses RSVP (Resource-Reservation Protocol)
To signal QoS requirements

61
IETF int-serv (Integrated Services)

Guaranteed service
Targets hard real-time applications.
User specifies traffic characteristics and a
service requirement.
Requires admission control at each of the
routers.
Can mathematically guarantee bandwidth, delay,
and jitter.
Controlled load.
Targets applications that can adapt to network
conditions within a certain performance window.
User specifies traffic characteristics and
bandwidth.
Requires admission control at each of the
routers.
Guarantee not as strong as with the guaranteed
service.
e.g., measurement-based admission control.
Best effort

62
RSVP

Sender sends PATH message to network
PATH leads data through the network
Routers install per-flow state
Receiver responds with RESV
RESV follows PATH trail back towards sender
Routers accept resource request (commit resources
to flow) or reject resource request
Data is handled in network elements

63
IETF GS subclasses

Tolerant GS
nominal mean delay, but can tolerate occasional
variation
not specified what this means exactly
uses controlled-load service
even at high loads, admission control assures a
source that its service does not suffer
it really is this imprecise!
Intolerant GS
need a worst case delay bound
equivalent to CBRVBR in ATM Forum model

64
IETF BE subclasses

Interactive burst
bounded asynchronous service, where bound is
qualitative, but pretty tight
e.g. paging, messaging, email
Interactive bulk
bulk, but a human is waiting for the result
e.g. FTP
Asynchronous bulk
junk traffic
e.g netnews

65
IETF Diff-Serv (Differentiated Services)

Intended to address the following difficulties
with Intserv and RSVP
Scalability maintaining states by routers in
high speed networks is difficult due to the very
large number of flows
Flexible Service Models Intserv has only two
classes, want to provide more qualitative service
classes want to provide relative service
distinction (Platinum, Gold, Silver, )
Simpler signaling (than RSVP) many applications
and users may only want to specify a more
qualitative notion of service

66
Diffserv PHB (Per-Hop-Behavior)

Packet is marked in the Type of Service (TOS) in
IPv4, and Traffic Class in IPv6.
6 bits used for Differentiated Service Code Point
(DSCP) and determine PHB that the packet will
receive.
EF, 4 classes of AF, each with 3 drop priorities
(AF11, AF12, AF13, AF21, AF22, AF23, AF31, AF32,
AF33, AF41, AF42, AF43)and Best-Effort (BE)
2 bits are currently unused.

67
PHB Class Selector

Derived from IP Precedence values
6 bit diff-serv code point (DSCP) determines
per-hop behavior of packet treatment
Expedited Forwarding (EF) low loss and latency
Assured Forwarding (AF) 4 classes, 3 drop
precedence
Best Effort (BE) classical IP
No absolute guarantees

68
DiffServ

Routers configured for certain PHBs (Per Hop
Behavior)
Resources are allocated to PHBs
Edge routers are configured to mark DSCP
(requests PHB) based on classification
information
Traffic arriving at edge router marked with DSCP
Traffic in core routers go to PHB requested by
DSCP

DSCP marked at edge
SLA defines capacity at each service level (DSCP)
69
Diff-Serv Network Architecture
Backbone
Scalable Solutions Require Cooperative Edge
and Backbone Functions

Edge Functions
Packet classification
Bandwidth management
L3 metering
Security filtering
Access aggregation

Backbone Functions
High-speed switching and transport
QoS enforcement
QoS interworking

70
Packet Classification

Up to six traffic classes via ToS precedence bits
Classification by physical port, IP address,
application, IP protocol, etc.
Network or external assignment

71
Multi-field Packet Classification
L3-DA
L3-SA
L4-PROT
Field 1 Field 2 Field k Action
Rule 1 5.3.40.0/21 2.13.8.11/32 UDP A1
Rule 2 5.168.3.0/24 152.133.0.0/16 TCP A2

Rule N 5.168.0.0/16 152.0.0.0/8 ANY AN
Packet Classification Find the action associated
with the highest priority rule matching an
incoming packet header.
Courtesy Nick McKeown_at_Stanford
72
Formal Problem Definition

Given a classifier C with N rules, Rj, 1 ? j ? N,
where Rj consists of three entities
A regular expression Rji, 1 ? i ? d, on each of
the d header fields,
A number, pri(Rj), indicating the priority of the
rule in the classifier, and
An action, referred to as action(Rj).

For an incoming packet P with the header
considered as a d-tuple of points (P1, P2, ,
Pd), the d-dimensional packet classification
problem is to find the rule Rm with the highest
priority among all the rules Rj matching the
d-tuple i.e., pri(Rm) gt pri(Rj), ? j ? m, 1 ? j
? N, such that Pi matches Rji, 1 ? i ? d. We
call rule Rm the best matching rule for packet P.
Courtesy Nick McKeown_at_Stanford
73
Routing Lookup Instance of 1D Classification

One-dimension (destination address)
Forwarding table ? classifier
Routing table entry ? rule
Outgoing interface ? action
Prefix-length ? priority

Courtesy Nick McKeown_at_Stanford
74
Example 4D Classifier
Rule L3-DA L3-SA L4-DP L4-PROT Action
R1 152.163.190.69/255.255.255.255 152.163.80.11/255.255.255.255 Deny
R2 152.168.3/255.255.255 152.163.200.157/255.255.255.255 eq www udp Deny
R3 152.168.3/255.255.255 152.163.200.157/255.255.255.255 range 20-21 udp Permit
R4 152.168.3/255.255.255 152.163.200.157/255.255.255.255 eq www tcp Deny
R5 Deny
Courtesy Nick McKeown_at_Stanford
75
Example Classification Results
Pkt Hdr L3-DA L3-SA L4-DP L4-PROT Rule, Action
P1 152.163.190.69 152.163.80.11 www tcp R1, Deny
P2 152.168.3.21 152.163.200.157 www udp R2, Deny
Courtesy Nick McKeown_at_Stanford
76
Classification algorithms

Types
Linear search
Associative search
Trie-based techniques
Crossproducting
Heuristic algorithms
Algorithms So far
Good for two fields, but do not scale to more
than two fields, OR
Good for very small classifiers (lt 50 rules)
only, OR
Have non-deterministic classification time, OR
Either too slow or consume too much storage

Another Project Item
77
DiffServ Routers

DiffServ Edge Router
Classifier
Meter
Policer
Marker

DiffServ Core Router
PHB
PHB
Select PHB
Local conditions
PHB
PHB
Extract DSCP
Packet treatment
78
Edge Router/Host Functions

Classification marks packets according to
classification rules to be specified.
Metering checks whether the traffic falls within
the negotiated profile.
Marking marks traffic that falls within profile.
Conditioning delays and then forwards, discards,
or remarks other traffic.

79
Core Functions

Forwarding according to Per-Hop-Behavior or
PHB specified for the particular packet class
such PHB is strictly based on class marking (no
other header fields can be used to influence
PHB).
BIG ADVANTAGE
No state info to be maintained by routers!

80
Forwarding (PHB)

PHB results in a different observable
(measurable) forwarding performance behavior.
PHB does not specify what mechanisms to use to
ensure required PHB performance behavior.
Examples
Class A gets x of outgoing link bandwidth over
time intervals of a specified length.
Class A packets leave first before packets from
class B.

81
Forwarding (PHB)

Expedited Forwarding (EF)
Guarantees a certain minimum rate for the EF
traffic.
Implies isolation guarantee for the EF traffic
should not be influenced by the other traffic
classes.
Admitted based on peak rate.
Non-conformant traffic is dropped or shaped.
Possible service providing a virtual wire.

82
Forwarding (PHB)

Assured Forwarding (AF)
AF defines 4 classes with some bandwidth and
buffers allocated to them.
The intent is that it will be used to implement
services that differ relative to each other
(e.g., gold, silver,).
Within each class, there are three drop
priorities, which affect which packets will get
dropped first if there is congestion.
Lots of studies on how these classes and drop
priorities interact with TCP flow control.
Non-conformant traffic is remarked.

83
Example of EF A Virtual Leased Line Service

Service offers users a dedicated traffic pipe.
Guaranteed bandwidth between two points.
Very low latency and jitter since there should be
no queuing delay (peak rate allocation).
Admission control makes sure that all links in
the network core have sufficient EF bandwidth.
Simple case sum of all virtual link bandwidth is
less than the capacity of the slowest link.
Traffic enforcement for EF traffic limits how
much EF traffic enters the network.

84
Differentiated Services Issues

The key to making Diffserv work is bandwidth
management in the network core.
Simple for simple services such as the virtual
pipe, but it is much more challenging for complex
service level agreements.
Notion of a bandwidth broker that manages the
core network bandwidth.
Definition of end-to-end services for paths that
cross networks with different forwarding
behaviors
Some packets will be handled differently in
different routers.
Some routers are not DiffServ capable.
Per-Domain Behavior (PDB)

85
Some points to ponder

The only thing out there is CBR and asynchronous
bulk!
There are application requirements. There are
also organizational requirements (link sharing)
Users needs QoS for other things too!
billing
privacy and security
reliability and availability

86
Outline

Economic principles
Traffic classes
Time scales
Mechanisms
Some open problems

87
Time scales

Some actions are taken once per call
tell network about traffic characterization and
request resources
in ATM networks, finding a path from source to
destination
Other actions are taken during the call, every
few round trip times
feedback flow control
Still others are taken very rapidly,during the
data transfer
scheduling
policing and regulation
Traffic management mechanisms must deal with a
range of traffic classes at a range of time scales

88
Summary of mechanisms at each time scale

Less than one round-trip-time (cell or packet
level)
Scheduling and buffer management
Regulation and policing
Policy routing (datagram networks)
One or more round-trip-times (burst-level)
Feedback flow control
Retransmission
Renegotiation

89
Summary (cont.)

Session (call-level)
Signaling
Admission control
Service pricing
Routing (connection-oriented networks)
Day
Peak load pricing
Weeks or months
Capacity planning

90
Outline

Economic principles
Traffic classes
Mechanisms at each time scale
Faster than one RTT
scheduling and buffer management
regulation and policing
policy routing
One RTT
Session
Day
Weeks to months
Some open problems

91
Faster than RTT

Scheduling and buffer management
Policing and Regulation
In separate set of slides

92
Renegotiation
93
Renegotiation

An option for guaranteed-service traffic
Static descriptors dont make sense for many real
traffic sources
interactive video
Multiple-time-scale traffic
burst size B that lasts for time T
for zero loss, descriptors (P,0), (A, B)
P peak rate, A average B Burst Size
T large gt serving even slightly below P leads to
large buffering requirements
one-shot descriptor is inadequate

94
Renegotiation (cont.)

Renegotiation matches service rate to traffic
Renegotiating service rate about once every ten
seconds is sufficient to reduce bandwidth
requirement nearly to average rate
works well in conjunction with optimal smoothing
Fast buffer reservation is similar
each burst of data preceded by a reservation
Renegotiation is not free
signaling overhead
call admission ?
perhaps measurement-based admission control

95
RCBR

Extreme viewpoint
All traffic sent as CBR
Renegotiate CBR rate if necessary
No need for complicated scheduling!
Buffers at edge of network
much cheaper
Easy to price
Open questions
when to renegotiate?
how much to ask for?
admission control
what to do on renegotiation failure

96
Outline

Economic principles
Traffic classes
Mechanisms at each time scale
Faster than one RTT
One RTT
Session
Signaling
Admission control
Day
Weeks to months
Some open problems

97
Signaling
98
Signaling

How a source tells the network its utility
function or resource requirements
Two parts
how to carry the message (transport)
how to interpret it (semantics)
Useful to separate these mechanisms

99
Signaling semantics

Classic scheme sender initiated
SETUP, SETUP_ACK, SETUP_RESPONSE
Admission control
Tentative resource reservation and confirmation
Simplex and duplex setup
Doesnt work for multicast

100
Resource translation

Application asks for end-to-end quality
How to translate to per-hop requirements?
E.g. end-to-delay bound of 100 ms
What should be bound at each hop?

101
Signaling transport

Telephone network uses Signaling System 7 (SS7)
Carried on Common Channel Interoffice Signaling
(CCIS) network
CCIS is a datagram network
SS7 protocol stack is loosely modeled on ISO (but
predates it)
Signaling in ATM networks uses Q.2931 standard
part of User Network Interface (UNI)
complex
layered over Service Specific Connection Oriented
Protocol SSCOP (a reliable transport protocol)
and AAL5

102
Internet signaling transport RSVP

Main motivation is to efficiently support
multipoint multicast with resource reservations
In unicast, a source communicates with only one
destination
In multicast, a source communicates with more
than one destination
Signalling Progression
Unicast
Naive multicast
Intelligent multicast
Naive multipoint multicast
RSVP

103
RSVP motivation
104
Multicast reservation styles

Naive multicast (source initiated)
source contacts each receiver in turn
wasted signaling messages
Intelligent multicast (merge replies)
two messages per link of spanning tree
source needs to know all receivers
and the rate they can absorb
doesnt scale
Naive multipoint multicast
two messages per source per link
cant share resources among multicast groups

105
RSVP

Receiver initiated
Reservation state per group, instead of per
connection
PATH and RESV messages
PATH sets up next hop towards source(s)
RESV makes reservation
Travel as far back up as necessary
how does receiver know of success?

106
Reservation Styles

How resource reservations are aggregated/merged
for multiple receivers in the same multicast
group
Two options, specified in the receivers
reservation requests
Reservation attribute reservation is shared over
flows from multiple senders, or distinct for each
sender
Sender selection explicit list or wildcard
Three reservation styles are defined

107
Filters

Allow receivers to separate reservations
Fixed filter
receive from exactly one source
Dynamic filter
dynamically choose which source is allowed to use
reservation

Fixed-Filter
Specifies a distinct reservation for each sender
and an explicit list of senders
Symbolic representation FF(S1Q1, S2Q2, )

Shared-Explicit
Specifies that a single resource reservation is
to be shared by an explicit list of senders
Symbolic representation SE(S1, S2, Q)

Wildcard-Filter
Specifies that a single resource reservation is
to be shared by all senders to this address
Symbolic representation WF(Q)

108
Soft state

State in switch controllers (routers) is
periodically refreshed
On a link failure, automatically find another
route
Transient!
But, probably better than with ATM

109
Why is signaling hard ?

Complex services
Feature interaction
call screening call forwarding
Tradeoff between performance and reliability
Extensibility and maintainability

110
Outline

Economic principles
Traffic classes
Mechanisms at each time scale
Faster than one RTT
One RTT
Session
Signaling
Admission control
Day
Weeks to months
Some open problems

111
Admission control
112
Connection Admission Control (CAC)

Can a call be admitted?
? (bandwidth allocated for all connections) ?
Link Rate
Otherwise call is inadmissible
What bandwidth to allocate to connections??
Depends upon the traffic, traffic model assumed
and the Queueing methodology deployed and model
used to estimate the required bandwidth
Procedure
Map the traffic descriptors associated with a
connection onto a traffic model
Use this traffic model with an appropriate
queuing model for each congestion point, to
estimate whether there are enough system
resources to admit the connection in order to
guarantee the QoS at every congestion (or
queuing) point.
Allocate resources if the connection is accepted.

113
CAC (continued ..)

Depending on the traffic models used, the CAC
procedures can be too conservative by over
allocating the resources.
This reduces the statistical gains
An efficient CAC is the one which produces
maximum amount of statistical gain at a given
congestion point without violating the QoS.
The efficiency of the CAC thus depends on how
closely the two steps (traffic model and queuing
model) above model reality.
Both the traffic and queuing models are well
researched and widely published in the
literature.

114
CBR and UBR Admission Control

CBR admission control (Peak Rate Allocation)
simple
on failure try again, reroute, or hold
Best-effort admission control
trivial
if minimum bandwidth needed, use CBR test

115
CAC for CBR (with small jitter)

Given the buffer size B, the link capacity C and
the peak cell rate of the connection PCRi,
determine a load ? such that the probability of
queue length exceeding B is less than ?, where ?
is a small number such as 10-10
Using M/D/1 model
Using nD/D/1 model

116
Cell Loss Probability versus Buffer Size

?0.9
M/D/1 is conservative
For large N, both give similar performance

117
VBR admission control

VBR
peak rate differs from average rate burstiness
if we reserve bandwidth at the peak rate, wastes
bandwidth
if we reserve at the average rate, may drop
packets during peak
key decision how much to overbook
Four known approaches
peak rate admission control
worst-case admission control
admission control with statistical guarantees
measurement-based admission control

118
1. Peak-rate admission control

Reserve at a connections peak rate
Pros
simple (can use FIFO scheduling)
connections get negligible delay and loss
works well for a small number of sources
Cons
wastes bandwidth
peak rate may increase because of scheduling
jitter

119
2. Worst-case admission control

Characterize source by average rate and burst
size (LBAP)
Use WFQ or rate-controlled discipline to reserve
bandwidth at average rate
Pros
may use less bandwidth than with peak rate
can get an end-to-end delay guarantee
Cons
for low delay bound, need to reserve at more than
peak rate!
implementation complexity

rate
time
120
3. Admission with statistical guarantees

Key insight is that as number of calls increases,
probability that multiple sources send a burst
decreases
sum of connection rates is increasingly smooth
With enough sources, traffic from each source can
be assumed to arrive at its average rate
Put in enough buffers to make probability of loss
low
Theory of large deviations quantitatively bounds
the overflow probability
By allowing a small loss, we can reduce the
resources considerably

121
Example

Consider an ensemble of 10 identical and
independent sources, each of which is on with a
probability 0.1. When on has a transmission
rate of 1.0. What is the probability that they
overflow a shared link of capacity 8?
The probability that n sources are on out of 10
is given by

The probability of loss is less than 10-6 For
peak allocation we need a capacity of 10 By
allowing loss, we reduced resources by 20!!
122
3. Admission with statistical guarantees (contd.)

Assume that traffic from a source is sent to a
buffer of size B which is drained at a constant
rate R
If source sends a burst, its delay goes up
If the burst is too large, bits are lost
Equivalent bandwidth (EBW) of the source is the
rate at which we need to drain this buffer so
that the probability of loss is less than L (and
the delay in leaving the buffer is less than d)
If many sources share a buffer, the equivalent
bandwidth of each source decreases (why?)
Equivalent bandwidth of an ensemble of
connections is the sum of their equivalent
bandwidths

123
3. Admission with statistical guarantees (contd.)

When a source arrives, use its performance
requirements and current network state to assign
it an equivalent bandwidth
Admission control sum of equivalent bandwidths
at the link should be less than link capacity
Pros
can trade off a small loss probability for a
large decrease in bandwidth reservation
mathematical treatment possible
can obtain delay bounds
Cons
assumes uncorrelated sources
hairy mathematics

124
Effective Bandwidth

This model maps each connections traffic
parameters into a real number EBWi, called the
Equivalent Bandwidth or Effective Bandwidth of
the connection such that the QoS constraints are
satisfied.
Thus, the effective bandwidth is derived as a
source property and with this mapping, the CAC
rule becomes very simple
For a connection with an average rate SCRi and
peak rate as PCRi, the effective bandwidth is a
number between the SCRi and PCRi. That is,
There are many methods and models published in
the literature

125
Properties of EBW

Additive Property Effective bandwidths are
additive, i.e., the total effective bandwidth
needed for N connections equals to the sum of
effective bandwidth of each connection
Independence Property Effective bandwidth for a
given connection is only a function of that
connections parameters.
due to the independence property, the effective
bandwidth method could be far more conservative
than a method which considers the true
statistical multiplexing (i.e., the method which
considers the presence of other connections)
With the effective bandwidths method, the CAC
function can add (or subtract) the effective
bandwidth of the connection which is being set-up
(or torn down) from the total effective
bandwidth. This is not easily possible with any
method which does not have the independence
property

126
EBW (First Approach by Roberts)

Assumes fluid sources and zero buffering (so that
two simultaneously active sources would cause
data loss)
Let each source has a peak rate P, mean rate m
and link capacity is C and required cell loss is
smaller than 10-9
The heuristic to estimate the EBW of a source is
EBW 1.2m 60m(P-m) / C
First term says EBW is 1.2 times of mean rate
Second term increases EBW in proportion to the
gap between peak and mean (an indicator of source
burstiness). This is mitigated by the large link
capacity.
Expression is independent of cell loss!!

127
EBW (Second approach by Gibbens and Hunt)

on-off sources with exponentially distributed
on and off periods
Let a source mean on period be and mean
off period be . When the source is on,
it is assumed to produce information at a
constant rate
Let B be the buffer size CLR is the cell loss
ratio required and
The Effective Bandwidth is given by

128
Example

Let traffic descriptors are SCR, PCR100Mb/s,
CLR10-7 and ABS (Average Burst Size)50 cells

129
EBW Observations

Equation implies that for large B, ??0 and EBW
(ci ) equals to the mean rate of the source
For a small buffer B, ??-? and the effective
bandwidth of the source will be , the peak
information rate
The queue length distribution is assumed to be
asymptotically exponential of form

130
EBW for Self-similar traffic (By Norros)

Let m is the mean bit rate of the traffic stream,
a is the coefficient of variation, B is the
buffer size, H is the Hurst parameter of the
stream (0.5?H?1), CLR is the target cell loss
ratio.
The EBW is given by
Note that this equation does not follow the
asymptotic exponential queue length distribution

131
Multi-class CAC

In the real world, the traffic flow consists of
multiple QoS classes, where, the services may be
partitioned and queued separately
To guarantee QoS, a certain amount of bandwidth
(or capacity) is reserved for each of the service
categories.
With effective bandwidth approach, this
assignment becomes very simple.
Let Nj be the number of sources for class j and
let ?j be the effective bandwidth of a source
belonging to class j. Let there be K such
classes. Then, the CAC for multi-class traffic
should check that the total estimated capacity is
less than the service rate. That is,

132
4. Measurement-based admission