Fair Queuing for Aggregated Multiple Links - PowerPoint PPT Presentation

1 / 45

About This Presentation

Title:

Fair Queuing for Aggregated Multiple Links

Description:

Fair Queuing service disciplines allocates bandwidth fairly among competing traffic. ... Calculation Method for Sorted-priority Fair Queuing Liu et al., 2004 ... – PowerPoint PPT presentation

Number of Views:66

Avg rating:3.0/5.0

Slides: 46

Provided by: the119

Category:

more less

Transcript and Presenter's Notes

Title: Fair Queuing for Aggregated Multiple Links

1
Fair Queuing for Aggregated Multiple Links

Josep M. Blanquer and Banu Özden
Proceedings of the ACM SIGCOMM, August 2001

2
ABSTRACT

Fair Queuing algorithms
Proportionally sharing a single server among
competing flows
Do not address the problem of sharing multiple
servers.
Multiserver applications
Link aggregation
Multiprocessors
Multi-path storage I/O

We introduce a new service discipline for
multi-server systems, MSF2Q, that provides
guarantees for competing flows.
We prove that this new service discipline is a
close approximation of the idealized Generalized
Processor Sharing (GPS) discipline.
We calculate its maximum packet delay and service
discrepancy with respect to GPS.

4
1. INTRODUCTION

A large increase in networked services ? a much
larger variety of traffic? different network
requirements to be met simultaneously over the
same links.
High bandwidth guarantee ? backups
low jitter guarantees? video streaming low
delay guarantees ?network data acquisition
Network resources must be appropriately
scheduled.

Fair Queuing service disciplines allocates
bandwidth fairly among competing traffic.
Protection from misbehaving traffic
Effective congestion control
Better services for rate-adaptive applications
Strict QoS guarantees, with admission control.

Growing demand for bandwidth ? Incremental
scaling techniques? Grouping multiple links into
a single logical interface 3
Implementations
1 3Coms Dynamic Access
2 Adaptec Duralink Software Suite
12 Hewlett Packards Auto-Port Aggregation
14 Intel Load Balancing
6 J. Blanquer, al. et. Resource Management for
QoS in Eclipse/BSD, Proceedings of the First
FreeBSD Conference, Berkeley, California, Oct.
1999.

7
Adaptec Duralink
8
HP Auto Port Aggregation
9
Intel Load Balancing
10
2. BACKGROUND

GPS (Generalized Processor Sharing)
Guaranteed fairness
Wx(t, t) the amount of traffic for flow x
served in the interval t, t, while any flow x
that is continuously backlogged during t, t.
?x weight of flow x proportion of the
server bandwidth that flow x receives when it
is backlogged.
Guaranteed rate
ri rate of flow ir server rate

Generalized Processor Sharing (GPS)
An idealized system that serves as a reference
model for the fair queuing disciplines.
The server transmits more than one flow
simultaneously and that the traffic is infinitely
divisible.
A number of packetized approximations to GPS have
been devised.
WFQ (Weighted Fair Queueing) 89 Demers et al.
VC (Virtual Clock) 90 Zhang
GPS (General Processor Sharing) 93 Parekh et al.
SCFQ (Self-Clocked Fair Queueing) 94 Golestani
WF2Q (Worst-case Fair Weighted Fair Queueing) 96
Bennett et al.
SFQ (Start Time Fair Queueing) 96 Goyal et al.

12
A New Priority Calculation Method for
Sorted-priority Fair Queuing Liu et al., 2004

B. Current packet priority calculation methods
Three best known packet priority calculation
methods are 9
Smallest Finish time First (SFF)
Packet selection PiX(t) li/?I (li packet
length)
WFQ and SCFQ
Smallest Start time First (SSF)
Packet selection PiX(t)
SFQ
Smallest Eligible Finish time First (SEFF)
Pre-selection sessions with session potentials
smaller than the system potential.
Packet selection (SFF) PiX(t) li/?i
WF2Q

13
3. PROPORTIONAL SHARING OF MULTISERVER SYSTEMS

Numerous applications utilizing multi-server
systems that can benefit from service guarantees
Network Multiple network adapters to a web or
file server
Storage Multiple I/O channels to a RAID server

System Model

WFQ
15
WFQ
16
3.1 A Packetized Fair Queuing Discipline for
Multi-Servers

MSFQs Scheduling discipline is the same as GPS
When a server is idle and there is a packet
waiting for service, MSFQ schedules the next
packet.
The next packet is defined as the first packet
that would complete service in the (GPS, 1,Nr)
system if no more packets were to arrive.
To compare how well a (MSFQ ,N, r) system
approximates a (GPS, 1,Nr) system, calculate
(i) the worst case delay
(ii) the traffic discrepancy

17
3.2 Preliminary Properties

Delay and service properties of MSFQ do not
trivially follow from the single server case,
WFQ.
GPS and MSFQ busy periods do not coincide.

Nr
Finish Time?1 L / Nr
(GPS, 1,Nr)
Bits left L r (L/Nr) L (L/N) (N-1)L
/ N
r
r
(MSFQ ,N, r)

r
Finish Time ?2 L / r
t
?W(0, t) W (0, t)
18

When GPS is busy, MSFQ is busy. However, the
converse is not true.
Thus for any t , W(0, t) W(0, t),
(2)where W(0, t) and W (0, t) denote the total
number of bits serviced by GPS and MSFQ ,
respectively, by time t.
We will use the term busy period to refer to a
busy period in the reference (GPS, 1,Nr) system.

Work from previous busy periods can accumulate
under MSFQ.
This may happen either at the beginning or in the
middle of a busy period.

Arrival Time
Delayed Finish Service Time
20
Arrival Time
Delayed Start Service Time
21

Theorem 1 For any t, W(0, t) - W (0, t) (N
- 1) Lmaxwhere Lmax denote the maximum packet
length.
Proof
The slope of W (GPS) alternates between Nr (when
a busy period resumes) and 0 (idle, between two
consecutive busy periods).
The slope of W (MSFQ) is at most Nr at any
given time,

22
(No Transcript)
23

Case 1 At most N - 1 MSFQ servers are busy at
t
Since MSFQ is work-conserving, if a server is
idle, we know that there is no packet waiting for
transmission.
In the worst case, all the k busy servers have
just started transmitting a packet of maximum
length (Lmax).
W(0, t) - W (0, t) k Lmax (a)
where k N 1

Case 2 All MSFQ servers are busy at t
Let to, t be the largest interval in which all
MSFQ servers are busy.
Since in to, t the slope of W is Nr , W(0,
t) - W(0, t) W(0, to) - W(0, to) (b)

If to 0, then W(0, t) W(0, t).Otherwise, if
to gt 0, we know from (a), W(0, to) - W(0, to)
(N - 1) Lmax (c)
From (b) and (c), we have
W(0, t) - W (0, t) (N - 1) Lmax ?
This theorem implies the need for a buffer space
of (N - 1) Lmax.

The discrepancy of packet departure times (i.e.
begin transmitting/servicing) between
multi-server and single-server
Let dp be the time at which packet p departs from
(GPS, 1,Nr) system.
MSFQ packets may not depart in increasing order
of dp.

Lemma 1 Packet k will be scheduled no later
than
where ak and bk be respectively the arrival
time and scheduling time of packet k over N
servers, each with a rate of r, P be the set of
packets scheduled before packet k since time ak,
including the packets in service at ak, Li be
the length of packet i.

Proof
Given a load that must be scheduled before packet
k, a work conserving service discipline schedules
packet k latest, if the load is equally divided
among the N servers such that all of them finish
the work at the same time. ?

29
4. PACKET DELAY

Theorem 2 For all packets p,
where dp and dp be the time at which packet p
departs from the (MSFQ,N, r) and (GPS,1, Nr)
system, respectively.
Proof
Skipped

30
5. SERVICE PER-FLOW

Theorem 3 For any t ,
Wi(0, t) - Wi (0, t) N Lmax
Proof
Skipped

31
6. FAIRNESS

Example 3
4 servers
11 flows (fixed packet length)
F1 Weight 0.5, 10 packets at t 0
F2 F11 Weight 0.05, each with 1 packet at t
0

GPS Scheduled by WFQ (? finish time)

F1A 0 L / 0.5
F1B F1A L / 0.5 2L / 0.5

F2 0 L / 0.05
F3 0 L / 0.05

33

MSFQ Scheduled by WFQ (? finish time)

GPS Scheduled by WF2Q (eligible start time (HOL)
finish time)

Not Smooth?
?
35

The direct application of WF2Q technique to
multi-server systems does not fix the undesired
burstiness problem and moreover, it makes the
discipline non-workconserving.

Not eligible until the previous pkt is
scheduled? non-workconserving
36
6.1 MSF2Q

(MSF2Q,N, r)
A packet is outstanding if it is being
transmitted.
Let ôi(t) denote the number of outstanding flow i
packets at the MSF2Q system at time t.
Wi(t, t) the work completed for flow i under
MSF2Q over the interval t, t

At time t, when a server is idle and there is a
packet waiting for service, MSF2Q schedules among
the flows (eligible) that satisfy or
and
That would complete service in the GPS system
earliest

Example 3 F1 r1 0.5 F2F10 rx 0.05 r
1/4 0.25 ? ô1 ?0.5/0.25? 2 ôx ?0.05/0.25?
1
38

The output of MSF2Q in Example 3

Smooth scheduling
Example 3 F1 r1 0.5 F2F10 rx 0.05 r
1/4 0.25 ? ô1 ?0.5/0.25? 2 ôx ?0.05/0.25?
1
39
6.2 Properties of MSF2Q

Theorem 4 Let Li,max denote the maximum packet
length of flow i. For any time t and flow i, the
following property holds (8)
Proof
Skipped

40
7. APPLICATIONS

Link Aggregation
Logical grouping of several Ethernet network
interfaces to allow for cost-effective, load
balancing, better scalability, and
fault-tolerance.
IEEE 802.3ad
Currently ranges from two to eight Fast/Gigabit
Ethernet ports in either servers or switching
elements.

Access of storage I/O
To connect the RAID system to a host (e.g., Web
server) with multiple SCSI or Fiber Channels to
improve the I/O performance.
Load balancing, failover

42
8. RELATEDWORK

Skipped

43
9. CONTRIBUTIONS AND FUTUREWORK

Link aggregation, or the aggregation of multiple
interfaces into a single logical link, is
becoming the predominant approach for bandwidth
scaling.
Numerous fair queuing results previously obtained
for single server systems do not directly apply
to multi-server systems.

We first analyzed the cumulative service, packet
delay and per-flow cumulative service bounds for
Weighted Fair Queuing (WFQ) applied to a
multi-server system.
We then presented a new fair queuing algorithm -
MSF2Q that leads to smooth and fair schedules in
finer time scales.

Our future plans include
Investigation of implementation issues
Quantitative comparison of the approach presented
in this paper to the alternative approach of
partitioning flows among servers
Enhancing the algorithms for multiprocessors and
cluster of servers
Hierarchal GPS
Servers with different rates
Misordering of packets