Interconnection Network Design - PowerPoint PPT Presentation

About This Presentation
Title:

Interconnection Network Design

Description:

Title: Introduction to Network Design Author: David Culler Last modified by: bhuyan Created Date: 3/17/1999 12:54:39 AM Document presentation format – PowerPoint PPT presentation

Number of Views:117
Avg rating:3.0/5.0
Slides: 24
Provided by: DavidCu164
Learn more at: http://www.cs.ucr.edu
Category:

less

Transcript and Presenter's Notes

Title: Interconnection Network Design


1
Interconnection Network Design
  • Adapted from UC, Berkeley Notes

2
Scalable, High Perf. Interconnection Network
  • At Core of Parallel Computer Arch.
  • Requirements and trade-offs at many levels
  • Elegant mathematical structure
  • Deep relationships to algorithm structure
  • Managing many traffic flows
  • Electrical / Optical link properties
  • Little consensus
  • interactions across levels
  • Performance metrics?
  • Cost metrics?
  • Workload?
  • gt need holistic understanding

3
Requirements from Above
  • Communication-to-computation ratio
  • gt bandwidth that must be sustained for given
    computational rate
  • traffic localized or dispersed?
  • bursty or uniform?
  • Programming Model
  • protocol
  • granularity of transfer
  • degree of overlap (slackness)
  • gt job of a parallel machine network is to
    transfer information from source node to dest.
    node in support of network transactions that
    realize the programming model

4
Goals
  • latency as small as possible
  • as many concurrent transfers as possible
  • operation bandwidth
  • data bandwidth
  • cost as low as possible

5
Links and Channels
  • transmitter converts stream of digital symbols
    into signal that is driven down the link
  • receiver converts it back
  • tran/rcv share physical protocol
  • trans link rcv form Channel for digital info
    flow between switches
  • link-level protocol segments stream of symbols
    into larger units packets or messages (framing)
  • node-level protocol embeds commands for dest
    communication assist within packet

6
Formalism
  • network is a graph V switches and nodes
    connected by communication channels C Í V V
  • Channel has width w and signaling rate f 1/t
  • channel bandwidth b wf
  • phit (physical unit) data transferred per cycle
  • flit - basic unit of flow-control
  • Number of input (output) channels is switch
    degree
  • Sequence of switches and links followed by a
    message is a route
  • Think streets and intersections

7
What characterizes a network?
  • Topology (what)
  • physical interconnection structure of the network
    graph
  • direct node connected to every switch
  • indirect nodes connected to specific subset of
    switches
  • Routing Algorithm (which)
  • restricts the set of paths that msgs may follow
  • many algorithms with different properties
  • gridlock avoidance?
  • Switching Strategy (how)
  • how data in a msg traverses a route
  • circuit switching vs. packet switching
  • Flow Control Mechanism (when)
  • when a msg or portions of it traverse a route
  • what happens when traffic is encountered?

8
Topological Properties
  • Routing Distance - number of links on route
  • Diameter - maximum routing distance between any
    two nodes in the network
  • Average Distance Sum of distances between
    nodes/number of nodes
  • Degree of a Node Number of links connected to a
    node gt Cost high if degree is high
  • A network is partitioned by a set of links if
    their removal disconnects the graph
  • Fault-tolerance Number of alternate paths
    between two nodes in a network

9
Typical Packet Format
  • Two basic mechanisms for abstraction
  • encapsulation
  • fragmentation

10
Communication Perf Latency
  • Time(n)s-d overhead routing delay channel
    occupancy contention delay
  • occupancy (n ne) / b
  • Routing delay?
  • Contention?

11
Review Performance Metrics
Sender
(processor busy)
Transmission time (size bandwidth)
Time of Flight
Receiver Overhead
Receiver
(processor busy)
Transport Latency
Total Latency
Total Latency Sender Overhead Time of Flight
Message Size BW
Receiver Overhead
Includes header/trailer in BW calculation?
12
StoreForward vs Cut-Through Routing
  • h(n/b D) vs n/b h D
  • what if message is fragmented?
  • wormhole vs virtual cut-through

13
Store and Forward vs. Cut-Through
  • Store-and-forward policy each switch waits for
    the full packet to arrive in switch before
    sending to the next switch (good for WAN)
  • Cut-through routing or worm hole routing switch
    examines the header, decides where to send the
    message, and then starts forwarding it
    immediately
  • In worm hole routing, when head of message is
    blocked, message stays strung out over the
    network, potentially blocking other messages
    (needs only buffer the piece of the packet that
    is sent between switches). CM-5 uses it, with
    each switch buffer being 4 bits per port.
  • Cut through routing lets the tail continue when
    head is blocked, accordioning the whole message
    into a single switch. (Requires a buffer large
    enough to hold the largest packet).

14
Store and Forward vs. Cut-Through
  • Advantage
  • Latency reduces from function ofnumber of
    intermediate switches X by the size of the packet
    to time for 1st part of the packet to
    negotiate the switches the packet size
    interconnect BW

15
Contention
  • Two packets trying to use the same link at same
    time
  • limited buffering
  • drop?
  • Most parallel mach. networks block in place
  • link-level flow control
  • tree saturation
  • Closed system - offered load depends on delivered

16
Congestion Control
  • Packet switched networks do not reserve
    bandwidth this leads to contention (connection
    based limits input)
  • Solution prevent packets from entering until
    contention is reduced (e.g., freeway on-ramp
    metering lights)
  • Options
  • Packet discarding If packet arrives at switch
    and no room in buffer, packet is discarded (e.g.,
    UDP)
  • Flow control between pairs of receivers and
    senders use feedback to tell sender when
    allowed to send next packet
  • Back-pressure separate wires to tell to stop
  • Window give original sender right to send N
    packets before getting permission to send more
    overlapslatency of interconnection with overhead
    to send receive packet (e.g., TCP), adjustable
    window
  • Choke packets aka rate-based Each packet
    received by busy switch in warning state sent
    back to the source via choke packet. Source
    reduces traffic to that destination by a fixed
    (e.g., ATM)

17
Protocols HW/SW Interface
  • Internetworking allows computers on independent
    and incompatible networks to communicate reliably
    and efficiently
  • Enabling technologies SW standards that allow
    reliable communications without reliable networks
  • Hierarchy of SW layers, giving each layer
    responsibility for portion of overall
    communications task, called protocol families or
    protocol suites
  • Transmission Control Protocol/Internet Protocol
    (TCP/IP)
  • This protocol family is the basis of the Internet
  • IP makes best effort to deliver TCP guarantees
    delivery
  • TCP/IP used even when communicating locally NFS
    uses IP even though communicating across
    homogeneous LAN

18
TCP/IP packet
  • Application sends message
  • TCP breaks into 64KB segements, adds 20B header
  • IP adds 20B header, sends to network
  • If Ethernet, broken into 1500B packets with
    headers, trailers
  • Header, trailers have length field, destination,
    window number, version, ...

Ethernet
IP Header
TCP Header
IP Data
TCP data ( 64KB)
19
Bandwidth
  • What affects local bandwidth?
  • packet density b x n/(n ne)
  • routing delay b x n / (n ne wD)
  • contention
  • endpoints
  • within the network
  • Aggregate bandwidth
  • bisection bandwidth
  • sum of bandwidth of smallest set of links that
    partition the network
  • total bandwidth of all the channels Cb
  • suppose N hosts issue packet every M cycles with
    ave dist
  • each msg occupies h channels for l n/w cycles
    each
  • C/N channels available per node
  • link utilization r MC/Nhl lt 1

20
Switches
21
Switch Components
  • Output ports
  • transmitter (typically drives clock and data)
  • Input ports
  • synchronizer aligns data signal with local clock
    domain
  • essentially FIFO buffer
  • Crossbar
  • connects each input to any output
  • degree limited by area or pinout
  • Buffering
  • Control logic
  • complexity depends on routing logic and
    scheduling algorithm
  • determine output port for each incoming packet
  • arbitrate among inputs directed at same output

22
Interconnection Topologies
  • Class networks scaling with N
  • Logical Properties
  • distance, degree
  • Physcial properties
  • length, width
  • Fully connected network
  • diameter 1
  • degree N
  • cost?
  • bus gt O(N), but BW is O(1) - actually worse
  • crossbar gt O(N2) for BW O(N)
  • VLSI technology determines switch degree

23
Summary
Topology Degree Diameter Ave Dist Bisection D (D
ave) _at_ P1024 1D Array 2 N-1 N / 3 1 huge 1D
Ring 2 N/2 N/4 2 2D Mesh 4 2 (N1/2 - 1) 2/3
N1/2 N1/2 63 (21) 2D Torus 4 N1/2 1/2
N1/2 2N1/2 32 (16) k-ary n-cube 2n nk/2 nk/4 nk/4
15 (7.5) _at_n3 Hypercube n log N n n/2 N/2 10
(5)
Write a Comment
User Comments (0)
About PowerShow.com