3F4 Error Control Coding - PowerPoint PPT Presentation

About This Presentation
Title:

3F4 Error Control Coding

Description:

3F4 Error Control Coding Dr. I. J. Wassell Introduction Error Control Coding (ECC) Extra bits are added to the data at the transmitter (redundancy) to permit error ... – PowerPoint PPT presentation

Number of Views:512
Avg rating:3.0/5.0
Slides: 71
Provided by: ijw9
Category:
Tags: 3f4 | coding | control | error | noise

less

Transcript and Presenter's Notes

Title: 3F4 Error Control Coding


1
3F4 Error Control Coding
  • Dr. I. J. Wassell

2
Introduction
  • Error Control Coding (ECC)
  • Extra bits are added to the data at the
    transmitter (redundancy) to permit error
    detection or correction at the receiver
  • Done to prevent the output of erroneous bits
    despite noise and other imperfections in the
    channel
  • The positions of the error control coding and
    decoding are shown in the transmission model

3
Transmission Model
4
Error Models
  • Binary Symmetric Memoryless Channel
  • Assumes transmitted symbols are binary
  • Errors affect 0s and 1s with equal
    probability (i.e., symmetric)
  • Errors occur randomly and are independent from
    bit to bit (memoryless)

1-p
0
0
p is the probability of bit error or the Bit
Error Rate (BER) of the channel
p
IN
OUT
p
1
1
1-p
5
Error Models
  • Many other types
  • Burst errors, i.e., contiguous bursts of bit
    errors
  • output from DFE (error propagation)
  • common in radio channels
  • Insertion, deletion and transposition errors
  • We will consider mainly random errors

6
Error Control Techniques
  • Error detection in a block of data
  • Can then request a retransmission, known as
    automatic repeat request (ARQ) for sensitive data
  • Appropriate for
  • Low delay channels
  • Channels with a return path
  • Not appropriate for delay sensitive data, e.g.,
    real time speech and data

7
Error Control Techniques
  • Forward Error Correction (FEC)
  • Coding designed so that errors can be corrected
    at the receiver
  • Appropriate for delay sensitive and one-way
    transmission (e.g., broadcast TV) of data
  • Two main types, namely block codes and
    convolutional codes. We will only look at block
    codes

8
Block Codes
  • We will consider only binary data
  • Data is grouped into blocks of length k bits
    (dataword)
  • Each dataword is coded into blocks of length n
    bits (codeword), where in general ngtk
  • This is known as an (n,k) block code

9
Block Codes
  • A vector notation is used for the datawords and
    codewords,
  • Dataword d (d1 d2.dk)
  • Codeword c (c1 c2..cn)
  • The redundancy introduced by the code is
    quantified by the code rate,
  • Code rate k/n
  • i.e., the higher the redundancy, the lower the
    code rate

10
Block Code - Example
  • Dataword length k 4
  • Codeword length n 7
  • This is a (7,4) block code with code rate 4/7
  • For example, d (1101), c (1101001)

11
Error Control Process
Source code data chopped into blocks
Codeword (n bits)
101101
Dataword (k bits)
Codeword possible errors (n bits)
Dataword (k bits)
Error flags
12
Error Control Process
  • Decoder gives corrected data
  • May also give error flags to
  • Indicate reliability of decoded data
  • Helps with schemes employing multiple layers of
    error correction

13
Parity Codes
  • Example of a simple block code Single Parity
    Check Code
  • In this case, n k1, i.e., the codeword is the
    dataword with one additional bit
  • For even parity the additional bit is,
  • For odd parity the additional bit is 1-q
  • That is, the additional bit ensures that there
    are an even or odd number of 1s in the
    codeword

14
Parity Codes Example 1
  • Even parity
  • (i) d(10110) so,
  • c(101101)
  • d(11011) so,
  • c(110110)

15
Parity Codes Example 2
  • Coding table for (4,3) even parity code

16
Parity Codes
  • To decode
  • Calculate sum of received bits in block (mod 2)
  • If sum is 0 (1) for even (odd) parity then the
    dataword is the first k bits of the received
    codeword
  • Otherwise error
  • Code can detect single errors
  • But cannot correct error since the error could be
    in any bit
  • For example, if the received dataword is (100000)
    the transmitted dataword could have been (000000)
    or (110000) with the error being in the first or
    second place respectively
  • Note error could also lie in other positions
    including the parity bit

17
Parity Codes
  • Known as a single error detecting code (SED).
    Only useful if probability of getting 2 errors is
    small since parity will become correct again
  • Used in serial communications
  • Low overhead but not very powerful
  • Decoder can be implemented efficiently using a
    tree of XOR gates

18
Hamming Distance
  • Error control capability is determined by the
    Hamming distance
  • The Hamming distance between two codewords is
    equal to the number of differences between them,
    e.g.,
  • 10011011
  • 11010010 have a Hamming distance 3
  • Alternatively, can compute by adding codewords
    (mod 2)
  • 01001001 (now count up the ones)

19
Hamming Distance
  • The Hamming distance of a code is equal to the
    minimum Hamming distance between two codewords
  • If Hamming distance is
  • 1 no error control capability i.e., a single
    error in a received codeword yields another
    valid codeword
  • XXXXXXX X is a valid codeword
  • Note that this representation is diagrammatic
    only.
  • In reality each codeword is surrounded by n
    codewords. That is, one for every bit that
    could be changed

20
Hamming Distance
  • If Hamming distance is
  • 2 can detect single errors (SED) i.e., a
    single error will yield an invalid codeword
  • XOXOXO X is a valid codeword
  • O in not a valid codeword
  • See that 2 errors will yield a valid (but
    incorrect) codeword

21
Hamming Distance
  • If Hamming distance is
  • 3 can correct single errors (SEC) or can detect
    double errors (DED)
  • XOOXOOX X is a valid codeword
  • O in not a valid codeword
  • See that 3 errors will yield a valid but
    incorrect codeword

22
Hamming Distance - Example
  • Hamming distance 3 code, i.e., SEC/DED
  • Or can perform single error correction (SEC)

X is a valid codeword O is an invalid codeword
23
Hamming Distance
  • The maximum number of detectable errors is
  • That is the maximum number of correctable errors
    is given by,
  • where dmin is the minimum Hamming distance
    between 2 codewords and means the smallest
    integer

24
Linear Block Codes
  • As seen from the second Parity Code example, it
    is possible to use a table to hold all the
    codewords for a code and to look-up the
    appropriate codeword based on the supplied
    dataword
  • Alternatively, it is possible to create codewords
    by addition of other codewords. This has the
    advantage that there is now no longer the need to
    held every possible codeword in the table.

25
Linear Block Codes
  • If there are k data bits, all that is required is
    to hold k linearly independent codewords, i.e., a
    set of k codewords none of which can be produced
    by linear combinations of 2 or more codewords in
    the set.
  • The easiest way to find k linearly independent
    codewords is to choose those which have 1 in
    just one of the first k positions and 0 in the
    other k-1 of the first k positions.

26
Linear Block Codes
  • For example for a (7,4) code, only four codewords
    are required, e.g.,
  • So, to obtain the codeword for dataword 1011, the
    first, third and fourth codewords in the list are
    added together, giving 1011010
  • This process will now be described in more detail

27
Linear Block Codes
  • An (n,k) block code has code vectors
  • d(d1 d2.dk) and
  • c(c1 c2..cn)
  • The block coding process can be written as cdG
  • where G is the Generator Matrix

28
Linear Block Codes
  • Thus,
  • ai must be linearly independent, i.e.,
  • Since codewords are given by summations of the
    ai vectors, then to avoid 2 datawords having the
    same codeword the ai vectors must be linearly
    independent

29
Linear Block Codes
  • Sum (mod 2) of any 2 codewords is also a
    codeword, i.e.,
  • Since for datawords d1 and d2 we have

So,
30
Linear Block Codes
  • 0 is always a codeword, i.e.,
  • Since all zeros is a dataword then,

31
Error Correcting Power of LBC
  • The Hamming distance of a linear block code (LBC)
    is simply the minimum Hamming weight (number of
    1s or equivalently the distance from the all 0
    codeword) of the non-zero codewords
  • Note d(c1,c2) w(c1 c2) as shown previously
  • For an LBC, c1 c2c3
  • So min (d(c1,c2)) min (w(c1 c2)) min (w(c3))
  • Therefore to find min Hamming distance just need
    to search among the 2k codewords to find the min
    Hamming weight far simpler than doing a pair
    wise check for all possible codewords.

32
Linear Block Codes example 1
  • For example a (4,2) code, suppose

a1 1011 a2 0101
  • For d 1 1, then

33
Linear Block Codes example 2
  • A (6,5) code with
  • Is an even single parity code

34
Systematic Codes
  • For a systematic block code the dataword appears
    unaltered in the codeword usually at the start
  • The generator matrix has the structure,

k
R
R n - k
  • P is often referred to as parity bits

35
Systematic Codes
  • I is kk identity matrix. Ensures dataword
    appears as beginning of codeword
  • P is kR matrix.

36
Decoding Linear Codes
  • One possibility is a ROM look-up table
  • In this case received codeword is used as an
    address
  • Example Even single parity check code
  • Address Data
  • 000000 0
  • 000001 1
  • 000010 1
  • 000011 0
  • .
  • Data output is the error flag, i.e., 0 codeword
    ok,
  • If no error, dataword is first k bits of codeword
  • For an error correcting code the ROM can also
    store datawords

37
Decoding Linear Codes
  • Another possibility is algebraic decoding, i.e.,
    the error flag is computed from the received
    codeword (as in the case of simple parity codes)
  • How can this method be extended to more complex
    error detection and correction codes?

38
Parity Check Matrix
  • A linear block code is a linear subspace Ssub of
    all length n vectors (Space S)
  • Consider the subset Snull of all length n vectors
    in space S that are orthogonal to all length n
    vectors in Ssub
  • It can be shown that the dimensionality of Snull
    is n-k, where n is the dimensionality of S and k
    is the dimensionality of Ssub
  • It can also be shown that Snull is a valid
    subspace of S and consequently Ssub is also the
    null space of Snull

39
Parity Check Matrix
  • Snull can be represented by its basis vectors. In
    this case the generator basis vectors (or
    generator matrix H) denote the generator matrix
    for Snull - of dimension n-k R
  • This matrix is called the parity check matrix of
    the code defined by G, where G is obviously the
    generator matrix for Ssub- of dimension k
  • Note that the number of vectors in the basis
    defines the dimension of the subspace

40
Parity Check Matrix
  • So the dimension of H is n-k ( R) and all
    vectors in the null space are orthogonal to all
    the vectors of the code
  • Since the rows of H, namely the vectors bi are
    members of the null space they are orthogonal to
    any code vector
  • So a vector y is a codeword only if yHT0
  • Note that a linear block code can be specified by
    either G or H

41
Parity Check Matrix
  • So H is used to check if a codeword is valid,

R n - k
  • The rows of H, namely, bi, are chosen to be
    orthogonal to rows of G, namely ai
  • Consequently the dot product of any valid
    codeword with any bi is zero

42
Parity Check Matrix
  • This is so since,

and so,
  • This means that a codeword is valid (but not
    necessarily correct) only if cHT 0. To ensure
    this it is required that the rows of H are
    independent and are orthogonal to the rows of G
  • That is the bi span the remaining R ( n - k)
    dimensions of the codespace

43
Parity Check Matrix
  • For example consider a (3,2) code. In this case G
    has 2 rows, a1 and a2
  • Consequently all valid codewords sit in the
    subspace (in this case a plane) spanned by a1 and
    a2
  • In this example the H matrix has only one row,
    namely b1. This vector is orthogonal to the plane
    containing the rows of the G matrix, i.e., a1 and
    a2
  • Any received codeword which is not in the plane
    containing a1 and a2 (i.e., an invalid codeword)
    will thus have a component in the direction of b1
    yielding a non- zero dot product between itself
    and b1

44
Parity Check Matrix
  • Similarly, any received codeword which is in the
    plane containing a1 and a2 (i.e., a valid
    codeword) will not have a component in the
    direction of b1 yielding a zero dot product
    between itself and b1

45
Error Syndrome
  • For error correcting codes we need a method to
    compute the required correction
  • To do this we use the Error Syndrome, s of a
    received codeword, cr
  • s crHT
  • If cr is corrupted by the addition of an error
    vector, e, then
  • cr c e
  • and
  • s (c e) HT cHT eHT
  • s 0 eHT
  • Syndrome depends only on the error

46
Error Syndrome
  • That is, we can add the same error pattern to
    different codewords and get the same syndrome.
  • There are 2(n - k) syndromes but 2n error
    patterns
  • For example for a (3,2) code there are 2
    syndromes and 8 error patterns
  • Clearly no error correction possible in this case
  • Another example. A (7,4) code has 8 syndromes and
    128 error patterns.
  • With 8 syndromes we can provide a different value
    to indicate single errors in any of the 7 bit
    positions as well as the zero value to indicate
    no errors
  • Now need to determine which error pattern caused
    the syndrome

47
Error Syndrome
  • For systematic linear block codes, H is
    constructed as follows,
  • G I P and so H -PT I
  • where I is the kk identity for G and the RR
    identity for H
  • Example, (7,4) code, dmin 3

48
Error Syndrome - Example
  • For a correct received codeword cr 1101001
  • In this case,

49
Error Syndrome - Example
  • For the same codeword, this time with an error in
    the first bit position, i.e.,
  • cr 1101000
  • In this case a syndrome 001 indicates an error in
    bit 1 of the codeword

50
Comments about H
  • The minimum distance of the code is equal to the
    minimum number of columns (non-zero) of H which
    sum to zero
  • We can express

Where do, d1, dn-1 are the column vectors of H
  • Clearly crHT is a linear combination of the
    columns of H

51
Comments about H
  • For a codeword with weight w (i.e., w ones), then
    crHT is a linear combination of w columns of H.
  • Thus we have a one-to-one mapping between weight
    w codewords and linear combinations of w columns
    of H
  • Thus the min value of w is that which results in
    crHT0, i.e., codeword cr will have a weight w (w
    ones) and so dmin w

52
Comments about H
  • For the example code, a codeword with min weight
    (dmin 3) is given by the first row of G, i.e.,
    1000011
  • Now form linear combination of first and last 2
    cols in H, i.e., 011010001 0
  • So need min of 3 columns ( dmin) to get a zero
    value of cHT in this example

53
Standard Array
  • From the standard array we can find the most
    likely transmitted codeword given a particular
    received codeword without having to have a
    look-up table at the decoder containing all
    possible codewords in the standard array
  • Not surprisingly it makes use of syndromes

54
Standard Array
  • The Standard Array is constructed as follows,

All patterns in row have same syndrome
Different rows have distinct syndromes
  • The array has 2k columns (i.e., equal to the
    number of valid codewords) and 2R rows (i.e., the
    number of syndromes)

55
Standard Array
  • The standard array is formed by initially
    choosing ei to be,
  • All 1 bit error patterns
  • All 2 bit error patterns
  • Ensure that each error pattern not already in the
    array has a new syndrome. Stop when all syndromes
    are used

56
Standard Array
  • Imagine that the received codeword (cr) is c2
    e3 (shown in bold in the standard array)
  • The most likely codeword is the one at the head
    of the column containing c2 e3
  • The corresponding error pattern is the one at the
    beginning of the row containing c2 e3
  • So in theory we could implement a look-up table
    (in a ROM) which could map all codewords in the
    array to the most likely codeword (i.e., the one
    at the head of the column containing the received
    codeword)
  • This could be quite a large table so a more
    simple way is to use syndromes

57
Standard Array
  • This block diagram shows the proposed
    implementation

58
Standard Array
  • For the same received codeword c2 e3, note that
    the unique syndrome is s3
  • This syndrome identifies e3 as the corresponding
    error pattern
  • So if we calculate the syndrome as described
    previously, i.e., s crHT
  • All we need to do now is to have a relatively
    small table which associates s with their
    respective error patterns. In the example s3 will
    yield e3
  • Finally we subtract (or equivalently add in
    modulo 2 arithmetic) e3 from the received
    codeword (c2 e3) to yield the most likely
    codeword, c2

59
Hamming Codes
  • We will consider a special class of SEC codes
    (i.e., Hamming distance 3) where,
  • Number of parity bits R n k and n 2R 1
  • Syndrome has R bits
  • 0 value implies zero errors
  • 2R 1 other syndrome values, i.e., one for each
    bit that might need to be corrected
  • This is achieved if each column of H is a
    different binary word remember s eHT

60
Hamming Codes
  • Systematic form of (7,4) Hamming code is,
  • The original form is non-systematic,
  • Compared with the systematic code, the column
    orders of both G and H are swapped so that the
    columns of H are a binary count

61
Hamming Codes
  • The column order is now 7, 6, 1, 5, 2, 3, 4,
    i.e., col. 1 in the non-systematic H is col. 7 in
    the systematic H.

62
Hamming Codes - Example
  • For a non-systematic (7,4) code
  • d 1011
  • c 1110000
  • 0101010
  • 1101001
  • 0110011
  • e 0010000
  • cr 0100011
  • s crHT eHT 011
  • Note the error syndrome is the binary address of
    the bit to be corrected

63
Hamming Codes
  • Double errors will always result in wrong bit
    being corrected, since
  • A double error is the sum of 2 single errors
  • The resulting syndrome will be the sum of the
    corresponding 2 single error syndromes
  • This syndrome will correspond with a third single
    bit error
  • Consequently the corrected codeword will now
    contain 3 bit errors, i.e., the original double
    bit error plus the incorrectly corrected bit!

64
Bit Error Rates after Decoding
  • For a given channel bit error rate (BER), what is
    the BER after correction (assuming a memoryless
    channel, i.e., no burst errors)?
  • To do this we will compute the probability of
    receiving 0, 1, 2, 3, . errors
  • And then compute their effect

65
Bit Error Rates after Decoding
  • Example A (7,4) Hamming code with a channel BER
    of 1, i.e., p 0.01
  • P(0 errors received) (1 p)7 0.9321
  • P(1 error received) 7p(1 p)6 0.0659
  • P(3 or more errors) 1 P(0) P(1) P(2)
    0.000034

66
Bit Error Rates after Decoding
  • Single errors are corrected, so,
  • 0.9321 0.0659 0.998 codewords are correctly
    detected
  • Double errors cause 3 bit errors in a 7 bit
    codeword, i.e., (3/7)4 bit errors per 4 bit
    dataword, that is 3/7 bit errors per bit.
  • Therefore the double error contribution is
    0.0023/7 0.000856

67
Bit Error Rates after Decoding
  • The contribution of triple or more errors will be
    less than 0.000034 (since the worst that can
    happen is that every databit becomes corrupted)
  • So the BER after decoding is approximately
    0.000856 0.000034 0.0009 0.09
  • This is an improvement over the channel BER by a
    factor of about 11

68
Perfect Codes
  • If a codeword has n bits and we wish to correct
    up to t errors, how many parity bits (R) are
    needed?
  • Clearly we need sufficient error syndromes (2R of
    them) to identify all error patterns up to t
    errors
  • Need 1 syndrome to represent 0 errors
  • Need n syndromes to represent all 1 bit errors
  • Need n(n-1)/2 to syndromes to represent all 2 bit
    errors
  • Need nCe n!/(n-e)!e! syndromes to represent all
    e bit errors

69
Perfect Codes
  • So,

If equality then code is Perfect
  • Only known perfect codes are SEC Hamming codes
    and TEC Golay (23,12) code (dmin7). Using
    previous equation yields

70
Summary
  • In this section we have
  • Used block codes to add redundancy to messages to
    control the effects of transmission errors
  • Encoded and decoded messages using Hamming codes
  • Determined overall bit error rates as a function
    of the error control strategy
Write a Comment
User Comments (0)
About PowerShow.com