Data Engineering 04 07186

About This Presentation

Title:

Data Engineering 04 07186

Description:

Data Engineering 04 07186 – PowerPoint PPT presentation

Number of Views:3242

Avg rating:3.0/5.0

Slides: 229

Provided by: drsiwo

Category:

more less

Transcript and Presenter's Notes

Title: Data Engineering 04 07186

1
Data Engineering 04 07186

Dr S. I. Woolley
http//www.eee.bham.ac.uk/woolleysi
Electronic, Electrical and Computer Engineering,
The University of Birmingham, U.K.

2
Lecture slides(Excluding guest
lectures)Please note that these lectures
slides do not form the complete assessable course
materials. Notes should be taken in class.
Your laboratory work, tutorial solutions and
private study work (including exercises given in
classes) all form part of the assessable
material.Please check the course web site and
bb forum for notices.
3
Summary of aims

To provide
An understanding of lossless and lossy data
compression methods.
An understanding of the issues affecting data
quality and quality assessment.
An appreciation of the effects of uncorrected
channel errors.
An appreciation of data security issues and an
understanding of encryption fundamentals.

4
The recommended text

The Data Compression Book
(recently out of print)
Mark Nelson and Jean-loup Gailly,
MT Books
2nd Edition.
ISBN 1-55851-434-1

5
The course website and forum

www.eee.bham.ac.uk/woolleysi and
www.bb.bham.ac.uk

6
Private study and assessment

There are several tutorial and private study
tasks identified in the text. Additional tasks
may be set during the module. Students are
reminded that these will form part of the
assessable material.
Recommended laboratory exercise
Lossy and lossless image compression and quality
assessment
Coursework
one assessed piece of coursework
It is essential that you supplement the summary
notes during the course with the results of
tutorial and private study tasks.

7
Contents

An introduction to data compression
Compression fundamentals
Lossy and lossless methods
Static and adaptive methods
Advantages and disadvantages
Tradeoffs and selection criteria
Lossless data compression methods
Run-length encoding
Huffman compression
Arithmetic compression
Lempel-Ziv compression ('77 and '78)

8
Contents

Lossy compression methods
DCT, wavelet and fractal methods
Quality considerations subjective vs. objective
regions of interest/importance non-linear
rate/distortions application/cost considerations
Audio coding overview of MIDI and MP3
Video compression an introduction to MPEG (MPEG1
encoding laboratory)

9
Contents

Data security
Introduction to security issues
Basic encryption methods, DES, RSA and PGP
Channel errors and the effects of errors on
compressed data
Introduction to error types amplitude and
synchronization errors
Modelling errors Gilbert and modified Gilbert
models
Investigation of error propagations in compressed
data streams and consideration of remedial
methods

10
Assessment

A simple fractal image compressor
A simple C implementation and report summarising
code design, results and analysis.

Introduction

12
Data coding

13
Coding

The type of appropriate coding depends on, for
example, the -
amount and type of information (audio, image,
video, data, instrumentation)
available bandwidth and/or storage
type of channel (one-way, two-way)
delay tolerance
importance of the data
importance of secure communications
type of errors which can be tolerated
quality of the channel (types and frequency of
errors)
system cost

14
Compression
Abbreviation is a simple example of
compression. E.g., A classified
advertisement "Lux S/C aircon refurb apt,
N/S, lge htd pool, slps 4, 850pw, avail wks or
w/es Jul-Oct.Tel (eves)" Decompresses
to Luxury self-contained refurbished apartment
for non-smokers. Large heated pool, sleeps 4,
850 per week, available weeks or weekends July
to October. Telephone (evenings)
15
Compression

Compression is as old as language.
Humans naturally compress. For example, commonly
used words tend to be short (see next slide).
Receivers and senders naturally strive to
establish a robust, error-free communication
channel establishing shared vocabularies and
shared models of their environments and
experiences (based on shared knowledge).
With confidence in shared understandings,
compression naturally evolves.

16
The 40 most commonly English used words

1 the
2 of
3 to
4 and
5 a
6 in
7 is
8 it
9 you
10 that
11 he
12 was
13 for
14 on

15 are
16 with
17 as
18 I
19 his
20 they
21 be
22 at
23 one
24 have
27 or
28 had
29 by

30 hot 31 word 32 but 33 what 34 some
35 we 36 can 37 out 38 other 39 were 40
all
17
Run-length encoding
A very simple method for compressing long runs of
character repetitions 0000 0000 0000 5555 0000
0000 compresses to (12,0)(4,5)(8,0)
24 bytes reduced to 6 gives 24/6 4 . i.e.
a 41 compression ratio or should that be
(0,12)(5,4)(0,8)?
18
Patent issues (from comp.compression faq)
(a) Run length encoding Tsukiyama has two
patents on run length encoding 4,586,027 and
4,872,009 granted in 1986 and 1989
respectively. The first one covers run length
encoding in its most primitive form a length
byte followed by the repeated byte. The second
patent covers the 'invention' of limiting the
run length to 16 bytes and thus the encoding of
the length on 4 bits. Here is the start of
claim 1 of patent 4,872,009, just for pleasure
1. A method of transforming an input data
string comprising a plurality of data bytes,
said plurality including portions of a plurality
of consecutive data bytes identical to one
another, wherein said data bytes may be of a
plurality of types, each type representing
different information, said method comprising
the steps of ...
19
Popular compression
20
Text message examples
21
Text message quiz

IYSS
BTW
L8
OIC
PCM
IYKWIMAITYD
ST2MORO
TTFN
LOL

The abuse selection
lt-(
(
--------)
IUTLUVUBIAON

22
(No Transcript)
23
(No Transcript)
24
(No Transcript)
25
Data compression

Data compression requires the identification and
extraction of source redundancy.
All meaningful data contains redundancy
(patterns) in its uncoded form - otherwise it
would resemble noise.
We can compress data by-
giving shorter codewords to frequently occurring
symbols or words.
sending only the difference between symbols or
pictures
sending only the difference between the data and
some model of normal data
by referring to previously repeated strings
by sending approximations of signal values

26
Lossless and lossy compression
Compression methods which enable exact recovery
of the data on decompression are called lossless
or reversible. Methods which permanently remove
redundancy by approximating data values are
called lossy or irreversible.
27
Some data powers of ten
1 bit A binary decision (yes/no)
(on/off) 100 Kbyte A low resolution photograph
2 Mbyte A high resolution photograph 5
Mbyte Complete works of Shakespeare or 5 s of
TV-quality video 10 Mbyte A digital chest
X-ray 100 Mbyte 1 metre of shelved books 1
Gbyte A low resolution video 2 Terabytes (1
000 000 000 000 bytes) An academic research
library 2 Petabytes (1 000 000 000 000 000 bytes)
All US research libraries 5 Exabytes (1 000
000 000 000 000 000 bytes) All words ever
spoken Zettabyte (1 000 000 000 000 000 000
000 bytes) Yottabyte (1 000 000 000 000 000
000 000 000 bytes)
28
The compression trade-off
Delay Legal issues Specialized hardware Data
more sensitive to error Need for decompression
key
Reduced time and cost. More efficient storage
faster transmission.
29
Compression and channel errors

Noisy/busy channels are problematic for
compressed data.
Unless compressed data is delivered 100
error-free (i.e., no changes - no lost packets)
the whole file is often destroyed.
E.g.

30
Robust methods
Decompress
Compress
Channel error/s
31
Compression

All meaningful data contains redundancy
(patterns) in its uncoded form -
otherwise it would resemble noise.
We can compress data by-
giving shorter codewords to frequently occurring
symbols or words.
sending only the difference between symbols or
pictures
sending only the difference between the data and
some model of normal data
by referring to previously repeated strings
by sending approximations of signal values

32
Measuring information
For any discrete probability distribution, the
value of the entropy function (H) is given
by- (rradix 2 for binary) Units
bits/symbol Where the source alphabet has q
symbols of probability pi (I1..q). Note change
of base
33
Private study

Revise basic information theory.
Consider the advantages and disadvantages of data
compression in real systems.
Investigate patent issues relevant to data
compression.

Static vs. adaptive methods

35
Compression continued Static vs. adaptive
compression
Compression algorithms remove source redundancy
by using some definition (model) of the source
characteristics. Compression algorithms which
use a pre-defined source model are static.
Algorithms which use the data itself to fully
or partially define this model are referred to as
adaptive.
36
Compression continued Static vs. adaptive
compression
Static implementations can achieve very good
compression ratios for well defined sources.
Adaptive algorithms are more versatile, and
update their source models according to current
characteristics. However, they have lower
compression performance, at least until a
suitable model is properly generated. The
understanding of source modes and trajectories is
important.
37

Lossless compression - Huffman

38
Huffman compression
Source character frequency statistics are used
to allocate codewords for output. Compression
can be achieved by allocating shorter codewords
to the more frequently occurring characters
(e.g., in Morse E Y ---).
39
Huffman compression
By arranging the source alphabet in descending
order of probability, then repeatedly adding the
two lowest probabilities and resorting, a Huffman
tree can be generated. The resultant codewords
are formed by tracing the tree path from the root
node to the codeword leaf. Rewriting the
table as a tree, 0s and 1s are assigned to the
branches. The codewords for each symbols are
simply constructed by following the path to their
nodes.
40
Huffman compression
41
Huffman compression
42
Huffman compression
43
Huffman compression - exercises
Create a Huffman tree for the sources described
below char. frequency a 0.1 b 0.3
c 0.2 d 0.05 e 0.15 f 0.2 Generate the
compressed output for the source
c,b,d,b,d,d,e,a,d,f,d,b and calculate the entropy
of the original source (as described in 1), the
number of bits required for the encoded string
above and the compression ratio
achieved. Consider the effects of uncorrected
channel errors on a Huffman compressed source.
Try encoding a sequence and decoding it with
single bit errors in different locations. What
happens and why?
44
(No Transcript)
45
(No Transcript)
46
(No Transcript)
47
(No Transcript)
48
(No Transcript)
49

Lossless compression
- arithmetic

50
Arithmetic coding
The method of compression employed by Huffman
compression coding involves the allocation of
shorter codewords for more frequently occurring
characters. It is, however, unable to allocate
fractional codeword lengths, so that a character
must be allocated at least a one-bit codeword no
matter how high its frequency. Huffman coding
cannot, therefore, achieve optimal compression.
Arithmetic coding offers an alternative to
Huffman coding, enabling characters to be
represented as fractional bit lengths. This is
achieved by representing the source as a real
number, greater than or equal to zero, but less
than one, denoted as the range 0,1).
51
Arithmetic coding patent issues (from
comp.compression faq)
IBM holds many patents on arithmetic coding
(4,122,440 4,286,256 4,295,125 4,463,342
4,467,317 4,633,490 4,652,856 4,792,954 4,891,643
4,901,363 4,905,297 4,933,883 4,935,882
5,045,852 5,099,440 5,142,283 5,210,536
5,414,423 5,546,080). It has patented in
particular the Q-coder implementation of
arithmetic coding. The JBIG standard, and the
arithmetic coding option of the JPEG standard
requires use of the patented algorithm.
52
Arithmetic coding (example)
53

Lossless compression
- Lempel-Ziv

54
Lempel-Ziv compression

Lempel-Ziv 1977 and 1978 (LZ77 and LZ78)
Welch 1984
LZW - implemented in many popular compression
methods.
Lossless, universal (adaptive)
Exploits string-based redundancy
Not good for image compression (why?)
Unisys patent on .GIF implementation

55
LZ77 Patent issues (from comp.compression faq)
Waterworth patented (4,701,745) the algorithm
now known as LZRW1, because Ross Williams
reinvented it later and posted it on
comp.compression on April 22, 1991. The
same algorithm has later been patented by
Gibson Graybill. The patent office failed to
recognize that the same algorithm was patented
twice, even though the wording used in the two
patents is very similar. The Waterworth patent
is now owned by Stac Inc, which won a lawsuit
against Microsoft, concerning the compression
feature of MSDOS 6.0. Damages awarded were 120
million. (Microsoft and Stac later settled out
of court.)
56
LZ78 Patent issues (from comp.compression faq)
One form of the original LZ78 algorithm was
patented (4,464,650) by its authors Lempel,
Ziv, Cohn and Eastman. This patent is owned by
Unisys. The LZW algorithm used in 'compress'
is patented by IBM (4,814,746) and Unisys
(4,558,302). It is also used in the V.42bis
compression standard, in Postscript Level 2, in
GIF and TIFF. Unisys sells the license to
modem manufacturers for a onetime fee. CompuServe
is licensing the usage of LZW in GIF products
for 1.5 of the product price, of which 1 goes
to Unisys usage of LZW in non-GIF products
must be licensed directly from Unisys.
The IBM patent application was first filed three
weeks before that of Unisys, but the US patent
office failed to recognize that they covered
the same algorithm. (The IBM patent is more
general, but its claim 7 is exactly LZW.)
57
(No Transcript)
58
Lempel-Ziv dictionaries

How they work -
Parse data character by character generating a
dictionary of previously seen strings
LZ77 sliding window dictionary
LZ78 full dictionary history
LZ78 Description
With a source of 8-bits/character (28 256,
i.e., source characters 0-255 taken.) Extra
characters needed to describe strings (in
dictionary).
Output all 9-bit, i.e., 0-511.
Need to reserve some characters for special
codewords say, 256-262, so dictionary entries
begin at 263.
We can refer to dictionary entries as D1, D2, D3
etc. (equivalent to 263, 264, 265 etc.)
Dictionaries typically grow to 12- and 15-bit
lengths.

59
(No Transcript)
60
(No Transcript)
61
(No Transcript)
62
Lempel-Ziv continued
Compression of our example The original source
contains 13x8-bit characters (104 bits) and the
compressed output contains 10x9-bit codewords
(90 bits). Compression ratio (old size/new
size)1
63
Lempel-Ziv exercises
1. (i) How is decompression performed? Using
the source output from our example, build a
dictionary and decompress the source. (ii) Why
are 9-bit codewords required? (iii) What happens
when the dictionary has 256 entries? How can
it be extended? (iv) Why should the dictionary
size be limited and how can it be made more
efficient? 2. (i) How is decompression
performed? (ii) Using the source output from
our example, build a dictionary and
decompress the source. 3. Compress the
string less careless less lossless.
64
Lempel-Ziv private study
1. Identify applications of LZ-based methods and
investigate patent issues (see comp.compression
faq) 2. What is the LZ exception? How is it
interpreted?
65
(No Transcript)
66
(No Transcript)
67

Compression of images

Image data

69
The test image set
70
The digital image
71
Image pixels
72
x,y or r,c 0 or 1?

Common problems in manipulating image files
involves the mismatch between (x,y) notation and
matrix row,column i.e., (y,x). Both are
correct but programs but treat them
consistently.
The other common error is the 1/-1 error.
Pixels can be defined as starting at 0 or 1,
again both are correct but programs must be
consistent.

73
mouse
Mouse.raw has 320x200 pixels. Mouse is an 8-bit
grey-scale test image we will use later. 8 bits
per pixel (bpp) means we have 256 values (Black0
white 255).
320
200
74
A closer look at mouse
75
Negative mouse (xn,yn)(xn,yn)-255
76
Reducing resolution Mouse at 80x50 pixels
77
Mouse with only 16 colours (i.e.,4 bits per
pixel)
78
Mouse with just 2 colours (i.e.,1 bit per pixel)
79
Paintshop Pro software (Jasc)
80
Paintshop Pro software (Jasc)
81
Differencing neighbouring pixels
82
Accessing .raw image data
83
Mouse.raw at 10x10 pixels!!
84
Opening a 10x10 pixel binary file in C

include "stdio.h"
include "stdlib.h"
int main(void)
printf("Very simple program for reading image
values into an array ....\n")
FILE image_in
int r,c
unsigned char image1010 /image size in
rows and columns/
image_infopen("mouse10.raw","rb") /open
binary image for reading/
if (image_inNULL)
printf("File not found ... \n")
else
fread(image, 1, 100, image_in)
for (r0rlt10r)
for (c0clt10c)
printf("d\t",imagerc) /print values
with tabs/

Lossless image compression

86
Lossless image compression

What images might we want to compress losslessly?
How would Huffman and arithmetic methods perform
on images?
How could we improve their performance?

87
Differencing (W.C.Tham)
88
.GIF (Graphics Interchange Format)

Unisys patent
Differencing (DPCM Differential pulse code
modulation)
LZ78
Lossless but limited to 8-bits (ie. 256 colours)
Recommended for graphics and line drawings and
NOT natural or real-world images.

89
(No Transcript)
90
(No Transcript)
91

Human vision
Lossy image and video compression methods use
models of the human visual system to determine
which aspects of the data can be lost.
The human eye is a marvellously sophisticated and
highly adaptive system. There is also a
significant amount of image processing performed
by the brain.
Newer compression methods attempt simple but
improved methods of modelling regions of interest
and the human visual system.

92
The Human Vision System (HVS)

The eye can adapt to a range of intensities in
the order of 1010 from lowest visible light to
highest bearable glare.
Non-linear contrast sensitivity
Cortex filter
Saccades and fixations

93
The rods and cones of the eye

Cones and rods two types of discrete light
receptors.
6-7 million cones centrally located (area of
retina called the fovea) highly sensitive to
colour and bright light do not work well in dim
light. Each has its own nerve to the brain.
75 million rods across the surface of the
retina sensitive to light but not colour.
Share nerve endings. Work in dim and bright
conditions.

94
Eye-tracking
95
The portrait artists eye movement
96
Our research angiogram video
97
Foveated imaging
98
Foveated imaging
99
Foveated imaging
100
Visual perception selected examples

"The Mach band effect" (studied by Ernst Mach in
the 1860s) involves an exaggeration of contrast
at edges.
see http//www.langara.bc.ca/psychology/brightness
.htm
"The change in the sensitivity of one part of the
retina by the level of brightness excitation in
an adjacent retinal area is called lateral
inhibition and accounts for most of the contrast
effects...the visual system responds mostly to
variations in light intensity across the visual
field which interact with each other within the
visual pathways."

101
Contrast Sensitivity
0
1
2
3
4
Circle constant
Background constant
Just noticeable difference (JND) at 2
102
Contrast Sensitivity
0
1
2
3
4
Circle constant
Background constant
Just noticeable difference (JND) at 2
103
Contrast Sensitivity
0
1
2
3
4
Backgrounddifferent thanboth halves
Backgroundsame asright half
Just noticeable difference (JND) 4 (top) and
2 (bottom)
104
Contrast Sensitivity
0
1
2
3
4
Backgrounddifferent thanboth halves
Backgroundsame asright half
Just noticeable difference (JND) 4 (top) and
2 (bottom)
105
Interpreting images "Spot" the dog
Top right is a well-known image from textbooks on
visual perception. If you have seen it before
you will have no trouble detecting the
dog. Bottom right is an upside down
world. Below is the ambiguous bunny/duck and
seal/donkey.

106
What do you see?

What do you see in each of the four images top
right?
Bottom left (proximity) what is the
relationship between the two circles?
Bottom compare the three images.

107
Can you accurately read the pen colours in order?

red green blue green red yellow blueyellow red
blue yellow green red blueblue yellow yellow
blue red blue yellowred green green red green
green greengreen blue blue yellow yellow
yellowyellow red green yellow blue green
redblue green red red green red green bluered
yellow yellow red blue yellow blueyellow blue
red blue green green yellowgreen red yellow blue
yellow blue redblue red blue green red yellow
bluegreen green red yellow blue yellow blue

108
(No Transcript)
109

Lossy image compression methods

110
Lossless and lossy compression
Compression methods which enable exact recovery
of the data on decompression are called lossless
or reversible. Methods which permanently remove
redundancy by approximating data values are
called lossy or irreversible.
111
What is Quality?
Measurement methods Objective- impartial
measuring methods Subjective- based on personal
feelings We need definitions of quality
(degree of excellence?) and to define how we
will compare the original and decompressed images.
112
Measuring Quality
Objectively- E.g.,Root Mean Square Error (RMSE)
Subjectively- E.g., Mean Opinion Scores
(MOS) 5very good 1 very poor or... 5
perfect, 4 just noticeable, 3 slightly
annoying, 2 annoying, 1 very annoying
113
Lossy methods
301
114
Optimizing the trade-off
115
Original Approx. detail
116
The rate/distortion trade-off
117
Subjective testing - things to consider

Which images will be shown?
E.g., Direct comparison (is the original always
visible?)
What are the viewing conditions?
Lighting, distance from screen, monitor
resolution etc. etc.
What is the content and how important is it?
Is all content equally important
Who are the viewers and how do they perform?
Viewer expertise/ cooperation/ consistency/
calibration (are viewers scores relevant to the
application, consistent over time, consistent
between each other)

118
(No Transcript)
119
The test image set
120
(No Transcript)
121

Lossy methods - DCT

122
DCT image compression

The philosophy behind DCT image compression is
that the human eye is less sensitive to
higher-frequency information (and also more
sensitive to intensity than to colour), so that
compression can be achieved by more coarsely
quantising the large amount of high-frequency
components usually present.
Firstly, the image must be transformed into the
frequency domain. Since it would be
computationally expensive to transform even a low
resolution image in one full block, the image is
subdivided.
The JPEG (Joint CCITT and ISO Photographic
Experts Group) standard algorithm for full-colour
and grey-scale image compression uses 8x8 blocks.

123
DCT image compression

The DCT itself does not achieve compression, but
rather prepares the image for compression.
Once in the frequency domain the image's
high-frequency coefficients can be coarsely
quantised so that many of them (gt50) can be
truncated to zero.
The coefficients can then be arranged so that the
zeroes are clustered (zig-zag collection) and
Run-Length Encoding (RLE), whereby repeated
values are referenced and followed by a counter
indicating the number of successive occurrences,
can be applied.
The remaining data is then compressed with
Huffman coding (arithmetic coding is also an
option in the standard though patent issues have
hindered application).

124
DCT stages

Blocking (8x8)
DCT (Discrete Cosine Transformation)
Quantization
Zigzag Scan
DPCM on DC component
RLE on AC Components
Entropy Coding

125
DCT compression the DCT bases
126
DCT mathematics
127
The JPEG quantization matrix
16 11 10 16 24 40 51 61 12 12 14
19 26 58 60 55 14 13 16 24 40 57
69 56 14 17 22 29 51 87 80 62
18 22 37 56 68 109 103 77 24 35 55
64 81 104 113 92 49 64 78 87 103 121
120 101 72 92 95 98 112 100 103 99
128
Nelsons simpler linear quantizer
129
The JPEG quantization
130
DCT compression
131
DCT compression
132
DCT compression
133
DCT compression
134
Gibbs phenomenon

The presence of artefacts around sharp edges is
referred to as Gibb's phenomenon.
These are caused by the inability of a finite
combination of continuous functions to describe
jump discontinuities (e.g. edges).
At higher compression ratios these losses become
more apparent, as does the blocked nature of the
compressed form.
The loss of edge clarity can be clearly seen in a
difference mapping comparing an original image
with its heavily compressed equivalent.

135
Original test image
136
Lossy DCT reconstruction
Q25 CR 11.6 1
137
The difference (Gibbs phenomenon)
138
(No Transcript)
139

Lossy methods
- fractal

140
Iterative generation of the Serpinski
Triangle/Gasket
141
Fractal methods

Fractal compression attempts to reverse the
process of fractal generation by searching an
image for self-similarity and generating IFS
descriptions.
Decompression is achieved by applying PIFS
iteratively until convergence is achieved.
Asymmetry the compression process requires
significant searching and calculation of best
matches making is a slow process - the
compression process requires simple PIFS
constructions which is by comparison very fast.
Enhanced resolution Reconstructed images can be
regenerated (calculated from the fractal
self-similarities) such that they can appear
higher resolution than the original!)

142
Fractal image compression
143
Fractal image compression
144
Fractal methods

Compression losses - these introduce shifting
errors.
Reconstructed images look "painted".
They tend to look more natural compared to Gibbs
and DCT blocking - more subjectively acceptable -
though pixel-based error measures (e.g. RMSE) not
so good.

145
Fractal compression theory - I
146
Fractal compression theory - II
147
Fractal compression theory - III
148
(No Transcript)
149
(No Transcript)
150
(No Transcript)
151
(No Transcript)
152
(No Transcript)
153
(No Transcript)
154
(No Transcript)
155
(No Transcript)
156
(No Transcript)
157

MPEG Digital Video
An introduction from 2001 lecture notes from Dr
Nick Flowers

158
Contents

Analogue TV
Basic digital TV the problem
MPEG-1 definition
Decimation (spatial, temporal and colour)
Spatial compression
Temporal compression
Difference coding
Motion compensation
Errors GOP

159
Analogue TV

European TV format 625 scan lines, 25
interlaced frames per second, 43 aspect ratio
Interlacing reduces the vertical resolution to
312.5 lines
Horizontal resolution is 312.5(4/3) 417 lines
Bandwidth required 62541725 6.5MHz
Analogue colour information is cleverly added
without increasing bandwidth (NTSC, PAL and SECAM
standards)

160
Digital TV Raw video

For digital use, 8 bit resolution is adequate
For colour pictures we have Red, Green and Blue
(RGB)
To digitise, we need to sample at twice the
highest frequency (6.5MHz) and convert three
colours (RGB) at 8 bits each
Bitrate (6.52) 3 8 312 Mbits/Sec
(compare with analogue bandwidth of 6.5MHz)
Digital TV seems to have created a big problem
using raw digitisation we need coding to help

161
Coding of Moving Pictures and Associated Audio
for Digital Storage Media at up to about 1.5
Mbits/sec.International Standard IS-11172,
completed in 10.92
Commonly known as MPEG-1

Moving Picture Experts Group 1st phase
Video CD - A standard for video on CDs - VHS
quality
Audio CDs have a data rate of 1.5Mb/s video
has a raw data rate of 312Mb/s 200 times
higher!
Something has to be lost

162
MPEG-1 decimation

This means just throwing data away not even
attempting to preserve the data
Three different areas for decimation
Spatial
Colour
Temporal
(plus audio)
Temporal - Interlacing is dropped giving 25 full
frames per second

163
Spatial decimation
625 Half-lines

European broadcast TV standard
Resolution is reduced to 352 (width) by 288
(height) pixels
Source Input Format (SIF)

417 lines
288 pixels
352 pixels
164
Colour decimation

Human perception is most sensitive to luminance
(brightness) changes
Colour is less important e.g. a black and white
photograph is still recognisable
RGB encoding is wasteful human perception
tolerates poorer colour.
Use YUV and only encode chrominance (UV) at half
resolution in each direction (176 by 144)
Quarter SIF) This gives 0.25 data for U and V
compared to Y

165
Original (100)
0.5 UV (25)
0.25 UV (6.25)
0.2 UV (4)
0.1 UV (1)
0.0 UV (0)
166
Temporal decimation

Three standards for frame rate in use today
Cinema uses 24 FPS
European TV uses 25 FPS
American TV uses 30 FPS
Lowest acceptable frame rate is 25 FPS so little
decimation can be achieved for Video CD
MPEG-1 does allow much lower frame rates e.g. for
internet video but quality is reduced

15 FPS
25 FPS
167
Decimation the result

After throwing away all this information, we
still have a data rate of (assuming 8 bits per
YUV)
Y (352288) 25 8 20.3 Mb/s
U (352/2 288/2) 25 8 5.07 Mb/s
V (352/2 288/2) 25 8 5.07 Mb/s
TOTAL (for video) 30.45 Mb/s
MPEG 1 audio runs at 128Kb/s
Video CD - Target is 1.5Mb/sec
Space for video 1.5 0.128Mb/s 1.372Mb/s
So now use compression to get a saving of 221

168
Spatial compression

A video is a sequence of images and images can
be compressed
JPEG uses lossy compression typical compression
ratios are 101 to 201
We could just compress images and send these
Time does not enter into the process
This is called intra-coding (intra within)

169
Spatial compression

Very similar to JPEG
Image divided into 8 by 8 pixel sub-blocks
Number of blocks 352/8 by 288/8 44 by 36
blocks
Each block DCT coded
Quantisation - dropping low-amplitude
coefficients
Huffman coded
This produces a complete frame called an Intra
frame (I)

170
Temporal compression

Spatial compression does not take into account
similarities between adjacent frames
Talking Heads - Backgrounds dont change
Consecutive images (1/25th second apart) are very
similar
Just send the difference between adjacent frames

171
Difference coding

Only send difference between this frame and
previous frame
Result is very sparse high compression now
possible using block-based DCT as before

172
Difference coding

Using the previous frame and the difference frame
we can re-create the original this is called a
predicted frame (P)
This re-created frame can then be used to form
the next frame and the process repeated

173
Difference coding

Difference coding is good for talking heads
Not good for scenes with lots of movement

174
Motion compensation

Difference coding is good, but often an object
will simply change position between frames
DCT coding not as good as for sparse difference
image

175
Motion compensation

Video is three-dimensional (X,Y, Time)
DCT coding reduces information in X and Y
Stationary objects do not move in time
Motion compensation takes time into account
No need to code the image of the object just
send a motion vector indicating where it has
moved to

176
Motion compensation

Called Motion Compensation since we actually
adjust the position of the object to compensate
for the movement

177
Motion compensation the problems

Objects rarely move and retain their shape
If object moves and changes shape a little
Find movement and send motion vector
Subtract moved object in last frame from object
in new frame
DCT code the difference
But what is an object? We have an array of
pixels.
Could try and segment image into separate objects
but very intense processing!
Simple option - split image up into blocks that
dont correspond to objects in the image
macroblocks

178
Macroblocks

Macroblocks can be any shape or size
If small, then we need to send lots of vectors
If large, then we are unlikely to find a matching
macroblock
MPEG-1 uses a 16 by 16 pixel macroblock
Each macroblock is the unit for motion
compensation
Find macroblock in previous frame similar to this
one
If match found, send motion vector
Subtract this macroblock from previous displaced
macroblock
DCT code the difference
If no matching block found, abandon motion
compensation and just DCT code the macroblock

179
MPEG-1 compression

Eyes - difference data DCT coded
Ball - motion vector coded, actual image data not
coded
Rabbit - Intra coded with no temporal compression
Coding method varies between macroblocks whole
is a P frame

180
Group of pictures - GOP

Problem with P frames is any errors are
propagated (like making copies of copies of
copies) - so we regularly send full (I) frames to
eliminate errors
Every 0.5 seconds approx we send a full frame (I)
I P P P P P P P P P P P I P P P P P P P P P
P P I P P
? GOP ?
In the event of an error, data stream is
resynchronised after 12/25th of a second (or
15/30th for USA)
The sequence between Is is called a Group Of
Pictures

181

Data security and encryption

182
Security and Cryptographic Algorithms

Source A summary of Chapter Eleven and Simon
Singh's "The Code Book"
A summary of threats
Security requirements
Steganography and cryptography
Substitution ciphers
Keys and the Caesar cipher
Cryptanalysis
The Vingenère cipher
DES (Data Encryption Standard)
Diffie-Hellman-Merkle key exchange
RSA (Rivest, Shamir and Adleman)
PGP (Pretty Good Privacy)

183
Network security threats

In formation can be observed and recorded by
eavesdroppers.
Imposters can attempt to gain unauthorised access
to a server.
An attacker can flood a server with requests,
causing a denial-of-service for legitimate
clients.
An imposter can impersonate a legitimate server
and gain sensitive information from a client.
An imposter can place themselves in the middle,
convincing a server that it is a legitimate
client and a client that it is a legitimate
server.

184
Network scrutiny

Footprinting
Gathering information on a network (creating a
profile of an organizations security posture -
identifying a list of network and IP addresses.)
Scanning
Identifying live and reachable target systems.
(Ping sweeps, port scans, application of
automated discovery tools).
Enumeration
Extracting account information. (Examining
active connections to systems).

185
Passwords

Some users, when allowed to choose any password,
select absurdly short ones.
Stallings quotes the example below from Purdue
University.
People also tend to select guessable passwords.

186
Passwords

Stallings references a report which demonstrates
the effectiveness of password guessing.
The author collected UNIX passwords from a
variety of encrypted password files.
Nearly 25 of passwords were guessed with the
following strategy-
Try users name, initials, account name (130
permutations for each).
Try dictionary words - including the systems own
on-line dictionary (60,000 words).
Try permutations of words from step above
(Including making first letter uppercase or a
control character, making the entire word
uppercase, reversing the word, changing os to
0s etc (another 1 million words to try).
More capitalization permutations (another million
words to check).

187
(No Transcript)
188
Behaviour profiles
189
Security requirements

Privacy or confidentiality information should be
readable only by the intended recipient.
Integrity the recipient can confirm that the
message has not been altered during transmission.
Authentication it is possible to verify the
identity of the sender and/or receiver.
Nonrepudiation The sender cannot deny having
sent a given message.
The above requirements are not new and various
security mechanisms have been used for many years
in important transactions.
What is new is the speed at which break-in
attempts can be made from a distance by using a
network.

190
Steganography

Steganography (from Greek steganos-covered and
graphein-to write) involves hiding the existence
of a message.
Herodotus (chronicler of the 5th century BC
Greece/ Persian conflicts) recounts how an
important message was written onto the shaved
head of a messenger and delivered when his hair
had grown back.
Many hidden message systems have been used in the
past
The Chinese wrote on fine silk which was covered
in wax and swallowed.
A 16th century Italian scientist described how to
write on a hard-boiled egg with alum and vinegar.
The solution passes clearly through the shell
but stained the egg.
The FB1 found the first microdot (a photographed
page reduced to the size of a full stop pasted
into a document) in 1941.
More recently images were shown to be easily
communicated in the LSBs of higher-resolution
images.

191
Cryptography

Cryptography (Greek kryptos-hidden) is the
science of making messages secure.
The original message is the plaintext.
The encryption/decryption algorithm is called the
cipher.
The encrypted message is the ciphertext.
(Simon Singh mentions that the correct title for
his book would be The Code and Cipher Book.)
Cryptography can be divided into two branches
transposition and substitution.

192
Substitution ciphers

The Kama-sutra recommends women study 64 arts
including chess, bookbinding, carpentry and the
art of secret writing suggesting a simple
substitution cipher involving simple pairing of
letters of the alphabet.
The Caesar cipher applies a simple shift between
the plain alphabet and cipher alphabet. The
exact shift can be considered as the cipher key.
An example of a 3 letter shifted Caesar cipher
(lower case for plaintext and UPPERCASE for
ciphertext.
a b c d e f g h i j k l m n o p q r s t u v w x y
z
D E F G H I J K L M N O P Q R S T U V W X Y Z A B
C

193
Keys and the Caesar cipher

The simple Caesar cipher has just 25 keys (i.e.,
25 possible shifts). So that cryptanalysts could
quickly break the code by trying all possible
shifts.
Allowing any pair of substitutions results in
many, many more combinations, approx. 4x1026 but
the communication and safe preservation of the
key becomes more difficult.
A compromise involves the use of a keyword or
keyphrase, e.g.,
a b c d e f g h i j k l m n o p q r s t u v w x y
z
J U L I S C A E R T V W X Y Z B D F G H K M N O P
Q

194
Cryptanalysis

Singh describes how early Arab scholars invented
cryptanalysis, for example, using frequency
analysis to identify substitutions.
Relative frequencies of letters of the alphabet

195
The Vingenère cipher

The Vingenère cipher was published in 1586. It
is a polyalphabetic cipher (as opposed to a
monoalphabetic cipher) because it uses several
cipher alphabets per message (and hence makes
frequency cryptanalysis more difficult).
Again a key (keyword or keyphrase) is required.

196
(No Transcript)
197
DES The Data Encryption Standard

IBM invented "Lucifer", an encryption system
adopted as the Data Encryption Standard (DES) in
1976.
DES repeatedly scrambles (mangles) blocks of 64
bits with an encryption key of 56bits.
The key was reduced from a longer key to 56bits
as required by the American NSA (National
Security Agency).

198
Triple DES

For added security ....DES can use two keys as
follows-
CEK1(DK2(EK1(P)))
and
PDK1(EK2(DK1(P)))

199
The key distribution problem

How can secret keys be exchanged by parties who
want to communicate?
In the late 1970s, banks distributed keys by
employing special dispatch riders who had been
vetted and were among the company's most trusted
employees. They would travel across the world
with padlocked briefcases, personally
distributing keys to everyone who would receive
messages from the bank over the next week.

200
Diffie-Hellman-Merkle

Whitfield Diffie and Martin Hellman
Diffie accepted a research position with Hellman
and was later joined by Ralph Merkle at Stanford.
Diffie imagined two strangers (Alice and Bob)
meeting on the Internet and wondered how they
could send each other an encrypted message which
an eavesdropper (Eve) could not read).
Although safe key exchange had been considered
impossible ...

201
A simple padlock example

It is possible to imagine secure message exchange
over an insecure communication system.
Imagine Alice sends a package to Bob securing it
with a padlock. Bob can't open it but adds his
own padlock to it and sends it back to Alice who
removes her padlock and sends it back to Bob
Bob can now open his own padlock. QED.
Alice and Bob both kept their keys safe and the
package was never unlocked in the system.
The problem with applying this simple solution
was the order of events. Encryption methods up
to this time has required a "last on, last off"
ordering.

202
One-way functions

Most mathematical functions are two-way. E.g.,
doubling functions can be undone by halving.
That is, most operations are reversible and the
two operations tend to be of similar orders of
complexity.
One-way functions are impossible, or very
difficult to reverse.

203
Modular arithmetic for one-way functions

Solutions to modular arithmetic functions have
apparently random results which makes guessing
solutions based on adjacent results impossible.
x 1 2 3 4 5 6
3x 3 9 27 81 243 729
3x(mod7) 3 2 6 4 5 1
In the simple example above it is very easy to
calculate 3x(mod7) given x, but more difficult
to reverse the process, i.e., to find x given
3x(mod7).
With larger values, e.g., 453x(mod 21,997), it
is still relatively easy to encode x, but
decoding would be extremely difficult.
In 1976, Diffie, Hellman and Merkle invented a
system for safe key exchange using modular
arithmetic to provide one-way functions.

204
Public-key cryptography

A disadvantage of the Diffie-Hellman-Merkle key
exchange is that it requires interaction (mutual
exchange of information) between Alice and Bob,
i.e., spontaneous interchange of encrypted
messages is not possible.
Diffie went on to specify the requirements for an
asymmetric key system, i.e., a system where the
encryption and decryption keys are different.
The encryption key is the public key and the
decryption key is the private key.
Again, with the padlock analogy, the public key
is like a padlock - anyone can lock it - but
opening it requires a private key kept safe by
the owner.
So Alice can encrypt messages to Bob (without any
special exchanges) using his widely-available
public key.

205
RSA (Rivest, Shamir and Adleman)

Rivest, Shamir and Adleman at MIT developed the
necessary public-key cryptography (RSA) specified
by Diffie.
RSA was announced in Scientific American in
August 1977.
The system involves large primes, p and q which
are multiplied together (Npxq) as part of the
public key.
Factoring N into p and q is extremely difficult
for large N.
For banking transactions, Ngt10308 provides an
extremely high level of security (a hundred
million PCs would take more than 1000 years to
find p and q.)

206
RSA

Select two large primes, p and q
Npxq
Select an integer, e, ensuring e and (p-1)x(q-1)
are relatively prime.
Public key N,e
(N should be unique, but e need not be)
The ciphertext, C, of a message, M, is given by
CMe(mod N)
So, everyone can encrypt their messages, since N
and e are publicly available.
The private key, d, is calculated as
exd1(mod(p-1)x(q-1)
Decryption is performed by solving-
MCd(mod N)

207
Pretty Good Privacy (PGP)

Phil Zimmermann believed everybody had a right to
the kind of privacy offered by RSA encryption.
He developed a user-friendly implementation of
RSA called PGP.
Since RSA is quite computationally complex,
Zimmerman designed PGP to use RSA to encrypt the
key of a cipher called IDEA, which is similar to
DES.
With the key safely encrypted with RSA, all
message data is then encrypted with the simple
cipher, IDEA.
To send a message to Bob, Alice encrypts (and
sends) an IDEA key with Bobs public RSA key and
encrypts (and sends) her message with the IDEA
key.
Bob uses his private RSA key to decrypt Alices
IDEA key which he then uses to decrypt Alices
message.

208
Digital signatures for verification

How can Bob be sure the message he receives is
from Alice? Anyone can use his public key to
encrypt messages for him.
Solution Alice can use her PRIVATE key to
ENCRYPT the message (note - the private key is
usually used for decryption).
Any message encrypted with the private key can be
decrypted by the public key - so this is not
secure (everyone has the public key) - but it
does prove authorship.
So, if Alice encrypts with Bobs public key,
privacy is guaranteed. If she encrypts with her
private key, she can prove authorship.
To ensure privacy AND authorship - she first
encrypts the message with her private key then
encrypts the result with Bobs public key.

209
(No Transcript)
210
(No Transcript)
211

Channel errors and compressed data

212
Effects of errors on compressed data
Compress
Decompress
Channel error/s
Consider the simple run-length encoding example
again. 0000 0000 0000 5555 0000 0000
compressed to (12,0)(4,5)(8,0) with a single
error in the compressed version,
say (12,2)(4,5)(8,0) becomes 2222 2222 2222
5555 0000 0000 or (1,0)(4,5)(8,0) becomes 0555
5000 0000 0
213
Channel errors

Errors can occur singly or in bursts.
Most channel errors can be approximated by simple
state models.
The BER (byte error rate) is a simple measure of
channel quality.

The modified Gilbert model
In the Good state there are no errors. Bad Type
1 and and Bad Type 2 represent two types of error
events p1 and p2 prob. of start of errored
states q1 and q2 prob. of end of errored states
214
Channel errors

Burst length distributions provide important
information about channel error activity.

increasing p1
probability
decreasing q1
increasing p2
decreasing q2
error length
215
Channel errors

Designing error protection systems
Know your enemy
Burst length distribution (and gap length
distribution)
Trade-off the correction of data with the
addition of redundancy
System cost/delay
Should errors be detected and corrected?
Detect and request retransmission? Detect and
conceal?
How important is the data.
Is all data equally important? Should some data
elements be protected more than others?
Is accepting bad data as good worse than
rejecting good data as bad?

216
The Hamming (7,4) code
An example of an (n,k) block code. Each codeword
contains n bits, k information bits and (n-k)
check bits. Hamming (7,4) is a nice easy code
but is not very efficient (a 75 overhead!). It
generates codewords with a Hamming distance of
3, i.e., all codewords differ in 3 locations. It
can correct one bit in error and detect two. A
codeword C k1 k2 k3 k4 c1 c2 c3 k1-k4 are
information bits and c1-c3 are check bits (the
systematic redundancy) calculated as
- c1k1k2k4 c2k1k3k4 c3k2k3k4 So the
data X 0 1 1 0 becomes the codeword 0 1 1
0 (010) (010) (110) 0 1 1 0 1 1 0
217
The Hamming (7,4) code
If we insert an error at bit 2 x
(syndrome) 0 1 1 0 1 1 0 becomes 0 0 1 0 1 1
0 Note c1 and c3 are wrong - these intersect
at k2 hence k2 is in error
c1k1k2k4
1
k1
k2
0
0
0
k4
1
0
1
c2k1k3k4
k3
c3k2k3k4
218
The Hamming (7,4) code
If we insert an error at bit 4 x
(syndrome) 0 1 1 0 1 1 0 becomes 0 1 1 1 1 1
0 Note c1,c2 and c3 are all wrong hence k4 is
in error
c1k1k2k4
1
k1
k2
0
1
1
k4
1
0
1
c2k1k3k4
k3
c3k2k3k4
219
Product codes
Imagine we have 4 x 4 data bits k11-k14,
k21-k24, k31-k34, k41-k44 Arranging them
horizontally we can add error protection in the
vertical direction as well. k11 k12 k13
k14 c11 c12 c13 k21 k22 k23 k24 c21 c22
c23 k31 k32 k33 k34 c31 c32 c33 k41
k42 k43 k44 c41 c42 c43 d11 d12 d13 d14 f11
f12 f13 d21 d22 d23 d24 f21 f22 f23
d31 d32 d33 d34 f31 f32 f33
220
Interleaving
Interleaving (systematically reordering) the
protected data stream means that errors are
distributed, i.e., more correctable. For
example- k1 k2 k3 k4 c1 c2 c3 k1 k2 k3 k4 c1 c2
c3 k1 k2 k3 k4 c1 c2 c3 x x x
with a three-way interleave- k1 k1 k1 k2 k2 k2
k3 k3 k3 k4 k4 k4 c1 c1 c1 c2 c2 c2 c3 c3 c3
x x x the errors are now
distributed (and correctable)- k1 k2 k3 k4 c1 c2
c3 k1 k2 k3 k4 c1 c2 c3 k1 k2 k3 k4 c1 c2 c3
x x x

221
More sophisticated codes
Sophisticated interleaving strategies are used in
most advanced digital communication systems. CDs
use product codes but with Reed-Solomon codes
(not Hamming.) They also use interleaving. Reed-
Solomon (RS) codes work on groups (e.g., bytes)
of inputs. For bytes nlt28 (codewords are a max
of 255 bytes). There is only a very small
probability of crypto-errors, i.e., correcting
good bytes by mistake. They can correct (n-k)/2
bytes and detect (n-k) bytes in error. For
example RS(122,106) can correct (122-106)/2 8
bytes in error. Other systems use layers of
error correction. If a lower (simpler) layer
detects errors, the next (more powerful) layer is
inspected. If errors are still detected the
final layer is interrogated. This method
increases the speed of decoding by only computing
check bytes when errors are suspected. For
example, the DAT drive uses 3 layers of error
control.
222
Uncorrected channel errors and compressed data