Image Processing Fundamentals - PowerPoint PPT Presentation

1 / 111

About This Presentation

Title:

Image Processing Fundamentals

Description:

Digital Image Processing Chapter 13: Image Compression Prepared by: Eng. Mohamed Hassan Supervised by: Dr. Ashraf Aboshosha http://www.icgst.com/A_Aboshosha.html – PowerPoint PPT presentation

Number of Views:946

Avg rating:3.0/5.0

Slides: 112

Provided by: George555

Category:

more less

Transcript and Presenter's Notes

Title: Image Processing Fundamentals

1
Digital Image Processing
Chapter 13 Image Compression
Prepared by Eng. Mohamed Hassan Supervised by
Dr. Ashraf Aboshosha http//www.icgst.com/A_Abosh
osha.html editor_at_icgst.com Tel.
0020-122-1804952 Fax. 0020-2-24115475
2
Goal of Image Compression

Digital images require huge amounts of space for
storage and large bandwidths for transmission.
A 640 x 480 color image requires close to 1MB of
space.
The goal of image compression is to reduce the
amount of data required to represent a digital
image.
Reduce storage requirements and increase
transmission rates.

3
Approaches

Lossless
Information preserving
Low compression ratios
Lossy
Not information preserving
High compression ratios
Trade-off image quality vs compression ratio

4
Data ? Information

Data and information are not synonymous terms!
Data is the means by which information is
conveyed.
Data compression aims to reduce the amount of
data required to represent a given quantity of
information while preserving as much information
as possible.

5
Data vs Information

The same amount of information can be represented
by various amount of data, e.g.

Your wife, Helen, will meet you at Logan Airport
in Boston at 5 minutes past 600 pm tomorrow
night Your wife will meet you at Logan Airport
at 5 minutes past 600 pm tomorrow night Helen
will meet you at Logan at 600 pm tomorrow night
Ex1
Ex2
Ex3
6
Data Redundancy
compression
Compression ratio
7
Data Redundancy

Relative data redundancy

Example
8
Types of Data Redundancy

Coding
Interpixel
Psychovisual
Compression attempts to reduce one or more of
these redundancy types.

9
Coding Redundancy

Code a list of symbols (letters, numbers, bits
etc.)
Code word a sequence of symbols used to
represent a piece of information or an event
(e.g., gray levels).
Code word length number of symbols in each code
word

10
Coding Redundancy
N x M image rk k-th gray level P(rk)
probability of rk l(rk) of bits for rk
Expected value
11
Coding Redundancy

l(rk) constant length

Example
12
Coding Redundancy

l(rk) variable length
Consider the probability of the gray levels

13
Interpixel redundancy

Interpixel redundancy implies that any pixel
value can be reasonably predicted by its
neighbors (i.e., correlated).

autocorrelation f(x)g(x)
14
Interpixel redundancy

To reduce interpixel redundnacy, the data must be
transformed in another format (i.e., through a
transformation)
e.g., thresholding, differences between adjacent
pixels, DFT

(110) bits/pair
15
Psychovisual redundancy

The human eye does not respond with equal
sensitivity to all visual information.
It is more sensitive to the lower frequencies
than to the higher frequencies in the visual
spectrum.
Idea discard data that is perceptually
insignificant!

16
Psychovisual redundancy
Example quantization
256 gray levels
16 gray levels
16 gray levels
i.e., add to each pixel a small pseudo-random
number prior to quantization
C8/4 21
17
How do we measure information?

What is the information content of a
message/image?
What is the minimum amount of data that is
sufficient to describe completely an image
without loss of information?

18
Modeling Information

Information generation is assumed to be a
probabilistic process.
Idea associate information with probability!

A random event E with probability P(E) contains
Note I(E)0 when P(E)1
19
How much information does a pixel contain?

Suppose that gray level values are generated by a
random variable, then rk contains

units of information!
20
How much information does an image contain?

Average information content of an image

using
Entropy
units/pixel
(assumes statistically independent random events)
21
Redundancy (revisited)

Redundancy

where
Note of Lavg H, the R0 (no redundancy)
22
Entropy Estimation

It is not easy to estimate H reliably!

image
23
Entropy Estimation

First order estimate of H

24
Estimating Entropy

Second order estimate of H
Use relative frequencies of pixel blocks

image
25
Estimating Entropy

The first-order estimate provides only a
lower-bound on the compression that can be
achieved.
Differences between higher-order estimates of
entropy and the first-order estimate indicate the
presence of interpixel redundancy!

Need to apply transformations!
26
Estimating Entropy

For example, consider differences

27
Estimating Entropy

Entropy of difference image

Better than before (i.e., H1.81 for original
image)

However, a better transformation could be found
since

28
Image Compression Model
29
Image Compression Model

Mapper transforms input data in a way that
facilitates reduction of interpixel redundancies.

30
Image Compression Model

Quantizer reduces the accuracy of the mappers
output in accordance with some pre-established
fidelity criteria.

31
Image Compression Model

Symbol encoder assigns the shortest code to the
most frequently occurring output values.

32
Image Compression Models

Inverse operations are performed.
But quantization is irreversible in general.

33
Fidelity Criteria

How close is to ?
Criteria
Subjective based on human observers
Objective mathematically defined criteria

34
Subjective Fidelity Criteria
35
Objective Fidelity Criteria

Root mean square error (RMS)
Mean-square signal-to-noise ratio (SNR)

36
Objective Fidelity Criteria (contd)
RMSE 5.17
RMSE 15.67
RMSE 14.17
37
Lossless Compression
38
Lossless Methods Taxonomy
39
Huffman Coding (coding redundancy)

A variable-length coding technique.
Optimal code (i.e., minimizes the number of code
symbols per source symbol).
Assumption symbols are encoded one at a time!

40
Huffman Coding

Forward Pass
1. Sort probabilities per symbol
2. Combine the lowest two probabilities
3. Repeat Step2 until only two probabilities
remain.

41
Huffman Coding

Backward Pass
Assign code symbols going backwards

42
Huffman Coding

Lavg using Huffman coding
Lavg assuming binary codes

43
Huffman Coding/Decoding

After the code has been created, coding/decoding
can be implemented using a look-up table.
Note that decoding is done unambiguously.

44
Arithmetic (or Range) Coding (coding redundancy)

No assumption on encode source symbols one at a
time.
Sequences of source symbols are encoded together.
There is no one-to-one correspondence between
source symbols and code words.
Slower than Huffman coding but typically achieves
better compression.

45
Arithmetic Coding (contd)

A sequence of source symbols is assigned a single
arithmetic code word which corresponds to a
sub-interval in 0,1.
As the number of symbols in the message
increases, the interval used to represent it
becomes smaller.
Smaller intervals require more information units
(i.e., bits) to be represented.

46
Arithmetic Coding (contd)
Encode message a1 a2 a3 a3 a4
1) Assume message occupies 0, 1)
2) Subdivide 0, 1) based on the probability of
ai
3) Update interval by processing source symbols
47
Example
Encode
a1 a2 a3 a3 a4
0.06752, 0.0688) or, 0.068
48
Example

The message a1 a2 a3 a3 a4 is encoded using 3
decimal digits or 3/5 0.6 decimal digits per
source symbol.
The entropy of this message is
Note finite precision arithmetic might cause
problems due to truncations!

-(3 x 0.2log10(0.2)0.4log10(0.4))0.5786
digits/symbol
49
Arithmetic Decoding
1.0
0.8
0.72
0.592
0.5728
a4
0.8
0.72
0.688
0.5856
0.57152
Decode 0.572
a3
0.4
0.56
0.624
0.5728
056896
a2
a3 a3 a1 a2 a4
0.2
0.48
0.592
0.5664
0.56768
a1
0.0
0.4
0.56
0.56
0.5664
50
LZW Coding (interpixel redundancy)

Requires no priori knowledge of pixel probability
distribution values.
Assigns fixed length code words to variable
length sequences.
Patented Algorithm US 4,558,302
Included in GIF and TIFF and PDF file formats

51
LZW Coding

A codebook (or dictionary) needs to be
constructed.
Initially, the first 256 entries of the
dictionary are assigned to the gray levels
0,1,2,..,255 (i.e., assuming 8 bits/pixel)

Initial Dictionary
Consider a 4x4, 8 bit image 39 39 126
126 39 39 126 126 39 39
126 126 39 39 126 126
52
LZW Coding (contd)

39 39 126 126
39 39 126 126
39 39 126 126
39 39 126 126

As the encoder examines image pixels, gray level
sequences (i.e., blocks) that are not in the
dictionary are assigned to a new entry.
- Is 39 in the dictionary..Yes - What
about 39-39.No - Then add 39-39 in entry
256
39-39
53
Example
Concatenated Sequence CS CR P
39 39 126 126 39 39 126 126 39 39
126 126 39 39 126 126
(CR)
(P)
CR empty
If CS is found (1) No Output (2) CRCS
else (1) Output D(CR) (2) Add CS to D (3)
CRP
54
Decoding LZW

The dictionary which was used for encoding need
not be sent with the image.
Can be built on the fly by the decoder as it
reads the received code words.

55
Differential Pulse Code Modulation (DPCM) Coding
(interpixel redundancy)

A predictive coding approach.
Each pixel value (except at the boundaries) is
predicted based on its neighbors (e.g., linear
combination) to get a predicted image.
The difference between the original and predicted
images yields a differential or residual image.
i.e., has much less dynamic range of pixel
values.
The differential image is encoded using Huffman
coding.

56
Run-length coding (RLC) (interpixel redundancy)

Used to reduce the size of a repeating string of
characters (i.e., runs)
1 1 1 1 1 0 0 0 0 0 0 1 ? (1,5) (0,
6) (1, 1)
a a a b b b b b b c c ? (a,3) (b, 6) (c,
2)
Encodes a run of symbols into two bytes (symbol,
count)
Can compress any type of data but cannot achieve
high compression ratios compared to other
compression methods.

57
Bit-plane coding (interpixel redundancy)

An effective technique to reduce inter pixel
redundancy is to process each bit plane
individually.
(1) Decompose an image into a series of binary
images.
(2) Compress each binary image (e.g., using
run-length coding)

58
Combining Huffman Coding with Run-length Coding

Assuming that a message has been encoded using
Huffman coding, additional compression can be
achieved using run-length coding.

e.g., (0,1)(1,1)(0,1)(1,0)(0,2)(1,4)(0,2)
59
Lossy Compression

Transform the image into a domain where
compression can be performed more efficiently
(i.e., reduce interpixel redundancies).

(N/n)2 subimages
60
Example Fourier Transform
The magnitude of the FT decreases, as u, v
increase!
K ltlt N
K-1
K-1
61
Transform Selection

T(u,v) can be computed using various
transformations, for example
DFT
DCT (Discrete Cosine Transform)
KLT (Karhunen-Loeve Transformation)

62
DCT

forward

inverse

63
DCT (contd)

Basis set of functions for a 4x4 image
(i.e.,cosines of different frequencies).

64
DCT (contd)
DFT
WHT
DCT
8 x 8 subimages 64 coefficients per
subimage 50 of the coefficients truncated
RMS error 2.32 1.78
1.13
65
DCT (contd)

Subimage size selection

4 x 4 subimages
8 x 8 subimages
2 x 2 subimages
original
66
JPEG Compression

JPEG is an image compression standard which was
accepted as an international standard in 1992.
Developed by the Joint Photographic Expert Group
of the ISO/IEC for coding and compression of
color/gray scale images.
Yields acceptable compression in the 101 range.
A scheme for video compression based on JPEG
called Motion JPEG (MJPEG) exists

67
JPEG Compression (contd)

JPEG uses DCT for handling interpixel redundancy.
Modes of operation
(1) Sequential DCT-based encoding
(2) Progressive DCT-based encoding
(3) Lossless encoding
(4) Hierarchical encoding

68
JPEG Compression (Sequential DCT-based encoding)
Entropy decoder
69
JPEG Steps

1.Divide the image into 8x8 subimages
For each subimage do
2. Shift the gray-levels in the range -128,
127
- DCT requires range be centered
around 0
3. Apply DCT (i.e., 64 coefficients)
1 DC coefficient F(0,0)
63 AC coefficients F(u,v)

70
JPEG Steps

4. Quantize the coefficients (i.e., reduce the
amplitude of coefficients that do not contribute
a lot).

Q(u,v) quantization table
71
Example

Quantization Table Qij

72
Example (contd)
Quantization
73
JPEG Steps (contd)

5. Order the coefficients using zig-zag ordering
- Place non-zero coefficients first
- Create long runs of zeros (i.e., good for
run-length encoding)

74
Example
75
JPEG Steps (contd)

6. Form intermediate symbol sequence and encode
coefficients
6.2 AC coefficients variable length coding

6.1 DC coefficients predictive encoding
76
Intermediate Coding
DC
symbol_1 (SIZE)
symbol_2 (AMPLITUDE)
end of block
DC (6)
(61) AC (0,2)
(-3)
symbol_1 (RUN-LENGTH, SIZE) symbol_2
(AMPLITUDE)
SIZE bits for encoding amplitude RUN-LENGTH
run of zeros
77
DC/AC Symbol Encoding

DC encoding
AC encoding

symbol_1 symbol_2 (SIZE)
(AMPLITUDE)
predictive coding
-2048, 2047
-211, 211-1 1 SIZE11
0 0 0 0 0 0 476 (6,9)(476)
-210, 210-1 1 SIZE10
If RUN-LENGTH gt 15, use symbol (15,0) , i.e.,
RUN-LENGTH16
78
Effect of Quality
90 (58k bytes)
50 (21k bytes)
10 (8k bytes)
worst quality, highest compression
best quality, lowest compression
79
Example 1 homogeneous 8 x 8 block
80
Example 1 (contd)
Quantized
De-quantized
81
Example 1 (contd)
Reconstructed
Error
82
Example 2 less homogeneous 8 x 8 block
83
Example 2 (contd)
Quantized
De-quantized
84
Example 2 (contd)
Error
Reconstructed spatial
85
JPEG for Color Images

Could apply JPEG on R/G/B components .
It is more efficient to describe a color in terms
of its luminance and chrominance content
separately, to enable more efficient processing.
YUV
Chrominance can be subsampled due to human vision
insensitivity

86
JPEG for Color Images

Luminance Received brightness of the light
(proportional to the total energy in the visible
band).
Chrominance Describe the perceived color tone of
the light (depends on the wavelength composition
of light
Hue Specify the color tone (i.e., depends on the
peak wavelength of the light).
Saturation Describe how pure the color is (i.e.,
depends on the spread or bandwidth of the light
spectrum).

87
JPEG for Color Images
Encoder
Decoder
88
JPEG Modes

JPEG supports several different modes
Sequential Mode
Progressive Mode
Hierarchical Mode
Lossless Mode
Sequential is the default mode
Each image component is encoded in a single
left-to-right, top-to-bottom scan.
This is the mode we have been describing.

89
Progressive JPEG

The image is encoded in multiple scans, in order
to produce a quick, rough decoded image when
transmission time is long.

Sequential
Progressive
90
Progressive JPEG (contd)

Send DCT coefficients in multiple scans
(1) Progressive spectral selection algorithm
(2) Progressive successive approximation
algorithm
(3) Hybrid progressive algorithm

91
Progressive JPEG (contd)

(1) Progressive spectral selection algorithm
Group DCT coefficients into several spectral
bands
Send low-frequency DCT coefficients first
Send higher-frequency DCT coefficients next

92
Progressive JPEG (contd)

(2) Progressive successive approximation
algorithm
Send all DCT coefficients but with lower
precision.
Refine DCT coefficients in later scans.

93
Progressive JPEG (contd)

(3) Hybrid progressive algorithm
Combines spectral selection and successive
approximation.

94
Hierarchical JPEG

Hierarchical mode encodes the image at several
different resolutions.
Image is transmitted in multiple passes with
increased resolution at each pass.

95
Hierarchical JPEG (contd)
f4
N/4 x N/4
f2
N/2 x N/2
f
N x N
96
Hierarchical JPEG (contd)

97
Hierarchical JPEG (contd)
98
Lossless Differential Pulse Code Modulation
(DPCM) Coding

Each pixel value (except at the boundaries) is
predicted based on certain neighbors (e.g.,
linear combination) to get a predicted image.
The difference between the original and predicted
images yields a differential or residual image.
Encode differential image using Huffman coding.

xm
dm
Entropy Encoder
pm
predictor
99
Lossy Differential Pulse Code Modulation (DPCM)
Coding

Similar to lossless DPCM except that (i) it uses
quantization and (ii) the pixels are predicted
from the reconstructed values of certain
neighbors.

100
Block Truncation Coding

Divide image in non-overlapping blocks of pixels.
Derive a bitmap (0/1) for each block using
thresholding.
e.g., use mean pixel value in each block as
threshold.
For each group of 1s and 0s, determine
reconstruction value
e.g., average of corresponding pixel values in
original block.

101
Subband Coding

Analyze image to produce components containing
frequencies in well defined bands (i.e.,
subbands)
e.g., use wavelet transform.
Optimize quantization/coding in each subband.

102
Vector Quantization

Develop a dictionary of fixed-size vectors (i.e.,
code vectors), usually blocks of pixel values.
Partition image in non-overlapping blocks (i.e.,
image vectors).
Encode each image vector by the index of its
closest code vector.

103
Fractal Coding

What is a fractal?
A rough or fragmented geometric shape that can be
split into parts, each of which is (at least
approximately) a reduced-size copy of the whole.

Idea store images as collections of
transformations!
104
Fractal Coding (contd)
Generated by 4 affine transformations!
105
Fractal Coding (contd)

Decompose image into segments (i.e., using
standard segmentations techniques based on edges,
color, texture, etc.) and look them up in a
library of IFS codes.
Best suited for textures and natural images.

106
Fingerprint Compression

An image coding standard for digitized
fingerprints, developed and maintained by
FBI
Los Alamos National Lab (LANL)
National Institute for Standards and Technology
(NIST).
The standard employs a discrete wavelet
transform-based algorithm (Wavelet/Scalar
Quantization or WSQ).

107
Memory Requirements

FBI is digitizing fingerprints at 500 dots per
inch with 8 bits of grayscale resolution.
A single fingerprint card turns into about 10 MB
of data!

A sample fingerprint image 768 x 768 pixels
589,824 bytes
108
Preserving Fingerprint Details
The "white" spots in the middle of the black
ridges are sweat pores Theyre admissible points
of identification in court, as are the little
black flesh islands in the grooves between
the ridges
These details are just a couple pixels wide!
109
What compression scheme should be used?

Better use a lossless method to preserve every
pixel perfectly.
Unfortunately, in practice lossless methods
havent done better than 21 on fingerprints!
Would JPEG work well for fingerprint compression?

110
Varying compression ratio

FBIs target bit rate is around 0.75 bits per
pixel (bpp)
i.e., corresponds to a target compression ratio
of 10.7 (assuming 8-bit images)
This target bit rate is set via a knob on the
WSQ algorithm.
i.e., similar to the "quality" parameter in many
JPEG implementations.
Fingerprints coded with WSQ at a target of 0.75
bpp will actually come in around 151

111
End
Thank you

Write a Comment

User Comments (0)