Image Compression by Learning Matrix Ortho-normal Bases

About This Presentation

Title:

Image Compression by Learning Matrix Ortho-normal Bases

Description:

Competitive with JPEG. 2. Vector. Conventional learning methods in vision like PCA, ICA, etc. ... Results competitive with JPEG standard. Future extensions: ... – PowerPoint PPT presentation

Number of Views:63

Avg rating:3.0/5.0

Slides: 24

Provided by: cir9

Learn more at: https://www.cise.ufl.edu

Category:

more less

Transcript and Presenter's Notes

Title: Image Compression by Learning Matrix Ortho-normal Bases

1
Image Compression by Learning Matrix Ortho-normal
Bases

Karthik Gurumoorthy
Ajit Rajwade
Arunava Banerjee
Anand Rangarajan
Department of CISE
University of Florida

2
Overview

A new approach to lossy image compression based
on machine learning.
Key idea Learning of Matrix Ortho-normal Bases
from training data to efficiently code images.
Applied to compression of well-known face
databases like ORL, Yale.
Competitive with JPEG.

3
Background Images as Vectors
Vector

Image
Conventional learning methods in vision like PCA,
ICA, etc.
4
Background Images as Matrices
Treated as a

Image
Matrix
Our approach following Rangarajan EMMCVPR-2001
Ye JMLR-2004
5
Image Patches

Image
Image of size divided into N
patches of size each treated as a
Matrix.
6
SVD of a Patch
P
U
S
V

U and V Ortho-normal matrices
S Diagonal Matrix of singular values
7
Exponentially decreasing Singular Values
useful for compression (e.g. SSVD Ranade et
al-IVC 2007).
8
SVD for Ensemble of Patches?

Consider a set of N image patches
SVD of each patch gives
Costly in terms of storage as we need to store N
ortho-normal basis pairs.

9
Common Orthonormal Basis-pairs?

Produce ortho-normal
basis-pairs, common for all N patches.
Since storing the basis pairs
is not expensive.

10
Away from SVD

Non-diagonal
Non-sparse

11
Away from SVD

What sparse matrix will optimally
reconstruct from ?
Optimally least error
Sparse matrix has at most some
non-zero elements.

12
Away from SVD

We have a simple, provably optimal greedy method
to compute such a
Compute the matrix .
In matrix , nullify all except the
largest elements to produce .

13
Learning algorithm

A set of N image patches
.
Learning K ltlt N ortho-normal basis pairs

Projection Matrices
Memberships
14
Summary of Training Algorithm

Input N image patches of size .
Output K pairs of ortho-normal bases
called as dictionary.

15
Testing phase

Divide each test image into patches of size
Fix per-pixel average error (say e), similar to
the quality user-parameter in JPEG.

16
Testing phase
. . .
. . .
. . .
17
Results ROC Curve (ORL Database)
RPP number of bits per pixel
18
Sample Reconstructions
0.92 bits
1.36 bits
0.5 bits
1.78 bits
3.023 bits
19
Results ORL Database

Size of original database is 3.46 MB.
Size of dictionary of 50 ortho-normal basis pairs
is 56 KB0.05MB.
Size of database after compression and coding
with our method with e 0.0001 is 1.3 MB.
Total compression rate achieved is 61.

20
Results ROC Curve (Yale Database)
RPP number of bits per pixel
21
Conclusions

New lossy image compression method using machine
learning.
Key idea 1 matrix based image representation.
Key idea2 Learning small set of matrix
ortho-normal basis pairs tuned to a database.
Results competitive with JPEG standard.
Future extensions video compression.

22
References

A. Rangarajan, Learning matrix space image
representations, Energy Minimizing Methods in
Computer Vision and Pattern Recognition, 2001.
J. Ye, Generalized low rank approximation of
matrices, Journal of Machine Learning Research
,2004.
M. Aharon, M. Elad and A. Bruckstein, The K-SVD
An algorithm for designing of overcomplete
dictionaries for sparse representation. IEEE
Transactions on Signal Processing, 2006.
A. Ranade, S. Mahabalarao and S. Kale. A
variation on SVD based image compression. Image
and Vision Computing, 2007.