Image Compression by Learning Matrix Ortho-normal Bases - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

Image Compression by Learning Matrix Ortho-normal Bases

Description:

Competitive with JPEG. 2. Vector. Conventional learning methods in vision like PCA, ICA, etc. ... Results competitive with JPEG standard. Future extensions: ... – PowerPoint PPT presentation

Number of Views:63
Avg rating:3.0/5.0
Slides: 24
Provided by: cir9
Learn more at: https://www.cise.ufl.edu
Category:

less

Transcript and Presenter's Notes

Title: Image Compression by Learning Matrix Ortho-normal Bases


1
Image Compression by Learning Matrix Ortho-normal
Bases
  • Karthik Gurumoorthy
  • Ajit Rajwade
  • Arunava Banerjee
  • Anand Rangarajan
  • Department of CISE
  • University of Florida

2
Overview
  • A new approach to lossy image compression based
    on machine learning.
  • Key idea Learning of Matrix Ortho-normal Bases
    from training data to efficiently code images.
  • Applied to compression of well-known face
    databases like ORL, Yale.
  • Competitive with JPEG.

3
Background Images as Vectors
Vector


Image
Conventional learning methods in vision like PCA,
ICA, etc.
4
Background Images as Matrices
Treated as a

Image
Matrix
Our approach following Rangarajan EMMCVPR-2001
Ye JMLR-2004
5
Image Patches


Image
Image of size divided into N
patches of size each treated as a
Matrix.
6
SVD of a Patch
P
U
S
V

U and V Ortho-normal matrices
S Diagonal Matrix of singular values
7
Exponentially decreasing Singular Values
useful for compression (e.g. SSVD Ranade et
al-IVC 2007).
8
SVD for Ensemble of Patches?
  • Consider a set of N image patches
  • SVD of each patch gives
  • Costly in terms of storage as we need to store N
    ortho-normal basis pairs.

9
Common Orthonormal Basis-pairs?
  • Produce ortho-normal
    basis-pairs, common for all N patches.
  • Since storing the basis pairs
    is not expensive.

10
Away from SVD
  • Non-diagonal
  • Non-sparse

11
Away from SVD
  • What sparse matrix will optimally
    reconstruct from ?
  • Optimally least error
  • Sparse matrix has at most some
    non-zero elements.

12
Away from SVD
  • We have a simple, provably optimal greedy method
    to compute such a
  • Compute the matrix .
  • In matrix , nullify all except the
    largest elements to produce .

13
Learning algorithm
  • A set of N image patches
    .
  • Learning K ltlt N ortho-normal basis pairs

Projection Matrices
Memberships
14
Summary of Training Algorithm
  • Input N image patches of size .
  • Output K pairs of ortho-normal bases
  • called as dictionary.


15
Testing phase
  • Divide each test image into patches of size
  • Fix per-pixel average error (say e), similar to
    the quality user-parameter in JPEG.

16
Testing phase
. . .
. . .
. . .
17
Results ROC Curve (ORL Database)
RPP number of bits per pixel
18
Sample Reconstructions
0.92 bits
1.36 bits
0.5 bits
1.78 bits
3.023 bits
19
Results ORL Database
  • Size of original database is 3.46 MB.
  • Size of dictionary of 50 ortho-normal basis pairs
    is 56 KB0.05MB.
  • Size of database after compression and coding
    with our method with e 0.0001 is 1.3 MB.
  • Total compression rate achieved is 61.

20
Results ROC Curve (Yale Database)
RPP number of bits per pixel
21
Conclusions
  • New lossy image compression method using machine
    learning.
  • Key idea 1 matrix based image representation.
  • Key idea2 Learning small set of matrix
    ortho-normal basis pairs tuned to a database.
  • Results competitive with JPEG standard.
  • Future extensions video compression.

22
References
  • A. Rangarajan, Learning matrix space image
    representations, Energy Minimizing Methods in
    Computer Vision and Pattern Recognition, 2001.
  • J. Ye, Generalized low rank approximation of
    matrices, Journal of Machine Learning Research
    ,2004.
  • M. Aharon, M. Elad and A. Bruckstein, The K-SVD
    An algorithm for designing of overcomplete
    dictionaries for sparse representation. IEEE
    Transactions on Signal Processing, 2006.
  • A. Ranade, S. Mahabalarao and S. Kale. A
    variation on SVD based image compression. Image
    and Vision Computing, 2007.

23
Questions
???
Write a Comment
User Comments (0)
About PowerShow.com