Pyramids and Texture - PowerPoint PPT Presentation

1 / 54

About This Presentation

Title:

Pyramids and Texture

Description:

... phenomenon Wagon wheels rolling the wrong way in movies Checkerboards misrepresented in ray tracing Striped shirts look funny on color ... few s Typically ... – PowerPoint PPT presentation

Number of Views:87

Avg rating:3.0/5.0

Slides: 55

Provided by: csNccuEd1

Category:

more less

Transcript and Presenter's Notes

Title: Pyramids and Texture

1
Pyramids and Texture
2
Scaled representations

Big bars and little bars are both interesting
Spots and hands vs. stripes and hairs
Inefficient to detect big bars with big filters
And there is superfluous detail in the filter
kernel
Alternative
Apply filters of fixed size to images of
different sizes
Typically, a collection of images whose edge
length changes by a factor of 2 (or root 2)
This is a pyramid (or Gaussian pyramid) by visual
analogy

3
A bar in the big images is a hair on the zebras
nose in smaller images, a stripe in the
smallest, the animals nose
4
Aliasing

Cant shrink an image by taking every second
pixel
If we do, characteristic errors appear
In the next few slides
Typically, small phenomena look bigger fast
phenomena can look slower
Common phenomenon
Wagon wheels rolling the wrong way in movies
Checkerboards misrepresented in ray tracing
Striped shirts look funny on color television

5
(No Transcript)
6

Constructing a pyramid by taking every second
pixel leads to layers that badly misrepresent the
top layer
7
Open questions

What causes the tendency of differentiation to
emphasize noise?
In what precise respects are discrete images
different from continuous images?
How do we avoid aliasing?
General thread a language for fast changes
The Fourier Transform

8
The Fourier Transform

Represent function on a new basis
Think of functions as vectors, with many
components
We now apply a linear transformation to transform
the basis
dot product with each basis element
In the expression, u and v select the basis
element, so a function of x and y becomes a
function of u and v
basis elements have the form

vectorized image
transformed image
Fourier transform base, also possible Wavelets,
steerable pyramids, etc.
9

Fourier basis element
example, real part
Fu,v(x,y)
Fu,v(x,y)const. for (uxvy)const.
Vector (u,v)
Magnitude gives frequency
Direction gives orientation.

10
Here u and v are larger than in the previous
slide.
11
And larger still...
12
Phase and Magnitude

Fourier transform of a real function is complex
difficult to plot, visualize
instead, we can think of the phase and magnitude
of the transform
Phase is the phase of the complex transform
Magnitude is the magnitude of the complex
transform
Curious fact
all natural images have about the same magnitude
transform
hence, phase seems to matter, but magnitude
largely doesnt
Demonstration
Take two pictures, swap the phase transforms,
compute the inverse - what does the result look
like?

13
(No Transcript)
14
This is the magnitude transform of the cheetah pic
15
This is the phase transform of the cheetah pic
16
(No Transcript)
17
This is the magnitude transform of the zebra pic
18
This is the phase transform of the zebra pic
19
Reconstruction with zebra phase, cheetah magnitude
20
Reconstruction with cheetah phase, zebra magnitude
21
Smoothing as low-pass filtering

The message of the FT is that high frequencies
lead to trouble with sampling.
Solution suppress high frequencies before
sampling
multiply the FT of the signal with something that
suppresses high frequencies
or convolve with a low-pass filter
A filter whose FT is a box is bad, because the
filter kernel has infinite support
Common solution use a Gaussian
multiplying FT by Gaussian is equivalent to
convolving image with Gaussian.

22
Sampling without smoothing. Top row shows the
images, sampled at every second pixel to get the
next bottom row shows the magnitude spectrum of
these images.
23
Sampling with smoothing. Top row shows the
images. We get the next image by smoothing the
image with a Gaussian with sigma 1 pixel, then
sampling at every second pixel to get the next
bottom row shows the magnitude spectrum of these
images.
24
Sampling with smoothing. Top row shows the
images. We get the next image by smoothing the
image with a Gaussian with sigma 1.4 pixels,
then sampling at every second pixel to get the
next bottom row shows the magnitude spectrum of
these images.
25
Applications of scaled representations

Search for correspondence
look at coarse scales, then refine with finer
scales
Edge tracking
a good edge at a fine scale has parents at a
coarser scale
Control of detail and computational cost in
matching
e.g. finding stripes
terribly important in texture representation

26
Example CMU face detection
27
The Gaussian pyramid

Smooth with gaussians, because
a gaussiangaussiananother gaussian
Synthesis
smooth and sample
Analysis
take the top image
Gaussians are low pass filters, so representation
is redundant

28
(No Transcript)
29
http//web.mit.edu/persci/people/adelson/pub_pdfs/
pyramid83.pdf
30
Texture

Key issue representing texture
Texture based matching
little is known
Texture segmentation
key issue representing texture
Texture synthesis
useful also gives some insight into quality of
representation
Shape from texture
will skip discussion

31
Texture synthesis

Given example, generate texture sample
(that is large enough, satisfies constraints, )

32
Texture analysis

Compare is this the same stuff?

33
pre-attentive texture discrimination
34
pre-attentive texture discrimination
35
pre-attentive texture discrimination

same or not?

36
pre-attentive texture discrimination
37
pre-attentive texture discrimination

same or not?

38
(No Transcript)
39
Representing textures

Textures are made up of quite stylized
subelements, repeated in meaningful ways
Representation
find the subelements, and represent their
statistics
But what are the subelements, and how do we find
them?
recall normalized correlation
find subelements by applying filters, looking at
the magnitude of the response
What filters?
experience suggests spots and oriented bars at a
variety of different scales
details probably dont matter
What statistics?
within reason, the more the merrier.
At least, mean and standard deviation
better, various conditional histograms.

40
Spots and bars at a fine scale
41
Spots and bars at a coarser scale
42
Fine scale
How many filters and what orientations?
Coarse scale
43
Texture Similarity based on Response Statistics

Collect statistics of responses over an image or
subimage
Mean of squared response
Mean and variance of squared response
Euclidean distance between vectors of response
statistics for two images is measure of texture
similarity

44
Example 1 Squared response
45
Example 2 Mean and variance of squared response

Compute the mean and standard deviation of the
filter outputs over the window, and use these for
the feature vector. (Ma and Manjunath, 1996)

Decreasing response vector similarity
46
The Choice of Scale

One approach start with a small window and
increase the size of the window until an increase
does not cause a significant change.

47
Laplacian Pyramids as Band-Pass Filters

courtesy of Wolfram
from Forsyth Ponce
Each level is the difference of a more smoothed
and less smoothed image ! It contains the band
of frequencies in between
48
(No Transcript)
49
(No Transcript)
50
Oriented Pyramids

Laplacian pyramid direction sensitivity

from Forsyth Ponce
51
Oriented Pyramids
Reprinted from Shiftable MultiScale Transforms,
by Simoncelli et al., IEEE Transactions on
Information Theory, 1992.
52
Gabor Filters