Bag of Words: recognition using texture - PowerPoint PPT Presentation

About This Presentation

Title:

Bag of Words: recognition using texture

Description:

A quiet meditation on the importance. of trying simple things first... Object. Bag of words' ... Of all the sensory impressions proceeding to the brain, the ... – PowerPoint PPT presentation

Number of Views:52

Avg rating:3.0/5.0

Slides: 37

Provided by: robf160

Learn more at: http://www.cs.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Bag of Words: recognition using texture

1
Bag of Words recognition using texture
A quiet meditation on the importance of trying
simple things first
16-721 Advanced Machine Perception A. Efros,
CMU, Spring 2006
Adopted from Fei-Fei Li, with some slides from
L.W. Renninger
2
(No Transcript)
3
Analogy to documents
Of all the sensory impressions proceeding to the
brain, the visual experiences are the dominant
ones. Our perception of the world around us is
based essentially on the messages that reach the
brain from our eyes. For a long time it was
thought that the retinal image was transmitted
point by point to visual centers in the brain
the cerebral cortex was a movie screen, so to
speak, upon which the image in the eye was
projected. Through the discoveries of Hubel and
Wiesel we now know that behind the origin of the
visual perception in the brain there is a
considerably more complicated course of events.
By following the visual impulses along their path
to the various cell layers of the optical cortex,
Hubel and Wiesel have been able to demonstrate
that the message about the image falling on the
retina undergoes a step-wise analysis in a system
of nerve cells stored in columns. In this system
each cell has its specific function and is
responsible for a specific detail in the pattern
of the retinal image.
4
(No Transcript)
5
(No Transcript)
6
1.Feature detection and representation
7
Feature detection

Sliding Window
Leung et al, 1999
Viola et al, 1999
Renninger et al 2002

8
Feature detection

Sliding Window
Leung et al, 1999
Viola et al, 1999
Renninger et al 2002
Regular grid
Vogel et al. 2003
Fei-Fei et al. 2005

9
Feature detection

Sliding Window
Leung et al, 1999
Viola et al, 1999
Renninger et al 2002
Regular grid
Vogel et al. 2003
Fei-Fei et al. 2005
Interest point detector
Csurka et al. 2004
Fei-Fei et al. 2005
Sivic et al. 2005

10
Feature detection

Sliding Window
Leung et al, 1999
Viola et al, 1999
Renninger et al 2002
Regular grid
Vogel et al. 2003
Fei-Fei et al. 2005
Interest point detector
Csurka et al. 2004
Fei-Fei et al. 2005
Sivic et al. 2005
Other methods
Random sampling (Ullman et al. 2002)
Segmentation based patches (Barnard et al. 2003

11
Feature Representation

Visual words, aka textons, aka keypoints
K-means clustered pieces of the image
Various Representations
Filter bank responses
Image Patches
SIFT descriptors
All encode more-or-less the same thing

12
Interest Point Features
Compute SIFT descriptor Lowe99
Normalize patch
Detect patches Mikojaczyk and Schmid 02 Matas
et al. 02 Sivic et al. 03
Slide credit Josef Sivic
13
Interest Point Features
14
Patch Features
15
dictionary formation
16
Clustering (usually k-means)
Vector quantization
Slide credit Josef Sivic
17
Clustered Image Patches
Fei-Fei et al. 2005
18
Filterbank
19
Textons (Malik et al, IJCV 2001)

K-means on vectors of filter responses

20
Textons (cont.)
21
Image patch examples of codewords
Sivic et al. 2005
22
Visual synonyms and polysemy
Visual Polysemy. Single visual word occurring on
different (but locally similar) parts on
different object categories.
Visual Synonyms. Two different visual words
representing a similar part of an object (wheel
of a motorbike).
23
Image representation
frequency
codewords
24
Scene Classification (Renninger Malik)
25
kNN Texton Matching
26
Discrimination of Basic Categories
27
Discrimination of Basic Categories
chance
28
Discrimination of Basic Categories
chance
29
Discrimination of Basic Categories
chance
30
Discrimination of Basic Categories
chance
31
Discrimination of Basic Categories
chance
32
Object Recognition using texture
33
Learn texture model