Shape Context Indexing Methods - PowerPoint PPT Presentation

1 / 48

About This Presentation

Title:

Shape Context Indexing Methods

Description:

This is a bipartite matching problem. Solution: Hungarian Algorithm. ... Uses bipartite matching (cubic complexity!). 22 minutes/object, heavily optimized. ... – PowerPoint PPT presentation

Number of Views:86

Avg rating:3.0/5.0

Slides: 49

Provided by: vassilis

Category:

more less

Transcript and Presenter's Notes

Title: Shape Context Indexing Methods

1

Lecture 27
Shape Context Indexing Methods

CSE 4392/6367 Computer Vision Spring
2009 Vassilis Athitsos University of Texas at
Arlington
2
Beyond Color Histograms

Some times, shape information is important.

3
Shape Context

Choose r1, r2, , rb
Choose s number of sectors.
Create a template consisting of rings and
sectors, as shown in the image.
Give a number to each sector of each ring.
For each edge pixel
Center the template on the pixel.
For each sector of each ring, count the number of
edge pixels in that sector.
Result each point ismapped to sb numbers.

source Wikipedia
4
Shape Representation

Pick T points from each shape, uniformly sampled.
Extract, for each point, the shape context
vector.
Then, each shape is represented as a matrix of
size T k.
T number of points we pick from each shape.
k s b.
s number of sectors in each ring.
b number of rings.

source Wikipedia
5
Shape Matching

Each shape is mapped to a matrix of size Tk.
T number of points we pick from each shape.
k s b.
s number of sectors in each ring.
b number of rings.
What is the cost of matching two shapes?
Simpler question what is the cost of matching
two shape contexts?

6
Shape Matching

Each shape is mapped to a matrix of size Tk.
T number of points we pick from each shape.
k s b.
s number of sectors in each ring. b number of
rings.
What is the cost of matching two shapes?
Simpler question what is the cost of matching
two shape contexts?
One answer Euclidean or Manhattan distance.
Better answer chi-square distance.
g(k) and h(k) k-th valuesof the two shape
contexts.

7
Shape Matching

Each shape is mapped to a matrix of size Tk.
T number of points we pick from each shape.
k s b.
s number of sectors in each ring. b number of
rings.
What is the cost of matching two shapes?
Key problem we do not know what point in one
image corresponds to what point in the other
image.
Solution find optimal 1-1 correspondences.
The cost of each correspondence is the matching
cost of the shape contexts of the two
corresponding points.
This is a bipartite matching problem.
Solution Hungarian Algorithm.
Complexity cubic to the number of points.

8
Shape Context Distance

Proposed by Belongie et al. (2001).
Error rate 0.63, with database of 20,000
images.
Uses bipartite matching (cubic complexity!).
22 minutes/object, heavily optimized.

9
Problem Definition
database (n objects)
10
Problem Definition

Goals
find the k nearest neighbors of query q.

11
Problem Definition

Goals
find the k nearest neighbors of query q.
Brute force time is linear to
n (size of database).
time it takes to measure a single distance.

12
Problem Definition

Goals
find the k nearest neighbors of query q.
Brute force time is linear to
n (size of database).
time it takes to measure a single distance.

13
Applications

Nearest neighbor classification.

Similarity-based retrieval.
Image/video databases.
Biological databases.
Time series.
Web pages.
Browsing music or movie catalogs.

handshapes
letters/digits
14
Expensive Distance Measures

Comparing d-dimensional vectors is efficient
O(d) time.

15
Expensive Distance Measures

Comparing d-dimensional vectors is efficient
O(d) time.

Comparing strings of length d with the edit
distance is more expensive
O(d2) time.
Reason alignment.

16
Expensive Distance Measures

Comparing d-dimensional vectors is efficient
O(d) time.

Comparing strings of length d with the edit
distance is more expensive
O(d2) time.
Reason alignment.

17
Matching Handwritten Digits
18
Matching Handwritten Digits
19
Matching Handwritten Digits
20
More Examples

Chamfer Distance.
Time series
Dynamic Time Warping.
Edit Distance for strings and DNA.
These measures are non-Euclidean, sometimes
non-metric.

21
Embeddings
database
Rd
embedding F
22
Embeddings
database
Rd
embedding F
query
q
23
Embeddings

Measure distances between vectors (typically much
faster).

database
Rd
embedding F
q
query
q
24
Embeddings

Measure distances between vectors (typically much
faster).
Caveat the embedding must preserve similarity
structure.

database
Rd
embedding F
q
query
q
25
Reference Object Embeddings
database
26
Reference Object Embeddings
database
r1
r2
r3
27
Reference Object Embeddings
database
r1
r2
r3
x
F(x) (D(x, r1), D(x, r2), D(x, r3))
28
F(x) (D(x, LA), D(x, Lincoln), D(x, Orlando))
F(Sacramento).... ( 386, 1543, 2920) F(Las
Vegas)..... ( 262, 1232, 2405) F(Oklahoma
City). (1345, 437, 1291) F(Washington DC).
(2657, 1207, 853) F(Jacksonville).. (2422,
1344, 141)
29
How Do We Use It?

Filter-and-refine retrieval
Offline step compute embedding F of entire
database.

30
How Do We Use It?

Filter-and-refine retrieval
Offline step compute embedding F of entire
database.
Given a query object q
Embedding step
Compute distances from query to reference objects
? F(q).

31
How Do We Use It?

Filter-and-refine retrieval
Offline step compute embedding F of entire
database.
Given a query object q
Embedding step
Compute distances from query to reference objects
? F(q).
Filter step
Find top p matches of F(q) in vector space.

32
How Do We Use It?

Filter-and-refine retrieval
Offline step compute embedding F of entire
database.
Given a query object q
Embedding step
Compute distances from query to reference objects
? F(q).
Filter step
Find top p matches of F(q) in vector space.
Refine step
Measure exact distance from q to top p matches.

33
Ideal Embedding Behavior
F
Rd
original space X
a
q
For any query q we want F(NN(q)) NN(F(q)).
34
Ideal Embedding Behavior
F
Rd
original space X
a
q
For any query q we want F(NN(q)) NN(F(q)).
35
Ideal Embedding Behavior
F
Rd
original space X
a
q
For any query q we want F(NN(q)) NN(F(q)).
36
Ideal Embedding Behavior
F
Rd
original space X
For any query q we want F(NN(q))
NN(F(q)). For any database object b besides
NN(q), we want F(q) closer to F(NN(q)) than to
F(b).
37
Embeddings As Classifiers
For triples (q, a, b) such that - q is a query
object - a NN(q) - b is a database object
Classification task is q closer to a or to b?
38
Embeddings As Classifiers
For triples (q, a, b) such that - q is a query
object - a NN(q) - b is a database object
Classification task is q closer to a or to b?

Any embedding F defines a classifier F(q, a, b).
F checks if F(q) is closer to F(a) or to F(b).

39
Classifier Definition
For triples (q, a, b) such that - q is a query
object - a NN(q) - b is a database object
Classification task is q closer to a or to b?

Given embedding F X ? Rd
F(q, a, b) F(q) F(b) - F(q) F(a).
F(q, a, b) gt 0 means q is closer to a.
F(q, a, b) lt 0 means q is closer to b.

40
Key Observation
F
Rd
original space X

If classifier F is perfect, then for every q,
F(NN(q)) NN(F(q)).
If F(q) is closer to F(b) than to F(NN(q)), then
triple (q, a, b) is misclassified.

41
Key Observation
F
Rd
original space X

Classification error on triples (q, NN(q), b)
measures how well F preserves nearest neighbor
structure.

42
Optimization Criterion

Goal construct an embedding F optimized for
k-nearest neighbor retrieval.
Method maximize accuracy of F on triples (q, a,
b) of the following type
q is any object.
a is a k-nearest neighbor of q in the database.
b is in database, but NOT a k-nearest neighbor of
q.
If F is perfect on those triples, then F
perfectly preserves k-nearest neighbors.

43
1D Embeddings as Weak Classifiers

1D embeddings define weak classifiers.
Better than a random classifier (50 error rate).

44
Lincoln
Detroit
LA
Chicago
New York
Cleveland
Chicago
LA
Detroit
New York
45
Results on Hand Dataset
Chamfer distance 112 seconds per query
46
Results on Hand Dataset
Database 80,640 synthetic images of hands.
Query set 710 real images of hands.
47
Results on MNIST Dataset