Title: Large Scale Discovery of Spatially Related Images
1Large Scale Discoveryof Spatially Related Images
- Ondrej Chum and Jirà Matas
- Center for Machine Perception
- Czech Technical University
- Prague
2Related Vision Problems
- Organize my holiday snapshots
- Schaffalitzky and Zisserman ECCV02
- Find images containing a given object
(window) - Sivic ICCV03, Nister CVPR06, Jegou CVPR07,
Philbin CVPR07, Chum ICCV07 - Find small object in a film
- Sivic and Zisserman CVPR04
- Match and reconstruct Saint Marco
- Snavely, Seitz and Szeliski SIGGRAPH06
This Work
- Find and match ALL spatially related images in a
large database, using only visual information,
i.e. not using (flicker) tags, EXIF info, GPS, .
3Visual Only Approach
- Large database (100 000 images in our
experiments) - Find spatially related clusters
- Fast method, even for sizes up to 250 images
- Probability of successful discovery of spatial
relation of images independent of database size
4Image Clustering and its Time Complexity
Standard Approach (using image retrieval) Quadrat
ic method in the size of database D -- O(D2) the
multiplicative constant at the quadratic term 1
quadratic even for small D
- Take each image in turn
- Use a image retrieval system to retrieve related
images - Compute connected components of the graph
- Proposed method
- Seed Generation hashingcharacterize images by
pseudo-random numbers stored in a hash table - time complexity equal to the sum of
variances of Poisson distributions - linear for database size D ¼ 250
-
- 2. Seed Growing retrieval
- complete the clusters only for cluster
members c ltlt D, complexity O(cD)
5Building on Two Methods
- Fast (low recall) seed generation based on
hashing - Thorough (high recall) seed growing based on
image retrieval
Chum, Philbin, Isard, and Zisserman Scalable
Near Identical Image and Shot Detection CIVR 2007
Chum, Philbin, Sivic, Isard, and Zisserman Total
Recall Automatic Query Expansion with a
Generative Feature Model for Object
Retrieval ICCV 2007
6Image Representation
Feature detector
SIFT descriptor Lowe04
Vector quantization
Visual vocabulary
Bag of words
Set of words
7Hypothesizing Seeds with min-Hash
- Spatially related images share visual words
- Problem Robustly estimate set overlap of high
dimensional sparse binary vectors in constant
time independent of the dimensionality (d¼105) - Set overlap probabilistically estimated via
min-Hash - Similar approach as LSH (locally sensitive
hashing)
Image similarity measured as a set overlap (using
min-Hash algorithm)
A1 n A2
A1
A2
A1 U A2
8min-Hash
- According to some (replicable) key select a small
number of non-zero elements - Similar vectors should have similar selected
elements - Key generate a random number (a hash) for each
dimension, choose nonzero element with minimal
value of the key
29 12 19
0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1
26 3 26
0 0 1 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0
29 12 1
1 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0
35 27 7
0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 1 1 0 0 0
0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1
9Seed Generation Probability of Success
An image pair forms a seed if at least one of k
s-tuples of min-Hashes
agrees. Probability that an image pair is
retrieved is a function of the similarity
where s,k are user-controllable parameters of
the method s governs the size of the hashing
table k is number of hashing tables Successfully
retrieved pair of images at least one
collision in one of the tables (equivalent to
AND-OR)
10Probability of Retrieving an Image Pair
Images of the same object and unrelated images
Near duplicate Images
13.9 (sim 0,066)
100 (sim 0.746)
probability of retrieval
100 (sim 0.322)
8.9 (sim 0.057)
99.5 (sim 0,217)
5.1 (sim 0.047)
similarity (set overlap)
11Spatially Related Images
18.9 (sim 0,074)
5.1 (sim 0,047)
probability of retrieval (log scale)
8.9
10.7
7.2
9.8
similarity (set overlap)
13.9
5.1
8.9
16.3
13.9
12Seed Generation
5
4
6
4
7
10
94.00
85.73
68.88
P (no seed)
13Seed Generation
Resemblance to RANSAC Related image pair an
all inlier sample (there is no need to enumerate
them all, one hit is sufficient) Probability of
retrieving an image pair fraction of
inliers The number of related image pairs how
many times we can try
68.88
55.13
31.84
1.94
P (no seed)
14At Least One Seed in Cluster
Estimate of the probability of failure plot
against the size of the cluster assumption used
in this plot all images in the cluster are
related
P(no seed)
cluster size
15Growing the Seed
- Application of Total Recall
- Combining average query expansion and transitive
closure - 3D geometric constraint (not only affine
transformation) - Tighter geometric constraints (10 pixel threshold)
Average query expansion (from possibly multiple
coplanar structures)
backproject features
query
enhanced query
Transitive closure crawl
16Summary of the Method
Images
Rejected seed
Unknown structure
min-Hash seeds
Spatial verification
Query Expansion
x
Seed
Failed retrieval
Cluster skeleton
Missed cluster
17Experiment 1 Univ. of Kentucky Dataset
- Nister Stewenius
- 2550 clusters of size 4 very small clusters
- partial ground truth different cluster
share the same background - How many clusters have at least one seed?
46.9
CONTRAST DIFFERENT TASK If we were looking for
ALL results not ANY (seed) the standard retrieval
measure on this dataset would be only 1.63 out of
4
18Experimental Validation UKY dataset
In University of Kentucky dataset average
similarity slightly above 0.06
P(no seed)
cluster size
19Experimental Results on 100k Images
Images downloaded from FLICKR Includes 11 Oxford
Landmarks with manually labelled ground truth
All Soul's
Hertford
Ashmolean
Keble
Balliol
Magdalen
Bodleian
Pitt Rivers
Christ Church
Radcliffe Camera
Cornmarket
20Experimental Results on 100k Images
Settings scalable to millions images, also
finding small clusters
Settings scalable to billions images, only
finding larger clusters
Timing 17 min 13 sec 16 min 20 sec 0.019 sec
/ image
21Application Object Labelling
- Factorizing the clusters using multiple
constrains - Matches between images
- Weak geometric constraints (coplanarity,
disparity) - Photographers psychology tends to take
pictures of single objects
22(No Transcript)
23(No Transcript)
24Automatic 3D Reconstruction
25(No Transcript)
26Conclusions
- Novel method for fast clustering in large
collections - Combines fast low recall method (seed generation)
and thorough (total recall) method for seed
growing - Probability of finding a cluster rapidly
increases with its size and is independent of the
size of the database - Can be incrementally updated as the database
grows - Efficient 0.019 sec / image on a single PC
- Fully parallelizable
- A state of the art near duplicate detection comes
as a bonus (as a part of seed generation)
27Thank you!
Technical Report available http//cmp.felk.cvut.cz
/chum/papers/Chum-TR-08.pdf
Thanks to Daniel Martinec, Michal Perdoch, James
Philbin, Jakub Pokluda