Scaling Content Based Image Retrieval Systems - PowerPoint PPT Presentation

1 / 7
About This Presentation
Title:

Scaling Content Based Image Retrieval Systems

Description:

Scaling Content Based Image Retrieval Systems. Christine Lo, Sushant Shankar, Arun Vijayvergiya ... Content Based Image Retrieval (CBIR) offers a way to ... – PowerPoint PPT presentation

Number of Views:68
Avg rating:3.0/5.0
Slides: 8
Provided by: sushant
Category:

less

Transcript and Presenter's Notes

Title: Scaling Content Based Image Retrieval Systems


1
Scaling Content Based Image Retrieval Systems
  • Christine Lo, Sushant Shankar, Arun Vijayvergiya
  • CS 267

2
Motivation
  • Finding an efficient way to search for images has
    been increasingly important, especially since
    image databases are growing at an unprecedented
    rate. For example, there are about 550,000 images
    uploaded to Facebook each second. 
  • Content Based Image Retrieval (CBIR) offers a way
    to classify images based on implicit criteria
    rather than user generated tags. This will make
    large image databases more organized and more
    searchable.
  • The bottleneck of the CBIR system is the
    classification algorithm. k-means is one of the
    classification methods we use for CBIR. We chose
    k-means because it is an unsupervised learning
    technique that will allow us to organize and
    classify unlabeled features.
  • Because of the size of image databases such as
    Flickr and Facebook, it is important to scale the
    classification algorithm to handle a large number
    of features. We accomplish this by parallelizing
    the k-means algorithm.

3
K-means Algorithm
  • General k-means Algorithm
  • Takes as input a list of vectors and separates
    them into k clusters.
  • Parallelization
  • We parallelize the k-means algorithm to minimize
    the computational bottleneck of the CBIR system.
    We compare two implementations of this, an OpenMP
    and an MPI implementation.

4
CBIR System
5
Results
6
Evaluation
7
Future Work
  • Auto-tune parameters for best results
  • Integrate clustering code with CBIR system at
    Berkeley PAR lab
  • Test on larger datasets such as Flickr and
    Facebook
Write a Comment
User Comments (0)
About PowerShow.com