Object Recognition Using Alignment - PowerPoint PPT Presentation

1 / 36

About This Presentation

Title:

Object Recognition Using Alignment

Description:

Suggestive of a single 'mental rotation' mechanism. Object recognition ... Mental rotation of 3D objects shapes. A single mental rotation mechanism ... – PowerPoint PPT presentation

Number of Views:102

Avg rating:3.0/5.0

Slides: 37

Provided by: Tri94

Category:

more less

Transcript and Presenter's Notes

Title: Object Recognition Using Alignment

1
Object Recognition Using Alignment

Brian J. Stankiewicz

2
Approaches to Human Object Recognition

Alignment Approach
Store image(s) in memory
Use image transformations to bring new view into
alignment with viewed image.

3
Approaches to Human Object Recognition

Alignment Approach

Template matching Failures
4
Approaches to Human Object Recognition

Alignment Approach

Many different exemplars of category of object.
How does one handle this type of variability?
5
Approaches to Human Object Recognition

Structural Description
Pre-process image before storing in memory
Decompose object into simple parts
Describe the objects shape in terms of their
parts
Parts are described using specific non-accidental
properties

6
Structural Descriptions

Objects are decomposed into parts.
Objects are described by specifying configuration
of parts and their relations.

7
Structural Descriptions

Each part is describe by specifying the values of
particular shape parameters.
Varying parameter varies the shape.

8
Structural Descriptions

Challenge.
How do you decompose image into objects and
objects into parts?
How do you determine the shape parameters of a
part given an image.
This topic will be covered next week in Biederman
and Biederman Cooper papers.

9
Today

Begin by investigating the effect of viewpoint on
object recognition.
Look for evidence of alignment approach
Shepard Metzler
Mental rotation of 3d shapes
Picture Plane and Depth rotations
Tarr Pinker
Mental rotation of 2d shapes
Picture plane rotation only
Multiple-Views Hypothesis

10
Shepard Metzler

Wanted to understand how humans recognize
different views of the same object.
Different images of same 3D shape can be produced
by manipulating viewpoint
Investigated the effect of depth and
picture-plane rotations.

11
Same/Different Paraidgm
12
Shepard Metzler Stimuli

Novel stimuli Not a lot of previous experience
Fairly difficult task
Cannot simply use simple features
Able to carefully control view information.

13
Shepard Metzler Procedure

Two images presented simultaneously
Images of identical or mirror reflected objects
Subjects indicated whether two images depicted
same object
Responded by pulling a lever
Record response times

14
Shepard Metzler Results

Response times increased linearly with
orientation
Suggests that subjects are mentally rotating
images to determine match.

RT To Same Responses
Angle of Rotation
15
Shepard Metzler Results

Reaction times increased linearly with depth
orientation
Suggests a similar mechanism

16
Shepard Metzler Results

Not only are both depth and picture-plane
rotations linearly increasing, but they have very
similar slopes.
Suggestive of a single mental rotation
mechanism.

17
Object recognition

Two fundamental approaches to human object
recognition
Alignment approaches
Object recognition through alignment process
Structural description approach
Decomposition of features included in an object
Describe the objects shape in terms of their
parts and relation among the parts.

18
What is alignment

Definition
A process that transform stored images to bring
new view into alignment with viewed image.
Why we need alignment?
We cannot recognize object exactly only by
template matching
Need for some process which transform input
images or data ? alignment

19
2 studies in alignment approaches

Shepard Metzler
Mental rotation of 3D objects shapes
A single mental rotation mechanism
Evidence same results from rotated depth and
picture-plane pairs.
Tarr Pinker
Multiple view hypothesis (?)

20
Tarr Pinker

Wanted to investigate mental rotation in more
detail
Two hypotheses
Single canonical image stored in memory and all
new images are aligned to that single
representation
Multiple-Views stored in memory.
Align new view to closest stored view

21
Tarr Pinker Method

Train subjects to recognize small set of novel,
letter-like objects.
Did a handedness task
Is the image the trained image (standard)or its
mirror reversal?

22
Tarr Pinker Stimuli

Novel, letter-like images.
Subjects trained on 3 of the images
Reduce stimuli specific effects

23
Tarr Pinker Procedure

Trained subjects on 4 different orientations
(0,45,-90,135)
Tested on trained and surprise orientations
Measured response times

24
Tarr Pinker Exp. 1 Results
Block 112 practice Block 13 practice surprise
25
Tarr Pinker Exp. 1 Results
Compute best fittingline to compute slope
Surprise orientations required degree to be
rotated 90? 45 ?- 135? 45 ?- 45? 45
?but 180? 90?
4 different orientation- images stored in
memory?
26
Tarr Pinker Exp. 1 Results
High slope much rotation single canonical
image
27
Tarr Pinker Exp. 1 Summary

Stimuli showed a similar result to previous
findings
Increased RT with disparate orientations from
training
Subjects showed improvement following training
Even after training, subjects were slower on
non-trained (intermediate) orientations

28
Tarr Pinker Exp. 2 Motivation

Demonstrated an improvement in recognition times
with training.
Not a demonstration of canonical or multiple
views.
Experiment 2, train on a few orientations and
test on multiple orientations.
See if there is evidence for rotating to the
nearest trained orientation.

29
Tarr Pinker Methods

Similar to Experiment 1
However, classification task rather than
handedness task.
Three objects Kip, Kef, Kor, and
distractors
Record response times

30
Tarr Pinker Exp. 2 Procedure

Train on 3 orientations
Test on multiple intervening orientations
Look for rotation functions to nearest trained
orientation

31
Tarr Pinker Exp. 2 Results
32
Tarr Pinker Exp. 2 Summary

Investigated whether subjects show a linearly
increasing RT to canonical view or closest
trained view.
Showed mixed evidence.
For 0 and 210 it appears that there is a dip in
the surrounding RTs
Suggests rotation to nearest orientation
For 105 no evidence of alignment.

33
Tarr Pinker Exp. 2 Results
34
Tarr Pinker Study 3