How do we know that we solved vision?

About This Presentation

Title:

How do we know that we solved vision?

Description:

Famous Tsukuba Image. Issue 1. We might be testing too soon... Need to ... Detailed segmented/labeled, all the scenes in life. Semantic meaning in image/video ... – PowerPoint PPT presentation

Number of Views:79

Avg rating:3.0/5.0

Slides: 22

Provided by: efr5

Learn more at: http://www.cs.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: How do we know that we solved vision?

1
How do we know that we solved vision?
16-721 Learning-Based Methods in Vision A.
Efros, CMU, Spring 2009
2
Columbia Object Image Library (COIL-100) (1996)
3
Corel Dataset
4
Yu Shi, 2004
5
(No Transcript)
6
Average Caltech categories (Torralba)
7
Flickr.com
8
Flickr Paris
9
Real Paris
10
Automated Data Collection
Kang, Efros, Hebert, Kanade, 2009
11
Something More Objective?
Famous Tsukuba Image
Middlebury Stereo Dataset
12
Issue 1

We might be testing too soon
Need to evaluate the entire system
Give it enough data
Ground it in the physical world
Allow it to affect / manipulate its environment
Do we need to solve Hard AI?
Maybe not. We dont need Human Vision per se
how about Rat Vision?

13
Issue 2

We might be looking for magic where none exist

14
Valentino Braitenberg, Vehicles Source Material
http//www.bcp.psych.ualberta.ca/mike/ Pearl_Stre
et/Margin/Vehicles/index.html Introduces a
series of (hypothetical) simple robots that
seem, to the outside observer, to exhibit complex
behavior. The complex behavior does not come
from a complex brain, but from a simple agent
interacting with a rich environment. Vehicle 1
Getting around A single sensor is attached to a
single motor. Propulsion of the motor is
proportional to the signal detected by the
sensor. The vehicle will always move in a
straight line, slowing down in the cold, speeding
up in the warm. Braitenberg Imagine, now,
what you would think if you saw such a vehicle
swimming around in a pond. It is restless, you
would say, and does not like warm water. But it
is quite stupid, since it is not able to turn
back to the nice cold sport it overshot in its
restless ness. Anyway, you would say, it is
ALIVE, since you have never seen a particle of
dead matter move around quite like that.
15
More complex vehicles
16
Moral of the Story

Law of Uphill Analysis and Downhill Invention
machines are easy to understand if youre
creating them much harder to understand from
the outside.
Psychological consequence if we dont know the
internal structure of a machine, we tend to
overestimate its complexity.

17
Turing Tests for Vision

Your thoughts

18
Have we solved vision if we solve all the
boundary cases?
Varum
19
Computer Vision Database Zhaoyin Jia

Object segmentation/recognition
Detailed segmented/labeled, all the scenes in
life.
Semantic meaning in image/video
Human understanding of the image/story behind
the image
Feeling/reaction after understanding

During the Spring break
Before the deadline
Failed in 16721
Best project in 16721
Love
Kiss
In the class
Cute Adorable Safe
More threatened Run faster Need more help
Threatened Run Call for help
20
How do we know that we solved vision?
Yuandong Tian

General Rule Turing test
If CVS HVS in
Training Performance Speed Failure
case
Then We declare vision is solved. Beers and
Being laid off.
Verifiable Specific Rules
Challenges in Training
Full-automatic object Discovery Categorization
from unlabeled, long video sequence.
Multi-view robust real-time Recognition of ten
of thousands of objects, given few trainings of
each object.
Challenges in Performance
Pixel-wise Localization and Registration in
cluttered and degraded scene
Long-term real-time robust Tracking for generic
objects in cluttered and degraded video sequence.
Human failure human vision illusion
Able to explain human vision illusions, and
Reproduce them.
Conclusion
Good luck for all!