Title: How do we know that we solved vision?
1How do we know that we solved vision?
16-721 Learning-Based Methods in Vision A.
Efros, CMU, Spring 2009
2Columbia Object Image Library (COIL-100) (1996)
3Corel Dataset
4Yu Shi, 2004
5(No Transcript)
6Average Caltech categories (Torralba)
7Flickr.com
8Flickr Paris
9Real Paris
10Automated Data Collection
Kang, Efros, Hebert, Kanade, 2009
11Something More Objective?
Famous Tsukuba Image
Middlebury Stereo Dataset
12Issue 1
- We might be testing too soon
- Need to evaluate the entire system
- Give it enough data
- Ground it in the physical world
- Allow it to affect / manipulate its environment
- Do we need to solve Hard AI?
- Maybe not. We dont need Human Vision per se
how about Rat Vision?
13Issue 2
- We might be looking for magic where none exist
14Valentino Braitenberg, Vehicles Source Material
http//www.bcp.psych.ualberta.ca/mike/ Pearl_Stre
et/Margin/Vehicles/index.html Introduces a
series of (hypothetical) simple robots that
seem, to the outside observer, to exhibit complex
behavior. The complex behavior does not come
from a complex brain, but from a simple agent
interacting with a rich environment. Vehicle 1
Getting around A single sensor is attached to a
single motor. Propulsion of the motor is
proportional to the signal detected by the
sensor. The vehicle will always move in a
straight line, slowing down in the cold, speeding
up in the warm. Braitenberg Imagine, now,
what you would think if you saw such a vehicle
swimming around in a pond. It is restless, you
would say, and does not like warm water. But it
is quite stupid, since it is not able to turn
back to the nice cold sport it overshot in its
restless ness. Anyway, you would say, it is
ALIVE, since you have never seen a particle of
dead matter move around quite like that.
15More complex vehicles
16Moral of the Story
- Law of Uphill Analysis and Downhill Invention
machines are easy to understand if youre
creating them much harder to understand from
the outside. - Psychological consequence if we dont know the
internal structure of a machine, we tend to
overestimate its complexity.
17Turing Tests for Vision
18Have we solved vision if we solve all the
boundary cases?
Varum
19Computer Vision Database Zhaoyin Jia
- Object segmentation/recognition
- Detailed segmented/labeled, all the scenes in
life. - Semantic meaning in image/video
- Human understanding of the image/story behind
the image - Feeling/reaction after understanding
During the Spring break
Before the deadline
Failed in 16721
Best project in 16721
Love
Kiss
In the class
Cute Adorable Safe
More threatened Run faster Need more help
Threatened Run Call for help
20How do we know that we solved vision?
Yuandong Tian
- General Rule Turing test
- If CVS HVS in
- Training Performance Speed Failure
case - Then We declare vision is solved. Beers and
Being laid off. - Verifiable Specific Rules
- Challenges in Training
- Full-automatic object Discovery Categorization
from unlabeled, long video sequence. - Multi-view robust real-time Recognition of ten
of thousands of objects, given few trainings of
each object. - Challenges in Performance
- Pixel-wise Localization and Registration in
cluttered and degraded scene - Long-term real-time robust Tracking for generic
objects in cluttered and degraded video sequence. - Human failure human vision illusion
- Able to explain human vision illusions, and
Reproduce them. - Conclusion
- Good luck for all!
16-721 Learning-based method in vision
21Turing Test for Vision
- From the blog
- No overall test. Vision is task-dependent. Do
one problem at a time. - Use Computer Graphics to generate tons of test
data - A well-executed Grand Challenge
- Genre Classification in Video
- The Ultimate Dataset (25-year-old grad student)
- Need to handle corner cases / illusions.
Dynamic range of difficulty. ? - Its all about committees, independent
evaluations, and releasing source code - Its hopeless