Title: General architecture
1Generalarchitecture
2Minimal Subscene
- Working definition The smallest set of objects,
actors and actions in a dynamic visual scene that
are relevant to present behavior - For now we will assume
- Bottom-up objects/actors/actions must be visible
- Top-down relevance to present behavior
explicitly specified, - e.g., by specifying a question or task
- Knowledge base the system may supplement
explicit knowledge - with long-term acquired knowledge
3MotivationHumans
- 1) Free examination
- 2) estimate material
- circumstances of family
- 3) give ages of the people
- 4) surmise what family has
- been doing before arrival
- of unexpected visitor
- 5) remember clothes worn by
- the people
- 6) remember position of people
- and objects
- 7) estimate how long the unexpected
- visitor has been away from family
Yarbus, 1967
4Beobot
5VisualAttention
6ObjectRecognition
Riesenhuber Poggio, Nat Neurosci, 1999 (MIT)
7Action Recognition
Oztop Arbib, 2001
8- Start
- Issue question
- Parse question
- Extract keywords
- Expand to related concepts,
- using ontology/KB
- -Fill initial task list
9Task list
- Working list of currently relevant
objects/actors/actions - Initially empty
- Question/task specification provides initial
filling-in - As the scene is scanned and objects/actors/actions
are - recognized, contents of task list are updated
10Where attention, saliency map and task map
- Input video stream
- Low-level vision massively parallel extraction
of simple visual features from video input - Saliency map localizes conspicuous (potentially
interesting) objects irrespectively of why they
are salient - Task map acts as spatial filter to saliency map
only locations in the current minimal subscene
can easily pass through. Other locations need to
be exceptionally salient to pass through.
11What memory
- Relates concepts to visual properties
- Bridge between visual and semantic knowledge
12Generalarchitecture
13Examples / experiments
- Examine video clips
- For each scene, please write down
- Most salient object
- Most salient action
- Minimal subscene
- Who is doing what to whom
14Scene 001
15Scene 001 Attentional Trajectory
16Scene 002
17Scene 002 Attentional Trajectory
18Scene 003
19Scene 003 Attentional Trajectory
20Scene 004
21Scene 004 Attentional Trajectory
22Scene 005
23Scene 005 Attentional Trajectory
24Scene 006
25Scene 006 Attentional Trajectory
26Scene 007
27Scene 007 Attentional Trajectory
28Scene 008
29Scene 008 Attentional Trajectory
30Scene 009
31Scene 009 Attentional Trajectory
32Scene 010
33Scene 010 Attentional Trajectory
34Scene 011
35Scene 011 Attentional Trajectory
36Scene 012
37Scene 012 Attentional Trajectory
38Scene 013
39Scene 013 Attentional Trajectory
40Scene 014
41Scene 014 Attentional Trajectory
42Scene 015
43Scene 015 Attentional Trajectory
44Scene 016
45Scene 016 Attentional Trajectory
46Scene 017
47Scene 017 Attentional Trajectory