Title: OSA 98 talk
1Nonlinear visual coding from an
intrinsic-geometry perspective
E. Barth A. B. Watson NASA Ames Research
Center http//vision.arc.nasa.gov
Supported by DFG grant Ba 1176/4-1 to EB and
NASA grant 199-06-12-39 to ABW
2Intrinsic dimensionality in 2D
- i0D constant in all directions
- i1D constant in one direction
- i2D no constant direction
3i0D
FT
4i1D
FT
- e.g. straight lines and edges, gratings
5i2D
FT
- e.g. corners, line ends, curved edges and lines
6i0D
i1D
i2D
7(No Transcript)
8i0D
i2D
i1D
9Intrinsic dimensionality in 3D
- i0D constant in all (space-time) directions
- i1D constant in 2 directions
- i2D constant in one direction
- i3D no constant direction
10i0D
FT
11i1D
FT
- e.g. drifting spatial grating
12i2D
FT
- e.g. drifting corner, flashed grating
13i3D
FT
- e.g. flow discontinuities, flashed corners
14Intrinsic dimensionality and motion
- FT of (rigid) motion signal is in a plane
15The visual input as a hypersurface
hypersurface
Visualization of surfaces is easier
16Geometric view on intrinsic dimensionality
17Curvature and motion
- (plane more than line but no volume)
18The Riemann tensor R
- most important property of (hyper)surfaces
- measures the curvature of the (hyper)surface
- has 6 independent components in 3D
- vanishes in 1D.
19The Riemann tensor components
are nonlinear combinations of derivatives, i.e.,
of linear filters with various spatio-temporal
orientations.
20R components and speed v
21R and direction of motion q
Multiple, distributed representation of direction.
22Sectional curvatures
23Direction tunings of R components
vertical motion
horizontal motion
24Barber pole
Wallach, 1935
25abolished illusion
Kooi, 1993
26Orthogonal orientation and direction tunings
Analytical predictions based on R components
Typical Type II MT neuron, macaque monkey
Direction tuning
Rodman Albright, 1989
Orientation tuning
27Multiple motions
Analytical predictions based on R components
Typical MT neuron, macaque monkey
Recanzone, Wurtz, Schwarz, 1997
28Conclusion
- Hypothesis that a basic (geometric) signal
property (the intrinsic dimensionality) is
encoded in early- and mid-level vision explains - orientation selectivity (derivatives, and R2, R3)
- endstopping
- (all R components are endstopped for
translations) - velocity selectivity
- direction selectivity
- some global-motion percepts (by integration)
- some properties reported for MT neurons.
(Reference to 3D world of moving objects is not
needed.)