A Young Mathematician

About This Presentation

Title:

A Young Mathematician

Description:

Title: SIAM Conf. Imaging Science, 2001 Author: Jackie Shen Last modified by: Jackie Shen Created Date: 6/25/2001 9:45:05 PM Document presentation format – PowerPoint PPT presentation

Number of Views:203

Avg rating:3.0/5.0

Slides: 75

Provided by: Jacki80

Category:

more less

Transcript and Presenter's Notes

Title: A Young Mathematician

1

A Young Mathematicians Reflection on
Vision
Illposedness Regularizations
Jackie (Jianhong) Shen
School of Mathematics
University of Minnesota, Minneapolis, MN 55455
Workshop on Regularization in Statistics
Banff International Research Station, Canada
September 6-11, 2003

2
Abstract

Vision, the perception of the 3-D world from its
2-D partial projections onto the left and right
retinas, is fundamentally an illposed inverse
problem. But after millions of years' of
evolution, human vision has become astonishingly
accurate and satisfactory. How could it have
become such a remarkable inverse problem solver,
and what are the hidden (or subconscious)
regularization techniques it employs?
This talk attempts to reflect and shed some
light on these billion-dollar or Nobel-level
questions (the works of several Nobel Laureates
will be mentioned indeed), based on the limited
but unique experience and philosophy of a young
mathematician. In particular, we discuss the
topological, geometric, statistical/Bayesian, and
psychological regularization techniques of visual
perception.
I owe deep appreciation to my former advisors,
mentors, and teachers leading me to the Door of
Vision Professors Gil Strang, Tony Chan, Stan
Osher, David Mumford, Stu Geman, and Dan Kersten.

3
Agenda

An Abstract View of Vision Imaging and
Perception
Illposedness of Visual Perception Root and
Solutions
Conscious or Subconscious Regularizations
Topological Regularization Generic Viewpoint
Principle
Geometric Regularization Perception Role of
Curvature
Statistical Regularization Gibbs Fields and
Learning
Psychological Regularization Role of Webers
Law.

4
Dedication

Dedicated to all pioneering mathematicians in
Mathematical Image and Vision Analysis (Miva),
on whose shoulders we the younger generations are
standing,
and on whose shoulders, we the younger must think
deeper, speak louder, and look further beyond
- Jackie Shen

An Abstract View of Vision
Passive Imaging
Active Visual Perception

6
Biological or Digital (Passive) Imaging Process
Lattice (or continuum) of photoreceptors
q viewing position angle
u 2-D image on the biological or digital retina
Passive Imaging Process
A 3-D world scene
Optical imaging process (human vision or digital
camera) can be modeled as a function (or
operator)
Here, 2 and 3 indicate the dimension of the
spatial variables. G and R denote the
configuration (scene geometry) and the
reflectance. All the as are parameters such as
I and q.
7
Active Visual Perception
Lattice (or continuum) of photoreceptors
q viewing position angle
u 2-D image on the biological or digital retina
Active Visual Perception
Perception is to reconstruct the 3-D world
(geometry, topology, material surface
properties, light source, etc.) from the
observed 2-D image
A 3-D world scene
8
A Quick Overview of Mathematicians Missions

Develop mathematical formulations for all the
important psychological and cognitive discoveries
in visual perception.
As in geometry and physics (Hamiltonian,
Statistical or Quantum Mechanics), unify and
extract the most fundamental laws or axioms of
perception, and develop their mathematical
foundations.
Decode the computational and information-theoreti
c efficiency behind human visual perception, and
integrate such knowledge into digital and
computational decision and optimization
algorithms.
Develop novel mathematical tools and theories
arising from such studies, and apply them to
other scientific and engineering fields, such as
data visualization/mining pattern
recognition/learning/coding multiscale modeling.

9
Is It Risky for a Young Mathematician to Study
Perception?

Quoting Albert Einstein
The object of all science, whether natural
science or psychology , is to co-ordinate our
experiences and to bring them into a logical
system.
- from Space and Time in Pre-Relativity
Physics, May, 1921.
To me, as for Special or General Relativities,
being logical necessarily means being
mathematical
Cause-effect modeling and description (e.g.,
learning theory)
Formulating basic psychological/perceptual laws
(e.g. Weber)
Simulating brain computation using computers
Verifying existing data, and furnishing
reasonable predictions.
etc.

Illposedness of
Visual Perception
Root of Illposedness
Solutions to Illposedness

11
Vision is an Inverse Computer Graphics Problem
Dan Kersten (1997)

3-D Computer Graphics (Hollywood animated
movies)
G Geometric configuration of a 3-D scene (bike,
table,)
R Material surface property reflectance field.
I ILLUMINANCE (incident lights, light source
and type).
Goal Generate a 2-D image U, which looks exactly
like the image that one sees if facing such a
real 3-D scene.
Visual Perception is an Inverse C.G. Problem
(Kersten)
Given a 2-D image U, one attempts to reconstruct
its 3-D world G, R, I, viewing position
angle, etc.

12
Vision is an illposed Inverse Problem (A-1)

A. Geometry is not invertible depth or range is
lost !
Mathematical Model (1) Projective Imaging
P (x1, x2, x3) ? (x1, x2) is not invertible!
For any given 2-D curve g2, there are infinitely
many 3-D curves g3, so that P(g3) g2 .
Example.

g3 (cos t, sin t, t )
g2 is just like the projection of a circle
13
Vision is an illposed Inverse Problem (A-2)

Mathematical Model (2) Perspective Imaging

a hyperbola
a parabola
an ellipse
Imaging plane (or retina)
All three different types of curves are imaged as
a 2-D circle !
14
Vision is an illposed Inverse Problem (B)

B. The Reflectance-Illuminance entanglement.

R
I
identical images !
different 3-D scenes
R
I
15
But vision still makes sense,

after millions of years evolution ,
which implies that
The human vision system is a well developed
system of software and hardware, which can solve
this highly ill-posed inverse problem efficiently
and robustly.
Fundamental Questions
What kind of regularization techniques the human
vision system employs to conquer the
illposedness?
What kind of features or variables to regularize ?

16
Deterministic Model Tikhonov Regularization

Tikhonov conditioning technique for inverse
problems
forward data generating U0 F(X).
backward (inverse) reconstruction of X
min d F(X), U0 R(X),
where d is a suitable discrepancy metric, and R
is the regularizing/conditioning term.
Example. (Cleaning and sharpening of astronomical
images)
U0 h X n . (h atmosphere blurring n
white noise)
Such an inverse problem is typically solved by

17
Tikhonov Meets Bayes Perception as Inference
Bayesian Perception X (geometry G,
reflectance R, ).
Gibbs energy formula Prob (y) 1/Z exp ( -
Ey / k T ). Thus, in terms of energies, we
have
which leads to Tikhonov Inversion !
Conclusion It is our a priori knowledge of the
world (i.e., Prob X ) that regularizes our
visual perception!
18
A Priori Knowledge (Common Sense) of the World
Regularizes Vision

G Knowledge of curve and shape geometry
I Knowledge of light sources illuminance
(sun, lamps, indoor or outdoor, )
R Knowledge of materials (wood, bricks, ) and
surface reflectance (metal shines and water
sparkles, )
Q Knowledge of the viewers (often standing
perpendicularly to the ground, viewing more
horizontally, several feet away for indoor
scenes, )

19
Example I Prior Knowledge on Curves
Boundaries

Straight lines length(g), or 1-D Hausdorff.
Eulers elastica (Mumford 1994, Chan-Kang-Shen
2002)
Piecewise elastica (Shah 2001)
which allows corners or hinges (i.e. the
discrete set S) along the curve.

20
Example II. Prior Knowledge on Material
Reflectance
Reflectance distribution R(x) in the visual field
W
W

The Mumford-Shah 1989 free-boundary model
Many challenging free-boundary problems.
Connections to interface motions (such as the
mean curvature motion).

curve energy
21
Deeper Question What to Learn How

How does the vision system subconsciously choose
what knowledge to learn and store, out of massive
visual data in daily life?
For such knowledge, to which degree of regularity
or compression that the vision system decides
to process, in order to achieve maximum
efficiency and robustness?
How to mathematically model (or quantify) such
activities?

Partial answer by Nobel Laureate (1972) Gerald
Edelman Neural Darwinism (Basic Books, Inc.,
1987) The vision system in each individual
operates as a selective system resembling
natural section in evolution, but operating by
different mechanisms.
22

Topological Regularization
Principle of Generic Viewpoint
vs.
Theorem of Transversality in
Differential Topology

23
Transversal Phenomenon
3-D world
projection
B
A
C
Digital or Biological 2-D Retina
and are transversal, in the sense of
differential topology and are not
tangent at any point
24
Transversal Is Generic Stability Universality

Suppose that (2, 3 are dimensions, and P is the
projection)
are transversal. Then for any perturbation
to , within an e
distance under a suitable smoothness norm,
are still transversal.
Theorem (Milnor) Let M and N be two smooth
manifolds, and
P N submanifold. Then any smooth mapping f M
? N can be
approximated arbitrarily closely by one which is
transversal to P.

Stability or Openness
Universality or Denseness
P
M
N
25
Principle of Generic Viewpoint in Vision

Generic Viewpoint (Nakayama Shimojo, Science
1992 Freeman, Nature 1994)
What we see is generic w.r.t. the viewpoint !
Mathematical Model
Suppose we see an image
Then
where denote small perturbations of
the as, controlled by e,
and D is a vision motivated measure, metric or
distance.
In terms of curves, D can be based on
invariants of differential topology
smoothness, corners, connected components,
branching,

26
How G.V. Principle Regularizes Perception 3
Examples

How the Generic Viewpoint Principle regularizes
visual perception (conditioning the
illposedness)
Set up preference/biases among all possible
scenes.
(Uniqueness) Lead to topologically unique
solutions.
(Stability) Lead to the stability of perceptual
solutions.

27
Example 1 Curve Perception
A pitchfork
Two curves

Given the image g , which 3-D scene (curve)
satisfies the G.V. Principle?
Answer G1. For G2, when the viewing angle q
is perturbed a little bit, the image would be
which is not diffeomorphic to g (since
branching degree d and number of connected
components c are invariants). Take the metric D,
for example,
D difference in d difference in c other
visual metric terms.

OR
28
Example 2 Shape Perception
A painted planar plate
a uniform V-wing

Given the image u, which 3-D scenario satisfies
the G. V. Principle?
Answer Planar Plate. If the viewer moves a
little bit (i.e. viewing angle q ?qe), the image
of the V-wing scene would become
The two 2-D shapes are NOT diffeomorphic since
corner is an invariant. (This is because if f is
a diffeomorphism, then the Jacobian Jf is
non-singular, and a corner can never be smoothed
out leading to a straight line.

29
Example 3 shape perception Simian or Human
Dan Kersten, Vision Psychology, UMN
u
v
Two human faces
Two simian faces
d
d0

For image u, we see two human faces. No doubt.
If we move the face contours closer till touching
(image v), do we still percept two human faces?
or two simians? Psychologists show that most of
us percept the latter. WHY? My explanation based
on G. V. Principle
In 3-D, give the two faces a slight distance
difference (from the viewer). Then the
perception of two human faces violates the G.V.
Principle, while the perception of two (100
transparent except along the outlines) simian
faces satisfies the G.V. Principle, or the
Stability Principle. Classical Interpretation
is based on Gestalt.

Geometric Regularization
Perception and Roles of
Curvature

31
Why Curvature (A) Architectural Evidence
St. Louis Arch, USA
Eiffel Tower, Paris
MITs Big Dome, Cambridge, USA

If human vision were blind to curvature,
architects would not have bothered to build these
beautiful structures ! (Disclaimer All
pictures are Internet downloads, not taken by the
author.)

32
Why Curvature (B) Anatomic Evidence

The work of Nobel Laureates (1981) David Hubel
Torstern Wiesel (1962)

Visual cortical surface
In my opinion, this provides the most intimate
evidence of visual processing of the curvature
information. WHY ? Because Curvature Spatial
Organization of Orientations k dq/ds !
Down to the interior
Quantized array of cortical visual neurons and
their oriented receptive fields
33
Why Curvature (C) Heuristic Mathematical
Argument

Taylor expansion for a curve
Question Which orders do matter to human visual
perception?
Answer Like Newtonian mechanics, only up to
the second order.
Heuristic Mathematical Argument
? First order matters since we can
detect rotation!
? Second order matters since we have a
strong sense of concavity/convexity/inflecti
on.
? Third order?

s
34
What Do All These Mean?

The proceeding evidences clearly demonstrate the
significance, feasibility, legitimacy of
curvature processing in visual perception. So
what? It implies to
Employ curvature as a deterministic regularizer
for solving numerous ill-posed inverse problems
in image and vision analysis ( Mumfords proposal
of Eulers elastica regularizer (1994), also
Masnou-Morel (1998, ICIP ), Chan-Kang-Shen,
(2002, SJAP ), Esedoglu-Shen (2002, EJAP ) )
Encode curvature in data structure and
organization (e.g., Candes-Donohos curvelets
(1999 2002, IEEE Trans. IP ))
Develop and study curvature-based mathematical
models (e.g., Chan-Shens CDD 3rd nonlinear PDE
model (2001, JVCIR ), Esedoglu-Shens proposal of
Mumford-Shah-Euler model (2002, EJAP ) Euclidean
and affine invariant scale spaces
(Alvarez-Guichard-Lions-Morel, 1993
Calabi-Olver-Tannenbaum, 1996) mean curvature
motions (Evans-Spruck, 1991)).

We focus on this point.
35
Perceptual Interpolation of Missing Boundaries
Occluding object
p
q
p
q

Problem Find a curve g (t), 0lttlt1, which passes
through the two endpoints p and q , and looks
natural.
Vision background Most objects in our material
world are not transparent. Occlusion is
universal. Human vision must be able to integrate
the dissected information (by suitable
interpolation), in order to successfully perceive
the world.
Why regularization is needed (1)
(non-uniqueness) otherwise too many curves to
choose from Brownian paths, say (2) to
regularize is to properly model being natural.

36
Second Order Polynomial Regularization
Modeling Being Natural

Let us try polynomial regularization
But which order is generically sufficient?
Count the constraints in 2-D, totally 22116
conditions.
Theorem. As long as , there exists a
unique parabolic interpolant in the form of

This solution echoes the earlier claim of second
order sufficiency
37
Second Order Geometric Regularization Eulers
Elastica
Mumford 1994

Variational approach under these constraints, to
minimize
called Eulers elastica. Energy was studied by
Euler (1744) to
model the steady shape of a thin and torsion
free elastica rod.
The equilibrium equation
which is an elliptic integral, and the solution
can be expressed by
elliptic functions (Mumford 1994).

38
Image Interpolation or Inpainting

Inpainting is an artistic way of saying Image
Interpolation (as first used in IP by Bertalmio
et al (SIGGRAPH, 2000))
Partial image information loss is very common
Occlusion caused by non-transparent objects
Data loss in wireless transmission
Cracks in ancient paintings due to pigment
aging/weather
Insufficient number of image acquisition sensors.
etc

Example I Occlusion
Example II Cracks
Read Shen (SIAM News, 36(5), Inpainting and
Fundamental Problem of IP, 2003)
39
Chan-Shens Inpainting Model via BV Regularizer
Chan-Shen (SIAM J. Appl. Math., 62(3), 2001)
Existence is guaranteed, but uniqueness is not.
The TV inpainting model
BV regularizer
least square (for uniform Gaussian noise)
The associated formal Euler-Lagrange equation on
W
Fidelity Index
with Neumann adiabatic condition along the
boundary of W.
Formally looks very similar to Rudin-Osher-Fatemi
s denoising model !
40
Chan-Shens Model for Inpainting Noisy Blurry
Images
Suppose KGt, is the Gaussian kernel. Then, the
model gives a good inverting of heat diffusion.
Without the BV regularization, backward diffusion
is notoriously ill-posed.
Chan-Shen (AMS Contemporary Math., 2002)
movie forever
41
The BV Regularizer is Insufficient for Inpainting
(Kanisza, Nitzberg-Mumford, Chan-Shen)
Long distance connection is too expensive for
the TV cost ! Cheaper to simply break it. We
need curvature!
42
Lifting Curve Regularity to Image Regularity

Using the level-sets of an image, we can lift a
curve model to an image model (formally
theoretical study by Bellettini, et al., 1992)
Connection to the mean curvature flow
(Evans-Spruck, 1991)

Curvature of level sets
43
Elastica Inpainting Model Curvature Incorporated
Chan-Kang-Shen (SJAP, 2002 also ref. to
Masnou-Morel, 1998 )

Theorem (associated E.-L. PDE).
The gradient descent flow is given by
where V is called the flux field , with proper
boundary conditions.

44
Elastica Inpainting Nonlinear Tranport CDD
Chan-Kang-Shen (SJAP, 2002 )

Transport along the isophotes (level sets)
Curvature driven diffusion (CDD) across the
isophotes
Conclusion
Elastica inpainting unifies the earlier work
of Bertalmio, Sapiro, Caselles, and Ballester
(SIGGRAPH, 2000) on transport based inpainting,
and that of Chan and Shen (JVCIR, 2001) on CDD
inpainting (motivated by human visual perception).

t
Tangential component
n
Normal component
level sets
45
Elastica Inpainting. I. Smoother Completion

Effect 1 as b/a increases,
connection becomes smoother.

Chan-Kang-Shen (SJAP, 2002 )
46
Elastica Inpainting. II. Long Distance is Cheaper

Effect 2 as b/a increases,
long distance connection gets cheaper.

For more theoretical and computational (4th order
nonlinear!) details, please see Chan-Kang-Shen
(SJAP, 63(2), 2002).
47
Inpainting Regularized by Mumford-Shah
Regulerizer
Chan-Shen (SJAP, 2000), Tsai-Yezzi-Willsky
(2001), Esedoglu-Shen (EJAP, 2002)

The Mumford-Shah (1989) image model was initially
designed for the segmentation application

Mumford-Shah based inpainting is to minimize
Inpainting domain
possible blurring
A free boundary optimization problem.
48
Mumford-Shah Inpainting Algorithm
Esedoglu-Shen (Europ. J. Appl. Math., 2002)

For the current guess of edge completion G, find
u to minimize

? equivalent to solving the elliptic equation
on W\G

This updated guess of u then guides the motion
of G

R-
R
Jump across G of the roughness measure
M. C. Motion
We can then benefit from the level-set
implementation by Chan-Vese.
49
M.-S. Inpainting via G-Convergence Approximation
Esedoglu-Shen (Europ. J. Appl. Math., 2002)
The G-convergence approximation of
Ambrosio-Tortorelli (1990)
z1
z0
Esedoglu-Shen shows that inpainting is the
perfect market for G-convergence
edge G is approximated by a signature function z.
50
Leading to Simple Elliptic Implementation
Esedoglu-Shen (2002)
The associated equilibrium PDEs are two coupled
elliptic equations for u and z, with Neuman
boundary conditions
which can be solved numerically by any efficient
elliptic solver.
51
Applications Disocclusion and Text Removal
Esedoglu-Shen (2002)
inpainted u
the edge signature z
inpainted u
Inpainting domain
52
Insufficiency of Mumford-Shah Regularity for
Inpainting
Defect I Artificial corners
Defect II Fail to realize the Connectivity Princi
ple, like BV.
53
Esedoglu-Shens Proposal Mumford-Shah-Euler
Esedoglu-Shen (2002)

Idea change the straight-line curve model
embedded in the
Mumford-Shah model to Eulers elastica
The G-convergence approximation (conjecture) of
De Giorgi (1991)

For the technical and computational details,
please see Esedoglu-Shen.
54
Inpainting Based on Mumford-Shah-Euler Regularity
Esedoglu-Shen (2002)

Issues
Very costly 4th order PDE
Many local minima
Without approximation, it is
difficult to implement geometry

55
Conclusion for Curvature Based Regularization

Curvature processing has its anatomical vision
foundation
Curvature processing has its cognitive/perceptual
foundation
Curvature based regularization is necessary for
Image analysis and coding
Image processing and modeling
Image computation algorithms and implementation
schemes
Curvature imposes both theoretical
computational challenges non-quadratic objective
functionals nonlinear 3rd and 4th order PDEs
insufficient theory for wellposedness (existence,
uniqueness, definition domains, etc.)

Statistical Regularization
Gibbs Fields and Learning

57
Stochastic View of Images Random Fields
Geman-Geman (1984), Grenander (1993), Zhu-Mumford
(1997)

An observed image u is treated as a randomly
drawn sample.
The sample space is an ensemble of images with
its own (often unknown) distribution.
An ensemble equipped with a probability
distribution m or p naturally leads to a random
field (R. F.) on the image domain.
Two distinguished features of such stochastic
view
We are not interested in the pixelwise details of
a particular image, rather, the key statistical
features of the R. F. associated.
The R. F. is assumed to be ergodic for a
typical sample image u, spatial statistics
converge to ensemble/field statistics.
In this view, image regularization is built into
the distribution m .

58
Stochastic View of Images Examples
(Shen-Jung, 2003)
(Shen-Jung, 2003)
Reaction-diffusion spots
Reaction-diffusion stripes
Wood texture (Internet download)
Crops (Internet download)
59
Stochastic Regularization Gibbs Fields
Geman-Geman (1984), Grenander (1993), Zhu-Mumford
(1997)

Stochastic regularity is encoded or specified by
the random field distribution p (u ) .
If p is a uniform distribution, then there is not
much tangible information associated to the
images, or in Shannons language, the information
content is the lowest (equivalently, no
regularity is in presence as the entropy reaches
highest).
In Grenanders (founder of Pattern Theory, Brown
U) language, textures are built from basic
building elements (atoms), and these atoms,
like molecules, are bound together by local
regularity energies, leading to informative
images.
Thus Gibbs Fields Model seems natural to
characterize such stochastic regularity Gibbs
Canonical Ensemble/Formula

regularity energy
visual temperature
partition function
60
Regularization Visual Potentials and Their Duals

What can a box of air (molecules) tell us
Microscopically never stops fluctuating (lacking
regularity)
Macroscopically there are a few key feature
potentials
Temperature T
Pressure p
Chemical potentials m
These potentials have their dual (additive)
variables
Energy E, dual to the inverse temperature b1/kT.
Volume V, dual to the pressure p.
Mole numbers N, dual to the chemical potential m.
Gibbs Generalized Ensemble Model says,
Prob (a micro state) 1/Z .exp(- b E. - b p
V. b m N. ).
Conclusion to apply Gibbs regularity in image
analysis, one needs to properly identify visually
meaningful potentials and their duals. (These
feature parameters will regularize images.)

61
Example of Gibbs Canonical Images Binary Ising
Images

Isings Model (1925, for ferromagnetic phase
transition originally)
Binary images on the Cartesian lattice Z2.
uu(i,j)1 or 1 (spin up or down).
In an external (biasing) magnetic field H , the
energy associated to each observed field u is
J internal energy of magnetic dipole (neighbor
pair)
a, b, c representing general pixels,
neighboring.
But generic (natural) images are not generated
from such clean physics. Challenge is to develop
good models for (generalized) energies, and
properly model short-range or long-range visual
interactions.

62
What Features to Regularize Visual Filters
Zhu-Mumford (1997)

Treat the human vision system as a system of
linear filters T (F1 , F2 , , FN ).
Each filter Fn is characterized by its special
capability in resolving a particular orientation
q with a particular spatial frequency w at a
particular scale s . That is, Fns are
parametrized by the feature vector ( q, w, s ).
Basic Assumption
To human vision, the random image fields are
completely describable (at least in satisfactory
approximation) by the filter response Tu. All
other features are blindly filtered out, and
treated visually insignificant. This is the
hidden rule of visual regularization!!
Justification (from Statistical Mechanics)
Though the dimension of the phase space of a box
of air molecules is huge, in equilibrium such a
system can be accurately described by three
feature parameters temperature, pressure, and
volume.

63
From Visual Filters to Potentials Maximum
Entropy Learning of Zhu-Mumford
Zhu-Mumford (1997)

Following the proceeding basic assumption, a
Gibbs image model would be ONLY based on the
statistics of the filter outputs Tu ( F1
u, F2 u, , FN u ).
But how exactly? Zhu-Mumfords MEL Model/Scheme
Each filter output vn Fn u is by itself a
random field.
In the ideal case, suppose we do know the random
field distributions qn ( vn ), n 1N. Then,
these can be used as a set of constraints on the
original Gibbs image u. That is, pp(u) should
lead to Prob( vn ) qn ( vn ), n 1N.
Under these set of constraints, one can use Gibbs
variational formulation of Statistical Mechanics
to find the unique Gibbs field that maximizes the
entropy, which necessarily takes the form of
In reality, the spatial structure of vn is often
ignored and each is treated as a field of
i.i.d.s. Thus the joint p.d.f qn is a direct
tensor product, and is replaced by its 1-D
histogram.

Diracs delta
64
Learning of Regularization Will Be Momentous

A brief conclusion
Statistical regularization is often achieved
through visually meaningful filters and filtering
processes.
The Gibbs field model can be learned based on the
empirical statistics (e.g. histograms) of these
filter outputs of a typical image sample, and the
ergodicity assumption is thus crucial in such
learning processes.
Maximum Entropy based Learning Processes (Melp)
will play an increasingly important role in
vision and pattern analysis.

Psychological Regularization
Role of Webers Law A Case Study

66
Webers Law
Shen (Physica D, 175, 2003)

Webers Law in Psychology
Let u denote the mean field of a background
(sound/light), and du the intensity increment
just detectable by human perception (ears/eyes)
or the so-called JND just-noticeable-difference
. Then du/u constant.
First qualitatively described by German
physiologist E. H. Weber in 1834 Later
formulated quantitatively by the great
experimental psychologist G. T. Fechner in 1858.
Search your own experience for validating Webers
Law
In a fully packed stadium high bgsound u gt have
to cry loud to be effectively heard by other
folks (i.e., du has to be high as well)
???? (An ancient Chinese idiom) translated to
In a night with a bright full moon, the stars
always look scarce.

67
Is Webers Law Psychological or Physiological ?
James Keener (Math. Physiology, 1998)

Jackie Shens Theorem Any commonly shared
psychological phenomenon (among most of the
6,000,000,000 people on this planet) has to be
physiological.
Webers Law expresses the light adaptive
capability of the frontal end of the entire
vision system the two retinas.
Without Webers Law, the retinas would not be
able to operate over a wide range of light
intensities from several photons to bright solar
light, since the membrane potential V of a
neuron cell has a saturated maximum value.
Webers Law is the result of a feedback mechanism
(Tranchina-Perskin, 1988) of the retina system,
which is implemented by the biochemical
physiology (ion channels) of the photoreceptors
(McNaughton, 1990).

68
Webers Law Regularization Can Respect Real
Perception
Shen (Physica D, vol. 175, 2003)

What does Webers Law have to do with
regularization?
Before Shen (2003), all variational image
regularizers in image and vision analysis are
defined by either the Sobolev norm
as in the Linear Filtering Theory, and the
Mumford-Shah model (1989), or the Total Variation
(TV) Radon measure
as in the Rudin-Osher-Fatemi model (Physica D,
1992).
The fundamental assumption of such regularizers
is that human visual sensitivity to small
fluctuations (or irregularities) depends on
nothing else but themselves. But this is
inappropriate according to Webers Law !

69
Weberized Regularization and Applications
Shen (Physica D, vol. 175, 2003)

Thus Shen (2003) proposed to Weberize (a word
conveniently coined) the conventional Sobolev or
TV regularizers to
or,
For example, the Weberized TV denoising and
deblurring model (for additive Gaussian noise)
would be to minimize the energy
where the light intensity field (or image) u is
non-negative.

70
Existence and Uniqueness of Weberization
Shen (Physica D, vol. 175, 2003)

Admissible space of u
Existence Theorem
Assume that and . Then there exists at
least one minimizer in the admissible space D.
Uniqueness Theorem
Assume that u z(x) in D is a minimizer and
at each pixel x. Then z(x) is
unique.

71
Weberized TV Restoration An Example
Profile of one horizontal slice
from noisy image
after Weberized TV restoration
72
One-Sentence Conclusion of Todays Talk

Regularization is crucial for visual perception,
and presents numerous challenges as well as
opportunities for further statistical and
mathematical modeling.

That is all, folks
Thank you for your patience!

Jackie
74
Acknowledgments

School of Mathematics and IMA, University of
Minnesota (UMN).
Tony Chan, Stan Osher, Luminita Vese, Selim
Esedoglu (UCLA) S.-H. Kang (U. Kentucky)
Yoon-Mo Jung (UMN).
Gil Strang (my Ph.D. advisor, MIT) for his
vision, guide, and support on research.
David Mumford and Stu Geman (Division Appl Math,
Brown U).
Dan Kersten and Paul Schrater (Psychology EECS,
UMN).
S. Masnou and J.-M. Morel (France) G. Sapiro and
M. Bertalmio (EECS, UMN).
Fadil Santosa, Peter Olver, Hans Othmer, Bob
Gulliver, Willard Miller, Doug Arnold, Mitch
Luskin (Colleagues at Math, UMN).
National Science Foundations (NSF), Program of
Applied Mathematics Office of Navy Research
(ONR).
All the generous support and warm words from
Jayant Shah, Andrea Bertozzi, David Donoho, James
Murray, Rachid Deriche.