A Young Mathematician - PowerPoint PPT Presentation

1 / 74
About This Presentation
Title:

A Young Mathematician

Description:

Title: SIAM Conf. Imaging Science, 2001 Author: Jackie Shen Last modified by: Jackie Shen Created Date: 6/25/2001 9:45:05 PM Document presentation format – PowerPoint PPT presentation

Number of Views:198
Avg rating:3.0/5.0
Slides: 75
Provided by: Jacki80
Category:

less

Transcript and Presenter's Notes

Title: A Young Mathematician


1
  • A Young Mathematicians Reflection on
  • Vision
  • Illposedness Regularizations
  • Jackie (Jianhong) Shen
  • School of Mathematics
  • University of Minnesota, Minneapolis, MN 55455
  • Workshop on Regularization in Statistics
  • Banff International Research Station, Canada
    September 6-11, 2003

2
Abstract
  • Vision, the perception of the 3-D world from its
    2-D partial projections onto the left and right
    retinas, is fundamentally an illposed inverse
    problem. But after millions of years' of
    evolution, human vision has become astonishingly
    accurate and satisfactory. How could it have
    become such a remarkable inverse problem solver,
    and what are the hidden (or subconscious)
    regularization techniques it employs?
  • This talk attempts to reflect and shed some
    light on these billion-dollar or Nobel-level
    questions (the works of several Nobel Laureates
    will be mentioned indeed), based on the limited
    but unique experience and philosophy of a young
    mathematician. In particular, we discuss the
    topological, geometric, statistical/Bayesian, and
    psychological regularization techniques of visual
    perception.
  • I owe deep appreciation to my former advisors,
    mentors, and teachers leading me to the Door of
    Vision Professors Gil Strang, Tony Chan, Stan
    Osher, David Mumford, Stu Geman, and Dan Kersten.

3
Agenda
  • An Abstract View of Vision Imaging and
    Perception
  • Illposedness of Visual Perception Root and
    Solutions
  • Conscious or Subconscious Regularizations
  • Topological Regularization Generic Viewpoint
    Principle
  • Geometric Regularization Perception Role of
    Curvature
  • Statistical Regularization Gibbs Fields and
    Learning
  • Psychological Regularization Role of Webers
    Law.

4
Dedication
  • Dedicated to all pioneering mathematicians in
    Mathematical Image and Vision Analysis (Miva),
  • on whose shoulders we the younger generations are
    standing,
  • and on whose shoulders, we the younger must think
    deeper, speak louder, and look further beyond
  • - Jackie Shen

5
  • An Abstract View of Vision
  • Passive Imaging
  • Active Visual Perception

6
Biological or Digital (Passive) Imaging Process
Lattice (or continuum) of photoreceptors
q viewing position angle
u 2-D image on the biological or digital retina
Passive Imaging Process
A 3-D world scene
Optical imaging process (human vision or digital
camera) can be modeled as a function (or
operator)
Here, 2 and 3 indicate the dimension of the
spatial variables. G and R denote the
configuration (scene geometry) and the
reflectance. All the as are parameters such as
I and q.
7
Active Visual Perception
Lattice (or continuum) of photoreceptors
q viewing position angle
u 2-D image on the biological or digital retina
Active Visual Perception
Perception is to reconstruct the 3-D world
(geometry, topology, material surface
properties, light source, etc.) from the
observed 2-D image
A 3-D world scene
8
A Quick Overview of Mathematicians Missions
  • Develop mathematical formulations for all the
    important psychological and cognitive discoveries
    in visual perception.
  • As in geometry and physics (Hamiltonian,
    Statistical or Quantum Mechanics), unify and
    extract the most fundamental laws or axioms of
    perception, and develop their mathematical
    foundations.
  • Decode the computational and information-theoreti
    c efficiency behind human visual perception, and
    integrate such knowledge into digital and
    computational decision and optimization
    algorithms.
  • Develop novel mathematical tools and theories
    arising from such studies, and apply them to
    other scientific and engineering fields, such as
    data visualization/mining pattern
    recognition/learning/coding multiscale modeling.

9
Is It Risky for a Young Mathematician to Study
Perception?
  • Quoting Albert Einstein
  • The object of all science, whether natural
    science or psychology , is to co-ordinate our
    experiences and to bring them into a logical
    system.
  • - from Space and Time in Pre-Relativity
    Physics, May, 1921.
  • To me, as for Special or General Relativities,
    being logical necessarily means being
    mathematical
  • Cause-effect modeling and description (e.g.,
    learning theory)
  • Formulating basic psychological/perceptual laws
    (e.g. Weber)
  • Simulating brain computation using computers
  • Verifying existing data, and furnishing
    reasonable predictions.
  • etc.

10
  • Illposedness of
  • Visual Perception
  • Root of Illposedness
  • Solutions to Illposedness

11
Vision is an Inverse Computer Graphics Problem
Dan Kersten (1997)
  • 3-D Computer Graphics (Hollywood animated
    movies)
  • G Geometric configuration of a 3-D scene (bike,
    table,)
  • R Material surface property reflectance field.
  • I ILLUMINANCE (incident lights, light source
    and type).
  • Goal Generate a 2-D image U, which looks exactly
    like the image that one sees if facing such a
    real 3-D scene.
  • Visual Perception is an Inverse C.G. Problem
    (Kersten)
  • Given a 2-D image U, one attempts to reconstruct
    its 3-D world G, R, I, viewing position
    angle, etc.

12
Vision is an illposed Inverse Problem (A-1)
  • A. Geometry is not invertible depth or range is
    lost !
  • Mathematical Model (1) Projective Imaging
  • P (x1, x2, x3) ? (x1, x2) is not invertible!
  • For any given 2-D curve g2, there are infinitely
    many 3-D curves g3, so that P(g3) g2 .
  • Example.

g3 (cos t, sin t, t )
g2 is just like the projection of a circle
13
Vision is an illposed Inverse Problem (A-2)
  • Mathematical Model (2) Perspective Imaging

a hyperbola
a parabola
an ellipse
Imaging plane (or retina)
All three different types of curves are imaged as
a 2-D circle !
14
Vision is an illposed Inverse Problem (B)
  • B. The Reflectance-Illuminance entanglement.

R
I
identical images !
different 3-D scenes
R
I
15
But vision still makes sense,
  • after millions of years evolution ,
  • which implies that
  • The human vision system is a well developed
    system of software and hardware, which can solve
    this highly ill-posed inverse problem efficiently
    and robustly.
  • Fundamental Questions
  • What kind of regularization techniques the human
    vision system employs to conquer the
    illposedness?
  • What kind of features or variables to regularize ?

16
Deterministic Model Tikhonov Regularization
  • Tikhonov conditioning technique for inverse
    problems
  • forward data generating U0 F(X).
  • backward (inverse) reconstruction of X
  • min d F(X), U0 R(X),
  • where d is a suitable discrepancy metric, and R
    is the regularizing/conditioning term.
  • Example. (Cleaning and sharpening of astronomical
    images)
  • U0 h X n . (h atmosphere blurring n
    white noise)
  • Such an inverse problem is typically solved by

17
Tikhonov Meets Bayes Perception as Inference
Bayesian Perception X (geometry G,
reflectance R, ).
Gibbs energy formula Prob (y) 1/Z exp ( -
Ey / k T ). Thus, in terms of energies, we
have
which leads to Tikhonov Inversion !
Conclusion It is our a priori knowledge of the
world (i.e., Prob X ) that regularizes our
visual perception!
18
A Priori Knowledge (Common Sense) of the World
Regularizes Vision
  • G Knowledge of curve and shape geometry
  • I Knowledge of light sources illuminance
    (sun, lamps, indoor or outdoor, )
  • R Knowledge of materials (wood, bricks, ) and
    surface reflectance (metal shines and water
    sparkles, )
  • Q Knowledge of the viewers (often standing
    perpendicularly to the ground, viewing more
    horizontally, several feet away for indoor
    scenes, )

19
Example I Prior Knowledge on Curves
Boundaries
  • Straight lines length(g), or 1-D Hausdorff.
  • Eulers elastica (Mumford 1994, Chan-Kang-Shen
    2002)
  • Piecewise elastica (Shah 2001)
  • which allows corners or hinges (i.e. the
    discrete set S) along the curve.

20
Example II. Prior Knowledge on Material
Reflectance
Reflectance distribution R(x) in the visual field
W
W
  • The Mumford-Shah 1989 free-boundary model
  • Many challenging free-boundary problems.
  • Connections to interface motions (such as the
    mean curvature motion).

curve energy
21
Deeper Question What to Learn How
  • How does the vision system subconsciously choose
    what knowledge to learn and store, out of massive
    visual data in daily life?
  • For such knowledge, to which degree of regularity
    or compression that the vision system decides
    to process, in order to achieve maximum
    efficiency and robustness?
  • How to mathematically model (or quantify) such
    activities?

Partial answer by Nobel Laureate (1972) Gerald
Edelman Neural Darwinism (Basic Books, Inc.,
1987) The vision system in each individual
operates as a selective system resembling
natural section in evolution, but operating by
different mechanisms.
22
  • Topological Regularization
  • Principle of Generic Viewpoint
  • vs.
  • Theorem of Transversality in
  • Differential Topology

23
Transversal Phenomenon
3-D world
projection
B
A
C
Digital or Biological 2-D Retina
and are transversal, in the sense of
differential topology and are not
tangent at any point
24
Transversal Is Generic Stability Universality
  • Suppose that (2, 3 are dimensions, and P is the
    projection)
  • are transversal. Then for any perturbation
    to , within an e
  • distance under a suitable smoothness norm,
  • are still transversal.
  • Theorem (Milnor) Let M and N be two smooth
    manifolds, and
  • P N submanifold. Then any smooth mapping f M
    ? N can be
  • approximated arbitrarily closely by one which is
    transversal to P.

Stability or Openness
Universality or Denseness
P
M
N
25
Principle of Generic Viewpoint in Vision
  • Generic Viewpoint (Nakayama Shimojo, Science
    1992 Freeman, Nature 1994)
  • What we see is generic w.r.t. the viewpoint !
  • Mathematical Model
  • Suppose we see an image
  • Then
  • where denote small perturbations of
    the as, controlled by e,
  • and D is a vision motivated measure, metric or
    distance.
  • In terms of curves, D can be based on
    invariants of differential topology
  • smoothness, corners, connected components,
    branching,

26
How G.V. Principle Regularizes Perception 3
Examples
  • How the Generic Viewpoint Principle regularizes
    visual perception (conditioning the
    illposedness)
  • Set up preference/biases among all possible
    scenes.
  • (Uniqueness) Lead to topologically unique
    solutions.
  • (Stability) Lead to the stability of perceptual
    solutions.

27
Example 1 Curve Perception
A pitchfork
Two curves
  • Given the image g , which 3-D scene (curve)
    satisfies the G.V. Principle?
  • Answer G1. For G2, when the viewing angle q
    is perturbed a little bit, the image would be
  • which is not diffeomorphic to g (since
    branching degree d and number of connected
    components c are invariants). Take the metric D,
    for example,
  • D difference in d difference in c other
    visual metric terms.

OR
28
Example 2 Shape Perception
A painted planar plate
a uniform V-wing
  • Given the image u, which 3-D scenario satisfies
    the G. V. Principle?
  • Answer Planar Plate. If the viewer moves a
    little bit (i.e. viewing angle q ?qe), the image
    of the V-wing scene would become
  • The two 2-D shapes are NOT diffeomorphic since
    corner is an invariant. (This is because if f is
    a diffeomorphism, then the Jacobian Jf is
    non-singular, and a corner can never be smoothed
    out leading to a straight line.

29
Example 3 shape perception Simian or Human
Dan Kersten, Vision Psychology, UMN
u
v
Two human faces
Two simian faces
d
d0
  • For image u, we see two human faces. No doubt.
  • If we move the face contours closer till touching
    (image v), do we still percept two human faces?
    or two simians? Psychologists show that most of
    us percept the latter. WHY? My explanation based
    on G. V. Principle
  • In 3-D, give the two faces a slight distance
    difference (from the viewer). Then the
    perception of two human faces violates the G.V.
    Principle, while the perception of two (100
    transparent except along the outlines) simian
    faces satisfies the G.V. Principle, or the
    Stability Principle. Classical Interpretation
    is based on Gestalt.

30
  • Geometric Regularization
  • Perception and Roles of
  • Curvature

31
Why Curvature (A) Architectural Evidence
St. Louis Arch, USA
Eiffel Tower, Paris
MITs Big Dome, Cambridge, USA
  • If human vision were blind to curvature,
    architects would not have bothered to build these
    beautiful structures ! (Disclaimer All
    pictures are Internet downloads, not taken by the
    author.)

32
Why Curvature (B) Anatomic Evidence
  • The work of Nobel Laureates (1981) David Hubel
    Torstern Wiesel (1962)

Visual cortical surface
In my opinion, this provides the most intimate
evidence of visual processing of the curvature
information. WHY ? Because Curvature Spatial
Organization of Orientations k dq/ds !
Down to the interior
Quantized array of cortical visual neurons and
their oriented receptive fields
33
Why Curvature (C) Heuristic Mathematical
Argument
  • Taylor expansion for a curve
  • Question Which orders do matter to human visual
    perception?
  • Answer Like Newtonian mechanics, only up to
    the second order.
  • Heuristic Mathematical Argument
  • ? First order matters since we can
    detect rotation!
  • ? Second order matters since we have a
  • strong sense of concavity/convexity/inflecti
    on.
  • ? Third order?

s
34
What Do All These Mean?
  • The proceeding evidences clearly demonstrate the
    significance, feasibility, legitimacy of
    curvature processing in visual perception. So
    what? It implies to
  • Employ curvature as a deterministic regularizer
    for solving numerous ill-posed inverse problems
    in image and vision analysis ( Mumfords proposal
    of Eulers elastica regularizer (1994), also
    Masnou-Morel (1998, ICIP ), Chan-Kang-Shen,
    (2002, SJAP ), Esedoglu-Shen (2002, EJAP ) )
  • Encode curvature in data structure and
    organization (e.g., Candes-Donohos curvelets
    (1999 2002, IEEE Trans. IP ))
  • Develop and study curvature-based mathematical
    models (e.g., Chan-Shens CDD 3rd nonlinear PDE
    model (2001, JVCIR ), Esedoglu-Shens proposal of
    Mumford-Shah-Euler model (2002, EJAP ) Euclidean
    and affine invariant scale spaces
    (Alvarez-Guichard-Lions-Morel, 1993
    Calabi-Olver-Tannenbaum, 1996) mean curvature
    motions (Evans-Spruck, 1991)).

We focus on this point.
35
Perceptual Interpolation of Missing Boundaries
Occluding object
p
q
p
q
  • Problem Find a curve g (t), 0lttlt1, which passes
    through the two endpoints p and q , and looks
    natural.
  • Vision background Most objects in our material
    world are not transparent. Occlusion is
    universal. Human vision must be able to integrate
    the dissected information (by suitable
    interpolation), in order to successfully perceive
    the world.
  • Why regularization is needed (1)
    (non-uniqueness) otherwise too many curves to
    choose from Brownian paths, say (2) to
    regularize is to properly model being natural.

36
Second Order Polynomial Regularization
Modeling Being Natural
  • Let us try polynomial regularization
  • But which order is generically sufficient?
  • Count the constraints in 2-D, totally 22116
    conditions.
  • Theorem. As long as , there exists a
    unique parabolic interpolant in the form of

This solution echoes the earlier claim of second
order sufficiency
37
Second Order Geometric Regularization Eulers
Elastica
Mumford 1994
  • Variational approach under these constraints, to
    minimize
  • called Eulers elastica. Energy was studied by
    Euler (1744) to
  • model the steady shape of a thin and torsion
    free elastica rod.
  • The equilibrium equation
  • which is an elliptic integral, and the solution
    can be expressed by
  • elliptic functions (Mumford 1994).

38
Image Interpolation or Inpainting
  • Inpainting is an artistic way of saying Image
    Interpolation (as first used in IP by Bertalmio
    et al (SIGGRAPH, 2000))
  • Partial image information loss is very common
  • Occlusion caused by non-transparent objects
  • Data loss in wireless transmission
  • Cracks in ancient paintings due to pigment
    aging/weather
  • Insufficient number of image acquisition sensors.
    etc

Example I Occlusion
Example II Cracks
Read Shen (SIAM News, 36(5), Inpainting and
Fundamental Problem of IP, 2003)
39
Chan-Shens Inpainting Model via BV Regularizer
Chan-Shen (SIAM J. Appl. Math., 62(3), 2001)
Existence is guaranteed, but uniqueness is not.
The TV inpainting model
BV regularizer
least square (for uniform Gaussian noise)
The associated formal Euler-Lagrange equation on
W
Fidelity Index
with Neumann adiabatic condition along the
boundary of W.
Formally looks very similar to Rudin-Osher-Fatemi
s denoising model !
40
Chan-Shens Model for Inpainting Noisy Blurry
Images
Suppose KGt, is the Gaussian kernel. Then, the
model gives a good inverting of heat diffusion.
Without the BV regularization, backward diffusion
is notoriously ill-posed.
Chan-Shen (AMS Contemporary Math., 2002)
movie forever
41
The BV Regularizer is Insufficient for Inpainting
(Kanisza, Nitzberg-Mumford, Chan-Shen)
Long distance connection is too expensive for
the TV cost ! Cheaper to simply break it. We
need curvature!
42
Lifting Curve Regularity to Image Regularity
  • Using the level-sets of an image, we can lift a
    curve model to an image model (formally
    theoretical study by Bellettini, et al., 1992)
  • Connection to the mean curvature flow
    (Evans-Spruck, 1991)

Curvature of level sets
43
Elastica Inpainting Model Curvature Incorporated
Chan-Kang-Shen (SJAP, 2002 also ref. to
Masnou-Morel, 1998 )
  • Theorem (associated E.-L. PDE).
  • The gradient descent flow is given by
  • where V is called the flux field , with proper
    boundary conditions.

44
Elastica Inpainting Nonlinear Tranport CDD
Chan-Kang-Shen (SJAP, 2002 )
  • Transport along the isophotes (level sets)
  • Curvature driven diffusion (CDD) across the
    isophotes
  • Conclusion
  • Elastica inpainting unifies the earlier work
    of Bertalmio, Sapiro, Caselles, and Ballester
    (SIGGRAPH, 2000) on transport based inpainting,
    and that of Chan and Shen (JVCIR, 2001) on CDD
    inpainting (motivated by human visual perception).

t
Tangential component
n
Normal component
level sets
45
Elastica Inpainting. I. Smoother Completion
  • Effect 1 as b/a increases,
  • connection becomes smoother.

Chan-Kang-Shen (SJAP, 2002 )
46
Elastica Inpainting. II. Long Distance is Cheaper
  • Effect 2 as b/a increases,
  • long distance connection gets cheaper.

For more theoretical and computational (4th order
nonlinear!) details, please see Chan-Kang-Shen
(SJAP, 63(2), 2002).
47
Inpainting Regularized by Mumford-Shah
Regulerizer
Chan-Shen (SJAP, 2000), Tsai-Yezzi-Willsky
(2001), Esedoglu-Shen (EJAP, 2002)
  • The Mumford-Shah (1989) image model was initially
    designed for the segmentation application

Mumford-Shah based inpainting is to minimize
Inpainting domain
possible blurring
A free boundary optimization problem.
48
Mumford-Shah Inpainting Algorithm
Esedoglu-Shen (Europ. J. Appl. Math., 2002)
  • For the current guess of edge completion G, find
    u to minimize

? equivalent to solving the elliptic equation
on W\G
  • This updated guess of u then guides the motion
    of G

R-
R
Jump across G of the roughness measure
M. C. Motion
We can then benefit from the level-set
implementation by Chan-Vese.
49
M.-S. Inpainting via G-Convergence Approximation
Esedoglu-Shen (Europ. J. Appl. Math., 2002)
The G-convergence approximation of
Ambrosio-Tortorelli (1990)
z1
z0
Esedoglu-Shen shows that inpainting is the
perfect market for G-convergence
edge G is approximated by a signature function z.
50
Leading to Simple Elliptic Implementation
Esedoglu-Shen (2002)
The associated equilibrium PDEs are two coupled
elliptic equations for u and z, with Neuman
boundary conditions
which can be solved numerically by any efficient
elliptic solver.
51
Applications Disocclusion and Text Removal
Esedoglu-Shen (2002)
inpainted u
the edge signature z
inpainted u
Inpainting domain
52
Insufficiency of Mumford-Shah Regularity for
Inpainting
Defect I Artificial corners
Defect II Fail to realize the Connectivity Princi
ple, like BV.
53
Esedoglu-Shens Proposal Mumford-Shah-Euler
Esedoglu-Shen (2002)
  • Idea change the straight-line curve model
    embedded in the
  • Mumford-Shah model to Eulers elastica
  • The G-convergence approximation (conjecture) of
    De Giorgi (1991)

For the technical and computational details,
please see Esedoglu-Shen.
54
Inpainting Based on Mumford-Shah-Euler Regularity
Esedoglu-Shen (2002)
  • Issues
  • Very costly 4th order PDE
  • Many local minima
  • Without approximation, it is
  • difficult to implement geometry

55
Conclusion for Curvature Based Regularization
  • Curvature processing has its anatomical vision
    foundation
  • Curvature processing has its cognitive/perceptual
    foundation
  • Curvature based regularization is necessary for
  • Image analysis and coding
  • Image processing and modeling
  • Image computation algorithms and implementation
    schemes
  • Curvature imposes both theoretical
    computational challenges non-quadratic objective
    functionals nonlinear 3rd and 4th order PDEs
    insufficient theory for wellposedness (existence,
    uniqueness, definition domains, etc.)

56
  • Statistical Regularization
  • Gibbs Fields and Learning

57
Stochastic View of Images Random Fields
Geman-Geman (1984), Grenander (1993), Zhu-Mumford
(1997)
  • An observed image u is treated as a randomly
    drawn sample.
  • The sample space is an ensemble of images with
    its own (often unknown) distribution.
  • An ensemble equipped with a probability
    distribution m or p naturally leads to a random
    field (R. F.) on the image domain.
  • Two distinguished features of such stochastic
    view
  • We are not interested in the pixelwise details of
    a particular image, rather, the key statistical
    features of the R. F. associated.
  • The R. F. is assumed to be ergodic for a
    typical sample image u, spatial statistics
    converge to ensemble/field statistics.
  • In this view, image regularization is built into
    the distribution m .

58
Stochastic View of Images Examples
(Shen-Jung, 2003)
(Shen-Jung, 2003)
Reaction-diffusion spots
Reaction-diffusion stripes
Wood texture (Internet download)
Crops (Internet download)
59
Stochastic Regularization Gibbs Fields
Geman-Geman (1984), Grenander (1993), Zhu-Mumford
(1997)
  • Stochastic regularity is encoded or specified by
    the random field distribution p (u ) .
  • If p is a uniform distribution, then there is not
    much tangible information associated to the
    images, or in Shannons language, the information
    content is the lowest (equivalently, no
    regularity is in presence as the entropy reaches
    highest).
  • In Grenanders (founder of Pattern Theory, Brown
    U) language, textures are built from basic
    building elements (atoms), and these atoms,
    like molecules, are bound together by local
    regularity energies, leading to informative
    images.
  • Thus Gibbs Fields Model seems natural to
    characterize such stochastic regularity Gibbs
    Canonical Ensemble/Formula

regularity energy
visual temperature
partition function
60
Regularization Visual Potentials and Their Duals
  • What can a box of air (molecules) tell us
  • Microscopically never stops fluctuating (lacking
    regularity)
  • Macroscopically there are a few key feature
    potentials
  • Temperature T
  • Pressure p
  • Chemical potentials m
  • These potentials have their dual (additive)
    variables
  • Energy E, dual to the inverse temperature b1/kT.
  • Volume V, dual to the pressure p.
  • Mole numbers N, dual to the chemical potential m.
  • Gibbs Generalized Ensemble Model says,
  • Prob (a micro state) 1/Z .exp(- b E. - b p
    V. b m N. ).
  • Conclusion to apply Gibbs regularity in image
    analysis, one needs to properly identify visually
    meaningful potentials and their duals. (These
    feature parameters will regularize images.)

61
Example of Gibbs Canonical Images Binary Ising
Images
  • Isings Model (1925, for ferromagnetic phase
    transition originally)
  • Binary images on the Cartesian lattice Z2.
  • uu(i,j)1 or 1 (spin up or down).
  • In an external (biasing) magnetic field H , the
    energy associated to each observed field u is
  • J internal energy of magnetic dipole (neighbor
    pair)
  • a, b, c representing general pixels,
    neighboring.
  • But generic (natural) images are not generated
    from such clean physics. Challenge is to develop
    good models for (generalized) energies, and
    properly model short-range or long-range visual
    interactions.

62
What Features to Regularize Visual Filters
Zhu-Mumford (1997)
  • Treat the human vision system as a system of
    linear filters T (F1 , F2 , , FN ).
  • Each filter Fn is characterized by its special
    capability in resolving a particular orientation
    q with a particular spatial frequency w at a
    particular scale s . That is, Fns are
    parametrized by the feature vector ( q, w, s ).
  • Basic Assumption
  • To human vision, the random image fields are
    completely describable (at least in satisfactory
    approximation) by the filter response Tu. All
    other features are blindly filtered out, and
    treated visually insignificant. This is the
    hidden rule of visual regularization!!
  • Justification (from Statistical Mechanics)
  • Though the dimension of the phase space of a box
    of air molecules is huge, in equilibrium such a
    system can be accurately described by three
    feature parameters temperature, pressure, and
    volume.

63
From Visual Filters to Potentials Maximum
Entropy Learning of Zhu-Mumford
Zhu-Mumford (1997)
  • Following the proceeding basic assumption, a
    Gibbs image model would be ONLY based on the
    statistics of the filter outputs Tu ( F1
    u, F2 u, , FN u ).
  • But how exactly? Zhu-Mumfords MEL Model/Scheme
  • Each filter output vn Fn u is by itself a
    random field.
  • In the ideal case, suppose we do know the random
    field distributions qn ( vn ), n 1N. Then,
    these can be used as a set of constraints on the
    original Gibbs image u. That is, pp(u) should
    lead to Prob( vn ) qn ( vn ), n 1N.
  • Under these set of constraints, one can use Gibbs
    variational formulation of Statistical Mechanics
    to find the unique Gibbs field that maximizes the
    entropy, which necessarily takes the form of
  • In reality, the spatial structure of vn is often
    ignored and each is treated as a field of
    i.i.d.s. Thus the joint p.d.f qn is a direct
    tensor product, and is replaced by its 1-D
    histogram.

Diracs delta
64
Learning of Regularization Will Be Momentous
  • A brief conclusion
  • Statistical regularization is often achieved
    through visually meaningful filters and filtering
    processes.
  • The Gibbs field model can be learned based on the
    empirical statistics (e.g. histograms) of these
    filter outputs of a typical image sample, and the
    ergodicity assumption is thus crucial in such
    learning processes.
  • Maximum Entropy based Learning Processes (Melp)
    will play an increasingly important role in
    vision and pattern analysis.

65
  • Psychological Regularization
  • Role of Webers Law A Case Study

66
Webers Law
Shen (Physica D, 175, 2003)
  • Webers Law in Psychology
  • Let u denote the mean field of a background
    (sound/light), and du the intensity increment
    just detectable by human perception (ears/eyes)
    or the so-called JND just-noticeable-difference
    . Then du/u constant.
  • First qualitatively described by German
    physiologist E. H. Weber in 1834 Later
    formulated quantitatively by the great
    experimental psychologist G. T. Fechner in 1858.
  • Search your own experience for validating Webers
    Law
  • In a fully packed stadium high bgsound u gt have
    to cry loud to be effectively heard by other
    folks (i.e., du has to be high as well)
  • ???? (An ancient Chinese idiom) translated to
    In a night with a bright full moon, the stars
    always look scarce.

67
Is Webers Law Psychological or Physiological ?
James Keener (Math. Physiology, 1998)
  • Jackie Shens Theorem Any commonly shared
    psychological phenomenon (among most of the
    6,000,000,000 people on this planet) has to be
    physiological.
  • Webers Law expresses the light adaptive
    capability of the frontal end of the entire
    vision system the two retinas.
  • Without Webers Law, the retinas would not be
    able to operate over a wide range of light
    intensities from several photons to bright solar
    light, since the membrane potential V of a
    neuron cell has a saturated maximum value.
  • Webers Law is the result of a feedback mechanism
    (Tranchina-Perskin, 1988) of the retina system,
    which is implemented by the biochemical
    physiology (ion channels) of the photoreceptors
    (McNaughton, 1990).

68
Webers Law Regularization Can Respect Real
Perception
Shen (Physica D, vol. 175, 2003)
  • What does Webers Law have to do with
    regularization?
  • Before Shen (2003), all variational image
    regularizers in image and vision analysis are
    defined by either the Sobolev norm
  • as in the Linear Filtering Theory, and the
    Mumford-Shah model (1989), or the Total Variation
    (TV) Radon measure
  • as in the Rudin-Osher-Fatemi model (Physica D,
    1992).
  • The fundamental assumption of such regularizers
    is that human visual sensitivity to small
    fluctuations (or irregularities) depends on
    nothing else but themselves. But this is
    inappropriate according to Webers Law !

69
Weberized Regularization and Applications
Shen (Physica D, vol. 175, 2003)
  • Thus Shen (2003) proposed to Weberize (a word
    conveniently coined) the conventional Sobolev or
    TV regularizers to
  • or,
  • For example, the Weberized TV denoising and
    deblurring model (for additive Gaussian noise)
    would be to minimize the energy
  • where the light intensity field (or image) u is
    non-negative.

70
Existence and Uniqueness of Weberization
Shen (Physica D, vol. 175, 2003)
  • Admissible space of u
  • Existence Theorem
  • Assume that and . Then there exists at
    least one minimizer in the admissible space D.
  • Uniqueness Theorem
  • Assume that u z(x) in D is a minimizer and
    at each pixel x. Then z(x) is
    unique.

71
Weberized TV Restoration An Example
Profile of one horizontal slice
from noisy image
after Weberized TV restoration
72
One-Sentence Conclusion of Todays Talk
  • Regularization is crucial for visual perception,
    and presents numerous challenges as well as
    opportunities for further statistical and
    mathematical modeling.

73
  • That is all, folks
  • Thank you for your patience!

Jackie
74
Acknowledgments
  • School of Mathematics and IMA, University of
    Minnesota (UMN).
  • Tony Chan, Stan Osher, Luminita Vese, Selim
    Esedoglu (UCLA) S.-H. Kang (U. Kentucky)
    Yoon-Mo Jung (UMN).
  • Gil Strang (my Ph.D. advisor, MIT) for his
    vision, guide, and support on research.
  • David Mumford and Stu Geman (Division Appl Math,
    Brown U).
  • Dan Kersten and Paul Schrater (Psychology EECS,
    UMN).
  • S. Masnou and J.-M. Morel (France) G. Sapiro and
    M. Bertalmio (EECS, UMN).
  • Fadil Santosa, Peter Olver, Hans Othmer, Bob
    Gulliver, Willard Miller, Doug Arnold, Mitch
    Luskin (Colleagues at Math, UMN).
  • National Science Foundations (NSF), Program of
    Applied Mathematics Office of Navy Research
    (ONR).
  • All the generous support and warm words from
  • Jayant Shah, Andrea Bertozzi, David Donoho, James
    Murray, Rachid Deriche.
Write a Comment
User Comments (0)
About PowerShow.com