Fundamentals linking discrete and continuous
approaches to computer vision - A topological view
Date: Monday June
18, 2007
Time: 1:30 - 5:30p
Location: Greenway Room D, Hyatt
Regency Hotel
Presenter: Leo Grady
In association with CVPR 2007
Movie - We show
how the methods developed here can be used to translate the
Mumford-Shah functional to a graph, which can be minimized to generate
a clustering algorithm
Description:
Continuous and combinatorial methods of image analysis have
developed in computer vision largely along separate lines of research.
The primary difference between a combinatorial and continuous
algorithm for computer vision is whether or not an image is treated as
a large set of regular samples approximating a continuum domain or as
a set of discrete objects. Although the former approach leads to
mathematical exposition and analysis in terms of partial differential
equations and the latter to graph theoretic methods, both toolsets for
analysis may be derived from the common language of topology. In
personal experience with members of the computer vision field, there
appears to be a misconception that combinatorial and continuum
mechanics are wholly separate from each other and that there can
therefore be no meaningful cross-pollination of ideas between
continuum and combinatorial algorithms. The primary goal of this short
course will be to clarify this confusion and rebuild a common
framework from primary principles that accommodates both
approaches. The structure of this short course will be to start from
the standpoint of topology (specifically algebraic topology) and show
that the mechanisms underlying the structure of continuous and
combinatorial PDEs is entirely analogous. In addition, the differing
perspectives often give many new interpretations of older concepts.
The goal is to give attendees working with disparate mathematical
tools the ability to translate continuum methods to a combinatorial
formulation and vice versa in order to facilitate idea exchange. The
latter part of the course will take practical examples from the
computer vision literature and show how they fit into this common
setting. A running theme of the short course is that neither continuum
nor combinatorial methods has any inherit primacy in computer vision
(or physics) and both mathematical traditions are rich enough to
support most concepts. Therefore, more justification should be given
to why a new idea is better expressed in a continuum or combinatorial
language. Most of the examples given throughout the short course will
be in the area of image segmentation, since this is the presenter's
particular area of focus.
Outline:
The course
consists of three sections: Theory, Example and Practice.
1)
Introduction
Course goals are outlined and a
history is provided of the mathematical treatment of space.
2)
Theory
We begin by revisiting integrals and
continuous PDEs, adopting a particular topological viewpoint of the
underlying mechanisms. We then review the theory of
combinatorial PDEs (i.e., cochains defined on chains) and show that
the underlying structures in continuous and combinatorial methods have
the same behavior and properties. The conclusion is that in
order for combinatorial PDEs to behave like their continuous
counterparts, care must be taken to identify continuous quantities
with combinatorial cells of appropriate dimension. The theory
section is divided into the following subsections
a) Integrals
b)
Classification of physical laws
c)
Combinatorial spaces
d) Combinatorial
functions
e) Conclusions
3) Example
This section is devoted to detailing the depth of
connection in structure of continuous and combinatorial
theories. We explore several, disparate ideas in continuous PDE
theory and show with concrete examples how the behavioral properties
are preserved exactly with the corresponding combinatorial analogues.
Additionally, we give examples for which the combinatorial analogue
provides new interpretations of traditionally continuous
concepts. The example section is divided into the following
subsections
a) Helmholtz decomposition
b) Stokes Theorem
c) Methods
of solution
d) Diffusion equation
e) The Laplacian
4) Practice
We now show that there is a practical benefit to
understanding the above theory in the applications of computer vision
and pattern recognition. We begin by interpreting older
algorithms from the viewpoint given above, drawing a common theme that
image content is employed as material properties of the domain.
We then apply the knowledge from the previous two sections in
developing the Random Walker segmentation method and in showing how to
generalize the shortest path problem to 3D (and consequently,
intelligent scissors/live wire). Finally, we take the viewpoint
of "translating" a continuous PDE to a combinatorial PDE and show how
to solve the classical Mumford-Shah PDE on a general graph. The
practice sections is divided into the following subsections
a) Images as constitutive
b)
Random walker image segmentation
c) Generalized
shortest paths
d) Combinatorial
Mumford-Shah
5) Conclusion