NEURAL DYNAMICS OF 3-D SURFACE PERCEPTION:
FIGURE-GROUND SEPARATION AND LIGHTNESS PERCEPTION
Frank Kelly and Stephen Grossberg
Perception and Psychophysics, in press
APPENDIX AND PARAMETER TABLE
Note: This equation file loads very slowly on the web, sometimes taking almost 5 minutes. If speed is important, consider downloading the PDF or G'zipped Postscript versions instead.
A.1 General Introduction
This section describes the FACADE model's BCS and FCS equations. These
equations are similar to those in Grossberg and McLoughlin (1997) and
Gove, Grossberg and Mingolla (1995) but there are several
refinements: LGN, simple cell and complex stages better sharpen the
responses of cells to boundaries. The boundary grouping process
incorporates inhibitory feedback from bipoles at other positions and
orientations that helps to break T-junctions. The remaining stages
are unchanged, except again for minor parameter differences. See
Grossberg and McLoughlin (1997) for a more complete discussion of the
basic equations.
A.2 LGN ON and OFF Channels
The model LGN discounts the illuminant and computes Weber-law
modulated and normalized estimates of image contrasts above an
adaptation level. The LGN ON activities
and OFF
activities
are described by on-center off-surround
and off-center on-surround networks, respectively; that obey membrane,
or shunting, equations (Grossberg, 1973, 1983; Hodgkin, 1964):
and
where the decay constant
, the upper and lower
activity bounds are
and the center
and surround
are defined by
Gaussian kernels:
where
and
To give the ON and OFF signals the same total strength, we use
where the width of center and
surround are described by
.
At equilibrium:
and
The difference of these ON and OFF activities is computed to generate opponent output signals:
The output is rectified using
A.3 Simple Cells
Model simple cells respond to oriented contrasts in the image. They respond to a prescribed contrast polarity. Even-symmetric and odd-symmetric simple cell receptive fields centered on the two-dimensional location (i,j) and of orientation k were defined using even and odd Gabor kernels:
and
where
and
For vertical cells at Scale 1 :
.
For horizontal cells at Scale 1 :
.
For vertical cells at the larger Scale 2 :
.
For horizontal cells at Scale 2 :
.
These parameters are slightly smaller than those in the
Grossberg and McLoughlin (1997) implementation to make edges
slightly sharper.
The extent of the receptive fields of the cells at each scale are as follows:
For Scale 1 :
, and for Scale 2 :
.
A.4 Complex Cells
Model complex cells pool signals from like-oriented simple cells that are sensitive to opposite contraxt polarities. They also pool left and right eye input to compute binocular disparity. Different cell sizes, or scales, can compute different disparity ranges. The two scales are repeated at the complex cell stage. Scale 1 contains two pools of disparity-sensitive cells (near-zero disparity and a disparity of 3 pixels). Scale 2 contains three pools of such cells (near-zero disparity, a disparity of 3 pixels, and a disparity of 7 pixels). Two fields of monocular complex cells are used to represent the left and right images. The dynamics of the complex cell stage are defined mathematically as follows:
where:
The inhibitory signal function
where,
for all scales and disparities.
is the excitatory input formed by the binocular
filter:
The binocular complex cell receptive fields are as follows: For scale 1, L, R = 20 ; for scale 2, L, R = 28 . Kernels satisfy:
and
The inhibitory connections between complex cells tuned to different disparities (possibly at different scales) obey:
for
.
The strength of the inhibitory connections between complex cells
depends on the scale and disparity of the pool of cells in question.
For cells at Scale 1 whose disparity is
d :
For cells at Scale 2 with disparity d :
.
In (A19),
determines the shift in the center of the Gaussians
between disparities d and e.
Also
where for Scale 1,
whereas for Scale 2:
A.4.1 Horizontal Complex Cells
Cells sensitive to horizontal boundaries are spatially sharpened using an on-center off-surround network:
where
and
. The Gaussian kernels
are:
and
where
. For Scale 1,
. For Scale 2 :
.
At equilibrium, after rectification to generate output signals, we
find:
A.5 Hypercomplex Cells : Spatial Competition
Hypercomplex cells carry out spatial and orientational competition in response to complex cell inputs. The spatial competition models the neural process of endstopping. The spatial competition uses an on-center off-surround network that includes both excitatory and inhibitory shunting feedback from the bipole cells, rather than just the additive excitatory feedback used by Grossberg and McLoughlin (1997):
At equilibrium:
where
T= 0.0000189 ,
and the feedback terms
are defined as follows:
and
where
for both scales.
Term
provides positive feedback from like-oriented bipole
cells (see Section A6) to help
complete boundaries, whereas
sums bipole cell outputs across orientations and space
to provide negative feedback with which to supress weaker boundaries.
This orientational competition works
across space to break the stems of T-junctions from their tops.
The feedback signal functions f and g are linear, with gains
that depend upon scale. Thus for Scale 1,
, and
, whereas for Scale 2,
,
and
. The feedforward
excitatory term is:
with Gaussian kernels
The inhibitory term is:
where
For Scale 1 and Scale 2,
.
The size of the kernels is
for both
Scales 1 and 2. The output signals from this stage are
.
A.5.1. Hypercomplex Cells : Orientational Competition
The hypercomplex competition between orientations obeys:
where
and F(x) = 20x .
This inhibitory feedback term F(x) comes from those monocular
FIDO cells which represent nearer depths (i.e., smaller disparities) as defined by
equation (A64). The excitatory input is
where the amount of inhibition between orientations
is
The inhibitory input is
with
For both Scales:
and
.
Solving at equilibrium and rectifying yields the output signals
A.6 Cooperative Bipole Cells
The bipole cells initiate boundary grouping by collecting oriented signals from two oriented branches of their receptive fields. Bipole cell activity satisfies:
where
bounds each
branch's activity:
and D = 0.1. The output threshold,
, in h
helps ensure that both lobes are active
before a bipole cell fires :
Here
. Terms
and
in (A39) correspond to two oriented branches of the bipole cell receptive
field:
and
where orientation R is perpendicular to orientation r,
a = 1.0, b = 2.0 . The subfraction of mutually perpendicular
input prevents colinear grouping from crossing regions that contain
other, non-colinear contrasts. For Scale 1 and Scale 2:
.
The bipole cell's receptive fields are implemented as in Gove et al.
(1995).
In particular, the branches of the bipole cell
receptive field obey
a Gaussian weighting operator:
Term
modulates filter values
based on their distance from the bipole's center, where
is
the optimal distance from the center:
In (A45), we set
and
.
In the current implementation the input images were 164 x 204 pixels in size
and
was chosen to make the bipoles 40 pixels in length.
Term
favors tangent values closer to the orientation of
the bipole's main axis:
where
.
Term
measures the similarity of the orientation of a point
and the angle formed by the tangent at that point. The
tangent defines the optimal orientation for that point and filter
element orientations closer to this optimal value will have greater
strength than those at larger angular separations:
with
.
Bipole Cell Feedback to Hypercomplex Cells : Orientational Competition
Unlike in Gove et al. (1995), the bipole-to-hypercomplex cell feedback is calculated directly from the bipole cell output, as in equations (A25) - (A28).
A.7 Monocular Filling-in and Monocular FIDOs
The monocular FIDO ON cell activities
and OFF cell
activities
diffuse the FCS outputs
and
, respectively, from the monocular preprocessing stage. Boundary outputs
create resistive barriers to the diffusion process. Filling-in obeys the
following equations (Grossberg and Todorovic, 1988):
and
where N consists of the four nearest neighbors to a cell and where the boundary-dependent diffusion coefficient obeys
where
and
. The boundary term
Thus any large boundary value at the nearest neighbor positions reduces the diffusion coefficient and thereby blocks filling-in. At equilibrium:
and
A.7.1 Output from Monocular FIDOs
Outputs from the monocular FIDOs generate both boundary pruning and surface pruning signals. In order to generate such signals at the contours of filled-in regions, the filled-in activities are processed by a contrast-sensitive on-center off-surround network as follows:
and
where
and
. The excitatory
input
has the on-center kernel
where
. The inhibitory
input
has the off-surround kernel
where S = 0.181 and
.
At equilibrium:
and
These contrast-sensitive signals were then subtracted and rectified to generate double-opponent output signals:
and
A.8 Boundary Pruning Signals to Hypercomplex Cells
In order to transform unoriented FIDO activities into boundary pruning signals at oriented hypercomplex cell activities in equation (A33), they are processed by oriented filters:
where
and
with odd-symmetric kernels
and even-symmetric kernels
All parameters for these FCS simple cells are the same as for the BCS simple cells in Section A3.
A.9 Surface Pruning Signals to the Binocular FIDOs
The binocular FIDOs receive inputs from both the left and right eye monocular FIDOs and the monocular preprocessing stage. Inputs from the monocular preprocessing stage are excitatory and are binocularly matched at the binocular FIDOs. Inputs from the monocular FIDOs are inhibitory surface pruning signals. Both excitatory and inhibitory signals are combined binocularly via the following equations:
and
where
and
. The excitatory term
matches left and right monocular preprocessing
signals:
and
The Binocular FIDOs also receive inhibitory surface pruning inputs from monocular FIDO cells that represent smaller disparities:
and
where
and
are the monocular FIDO outputs
in (A62) and (A63). The values of the saturation terms
and
are chosen to enable the monocular FIDO outputs to inhibit
monocular preprocessing signals and thereby prevent filling-in of
occluded regions at the Binocular FCS. Solving at steady state and rectifying yields:
and
The diffusive spread of binocular FIDO activity is defined by the following equations:
and
where N is the set of nearest neighbors, M = 0.1, and the boundary gating term is defined by
where
and
In (A79), the boundary signals
are enriched by adding the boundaries Z of nearer objects to the boundaries of farther objects,
thereby preventing occluded regions of the binocular FIDO from filling-in and
giving a percept of transparency where none exists. Thus:
where
is defined by
(A51). Solving at equilibrium yields:
and
These equations were solved at equilibrium using the Y12M package
(Zlatev, Wasniewski & Schaumburg, 1981) because
solving these equations using 4th-order Runge-Kutta (step size 0.0000025)
or adaptive step Runge-Kutta was computationally intractable. The Y12M
package uses an approximation algorithm to calculate
final FCS values based on filling-in.
Unfortunately one of the problems with this approximation is that it allows boundaries to leak color. This problem was solved by increasing the
strength of boundaries by setting
in (A79) to 100,000.
In the Munker-White simulation, because of the
sparsity of filling-in signals at the far depth binocular FCS (Figure 15f)
relative to the area these signals must fill in, we set
and
Equilibrium opponent ON - OFF and OFF - ON values are calculated as follows:
and
These are the binocular FIDO activities that are plotted in the simulations.
The parameters used in the simulations are listed below. Note: Older browsers may substitute for the Greek letters. Please download the postscript or PDF versions for complete accuracy.
|
LGN ON and OFF Channels |
|||
|
a 1 |
100 |
||
|
U1 |
50 |
||
|
L1 |
50 |
||
|
A1 |
1.0 |
||
|
A2 |
1.03361 |
||
|
s c |
0.5 |
||
|
s s |
1.5 |
||
|
Simple Cells |
|||
|
Scale 1 |
Vertical Cells |
Horizontal Cells |
|
|
s pk |
1.0 |
0.75 |
|
|
s qk |
0.75 |
1.0 |
|
|
Scale 2 |
Vertical Cells |
Horizontal Cells |
|
|
s pk |
1.25 |
1.0 |
|
|
s qk |
1.0 |
1.25 |
|
|
Complex Cells |
|||
|
a 2 |
0.01 |
||
|
b |
15.0 |
||
|
U2 |
1.0 |
||
|
L2 |
1.0 |
||
|
G e |
0.2 |
||
|
Scale 1 |
Scale 2 |
||
|
Add |
2.5 |
2.5 |
|
|
Ade |
1.0 |
1.0 |
|
|
Ad,mon |
0.25 |
0.15 |
|
|
m dd |
0.05 |
0.05 |
|
|
m de |
0.05 |
0.05 |
|
|
m d,mon |
0.075 |
0.075 |
|
|
m cdd |
0.015 |
0.015 |
|
|
m sdd |
0.5 |
0.5 |
|
|
Horizontal Complex Cells |
|||
|
a 3 |
0.1 |
||
|
U3 |
1.0 |
||
|
L3 |
1.0 |
||
|
A1 |
1.0 |
||
|
A2 |
1.0 |
||
|
Scale 1 |
Scale 2 |
||
|
m c |
1.5 |
0.5 |
|
|
m s |
0.06 |
0.06 |
|
|
Hypercomplex Cells: Spatial Competition |
|||
|
a 4 |
0.01 |
||
|
U4 |
1.0 |
||
|
L4 |
1.0 |
||
|
T |
0.0000189 |
||
|
Scale 1 |
Scale 2 |
||
|
s c |
1.0 |
1.0 |
|
|
s s |
2.0 |
2.0 |
|
|
Hypercomplex cells: Orientational Competition |
|||
|
a 5 |
1.0 |
||
|
U5 |
1.0 |
||
|
L5 |
1.0 |
||
|
Scale 1 |
Scale 2 |
||
|
C |
1.0 |
1.0 |
|
|
S |
1.5 |
1.5 |
|
|
s c |
0.5 |
0.5 |
|
|
s s |
0.75 |
0.75 |
|
|
Bipole Cells |
|||
|
D |
0.1 |
||
|
G |
0.1 |
||
|
A |
1.0 |
||
|
B |
2.0 |
||
|
Z |
1.0 |
||
|
r |
0.0 |
||
|
s g |
7.0 |
||
|
s k |
0.15 |
||
|
s r |
0.15 |
||
|
Monocular FIDO |
|||
|
Mm |
0.1 |
||
|
d |
100,000.0 |
||
|
k |
1.0 |
||
|
e |
1000.0 |
||
|
Output from Monocular FIDOs |
|||
|
a 8 |
0.01 |
||
|
L8 |
1.0 |
||
|
U8 |
1.0 |
||
|
C |
0.0398 |
||
|
s c |
2.0 |
||
|
S |
0.181 |
||
|
s s |
3.0 |
||
|
Binocular FIDO |
|||
|
a 9 |
0.01 |
||
|
U9 |
0.5 |
||
|
L9 |
50.0 |
||
|
Mb |
0.1 |
||
|
d |
100,000.0 |
||
|
k |
1.0 |
||
|
e |
1000.0 |
||
TABLE 1. Model Parameters.