Stephen Grossberg

Recent Publications and Technical Reports

Abstracts with links to papers available on-line

2003


  • Brown, J.W., Bullock, D., and Grossberg, S. (2003). How laminar frontal cortex and basal ganglia circuits interact to control planned and reactive saccades. Neural Networks, in press.

    How does the brain learn to balance between reactive and planned behaviors? The basal ganglia and frontal cortex together allow animals to learn planned behaviors that acquire rewards when prepotent reactive behaviors are insufficient. This paper proposes a new model, called TELOS, to explain how laminar circuitry of the frontal cortex, exemplified by the frontal eye fields, interacts with the basal ganglia, thalamus, superior colliculus, and inferotemporal and parietal cortices to learn and perform reactive and planned eye movements. The model is formulated as fourteen computational hypotheses. These specify how strategy priming and action planning (in cortical layers III, Va and VI) are dissociated from movement execution (in layer Vb), how the basal ganglia help to choose among and gate competing plans, and how a visual stimulus may serve either as a movement target or as a discriminative cue to move elsewhere. The direct, indirect and hyperdirect pathways through the basal ganglia are shown to enable complex gating functions, including deferred execution of selected plans, and switching among alternative sensory-motor mappings. Notably, the model can learn and gate the use of a What-to-Where transformation that enables spatially invariant object representations to selectively excite spatially coded movement plans. Model simulations show how dopaminergic reward and non-reward signals guide monkeys to learn and perform saccadic eye movements in the fixation, single saccade, overlap, gap, and delay (memory-guided) saccade tasks. Model cell activation dynamics quantitatively simulate seventeen established types of dynamics exhibited by corresponding real cells during performance of these tasks.

    Preliminary version appears as Boston University Technical Report CAS/CNS TR-2000-023. Available in PDF (BroBulGro2003NN.pdf)(606Kb).


  • Grossberg, S. (2003). How does the cerebral cortex work? Development, learning, attention, and 3D vision by laminar circuits of visual cortex. Behavioral and Cognitive Neuroscience Reviews, in press.

    A key goal of behavioral and cognitive neuroscience is to link brain mechanisms to behavioral functions. The present article describes recent progress towards explaining how the visual cortex sees. Visual cortex, like many parts of perceptual and cognitive neocortex, is organized into six main layers of cells, as well as characteristic sub-lamina. Here it is proposed how these layered circuits help to realize processes of development, learning, perceptual grouping, attention, and 3D vision through a combination of bottom-up, horizontal, and top-down interactions. A key theme is that the mechanisms which enable development and learning to occur in a stable way imply properties of adult behavior. These results thus begin to unify three fields: infant cortical development, adult cortical neurophysiology and anatomy, and adult visual perception. The identified cortical mechanisms promise to generalize to explain how other perceptual and cognitive processes work.

    Preliminary version appears as Boston University Technical Report CAS/CNS TR-2003-005. Available in PDF (Gro2003BCNR.pdf)(829Kb).


  • Grossberg, S. (2003). Resonant neural dynamics of speech perception. Journal of Phonetics, in press.

    What is the neural representation of a speech code as it evolves in time? How do listeners integrate temporally distributed phonemic information across hundreds of milliseconds, even backwards in time, into coherent representations of syllables and words? What sorts of brain mechanisms encode the correct temporal order, despite such backwards effects, during speech perception? How does the brain extract rate-invariant properties of variable-rate speech? This article describes an emerging neural model that suggests answers to these questions, while quantitatively simulating challenging data about audition, speech and word recognition. This model includes bottom-up filtering, horizontal competitive, and top-down attentional interactions between a working memory for short-term storage of phonetic items and a list categorization network for grouping sequences of items. The conscious speech and word recognition code is suggested to be a resonant wave of activation across such a network, and a percept of silence is proposed to be a temporal discontinuity in the rate with which such a resonant wave evolves. Properties of these resonant waves can be traced to the brain mechanisms whereby auditory, speech, and language representations are learned in a stable way through time. Because resonances are proposed to control stable learning, the model is called an Adaptive Resonance Theory, or ART, model.

    Preliminary version appears as Boston University Technical Report CAS/CNS TR-2002-008. Available in PDF (Gro2003TIPSPhonetics.pdf)(829Kb).


  • Grossberg, S., Govindarajan, K.K., Wyse, L.L. , and Cohen, M.A. (2003). ARTSTREAM: A neural network model of auditory scene analysis and source segregation. Neural Networks , in press.

    Abstract: Multiple sound sources often contain harmonics that overlap and may be degraded by environmental noise. The auditory system is capable of teasing apart these sources into distinct mental objects, or streams. Such an "auditory scene analysis" enables the brain to solve the cocktail party problem. A neural network model of auditory scene analysis, called the ARTSTREAM model, is presented to propose how the brain accomplishes this feat. The model clarifies how the frequency components that correspond to a given acoustic source may be coherently grouped together into a distinct streams based on pitch and spatial location cues. The model also clarifies how multiple streams may be distinguished and separated by the brain. Streams are formed as spectral-pitch resonances that emerge through feedback interactions between frequency-specific spectral representations of a sound source and its pitch. First, the model transforms a sound into a spatial pattern of frequency-specific activation across a spectral stream layer. The sound has multiple parallel representations at this layer. A sound's spectral representation activates a bottom-up filter that is sensitive to the harmonics of the sound's pitch. This filter activates a pitch category which, in turn, activates a top-down expectation that is also sensitive to the harmonics of the pitch. Resonance develops when the spectral and pitch representations mutually reinforce one another. Resonance provides the coherence that allows one voice or instrument to be tracked through a noisy multiple source environment. Spectral components are suppressed if they do not match harmonics of the top-down expectation that is read-out by the selected pitch, thereby allowing another stream to capture these components, as in the "old-plus-new heuristic" of Bregman. Multiple simultaneously occurring spectral-pitch resonances can hereby emerge. These resonance and matching mechanisms are specialized versions of Adaptive Resonance Theory, or ART, which clarifies how pitch representations can self-organize during learning of harmonic bottom-up filters and top-down expectations. The model also clarifies how spatial location cues can help to disambiguate two sources with similar spectral cues. Data are simulated from psychophysical grouping experiments, such as how a tone sweeping upwards in frequency creates a bounce percept by grouping with a downward sweeping tone due to proximity in frequency, even if noise replaces the tones at their intersection point. Illusory auditory percepts are also simulated, such as the auditory continuity illusion of a tone continuing through a noise burst even if the tone is not present during the noise, and the scale illusion of Deutsch whereby downward and upward scales presented alternately to the two ears are regrouped based on frequency proximity, leading to a bounce percept. Since related sorts of resonances have been used to quantitatively simulate psychophysical data about speech perception, the model strengthens the hypothesis that ART-like mechanisms are used at multiple levels of the auditory system. Proposals for developing the model to explain more complex streaming data are also provided.

    Key words: auditory scene analysis, streaming, cocktail party problem, pitch perception, spatial localization, neural network, resonance, adaptive resonance theory, ART, spectral-pitch resonance.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-03-017. Available in PDF ( GroGovWysCoh2003.pdf )(1.53Mb).


  • Grossberg, S. and Howe, P.D.L. (2003). A laminar cortical model of stereopsis and three-dimensional surface perception. Vision Research, 43, 801-829.

    Abstract: A laminar cortical model of stereopsis and later stages of 3D surface perception is developed and simulated. The model describes how initial stages of monocular and binocular oriented filtering interact with later stages of 3D boundary formation and surface filling-in in the lateral geniculate nucleus (LGN) and cortical areas V1, V2, and V4. In particular, it details how interactions between layers 4, 3B, and 2/3A in V1 and V2 contribute to stereopsis, and clarifies how binocular and monocular information combine to form 3D boundary and surface representations. Along the way, the model modifies and significantly extends the disparity energy model. Neural explanations are given for psychophysical data concerning: contrast variations of dichoptic masking and the correspondence problem, the effect of interocular contrast differences on stereoacuity, Panum's limiting case, the Venetian blind illusion, stereopsis with polarity-reversed stereograms, da Vinci stereopsis, and various lightness illusions. By relating physiology to psychophysics, the model provides new functional insights and predictions about laminar cortical architecture.

    Keywords: Cortical model; Depth perception; Stereopsis; Surface perception; Cortical layers; Lightness perception; Monocular binocular interactions

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2002-02 . Available in PDF (GroHow02VisRes.pdf)(1.84MB).


  • Grossberg, S. and Seitz, A., (2003). Laminar Development of Receptive Fields, Maps, and Columns in Visual Cortex: The Coordinating Role of the Subplate. Cerebral Cortex, in press.

    How is development of cortical maps in V1 coordinated across cortical layers to form cortical columns? Previous neural models propose how maps of orientation (OR), ocular dominance (OD), and related properties develop in V1. These models show how spontaneous activity, before eye opening, combined with correlation learning and competition, can generate maps similar to those found in vivo. These models have not discussed laminar architecture or how cells develop and coordinate their connections across cortical layers. This is an important problem since anatomical evidence shows that clusters of horizontal connections form, between iso-oriented regions, in layer 2/3 before being innervated by layer 4 afferents. How are orientations in different layers aligned before these connections form? Anatomical evidence demonstrates that thalamic afferents wait in the subplate for weeks before innervating layer 4. Other evidence shows that ablation of the cortical subplate interferes with the development of OR and OD columns. The model proposes how the subplate develops OR and OD maps, which then entrain and coordinate the development of maps in other lamina. The model demonstrates how these maps may develop in layer 4 by using a known transient subplate-to-layer 4 circuit as a teacher. The model subplate also guides the early clustering of horizontal connections in layer 2/3, and the formation of the interlaminar circuitry that forms cortical columns. It is shown how layer 6 develops and helps to stabilize the network when the subplate atrophies. Finally the model clarifies how BDNF manipulations may influence cortical development.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-02-006. Available in PDF (SeitzGrossbergCerCor.pdf)(5.85Mb)


           Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2001-12. Available in PDF (RaiGro03CerCor.pdf)(293KB)


    2002

     

    Abstract: This article further develops the FACADE neural model of 3-D vision and figure-ground perception to quantitatively explain properties of the McCollough effect. The model proposes that many McCollough effect data result from visual system mechanisms whose primary function is to adaptively align, through learning, boundary and surface representations that are positionally shifted, due to the process of binocular fusion. For example, binocular boundary representations are shifted by binocular fusion relative to monocular surface representations, yet the boundaries must become positionally aligned with the surfaces to control binocular surface capture and filling-in. The model also includes perceptual reset mechanisms that use habituative transmitters in opponent processing circuits. Thus the model shows how McCollough effect data may arise from a combination of mechanisms that have a clear functional role in biological vision. Simulation results with a single set of parameters quantitatively fit data from thirteen experiments that probe the nature of achromatic/chromatic and monocular/binocular interactions during induction of the McCollough effect. The model proposes how perceptual learning, opponent processing, and habituation at both monocular and binocular surface representations are involved, including early thalamocortical sites. In particular, it explains the anomalous McCollough effect utilizing these multiple processing sites. Alternative models of the McCollough effect are also summarized and compared with the present model.

     

    Key Words: Color perception, Binocular vision, Perceptual learning, Visual cortex, Aftereffects, Boundary segmentation, Surface representation, McCollough effect, FACADE model

     

    Preliminary version appears as Boston University Technical Report, CAS/CNS TR-01-004. Available in PDF (GroHwaMin02.pdf)(254Kb) and in Gzipped Postscript (GroHwaMin02.ps.gz)(171Kb).


    2001

    ·         Granger, E., Rubin, M. , Grossberg, S. , and Lavoie, P. (2001). A what-and-where fusion neural network for recognition and tracking of multiple radar emitters. Neural Networks, in press.

    Abstract: A neural network recognition and tracking system is proposed for classification of radar pulses in autonomous Electronic Support Measure systems. Radar type information is combined with position-specific information from active emitters in a scene. Type-specific parameters of the input pulse stream are fed to a neural network classifier trained on samples of data collected in the field. Meanwhile, a clustering algorithm is used to separate pulses from different emitters according to position-specific parameters of the input pulse stream. Classifier responses corresponding to different emitters are separated into tracks, or trajectories, one per active emitter, allowing for more accurate identification of radar types based on multiple views of emitter data along each emitter trajectory. Such a What-and-Where fusion strategy is motivated by a similar subdivision of labor in the brain. The fuzzy ARTMAP neural network is used to classify streams of pulses according to radar type using their functional parameters. Simulation results obtained with a radar pulse data set indicate that fuzzy ARTMAP compares favorably to several other approaches when performance is measured in terms of accuracy and computational complexity. Incorporation into fuzzy ARTMAP of negative match tracking (from ARTMAP-IC) facilitated convergence during training with this data set. Other modifications improved classification of data that include missing input pattern components and missing training classes. Fuzzy ARTMAP was combined with a bank of Kalman filters to group pulses transmitted from different emitters based on their position-specific parameters, and with a module to accumulate evidence from fuzzy ARTMAP responses corresponding to the track defined for each emitter. Simulation results demonstrate that the system provides a high level of performance on complex, incomplete and overlapping radar data.

    Key Words: radar, electronic support measures, pattern recognition, data fusion, neural network, ARTMAP, Kalman filter.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-029. Available in PDF (GraRubGroLav01.pdf)(514Kb) and in Gzipped Postscript (GraRubGroLav01.ps.gz)(271Kb).


    ·         Grossberg, S., Mingolla, E., and Viswanathan, L. (2001). Neural dynamics of motion integration and segmentation within and across apertures. Vision Research, in press.

    Abstract: A neural model is developed of how motion integration and segmentation processes, both within and across apertures, compute global motion percepts. Figure-ground properties, such as occlusion, influence which motion signals determine the percept. For visible apertures, a line's terminators do not specify true line motion. For invisible apertures, a line's intrinsic terminators create veridical feature tracking signals. Sparse feature tracking signals can be amplified before they propagate across position and are integrated with ambiguous motion signals within line interiors. This integration process determines the global percept. It is the result of several processing stages: Directional transient cells respond to image transients and input to a directional short-range filter that selectively boosts feature tracking signals with the help of competitive signals. Then a long-range filter inputs to directional cells that pool signals over multiple orientations, opposite contrast polarities, and depths. This all happens no later than cortical area MT. The directional cells activate a directional grouping network, proposed to occur within cortical area MST, within which directions compete to determine a local winner. Enhanced feature tracking signals typically win over ambiguous motion signals. Model MST cells which encode the winning direction feed back to model MT cells, where they boost directionally consistent cell activities and suppress inconsistent activities over the spatial region to which they project. This feedback accomplishes directional and depthful motion capture within that region. Model simulations include the barberpole illusion, motion capture, the spotted barberpole, the triple barberpole, the occluded translating square illusion, motion transparency and the chopsticks illusion. Qualitative explanations of illusory contours from translating terminators and plaid adaptation are also given.

    Key Words: motion integration, motion segmentation, motion capture, aperture problem, feature tracking, MT, MST, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-004. Available in PDF (GroMinLav01.pdf)(4.3Mb), HTML (GroMinLav01.html)(1.8Mb) and in Gzipped Postscript (GroMinLav01.ps.gz)(3.5Mb).


    ·         Grossberg, S. and Williamson, J.R. (2001). A neural model of how horizontal and interlaminar connections of visual cortex develop into adult circuits that carry out perceptual groupings and learning. Cerebral Cortex, 11, 37-58.

    Abstract: A neural model suggests how horizontal and interlaminar connections in visual cortical areas V1 and V2 develop within a laminar cortical architecture and give rise to adult visual percepts. The model suggests how mechanisms that control cortical development in the infant lead to properties of adult cortical anatomy, neurophysiology and visual perception. The model clarifies how excitatory and inhibitory connections can develop stably by maintaining a balance between excitation and inhibition. The growth of long-range excitatory horizontal connections between layer 2/3 pyramidal cells is balanced against that of short-range disynaptic interneuronal connections. The growth of excitatory on-center connections from layer 6-to-4 is balanced against that of inhibitory interneuronal off-surround connections. These balanced connections interact via intracortical and intercortical feedback to realize properties of perceptual grouping, attention and perceptual learning in the adult, and help to explain the observed variability in the number and temporal distribution of spikes emitted by cortical neurons. The model replicates cortical point spread functions and psychophysical data on the strength of real and illusory contours. The on-center, off-surround layer 6-to-4 circuit enables top-down attentional signals from area V2 to modulate, or attentionally prime, layer 4 cells in area V1 without fully activating them. This modulatory circuit also enables adult perceptual learning within cortical area V1 and V2 to proceed in a stable way.

    Keywords:None

    On-line version available from Cerebral Cortex (GroWil2000.pdf) (499Kb) in PDF format.


    ·         Pack, C., Grossberg, S. and Mingolla, E., (2001). A neural model of smooth pursuit control and motion perception by cortical area MST. Journal of Cognitive Neuroscience, in press.

    Abstract: Smooth pursuit eye movements are eye rotations that are used to maintain fixation on a moving target. Such rotations complicate the interpretation of the retinal image, because they nullify the retinal motion of the target, while generating retinal motion of stationary objects in the background. This poses a problem for the oculomotor system, which must track the stabilized target image, while suppressing the optokinetic reflex, which would move the eye in the direction of the retinal background motion, which is opposite to the direction in which the target is moving. Similarly, the perceptual system must estimate the actual direction and speed of moving objects in spite of the confounding effects of the eye rotation. This paper proposes a neural model to account for the ability of primates to accomplish these tasks. The model simulates the neurophysiological properties of cell types found in the superior temporal sulcus of the macaque monkey, specifically the medial superior temporal (MST) region. These cells process signals related to target motion, background motion, and receive an efference copy of eye velocity during pursuit movements. The model focuses on the interactions between cells in the ventral and dorsal subdivisions of MST, which are hypothesized to process target velocity and background motion, respectively. The model explains how these signals can be combined to explain behavioral data about pursuit maintenance and perceptual data from human studies, including the Aubert-Fleischl phenomenon and the Filehne Illusion, thereby clarifying the functional significance of neurophysiological data about these MST cell properties. It is suggested that the connectivity used in the model may represent a general strategy used by the brain in analyzing the visual world.

    Keywords: smooth pursuit, eye movements, visual cortex, MST, motion, optokinetic nystagmus, target tracking, perception

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-023. Available in HTML (PacGroMin.JCN00.html)(89Kb), PDF (PacGroMin.JCN00.pdf)(172Kb), or in Gzipped Postscript (PacGroMin.JCN00.ps.gz)(452Kb).


    ·         Raizada, R. and Grossberg, S. (2001). Context-sensitive bindings by the laminar circuits of V1 and V2: A unified model of perceptual grouping, attention, and orientation contrast. Visual Cognition, in press.

    Abstract: A detailed neural model is presented of how the laminar circuits of visual cortical areas V1 and V2 implement context-sensitive binding processes such as perceptual grouping and attention. The model proposes how specific laminar circuits allow the responses of visual cortical neurons to be determined not only by the stimuli within their classical receptive fields, but also to be strongly influenced by stimuli in the extra-classical surround. This context-sensitive visual processing can greatly enhance the analysis of visual scenes, especially those containing targets that are low contrast, partially occluded, or crowded by distractors. We show how interactions of feedforward, feedback and horizontal circuitry can implement several types of contextual processing simultaneously, using shared laminar circuits. In particular, we present computer simulations which suggest how top-down attention and preattentive perceptual grouping, two processes that are fundamental for visual binding, can interact, with attentional enhancement selectively propagating along groupings of both real and illusory contours, thereby showing how attention can selectively enhance object representations. These simulations also illustrate how attention may have a stronger facilitatory effect on low contrast than on high contrast stimuli, and how pop-out from orientation contrast may occur. The specific functional roles which the model proposes for the cortical layers allow several testable neurophysiological predictions to be made. The results presented here simulate only the boundary grouping system of adult cortical architecture. However, we also discuss how this model contributes to a larger neural theory of vision which suggests how intracortical and intercortical feedback help to stabilize development and learning within these cortical circuits. Although feedback plays a key role, fast feedforward processing is possible in response to unambiguous information. Model circuits are capable of synchronizing quickly, but context-sensitive persistence of previous events can influence how synchrony develops. Although these results focus on how the interblob cortical processing stream controls boundary grouping and attention, related modeling of the blob cortical processing stream suggests how visible surfaces are formed, and modeling of the motion stream suggests how transient responses to scenic changes can control long-range apparent motion and also attract spatial attention.

    Keywords: visual cortex, attention, grouping, orientation contrast, cortical layers, V1, V2, feedback, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-008. Available in PDF (RaiGro01.pdf) (197Kb) and G'zipped Postscript (RaiGro01.ps.gz) (441Kb). Mac browsers may have difficulty with the equations but the document should print properly.


    2000

    ·         Ajemian, R. , Bullock, D., and Grossberg, S. (2000). Kinematic coordinates in which motor cortical cells encode movement direction. Journal of Neurophysiology, 84, 2191-2203.

    Abstract: During goal-directed reaching in primates, a sensorimotor transformation generates a dynamical pattern of muscle activation. Within the context of this sensorimotor transformation, a fundamental question concerns the coordinate systems in which individual cells in the primary motor cortex (MI) encode movement direction. This article develops a mathematical framework that computes, as a function of the coordinate system in which an individual cell is hypothesized to operate, the spatial preferred direction (pd) of that cell as the arm configuration and hand location vary. Three coordinate systems are explicitly modeled: Cartesian spatial, shoulder-centered, and joint angle. The computed patterns of spatial pds are distinct for each of these three coordinate systems, and experimental approaches are described which can capitalize upon these differences to compare the empirical adequacy of each coordinate hypothesis. One particular experiment involving curved motion (Hocherman and Wise 1991) was analyzed from this perspective. Out of the three coordinate systems tested, the assumption of joint angle coordinates best explained the observed cellular response properties. The mathematical framework developed in this paper can also be used to design new experiments that are capable of disambiguating between a given set of specified coordinate hypotheses.

    Keywords:None

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-021. Available in PDF AjeBulGro00.pdf (221Kb), and Gzipped postscript AjeBulGro00.ps.gz (683Kb).


    ·         Grossberg, S. (2000a). The complementary brain: Unifying brain dynamics and modularity. Trends in Cognitive Sciences, 4, 233-246.

    Abstract: How are our brains functionally organized to achieve adaptive behavior in a changing world? This article presents one alternative to the computer metaphor suggesting that brains are organized into independent modules. Evidence is reviewed that brains are organized into parallel processing streams with complementary properties. Hierarchical interactions within each stream and parallel interactions between streams create coherent behavioral representations that overcome the complementary deficiencies of each stream and support unitary conscious experiences. This perspective suggests how brain design reflects the organization of the physical world with which brains interact. Examples from perception, learning, cognition, and action are described, and theoretical concepts and mechanisms by which complementarity is accomplished are presented.

    Keywords: modularity, What and Where processing, visual cortex, motor cortex, reinforcement, recognition, attention, learning, expectation, volition, speech, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-003. Available in HTML Gro00a.html (151Kb) and PDF Gro00a.pdf (210Kb).


    ·         Grossberg, S. Grossberg, S. (2000b). The imbalanced brain: From normal behavior to schizophrenia. Biological Psychiatry, 48, 81-98.

    Abstract: An outstanding problem in psychiatry concerns how to link discoveries about the pharmacological, neurophysiological, and neuroanatomical substrates of mental disorders to the abnormal behaviors that they control. A related problem concerns how to understand abnormal behaviors on a continuum with normal behaviors. During the past few decades, neural models have been developed of how normal cognitive and emotional processes learn from the environment, focus attention and act upon motivationally important events, and cope with unexpected events. When arousal or volitional signals in these models are suitably altered, they give rise to symptoms that strikingly resemble negative and positive symptoms of schizophrenia, including flat affect, impoverishment of will, attentional problems, loss of a theory of mind, thought derailment, hallucinations, and delusions. The present article models how emotional centers of the brain, such as the amygdala, interact with sensory and prefrontal cortices (notably ventral, or orbital, prefrontal cortex) to generate affective states, attend to motivationally salient sensory events, and elicit motivated behaviors. When such emotional centers become depressed, negative symptoms of schizophrenia emerge in the model. Such emotional centers are modeled as opponent affective processes, such as fear and relief, whose response amplitude and sensitivity are calibrated by an arousal level and chemical transmitters that slowly inactivate, or habituate, in an activity-dependent way. These opponent processes exhibit an Inverted-U whereby behavior become depressed if the arousal level is chosen too large or too small. The negative symptoms are due to the way in which the depressed opponent process interacts with other circuits throughout the brain.

    Keywords:schizophrenia, arousal, prefrontal cortex, amygdala, opponent process, neural networks

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-018. Available in HTML Gro.BioPsych2000.html (91Kb), PDF Gro.BioPsy2000.pdf (342Kb), and and Gzipped postscript Gro.BioPsy2000.ps.gz (489Kb).


    ·         Grossberg, S. (2000c). How hallucinations may arise from brain mechanisms of learning, attention, and volition. Invited article for the Journal of the International Neuropsychological Society, 6, 579-588.

    Abstract: This article suggests how brain mechanisms of learning, attention, and volition may give rise to hallucinations during schizophrenia and other mental disorders. The article suggests that normal learning and memory are stabilized through the use of learned top-down expectations. These expectations learn prototypes that are capable of focusing attention upon the combinations of features that comprise conscious perceptual experiences. When top-down expectations are active in a priming situation, they can modulate or sensitize their target cells to respond more effectively to matched bottom-up information. They cannot, however, fully activate these target cells. These matching properties are shown to be essential towards stabilizing the memory of learned representations. The modulatory property of top-down expectations is achieved through a balance between top-down excitation and inhibition. The learned prototype is the excitatory on-center in this top-down network. Phasic volitional signals can shift the balance between excitation and inhibition to favor net excitatory activation. Such a volitionally-mediated shift enables top-down expectations, in the absence of supportive bottom-up inputs, to cause conscious experiences of imagery and inner speech, and thereby to enable fantasy and planning activities to occur. If these volitional signals become tonically hyperactive during a mental disorder, the top-down expectations can give rise to conscious experiences in the absence of bottom-up inputs and volition. These events are compared with data about hallucinations. The article predicts where these top-down expectations and volitional signals may act in the laminar circuits of visual cortex, and by extension in other sensory and cognitive neocortical areas, and how the level of abstractness of learned prototypes may covary with the abstractness of hallucinatory content. A similar breakdown of volition may lead to declusions of control in the motor system.

    Key Words: hallucinations, learned expectations, attention, learning, adaptive resonance theory

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-020. Available in HTML, Gro99.hall.html PDF Gro99.hall.pdf (6.7Mb), and Gzipped postscript Gro99.hall.ps.gz (4.9Mb).


    ·         Grossberg, S. (2000d). Linking mind to brain: The mathematics of biological intelligence. Notices of the American Mathematical Society, 47, 1361-1372.

    Abstract: How our brains give rise to our minds is one of the most intriguing questions in all of science. We are now living in a particularly interesting time to consider this question. This is true because, during the last decade, mathematical models about how the brain works have finally succeeded in quantitatively simulating the experimentally recorded dynamics of individual cells in identified brain circuits and the behaviors that these circuits control. The models that have led to these successes incorporate qualitatively new ideas about how the brain is organized to achieve the remarkable flexibility and power of biological intelligence. These advances represent significant challenges and opportunities for mathematicians for several reasons. One obvious reason is that the models themselves are interesting mathematical objects. These models are typically defined by high-dimensional dynamical systems in which several types of nonlinear feedback operate across multiple spatial and temporal scales. They represent systems which are capable of autonomously adapting, or self-organizing, in response to a rapidly changing and unpredictable world.

    Keywords: None

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-016. Available in HTML Gro00.AMS.html (126Kb) and PDF Gro00.AMS.pdf (112Kb).


    ·         Grossberg, S. and Myers, C.W. (2000) The resonant dynamics of speech perception: Interword integration and duration-dependent backward effects. Psychological Review, 4, 735-767.

    Abstract: How do listeners integrate temporally distributed phonemic information into coherent representations of syllables and words? During fluent speech perception, variations in the durations of speech sounds and silent pauses can produce different perceived groupings. For example, increasing the silence interval between the words ``gray chip'' may result in the percept ``great chip'', whereas increasing the duration of fricative noise in ``chip'' may alter the percept to ``great ship'' (Repp et al., 1978). The ARTWORD neural model quantitatively simulates such context-sensitive speech data. In ARTWORD, sequential activation and storage of phonemic items in working memory provides bottom-up input to unitized representations, or list chunks, that group together sequences of items of variable length. The list chunks compete with each other as they dynamically integrate this bottom-up information. The winning groupings feed back to provide top-down support to their phonemic items. Feedback establishes a resonance which temporarily boosts the activation levels of selected items and chunks, thereby creating an emergent conscious percept. Because the resonance evolves more slowly than working memory activation, it can be influenced by information presented after relatively long intervening silence intervals. The same phonemic input can hereby yield different groupings depending on its arrival time. Processes of resonant transfer and competitive teaming help determine which groupings win the competition. Habituating levels of neurotransmitter along the pathways that sustain the resonant feedback lead to a resonant collapse that permits the formation of subsequent resonances.

    Keywords: speech perception, word recognition, consciousness, adaptive resonance, context effects, consonant perception, neural network, silence duration, working memory, categorization, clustering.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-001. Available in HTML GroMye00.html (350Kb), PDF GroMye00.pdf (504Kb) and Gzipped postscript GroMye00.ps.gz (303Kb).


    ·         Grossberg, S. and Paine, R.W.(2000). A neural model of corticocerebellar interactions during attentive imitation and predicitve learning of sequential handwriting movements. Neural Networks, 13, 999-1046.

    Abstract: Much sensory-motor behavior develops through imitation, as during the learning of handwriting by children (Burns, 1962; Freeman, 1914; Iacoboni et al., 1999). Such complex sequential acts are broken down into distinct motor control synergies, or muscle groups, whose activities overlap in time to generate continuous, curved movements that obey an inverse relation between curvature and speed. How are such complex movements learned through attentive imitation? Novel movements may be made as a series of distinct segments that may be quite irregular both in space and time, but a practiced movement can be made smoothly, with a continuous, often bell-shaped, velocity profile. How does learning of sequential movements transform reactive imitation into predictive, automatic performance? A neural model is summarized here which suggests how parietal, frontal, and motor cortical mechanisms, such as difference vector encoding, interact with adaptively-timed, predictive cerebellar learning during movement imitation and predictive performance (Grossberg & Paine, 2000). To initiate movement, visual attention shifts along the shape to be imitated and generates vector movement using motor cortical cells. During such an imitative movement, cerebellar Purkinje cells with a spectrum of delayed response profiles sample and learn the changing directional information and, in turn, send that learned information back to the cortex and eventually to the muscle synergies involved. If the imitative movement deviates from an attentional focus around a shape to be imitated, the visual system shifts attention, and may make an eye movement back to the shape, thereby providing corrective directional information to the arm movement system. This imitative movement cycle repeats until the corticocerebellar system can accurately drive the movement based on memory alone. A cortical working memory buffer transiently stores the cerebellar output and releases it at a variable rate, allowing speed scaling of learned movements which is limited by the rate of cerebellar memory readout. Movements can be learned at variable speeds if the density of the spectrum of delayed cellular responses in the cerebellum varies with speed. Learning at slower speeds facilitates learning at faster speeds. Size can be varied after learning while keeping the movement duration constant (isochrony). Context-effects arise from the overlap of cerebellar memory outputs. The model is used to simulate key psychophysical and neural data about learning to make curved movements, including a decrease in writing time as learning progresses; generation of unimodal, bell-shaped velocity profiles for each movement synergy; size and speed scaling with preservation of the letter shape and the shapes of the velocity profiles; an inverse relation between curvature and tangential velocity; and a Two-Thirds Power Law relation between angular velocity and curvature.

    Keywords:Handwriting, learning, imitation, cerebellum, frontal cortex, working memory, motor control

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-009. Available in PDF GroPai00.pdf (988Kb) and Gzipped postscript GroPai00.ps.gz (745Kb).


    ·         Grossberg, S. and Raizada, R. (2000). Contrast-sensitive perceptual grouping and object-based attention in the laminar circuits of primary visual cortex. Vision Research, 40, 1413-1432.

    Recent neurophysiological studies have shown that primary visual cortex, or V1, does more than passively process image features using the feedforward filters suggested by Hubel and Wiesel. It also uses horizontal interactions to group features preattentively into object representations, and feedback interactions to selectively attend to these groupings. All neocortical areas, including V1, are organized into layered circuits. We present a neural model showing how the layered circuits in areas V1 and V2 enable feedforward, horizontal, and feedback interactions to complete perceptual groupings over positions that do not receive contrastive visual inputs, even while attention can only modulate or prime positions that do not receive such inputs. Recent neurophysiological data about how grouping and attention occur and interact in V1 are simulated and explained, and testable predictions are made. These simulations show how attention can selectively propagate along an object grouping and protect it from competitive masking, and how contextual stimuli can enhance or suppress groupings in a contrast-sensitive manner.

    Keywords: attention, grouping, cortical layers, V1, V2, feedback, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-008. Available in PDF (GroRai99.pdf) (394Kb), and available in gzip'ed postscript (GroRai99.ps.gz) (186Kb).


    ·         Kelly, F.J. and Grossberg, S. (2000). Neural dynamics of 3-D surface perception: Figure-ground separation and lightness perception. Perception & Psychophysics, 62, 1596-1619.

    Abstract:This article develops the FACADE theory of three-dimensional (3-D) vision to simulate data concerning how two-dimensional (2-D) pictures give rise to 3-D percepts of occluded and occluding surfaces. The theory suggests how geometrical and contrastive properties of an image can either cooperate or compete when forming the boundary and surface representations that subserve conscious visual percepts. Spatially long-range cooperation and short-range competition work together to separate boundaries of occluding figures from their occluded neighbors, thereby providing sensitivity to T-junctions without the need to assume that T-junction "detectors" exist. Both boundary and surface representations of occluded objects may be amodally completed, while the surface representations of unoccluded objects become visible through modal processes. Computer simulations include Bregman-Kanizsa figure-ground separation, Kanizsa stratification, and various lightness percepts, including the Munker-White, Benary cross, and checkerboard percepts.

    Keywords: Amodal Completion, Depth Perception, Figure-Ground Perception, Lightness, Visual Cortex, Neural Network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-026. Note that the Appendix equations and Parameter Table are separate files due to size constraints. The paper is available in HTML KelGro2000.PP.html, PDF KelGro2000p.pdf, and Gzipped postscript KelGro2000p.ps.gz. The Appendix Equations are available in HTML KelGro2000.html, PDF KelGro2000.pdf, and Gzipped postscript KelGro2000.ps.gz. The Parameter Table is available separately in HTML table1.html, PDF table1.pdf, and Gzipped postscript table1.ps.gz.


    ·         Okatan, M. and Grossberg, S. (2000). Frequency-dependent synaptic potentiation, depression, and spike timing induced by Hebbian pairing in cortical pyramidal neurons. Neural Networks, 13, 699-708.

    Abstract: Experiments by Markram and Tsodyks (1996) have suggested that Hebbian pairing in cortical pyramidal neurons potentiates or depresses the transmission of a subsequent presynaptic spike train at steady-state depending on whether the spike train is of low frequency or high frequency, respectively. The frequency above which pairing induced a significant decrease in steady-state synaptic efficacy was as low as about 20 Hz and this value depends on such synaptic properties as probability of release and time constant of recovery from short-term synaptic depression. These characteristics of cortical synapses have not yet been fully explained by neural models, notably the decreased steady-state synaptic efficacy at high presynaptic firing rates. This article suggests that this decrease in synaptic efficacy in cortical synapses was not observed at steady-state, but rather during a transition period preceding it whose duration is frequency-dependent. It is shown that the time taken to reach steady-state may be frequency-dependent, and may take considerably longer to occur at high than low frequencies. As a result, the pairing-induced decrease in synaptic efficacy at high presynaptic firing rates helps to localize the firing of the postsynaptic neuron to a short time interval following the onset of high frequency presynaptic spike trains. This effect may "speed up the time scale" in response to high frequency bursts of spikes, and may contribute to rapid synchronization of spike firing across cortical cells that are bound together by associatively learned connections. Key Words: synaptic potentiation, synaptic depression, frequency-dependent synaptic plasticity, cortical pyramidal cells, Hebbian pairing, cortical synchronization

    Key Words: synaptic potentiation, synaptic depression, frequency-dependent synaptic plasticity, cortical pyramidal cells, Hebbian pairing, cortical synchronization

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-2000-003. Available in HTML (OkaGro01.htm) (300Kb), PDF (OkaGro01.pdf) (4.295Mb) and in Gzipped Postscript (OkaGro01.ps.gz)(5.253Mb).


    ·         Ross, Grossberg, S. and Mingolla, E. (2000). Visual cortical mechanisms of perceptual grouping: Interacting layers, networks, columns, and maps. Neural Networks, 13, 571-588.

    Abstract: The visual cortex has a laminar organization whose circuits form functional columns in cortical maps. How this laminar architecture supports visual percepts is not well understood. A neural model proposes how the laminar circuits of V1 and V2 generate perceptual groupings that maintain sensitivity to the contrasts and spatial organization of scenic cues. The model can decisively choose which groupings cohere and survive, even while balanced excitatory and inhibitory interactions preserve contrast-sensitive measures of local boundary likelihood or strength. In the model, excitatory inputs from LGN activate layers 4 and 6 of V1. Layer 6 activates an on-center off-surround network of inputs to layer 4. Together these layer 4 inputs preserve analog sensitivity to LGN input contrasts. Layer 4 cells excite pyramidal cells in layer 2/3 which activate monosynaptic long-range horizontal excitatory connections between layer 2/3 pyramidal cells, and short-range disynaptic inhibitory connections mediated by smooth stellate cells. These interactions support inward perceptual grouping between two or more boundary inducers, but not outward grouping from a single inducer. These boundary signals feed back to layer 4 via the layer 6-to-4 on-center off-surround network. This {\em folded feedback} joins cells in different layers into functional columns while selecting winning groupings. Layer 6 in V1 also sends top-down signals to LGN using an on-center off-surround network, which suppresses LGN cells that do not receive feedback, while selecting, enhancing, and synchronizing activity of those that do. The model is used to simulate psychophysical and neurophysiological data about perceptual grouping, including various Gestalt grouping laws.

    Keywords: visual cortex, perceptual grouping, cortical layers, cortical columns, cortical maps, cortical feedback, illusory contours, V1, V2, LGN

    This paper is only available in print. Requests for a reprint of CAS/CNS-TR-98-023 can be sent to amos@cns.bu.edu. Please include street address.

    1999

    ·         Baloch, A.A., Grossberg, S. Mingolla, E., and Nogueira, C.A.M. (1999). A neural model of first-order and second-order motion perception and magnocellular dynamics. Journal of the Optical Society of America A, 16, 953-978.

    Abstract: A neural model of motion perception simulates psychophysical data concerning first-order and second-order motion stimuli, including the reversal of perceived motion direction with distance from the stimulus (Gamma display), and data about directional judgments as a function of relative spatial phase or spatial and temporal frequency. Many other second-order motion percepts that have been ascribed to a second non-Fourier processing stream can also be explained in the model by interactions between ON and OFF cells within a single, neurobiologically interpreted magnocellular processing stream. Yet other percepts may be traced to interactions between form and motion processing streams, rather than to processing within multiple motion processing streams. The model hereby explains why monkeys with lesions of of the parvocellular layers, but not the magnocellular layers, of the lateral geniculate nucleus (LGN) are capable of detecting the correct direction of second-order motion, why most cells in area MT are sensitive to both first-order and second-order motion, and why after APB injection selectively blocks retinal ON bipolar cells, cortical cells are sensitive only to the motion of a moving bright bar's trailing edge. Magnocellular LGN cells show relatively transient responses while parvocellular LGN cells show relatively sustained responses. Correspondingly, the model bases its directional estimates on the outputs of model ON and OFF transient cells that are organized in opponent circuits wherein antagonistic rebounds occur in response to stimulus offset. Center-surround interactions convert these ON and OFF outputs into responses of lightening and darkening cells that are sensitive both to direct inputs and to rebound responses in their receptive field centers and surrounds. The total pattern of activity increments and decrements is used by subsequent processing stages (spatially short-range filters, competitive interactions, spatially long-range filters, and directional grouping cells) to determine the perceived direction of motion.

    Keywords: first-order motion, second-order motion, Gamma display, visual cortex, lateral geniculate nucleus, magnocellular processing, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-96-030. Available in gzip'ed postscript (101Kb). The figure files are available in 3 separate files. Figure file 1. Size in gzip'ed postscript (1,494Kb) Figure file 2. Size in gzip'ed postscript (372Kb) Figure file 3. Size in gzip'ed postscript (611Kb)


    ·         Boardman, I., Grossberg, S., Myers, C., and Cohen, M. (1999). Neural dynamics of perceptual order and context effects for variable-rate speech syllables. Perception & Psychophysics, 61, 1477-1500.

    Abstract: How does the brain extract invariant properties of variable-rate speech? A neural model, called PHONET, is developed to explain aspects of this process and, along the way, data about perceptual context effects. For example, in consonant vowel (CV) syllables such as /ba/ and /wa/, an increase in the duration of the vowel can cause a switch in the percept of the preceding consonant from /w/ to /b/ (Miller and Liberman, 1979). The frequency extent of the initial formant transitions of fixed duration also influences the percept (Schwab, Sawusch, and Nusbaum, 1981). PHONET quantitatively simulates over 98% of the variance in these data using a single set of parameters. The model also qualitatively explains many data about other perceptual context effects. In the model, C and V inputs are filtered by parallel auditory streams that respond preferentially to transient and sustained properties of the acoustic signal before being stored in parallel working memories. A lateral inhibitory network of onset- and rate-sensitive cells in the transient channel extracts measures of frequency transition rate and extent. Greater activation of the transient stream can increase the processing rate in the sustained stream via a cross-stream automatic gain control interaction. The stored activities across these gain-controlled working memories provide a basis for rate-invariant perception, since the transient-to-sustained gain control tends to preserve the relative activities across the transient and sustained working memories as speech rate changes. Comparisons with alternative models tested suggest the fit can not be attributed to the simplicity of the data. Brain analogs of model cell types are described. Key words: context effects, CV syllable, formant transition, neural network, phonetic perception, speech perception, vowel, consonant, sustained cells, transient cells, transition duration, transition rate, working memory.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-004. Available in PDF BoaGroMyeCoh99.pdf and Gzipped postscript BoaGroMyeCoh99.ps.gz (5.4Mb).


    ·         Brown, J., Bullock, D., and Grossberg, S. (1999). How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues. Journal of Neuroscience, 19, 10502-10511.

    Abstract: After classically conditioned learning, dopaminergic cells in the substantia nigra pars compacta (SNc) respond immediately to unexpected conditioned stimuli (CS) but omit formerly seen responses to expected unconditioned stimuli, notably rewards. These cells play an important role in reinforcement learning. A neural model explains the key neurophysiological properties of these cells before, during, and after conditioning, as well as related anatomical and neurophysiological data about the pedunculo-pontine tegmental nucleus (PPTN), lateral hypothalamus, ventral striatum, and striosomes. The model proposes how two parallel learning pathways from limbic cortex to the SNc, one devoted to excitatory conditioning (through the ventral striatum, ventral pallidum, and PPTN) and the other to adaptively timed inhibitory conditioning (through the striosomes), control SNc responses. The excitatory pathway generates CS-induced excitatory SNc dopamine bursts. The inhibitory pathway prevents dopamine bursts in response to predictable reward-related signals. When expected rewards are not received, striosomal inhibition of SNc that is unopposed by excitation results in a phasic drop in dopamine cell activity. The adaptively timed inhibitory learning uses an intracellular spectrum of timed responses that is proposed to be similar to adaptively timed cellular mechanisms in the hippocampus and the cerebellum. These mechanisms are proposed to include metabotropic glutamate receptor-mediated Ca2+ spikes that occur with different delays in striosomal cells. A dopaminergic burst in concert with a Ca2+ spike is proposed to potentiate inhibitory learning. The model provides a biologically predictive alternative to temporal difference (TD) conditioning models and explains substantially more data than alternative models.

    Keywords: dopamine, substantia nigra, reward, basal ganglia, conditioning, pedunculopontine tegmental nucleus, lateral hypothalamus, striosomes, adaptive timing

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-99-011. Download this paper as: PDF (BroBulGro99.pdf) or Gzipped Postscript (BroBulGro99.ps.gz) (152Kb).


    ·         Gancarz, G. and Grossberg, S. (1999). A neural model of saccadic eye movement control explains task-specific adaptation. Vision Research, 39, 3123-3143.

    Abstract: Multiple brain learning sites are needed to calibrate the accuracy of saccadic eye movements. This is true because saccades can be made reactively to visual cues, attentively to visual or auditory cues, or planned in response to memory cues using visual, parietal, and prefrontal cortex, as well as superior colliculus, cerebellum, and reticular formation. The organization of these sites can be probed by displacing a visual target during a saccade. The resulting adaptation typically shows incomplete and asymmetric transfer between different tasks. A neural model of saccadic system learning is developed to explain these data, as well as data about saccadic coordinate changes.

    Keywords: saccades, learning, superior colliculus, cerebellum, parietal cortex, prefrontal cortex, reticular formation, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-024. Download this paper as a Gzipped Postscript GanGro99.ps.gz (138Kb).

     


     

    ·         Grossberg, S. (1999). How does the cerebral cortex work? Learning, attention and grouping by the laminar circuits of visual cortex. Spatial Vision, 12, 163-186.

    Abstract: The organization of neocortex into layers is one of its most salient anatomical features. These layers include circuits that form functional columns in cortical maps. A major unsolved problem concerns how bottom-up, top-down, and horizontal interactions are organized within cortical layers to generate adaptive behaviors. This article models how these interactions help visual cortex to realize: (1) the binding process whereby cortex groups distributed data into coherent object representations; (2) the attentional process whereby cortex selectively processes important events; and (3) the developmental and learning processes whereby cortex shapes its circuits to match environmental constraints. New computational ideas about feedback systems suggest how neocortex develops and learns in a stable way, and why top-down attention requires converging bottom-up inputs to fully activate cortical cells, whereas perceptual groupings do not.

    Key Words: no key words

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-97-023. Available in HTML spvis5.html


    ·         Grossberg, S.(1999). The link between brain learning, attention, and consciousness. Consciousness and Cognition, 8, 1-44.

    Abstract: The processes whereby our brains continue to learn about a changing world in a stable fashion throughout life are proposed to lead to conscious experiences. These processes include the learning of top-down expectations, the matching of these expectations against bottom-up data, the focusing of attention upon the expected clusters of information, and the development of resonant states between bottom-up and top-down processes as they reach an attentive consensus between what is expected and what is there in the outside world. It is suggested that all conscious states in the brain are resonant states, and that these resonant states trigger learning of sensory and cognitive representations. The models which summarize these concepts are therefore called Adaptive Resonance Theory, or ART, models. Psychophysical and neurobiological data in support of ART are presented from early vision, visual object recognition, auditory streaming, variable-rate speech perception, somatosensory perception, and cognitive-emotional interactions, among others. It is noted that ART mechanisms seem to be operative at all levels of the visual system, and it is proposed how these mechanisms are realized by known laminar circuits of visual cortex. It is predicted that the same circuit realization of ART mechanisms will be found in the laminar circuits of all sensory and cognitive neocortex. Concepts and data are summarized concerning how some visual percepts may be visibly, or modally, perceived, whereas amodal percepts may be consciously recognized even though they are perceptually invisible. It is also suggested that sensory and cognitive processing in the What processing stream of the brain obey top-down matching and learning laws that are often complementary to those used for spatial and motor processing in the brain's Where processing stream. This enables our sensory and cognitive representations to maintain their stability as we learn more about the world, while allowing spatial and motor representations to forget learned maps and gains that are no longer appropriate as our bodies develop and grow from infanthood to adulthood. Procedural memories are proposed to be unconscious because the inhibitory matching process that supports these spatial and motor processes cannot lead to resonance.

    Keywords: Learning, expectation, attention, adaptive resonance, neural network, procedural memory, consciousness, object recognition, speech perception.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-97-018. Availabe in PDF (Gro.concog98.pdf) and Gzipped postscript (Gro.concog98.ps.gz) (170Kb).


     

    Abstract: How does the visual cortex combine information from both eyes to generate perceptual representations of object surfaces? Important clues about this process may be derived from data about the perceived brightnesses of surface regions under binocular viewing conditions, including data about binocular brightness summation in response to ganzfelds, the U-shaped data of Fechner's Paradox that violates binocular brightness summation, and the effects of different combinations of monocular and binocular contours and surface luminance differences on threshold sensitivity to monocular flashes of light. How to reconcile these apparently contradictory data properties has been a severe challenge to previous models, and none has explained them all. The present article quantitatively simulates them all by further developing the FACADE vision model. Key model processes discount the illuminant and compute image contrasts in each monocular channel using shunting on-center off-surround networks; binocularly fuse these discounted monocular signals using shunting on-center off-surround networks with nonlinear excitatory and inhibitory signals; and use these binocularly fused activities to trigger filling-in of a binocular surface representation that represents perceived surface brightness. Previous models that have suggested explanations of subsets of these data are discussed.

    Keywords: brightness perception, binocular vision, Fechner's paradox, ganzfeld, neural networks, FACADE theory, BCS, FCS

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-018. Available in PDF (GroKel99.pdf) and Gzip'ed postscript GroKel99.ps.gz (549Kb). Appendix Equation file is currently a separate document: Available in PDF (GroKel99b.pdf) and G'zipped postscript. GroKel99b.ps.gz (25Kb).

     


    ·         Grossberg, S., Mingolla, E., and Pack C. (1999). A neural model of motion processing and visual navigation by cortical area MST. Cerebral Cortex, 9, 878-895.

    Abstract Cells in the dorsal medial superior temporal cortex (MSTd) process optic flow generated by self-motion during visually-guided navigation. A neural model shows how interactions between well-known neural mechanisms (log polar cortical magnification, Gaussian motion-sensitive receptive fields, spatial pooling of motion-sensitive signals, and subtractive extraretinal eye movement signals) lead to emergent properties that quantitatively simulate neurophysiological data about MSTd cell properties and psychophysical data about human navigation. Model cells match MSTd neuron responses to optic flow stimuli placed in different parts of the visual field, including position invariance, tuning curves, preferred spiral directions, direction reversals, average response curves, and preferred locations for stimulus motion centers. The model shows how the preferred motion direction of the most active MSTd cells can explain human judgments of self-motion direction (heading), without using complex heading templates. The model explains when extraretinal eye movement signals are needed for accurate heading perception, and when retinal input is sufficient, and how heading judgments depend on scene layouts and rotation rates.

    Keywords: optic flow, heading, pursuit, cortical magnification

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-97-015. Available in HTML GroMinPac99.html. Available in PDF GroMinPac99.pdf, and available in gzip'ed postscript GroMinPac99.ps.gz (1,125Kb).


    ·         Grossberg, S. and Williamson, J.R. (1999). A self-organizing neural system for learning to recognize textured scenes. Vision Research, 39, 1385-1406.

    Abstract: A self-organizing ARTEX model is developed to categorize and classify textured image regions. ARTEX specializes the FACADE model of how the visual cortex sees, and the ART model of how temporal and prefrontal cortices interact with the hippocampal system to learn visual recognition categories and their names. FACADE processing generates a vector of boundary and surface properties, notably texture and brightness properties, by utilizing multi-scale filtering, competition, and diffusive filling-in. Its context-sensitive local measures of textured scenes can be used to recognize scenic properties that gradually change across space, as well as abrupt texture boundaries. ART incrementally learns recognition categories that classify FACADE output vectors, class names of these categories, and their probabilities. Top-down expectations within ART encode learned prototy,pes that pay attention to expected visual features. When novel visual information creates a poor match with the best existing category prototype, a memory search selects a new category with which classify the novel data. ARTEX is compared with psychophysical data, and is bench marked on classification of natural textures and synthetic aperture radar images. It outperforms state-of-the-art systems that use rule-based, backpropagation, and K-nearest neighbor classifiers.

    Keywords: pattern recognition; boundary segmentation; surface representation; filling-in; texture classification; neural network; adaptive resonance theory

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-97-001. To download the paper in PDF (GroWil99.pdf) or in Gzipped Postscript GroWil99.ps.gz(1.2Mb).


    ·         Mingolla, E., Ross, W., and Grossberg, S., (1999) A neural network for enhancing boundaries and surfaces in synthetic aperture radar images. Neural Networks,12, 499-511.

    Abstract: A neural network system for boundary segmentation and surface representation, inspired by a new local-circuit model of visual processing in the cerebral cortex, is used to enhance images of range data gathered by a synthetic aperture radar (SAR) sensor. Boundary segmentation is accomplished by an improved Boundary Contour System (BCS) model which completes coherent boundaries that retain their sensitivity to image contrasts and locations. A Feature Contour System (FCS) model compensates for local contrast variations and uses the compensated signals to diffusively fill-in surface regions within the BCS boundaries. Image noise pixels that are not supported by BCS boundaries are hereby eliminated. More generally, BCS/FCS processing normalizes input dynamic range, reduces noise, and enhances contrasts between surface regions. BCS/FCS processing hereby makes structures such as motor vehicles, roads, and buildings more salient to human observers than in original imagery. The new BCS model improves image enhancement with significant reductions in processing time and complexity over previous BCS applications. The new system also outperforms several established techniques for image enhancement.

    Keywords:Synthetic aperture radar, neural network, image enhancement, boundary segmentation, diffusion

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-032. Note: This paper is difficult to download due to the large size of the figure file. To download only the paper in PDF (MinRosGro99.pdf) or in Gzipped Postscript MinRosGro99.ps.gz(55Kb). To download the figures as a separate large file in PDF MinRosGro99.figs.pdf or in Gzipped Postscript MinRosGro99.figs.ps.gz(2,998Kb). Send mail requests to Robin Amos for a paper copy.


    1998

    ·         Cisek, P., Grossberg, S. and Bullock, D. (1998). A Cortico-Spinal Model of Reaching and Proprioception under Multiple Task Constraints. Journal of Cognitive Neuroscience, 10, 425-444.

    Abstract: A model of cortico-spinal trajectory for voluntary reaching movements is developed to functionally interpret a broad range of behavioral, physiological, and anatomical data. The model simulates how arm movements achieve their remarkable effficiency and accuracy in response to widely varying positional, speed, and force constraints. A key issue in arm movement control is how the brain copes with such a wide range of movement contexts. The model suggests how the brain may set automatic and volitional gating mechanisms to vary the balance of static and dynamic feedback information to guide the movement command and to compensate for external forces. For example, with increasing movement speed, the system shifts from a feedback position controller to a feedforward trajectory generator with superimposed dynamics compensation. Simulations of the model illustrate how it reproduces the effects of elastic loads on fast movements, endpoint errors in Coriolis fields, and several effects of muscle tendon vibration, including tonic and antagonist vibration reflexes, position and movement illusions, effects of obstructing the tonic vibration reflex, and reaching undershoots caused by antagonist vibration.

    Keywords: no keywords

    This paper available by mail only. Contact Robin Amos.


    • Gancarz, G. and Grossberg, S. (1998). A neural model of the saccade generator in the reticular formation. Neural Networks, 11, 1159-1174.

    Abstract: A neural model is developed of the neural circuitry in the reticular formation that is used to generate saccadic eye movements. The model simulates the behavior of identified cell types--such as long-lead burst neurons, short-lead excitatory and inhibitory burst neurons, omnipause neurons, and tonic neurons-- under many experimental conditions. Simulated phenomena include: saccade staircases, duration and amplitude of cell discharges for saccades of variable amplitude, component stretching to achieve straight oblique saccades, saturation of saccade velocity after saturation of saccade amplitude in response to high stimulation frequencies, tradeoffs between saccade velocity and duration to generate constant saccade amplitude, conservation of saccade amplitude in response to sufficiently brief stimulation of omnipause neurons, and high velocity smooth eye movements evoked by high levels of electrical stimulation of the superior colliculus. Previous saccade generator models have not explained this range of data. These models have also invoked mechanisms for which no neurophysiological evidence has been forthcoming, such as resetable integrators, perfect integrators, or target position movement commands. The present model utilizes only known reticular formation neurons. It suggests that a key part of the feedback loop within the saccade generator is realized by inhibitory feedback from short-lead to long-lead burst neurons, in response to excitatory feedforward signals from long-lead to short-lead burst neurons. When this property is combined with opponent interactions between agonist and antagonist muscle-controlling neurons, and motor error, or vector, inputs from the superior colliculus and other saccade-controlling brain regions, all of the above data can be explained. Taken together, these components generate a saccade reset cycle whereby activation of long-lead burster neurons inhibits omnipause neurons and thereby disinhibits short-lead excitatory burst neurons. The excitatory short-lead burst neurons can then respond to excitatory inputs from the long-lead burst neurons. Outputs from the excitatory short-lead burst neurons are integrated by the tonic cells while they also inhibit the long-lead burst neurons via inhibitory burst interneurons. When this inhibition is complete, the omnipause neurons are disinhibited. The omnipause neurons can then, once again, inhibit the short-lead burst neurons, whose inhibition of the long-lead burst neurons is thereby removed. The saccadic cycle can then begin again. In response to sustained electrical input, this cycle generates a staircase of identical saccades whose properties match the data much better than the staircases proposed by alternative models. A comparative analysis of the hypotheses and predictive capabilities of other saccade generator models is provided.

    Keywords: saccade generator, reticular formation, burst neurons, saccade staircase, neural network

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-98-014. Available in Gzipped postscript (2.35MB).


    Grossberg, S. and Pessoa, L. (1998). Texture segregation, surface representation, and figure-ground separation. Vision Research, 38, 2657-2684.

    Abstract: A widespread view is that most of texture segregation can be accounted for by differences in the spatial frequency content of texture regions. Evidence from both psychophysical and physiological studies indicate, however, that beyond these early filtering stages, there are stages of 3-D boundary segmentation and surface representation that are used to segregate textures. Chromatic segregation of element-arrangement patterns - as studied by Beck and colleagues - cannot be completely explained by the filtering mechanisms previously employed to account for achromatic segregation. An element arrangement pattern is composed of two types of elements that are arranged differently in different image regions (e.g., vertically on top and diagonally on bottom). FACADE theory mechanisms that have previously been used to explain data about 3-D vision and figure-ground separation are here used to simulate chromatic texture segregation data, including data with equiluminant elements on dark or light homogenous backgrounds, or backgrounds composed of vertical and horizontal dark or light stripes, or horizontal notched stripes. These data include the fact that segregation of patterns composed of red and blue squares decreases with increasing luminance of the interspaces. Asymmetric segregation properties under 3-D viewing conditions with the equiluminant elements close or far are also simulated. Two key model properties are a spatial impenetrability property that inhibits boundary grouping across regions with noncolinear texture elements, and a boundary-surface consistency property that uses feedback between boundary and surface representations to eliminate spurious boundary groupings and separate figures from their backgrounds.

    Keywords: figure-ground, perception, texture segregation, 3-D vision, grouping, boundary, surface, color vision, neural networks, filling-in

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-96-025. Available in gzip'ed postscript GroPes97.ps.gz (198Kb).


    1997

    The percepts known variously as the line motion illusion, motion induction, and transformational apparent motion have attracted a great deal of experimental interest, since they sensitively probe interactions between preattentive and attentive vision processes. The present article develops a neural model that qualitatively explains essentially all the data reported thus far, and quantitatively simulates key illustrative percepts. The model suggests how these data arise from neural mechanisms of preattentive boundary and surface formation, long-range apparent motion, form-motion interactions, and spatial attention. Direct cortical V1 $\rightarrow$ MT motion mechanisms and interstream V1 $\rightarrow$ V2 $\rightarrow$ MT form-motion interactions play a key role in model explanations. It is also proposed how preattentive long-range motion mechanisms can engage spatial attention as part of the motion capture process that solves the aperture problem.

    Preliminary version appears as Boston University Technical Report, CAS/CNS-TR-96-020. Available in PDF BalGro97.pdf (340Kb) and Gzip'ed postscript BalGro97.ps.gz (171Kb).


    ·         Grossberg, S., and Cohen, M. (1997). Parallel auditory filtering by sustained and transient channels separates coarticulated vowels and consonants IEEE Transactions on Speech and Audio Processing, 5, 301-318.

    Abstract: A neural model of peripheral auditory processing is described and used to separate features of coarticulated vowels and consonants After preprocessing of speech via a filterbank, the model splits into two parallel channels, a sustained channel and a transient channel. The sustained channel is sensitive to relatively stable parts of the speech waveform, notably synchronous properties of the vocalic portion of the stimulus. It extends the dynamic range of eighth nerve filters using coincidence detectors that combine operations of raising to a power, rectification, delay, multiplication, time averaging, and preemphasis. The transient channel is sensitive to critical features at the onsets and offsets of speech segments. It is built up from fast excitatory neurons that are modulated by slow inhibitory interneurons. These units are combined over high- frequency and low-frequency ranges using operations of rectification, normalization, mulUplicative gating, and opponent processing. Detectors sensitive to frication and to onset or offset of stop consonants and vowels are described. Model properties are characterized by mathematical analysis and computer simulations. Neural analogs of model cells in the cochlear nucleus and inferior colliculus are noted, as are psychophysical data about perception of CV syllables that may be explained by the sustained-transient channel hypothesis. The proposed sustained and transient processing seems to be an auditory analog of the sustained and transient processing that is known to occur in vision.


    Paper copies of these and Grossberg's other papers available from: Robin Amos
    Maintained by Susanne Daley
    June 18,2003