Computational Modeling of Orientation Tuning Dynamics in Monkey Primary Visual Cortex

advertisement
Computational Modeling of Orientation Tuning Dynamics in
Monkey Primary Visual Cortex
M. C. Pugh, D. L. Ringach, R. Shapley and M. J. Shelley
August 20, 1999
to appear in the Journal of Computational Neuroscience
please address correspondence to:
Dr. M. C. Pugh, Department of Mathematics, University of Pennsylvania
209 South 33rd Street, Philadelphia, PA 19104-6395
Phone: (215) 898-8472, FAX: (215) 573-4063, Email: mpugh@math.upenn.edu
D. L. Ringach, Psychology Department, University of California at Los Angeles, Los Angeles,
CA 90095
R. Shapley, Center for Neural Science, New York University, NY, NY 10003
M. J. Shelley, Courant Inst. for Mathematical Sciences, NYU, NY, NY 10012
Abstract
In the primate visual pathway, orientation tuning of neurons is first observed in the primary
visual cortex. The LGN cells that comprise the thalamic input to V1 are not orientation tuned,
but some V1 neurons are quite selective. Two main classes of theoretical models have been
offered to explain orientation selectivity: feedforward models, in which inputs from spatially
aligned LGN cells are summed together by one cortical neuron; and feedback models, in which
an initial weak orientation bias due to convergent LGN input is sharpened and amplified by
intracortical feedback. Recent data on the dynamics of orientation tuning, obtained by a crosscorrelation technique, may help to distinguish between these classes of models. To test this
possibility, we simulated the measurement of orientation tuning dynamics on various receptive
field models, including a simple Hubel-Wiesel type feedforward model: a linear spatio-temporal
filter followed by an integrate-and-fire spike generator. The computational study reveals that
simple feedforward models may account for some aspects of the experimental data, but fail to explain many salient features of orientation tuning dynamics in V1 cells. A simple feedback model
of interacting cells is also considered. This model is successful in explaining the appearance of
Mexican-hat orientation profiles, but other features of the data continue to be unexplained.
Keywords: Cortical dynamics, orientation tuning, monkey, primary visual cortex, layers.
1
Introduction
How visual cells in the primary visual cortex (area V1) become selective to the orientation of a visual
stimulus is an important question in visual neuroscience (see, for example, Das (1996); Reid and
Alonso (1996); Sompolinsky and Shapley (1997)). The question is how V1 cells become strongly
tuned for orientation since it is known that cells in the thalamus are not. Hubel and Wiesel, who
first discovered the property of orientation tuning in V1 cells, proposed a simple circuitry to explain
orientation tuning (Hubel and Wiesel, 1962, 1968). They suggested that the inputs from LGN cells
with spatially aligned receptive fields are summed together by one cortical neuron. This spatial
alignment of receptive fields generates a preference for a particular orientation in the cortical cell.
We refer to this, and similar proposals, as feedforward models. An alternate class of models are
feedback (or recurrent) models, which suggest that the feedforward tuning is initially weak and only
provides a small orientation bias to the cortical cell. This bias is then amplified and sharpened
by intracortical feedback (Wörgötter et al., 1991; Douglas et al., 1995; Somers et al., 1995; BenYishai et al., 1995; Maex and Orban, 1996; Carandini and Ringach, 1997). Clearly, these models
are not mutually exclusive; there is a continuum of models depending on the relative strength of the
feedforward and feedback connections (Ben-Yishai et al., 1995).
In this work we study recent data on the dynamics of orientation tuning obtained by a crosscorrelation technique (Ringach et al., 1997b). The method used for measuring tuning dynamics is a
modification of the cross-correlation method and shares many of its properties (de Boer and Kuyper,
1968; Marmarelis and Marmarelis, 1978; Jones and Palmer, 1987; Victor, 1992; Reid et al., 1997).
We believe such data may help to determine where the cortical circuitry is best represented along
the continuum between feedforward models and feedback models.
As shown below, there are clear differences of dynamics between cell classes in V1. The aim of
this paper is to consider these qualitative features, rather than fitting models to data. To achieve this
goal, we simulated the experimental protocol on various types of receptive field models, including a
simple Hubel-Wiesel type feedforward model: a spatio-temporal filter followed by an integrate-andfire spike generator. The computational study reveals that simple feedforward models may account
for some aspects of the experimental data, but fail to explain many salient features, including the
Mexican-hat profile of the orientation distributions and the active inhibition seen at off-optimal
orientations (“cross-orientation” inhibition). We then studied a feedback network of interacting
cells. A similar model is used in the earlier work of Ben-Yishai et al. (1995). We find that the model
with lateral coupling better approximates the behavior of some V1 neurons. However, some aspects
of the experimental data remain unexplained.
This paper is organized as follows. We first describe the experiment and the most salient features
of the data. A theoretical and computational study of the feed-forward model is then presented. We
then study of the behavior of a small network of interacting oriented neurons. Finally, we discuss
the relevance of our findings to the modeling of orientation tuning and cortical function.
2
The Experiment
The visual stimuli used are made up of sinusoidal luminance gratings,
I(x, y; θ, φ) = A (1 + sin [ ω(x cos θ − y sin θ) − φ ]) .
(1)
Here, (x, y) ∈ [−1, 1]×[−1, 1] represents coordinates in visual space and θ and φ are fixed parameters
whose values range from −90◦ to 90◦ and from 0◦ to 360◦ respectively. The angle θ corresponds to
the orientation of the grating in space while φ represents a spatial phase relative to the origin. The
variable ω represents the spatial frequency of the grating. A is the mean luminance of the grating
and 0 ≤ ≤ 1 is its contrast. Images of uniform luminance, A, with no spatial structure ( = 0),
will be referred to as “blanks”.
The visual stimulus used in the experiments of Ringach et al. (1997a) consists of a rapid sequence
of randomly selected images, each of which is either a “blank” or a grating at one of N possible
angles: θ1 = −90◦ , θ2 = −90◦ + ∆θ, . . . ,θN = −90◦ + (N − 1)∆θ, and one of M possible phases:
φ1 = 0◦ , . . . , φM = 0◦ +(M −1)∆φ; where ∆θ = 180◦/N and ∆φ = 360◦/M . Blanks are represented
by a dummy orientation with index N + 1; θN+1 = blank.
A random sequence of images, {Ii }, is generated from an associated random sequence of orientation angles and phases {(θn(i) , φm(i))} where n(i) is chosen from {1, . . . , N + 1} with probability
1/(N + 1) and m(i) is chosen from {1, . . . , M } with probability 1/M . θ and φ are piecewise constant
functions in time, changing every T ms (the duration of one “frame” in the movie). In the physiology
experiments T was 16.6 ms, N = 18, and M = 8.
The responses of cells to the visual stimulus are recorded and analyzed using the following
cross-correlation technique. First, a correlation time τ ≥ 0 is selected. An (N + 1) × M array of
counters, C, is zeroed. We read through the recorded spike train. If a spike occurred at time t we
find the orientation, θi , and spatial phase, φj , of the grating shown at time t − τ . We then increment
the corresponding counter, C(i, j; τ ), by one. If the image was a blank, then i = N + 1 and j is
chosen randomly from 1 . . . M . We apply this procedure to each spike in the spike train. In this
way, all the spikes are represented in the counter array.
P
P
Averaging over spatial phases, we define P r(θi ; τ ) = j C(i, j; τ )/ i,j C(i, j; τ ). P r(θi ; τ ) is
the relative frequency at which a grating with orientation θi appeared in the stimulus sequence
τ ms before a spike. Because the relative frequency approximates the probability distribution, in
the following we call P r(θ; τ ) the “probability” that a grating at angle θ appeared τ ms before a
spike. The dynamics of orientation tuning are studied by measuring how P r(θi ; τ ) evolves with the
correlation time τ . Blanks provide a dynamic baseline to which the responses to gratings can be
compared. In our interpretation, if P r(θi ; τ ) > P r(blank; τ ) , for a fixed τ , then the orientation θi
is excitatory relative to the blank. Similarly, if P r(θi ; τ ) < P r(blank; τ ) then θi is inhibitory.
How will P r(θi ; τ ) behave as a function of τ ? For τ = 0, for example, we do not expect to see
any influence of the visual stimulus on the output train of action potentials since there is a time
delay between the retina and the evoked activity of cortical cells. Therefore, P r(θ; 0) should be a
uniform (flat) distribution. Similarly, for very large correlation times τ we also expect to obtain a
uniform distribution for P r(θ; τ ); this is because cells have only finite memory. For intermediate
values of τ one would expect to see a smooth peaked distribution which represents the preference
3
of the cell for particular orientations. There will be one value of τ for which the cell will be better
tuned than at any other time; we will refer to this value as the “optimal τ ”.
Experimental Results
In this section we describe some of the salient features of the experimental results in Ringach et al.
(1997a, 1998) that we would like the mathematical models to explain. No obvious differences were
found between the dynamics of simple and complex cells. Both classes of neurons exhibited the range
of behaviors described below. The perspective plots in Fig. 1 illustrate typical results obtained from
orientation selective cells in monkey V1 at different values of the correlation time τ , with τ given
beside each graph. Each plot represents the measurements from a single cell. The graphs within the
perspective plots show the distributions P r(θ; τ ). The dashed line represents the baseline given by
the probability of a blank, P r(blank; τ ). The range of values for the correlation times τ was chosen
to bracket the response period for each cell. The x-axis represents the orientation θ between −90◦
and 90◦ , and the y-axis represents the value of P r(θ; τ ). The vertical scale bar in the lower graph
of each perspective plot has a height of 1/(N + 1), which is the value for a uniform distribution on
the orientation domain.
4C α
2+3
(a)
0
90
85
57
65
41
95
65
75
49
50
45
105
73
85
57
115
81
95
65
60
55
125
89
105
73
135
97
115
81
70
(e)
125
89
65
4B
(d)
97
80
-90
4A
(c)
75
1/(N+1)
0
5
(b)
75
49
55
65
41
40
33
45
33
55
35 ms
25 ms
35 ms
25 ms
45 ms
-90
0
90
-90
0
90
-90
0
90
-90
0
90
Orientation (deg)
Figure 1:
Examples of orientation tuning dynamics from monkey V1 cells. (a ) Orientation tuning dynamics from a cell in
layer 4Cα. The graphs within the perspective plots show the distributions P r(θ; τ) at different values of τ. The dashed line
represents the baseline given by the probability of a blank, P r(blank; τ). The x-axis represents the orientation θ between
−90◦ and 90◦ , and the y-axis represents the value of P r(θ; τ). The vertical scale bar in the lower graph of each perspective
plot has a height of 1/(N + 1), which is the value for a uniform distribution on the orientation domain; (b) Orientation
dynamics from a cell in layer 2+3 ; (c) Orientation dynamics from a cell in layer 5; (d ) Orientation tuning from a cell in
layer 4A. The profile at 57 ms is “transparent” with the profile at 65 msec indicated by a dotted line; (e) Orientation tuning
from a cell in layer 4B.
Fig. 1(a) illustrates the response obtained from a cell located in layer 4Cα, one of the main
input layers of V1. For τ < 40 ms and τ > 75 ms the distributions are uniform. The distribution
develops into a unimodal distribution and peaks at τ = 55 ms. The distribution’s peak occurs at
the same angle as the preferred angle of the cell as found using conventional drifting grating stimuli.
Notice that, in this case, the probability of the orientation orthogonal to the preferred one and the
probability of a “blank” behave in a very similar way. In fact, this behavior does not hold for all
layer 4C cells; there are cells for which the blanks and the orthogonal orientation do not have similar
dynamics. More intricate dynamical responses are observed outside layer 4C (Ringach et al., 1997b);
see Figs. 1(b–e).
4
Consider the responses depicted in Fig. 1(b). For τ < 40 ms and τ > 90 ms the distributions
are uniform. The response develops and peaks at τ = 57 ms. The distribution’s peak occurs at the
same angle as the preferred angle of the cell as found using conventional drifting grating stimuli.
The distribution relaxes to a uniform profile around τ = 73 ms, at which time the blank is more
probable than any angle. We observe an “inversion” of the probability profile at τ = 81 ms. At this
time the probability of the preferred orientation of the cell is lower than that of any other angle;
instead, the most probable angle τ = 81 ms before a spike is the angle orthogonal to the preferred
angle. Many cells show such “inversions”, and this is one of the properties we would like the models
to explain.
Another important aspect of the behavior shown in Fig. 1(b) is the dynamics of the probability
of a “blank”. Initially, for τ < 57 ms, the probability of a blank behaves in a very similar way to
the probability of the orientation orthogonal to the preferred angle. However, this is not the case
for larger values of τ . For example, for 57 ms ≤ τ ≤ 73 ms the probability of an angle that is not
close to the preferred angle is less than the probability of a “blank”. This suggests active inhibition
at off-optimal orientations. We note that this behavior also “inverts” at τ = 81 ms. An explanation
for the dynamics of this baseline is another goal of the mathematical modeling.
The results shown in Fig. 1(c) show a response pattern similar to that seen in Fig. 1(b). A
unimodal distribution develops and peaks at τ = 65 ms followed by a slight “inversion” of the
distribution at τ > 115 ms. However, we also see that for 75 ms ≤ τ ≤ 105 ms the distribution is
clearly bimodal; it has local maxima at the preferred orientation and at the orientation orthogonal
to it and has local minima flanking the preferred orientation. We refer to these profiles as “Mexicanhat” shaped — observe the shape at τ = 75 ms. Such distributions are characteristic of many of the
well-tuned cells found in V1. The dynamics of the “blank” baseline in Fig. 1(c) is quite similar to
that in Fig. 1(b). In particular, for 65 ms ≤ τ ≤ 85 ms off-preferred angles appear to be inhibited –
their probabilities are below the blank baseline.
In Fig. 1(d) we present another example of a cell that develops a Mexican-hat distribution
profile. Early in the dynamics, for τ = 41 ms, the distribution is quite broad. The distribution
then sharpens and by τ = 57 ms the cell is much better tuned. A Mexican-hat profile is evident
at τ = 65 ms. Finally, in Fig. 1(e) we present another example of a well-tuned cell that develops a
Mexican-hat profile at τ = 95 ms and which later, at τ = 115 ms, exhibits an “inversion”.
The dynamical features shown in Fig. 1 are typical of V1 cells. The goal of this study is
to explore what kind of models may explain the phenomena of “inversions”, the development of
Mexican-hat shaped profiles, and the dynamics of the “blank” baseline.
A Feed-forward model
We first present a feed-forward model that has been considered extensively in the literature as a
good candidate to explain the spatio-temporal properties of simple cells (Movshon et al., 1978; Citron
and Emerson, 1983; Jones and Palmer, 1987; McLean and Palmer, 1989; DeAngelis et al., 1993a;
McLean et al., 1994; Reid et al., 1997). It consists of a linear spatio-temporal filter followed by an
5
integrate-and-fire spike generator with leakage:
Z t ZZ
dv
(t) = −λ(v(t) − vr ) + DC +
H(x, y, t − s)I(x, y, s) dx dy ds.
dt
0
(2)
H(x, y, t) is the impulse response of a linear spatio-temporal filter, corresponding to a linearized
model of the input from the LGN, incorporating inputs from both on-center and off-center cells
(Palmer and Davis, 1981; Ferster, 1986; Heeger, 1992b,a). I(x, y, s) is the image shown at time
s and λ is the rate of leakage (1/λ is the membrane time constant). The leakage has been set
to zero in most of the computations presented here; although we do discuss the effects of having
nonzero leakage in various cases. DC represents DC input and vr is the resting potential. A spike
is generated when the membrane voltage variable v(t) reaches the firing threshold of −50 mV, the
voltage is then reset to vr = −70 mV (Connors et al. (1982); McCormick et al. (1985); Somers et al.
(1995); M et al. (1996)). In the model we also mimic the effects of an inhibitory reversal potential:
the voltage is not allowed to decrease past a lower barrier of −90 mV (see appendix).
We assume that the kernel H is space-time separable, hence
Z t
dv
(t) = −λ(v(t) − vr ) + DC +
G(t − s)r(s)ds.
dt
0
(3)
where r(s) is the spatial dot product of the image at time s and a spatial kernel,
ZZ
r(s) =
K(x, y; θK , φK ) I(x, y, s) dx dy.
(4)
The spatial kernel is taken to be a windowed Gabor function (Marcelja, 1980; Daugman, 1985),
d2
K(x, y; θK , φK ) = K0 W (d) e− L2 sin( ω(x cos θK − y sin θK ) − φK ).
(5)
This two-dimensional spatial kernel is parametrized by an orientation angle θK and phase φK (the
preferred orientation and phase of the simulated cell). In the above, ω is the spatial frequency of
the carrier, and d2 = x2 + y2 . The function is windowed to reflect that in the experiment the stimuli
are shown in a circular patch: W (d) = 1 if d < 1 and 0 otherwise. The length-scale L determines
the rate of decay away from the center of the receptive field. Fig. 2(a) shows the Gabor function
when ω = 3π, θK = 0, φK = 0◦ and Lω = 4.2. We use these parameters for the rest of the article.
(Ringach et al., 1997c) found Gabor-like receptive fields with multiple subfields in monkey V1.
In our simulations we use a biphasic temporal kernel:

 ae (t/τ )5 e−t/τ
3
G(t) =
 ae (t/τ )5 e−t/τ − ai (t−.05) e−(t−.05)/τ
τ
for 0 ≤ t ≤ .05
for t ≥ .05
(6)
where ae = 1.67, ai = 16.7, τ = .01 sec (see Fig. 2(b)). The temporal kernel is chosen so that
R∞
0 G(t) dt = 1 and max(G)/ min(G) ∼ −5, with its maximum occurring at 50 ms, its minimum
around 80 ms, and G decaying to zero around 150 ms (Gielen et al., 1981). Such a choice of G
introduces both time lag and memory effects. G models a magnocellular cell; it has both a positive
and a negative response and has a positive mean.
6
(a)
(b)
1
30
20
0
10
0
-10
-1
-1
0
Space
0
1
100
200
300
Time (ms)
Figure 2:
Spatial and temporal filters of the feedforward model. (a ) The spatial kernel is the windowed Gabor function
used in the simulations (see Eqn. (5)). Plotted here is a cross-section of the kernel, at y = 0, −1 ≤ x ≤ 1. (b) The temporal
kernel is as given in Eqn (6).
1
Response
0.8
0.6
0.4
0.2
0
-90
-45
0
45
90
Orientation (deg)
Figure 3: Orientation tuning of the feedforward spatial kernel. The graph shows a plot of r0◦ (θ). It shows the orientation
tuning of the LGN input in the feedforward model.
Interesting behaviors and analytical results can already be observed in a system whose temporal
kernel is a δ-function. In this case, the evolution equation is local in time,
dv
(t) = −λ(v(t) − vr ) + DC + r(t).
dt
(7)
As shown in the Appendix, in this situation the cross-correlation functions can be studied analytically. We find that leakage can cause sharpening in the cross-correlation functions. Also, in the
absence of leakage there is instantaneous decorrelation; P r(θ, τ ) becomes flat for all τ > 0.
Computational Results for the Feed-forward Model
We now present results for the feedforward model. To consider a specific example, assume a spatial
kernel K(x, y; θK , φK ) as shown in Fig. 2(a). This kernel is odd-symmetric about x = 0. For the
visual stimulus we choose a new grating pattern every T = 17 ms. The gratings, described by
Eqn. (1), are selected from a family of gratings with 60 equally spaced orientation angles and 6
equally spaced spatial phases. The number of angles and phases is essentially arbitrary; there is no
biological limitation on how close two angles or phases can be before their difference is imperceptible.
For a fixed phase φ, we first compute the spatial inner product of a grating with the spatial kernel:
ZZ
rφ (θ) = A
K(x, y; 0◦, 0◦ ) sin ( ω(x cos θ − y sin θ) − φ ) dx dy.
(8)
Because the kernel K is odd-symmetric, rφ(θ) = cos(φ)r0◦ (θ). Fig. 3 shows r0◦ , on [−90◦ , 90◦). To
abbreviate, in the sequel we refer to the spatial response component rφ (θ) as simply the “response”.
7
The LGN input does not, on average, provide DC input because the six response functions r0 (θ),
r60◦ (θ), r120◦ (θ), r180◦ (θ), r240◦ (θ), and r300◦ (θ) sum together to zero on [−90◦ , 90◦). It follows that
the expected value of the LGN input at any time is zero. DC input is provided by the term DC.
The response scales as the product AK0 ; the product of the contrast, the mean luminance, and
P
the kernel “gain” K0 (see Eqn. 5). We choose K0 = 1.2013 so that
r0◦ (θi ) = N when A = = 1.
This choice is arbitrary and does not affect the results. We then fix the mean luminance A and vary
the contrast to yield firing rates in the experimentally-observed range: between 1 and 20 spikes/sec.
Given a random sequence of images, the voltage equation (3) is numerically integrated in time.
We use a modified second-order Runge-Kutta method (M. Shelley, unpublished results) with 1 ms
time-steps. The initial voltage is the resting potential vr = −70 mV. When the voltage reaches
an upper threshold of −50 mV, it is reset to vr and the spike is registered by being included in
the cross-correlation counters C(i, j; τ ) for τ between 1 and 340 ms. The voltage is not allowed to
decrease below −90 mV.
Fig. 4 shows the cross-correlation functions as a function of the correlation time τ . The crosscorrelation function P r(θ; τ ) rises and peaks at about 55 ms, then falls, returning to baseline at
75 ms, followed by an “inversion” between 75 ms and 115 ms. As shown in Fig. 1(b), “inversions”
are sometimes observed in the experimental data. We also considered temporal kernels different
from (6). We found that strictly positive kernels led to no inversion. The kernel (6) has a positive
mean with significantly more “positive” area than negative. We considered kernels with less of an
imbalance between positive and negative area, as well as kernels with zero mean. In such cases,
the inversion was sometimes replaced by a second, smaller, peak rising and falling between 75 ms
and 115 ms. This is caused by the visual stimulus containing images that are exactly out of phase
relative to the preferred phase.
If the system leading to the measurement were completely linear then P r(θ; τ ) would be a uniform distribution. (The averaging over phases would completely flatten P r(θ; τ ).) The resetting of
the voltage after a spike event is the nonlinearity in the model that causes P r(θ; τ ) to be nonuniform.
95
85
75
65
55
45
35
25
1/(N+1)
15 ms
0
-90
0
90
Orientation (deg)
Figure 4:
Orientation tuning dynamics of a feedforward model. The plotting conventions are the same as in Fig. 1. The
simulation was stopped when 200,000 spikes were collected. There was no leakage and no DC input, λ = 0, DC = 0, and
A = 994.6. This resulted in a mean interspike time of 108 ms (9.24 spikes/sec) with a standard deviation of 167 ms.
8
The dashed line corresponds to the baseline — the probability of having shown a “blank” τ ms
before a spike. The baseline dynamics coincide with the dynamics of the cross-correlation function
at the angle orthogonal to the preferred orientation, P r(90◦, τ ). This is expected because the dot
product between the spatial kernel and any grating orthogonal to the preferred orientation equals
zero, which is also the value generated by a “blank”.
The voltage equation (3) has three terms; the first term is a leakage term, the second term is
DC input, and the third term is LGN input. In the absence of leakage, there are two mechanisms
to adjust the mean firing rate — the DC input and the product A. Intuitively, if one holds A
fixed and increases the DC input, then one is decreasing the number of spikes that are correlated
to the visual stimulus. For this reason, one would expect that the cross-correlation functions would
show less tuning the stronger the DC input. We test this expectation by first fixing the contrast
= 1/2 and choosing the mean luminance A so that in the absence of DC input the mean firing rate
is around 1 spike per second; the DC input is then increased. The results for four values of the DC
component, resulting in mean firing rates of 15, 5.1, 2.4, and 1.1 spikes per second, are presented in
Fig. 5.
The cross-correlation functions from the four computations all show the same qualitative dynamics shown in Fig. 4. A peak rises at the preferred orientation, falls, and is followed by an
inversion. However, both the maximum height of the peak and the maximum depth of the inversion
decrease as the DC input increases. In Fig. 5(a), we plot P r(0◦, τ ) as a function of the correlation
time τ , for the four DC inputs. The peak achieves its maximum at 53 ms. The larger the DC input,
the smaller the peak. This suggests a loss of tuning. To confirm this, we plot P r(θ, 53) as a function
of angle θ in Fig. 5(b). Almost all correlation to the preferred angle 0◦ is lost at the highest level of
DC input, as expected.
Another way to vary the firing rate is to hold the DC input fixed and to then vary the quantity
A. Maximal tuning is expected for DC = 0; this is when the spikes are maximally correlated to
the visual stimulus. We found that fixing DC 6= 0 and then varying A yields the same phenomena.
The leakage λ was again set to zero. Four values of A were selected resulting in mean firing rates
of 23, 15, 7.1, and 1.1 spikes per second. The ratio of the smallest value of A to the largest value is
.14. So if the largest value corresponded to contrast = 1, then the smallest contrast was = .14.
The cross-correlation functions have qualitative dynamics similar to those shown in Fig. 4.
Fig. 6(a) shows P r(0◦; τ ) as a function of τ . This figure demonstrates that P r(0◦; 50) increases as
the firing rate decreases (as A decreases), suggesting a sharpening. To confirm this sharpening, in
Fig. 6(b) we plot P r(θ; 50) as a function of θ. For almost all firing rates, the peak is followed by an
inversion. Fig. 6(c) depicts P r(θ; 84) as a function of θ. A value of τ = 84 ms was chosen as being
the time at which the inversion was largest at the 15 spikes/second simulation. The inversion is seen
in all but the 1.1 spikes/sec simulation.
The above simulations were done with no leakage, λ = 0. We find that if one fixes A and DC
and then increases the leakage, and plots the analogues of the figures in Figs. 6, one sees the same
behavior with firing rate: the greater the leakage (and hence the lower the firing rate) the greater
the sharpening.
9
(a)
0.05
°
Pr(0 ;τ)
0.04
0.03
0.02
0.01
0
100
200
300
τ (msec)
(b)
0.05
Pr( θ;53)
0.04
0.03
0.02
0.01
0
-90
-45
0
45
90
Orientation (deg)
Figure 5: Dependence of dynamics on DC input when A = 270 is fixed. Four values of DC are selected, DC =
0, 40, 100, 300, resulting in mean firing rates of 1.1, 2.4, 5.1, and 15 spikes per second. There is no leakage, λ = 0. (a )
The curves represent the dynamics at the preferred orientation angle, P r(0◦ , τ) at the four different DC input levels. The
peak height decreases monotonically as the firing rate increase (as the DC input increases). (b) Orientation probability
distributions for the time at which tuning is greatest, 53 ms, as a function of DC input.
(b)
(c)
0.04
0.03
0.02
0.03
0.02
0.01
0.01
0.01
0
0
0.02
Pr(θ;84)
0.05
0.04
Pr(θ;51)
0.05
°
Pr(0 ;τ)
(a)
0
100
Time (ms)
200
-90
-45
0
45
Orientation (deg)
90
0
-90
-45
0
45
90
Orientation (deg)
Figure 6: Dependence of dynamics on mean spike rate. There is no leakage and no DC input, λ = 0, DC = 0. The mean
firing rate is increased by increasing A. Four values of A = 1934, 1371, 828.6, 276.2 are selected, resulting in mean firing
rates of 1.1, 7.1, 15, and 23 spikes per second. (a ) The curves represent the dynamics at the preferred orientation angle,
P r(0◦ , τ), for the four different mean spike rate levels. As the spike rate decreases the peak height increases. (b) Orientation
probability distributions for the time at which tuning is greatest, 51 ms, as a function of mean spike rate. (c) Orientation
probability distributions for the time at which an inversion occurs, 84 ms, at the different mean rates elicited by the four
different levels of A. Except for the lowest mean rate, all the distributions have the same inversion of response at θ = 0◦.
The computations presented above used a spatial kernel that is anti-symmetric about the y-axis:
φK = 0 in formula (5). If the spatial kernel is not anti-symmetric, then the response to a blank
and the response to a grating at the orthogonal orientation are not constrained to be equal to each
other: rφ (90◦) 6= rφ (blank). One may ask if this would make it possible to observe different behaviors
between the dynamics of “blanks” and of the orthogonal orientation in the cross-correlation functions
(as seen in the experimental data).
We tested this possibility by performing simulations using spatial kernels with φK = π/4, 2π/4, . . . 7π/4.
Here, the response rφ(90◦ ) to the orthogonal angle can differ from the response to a blank by up
to 1.1% (for φK = 5π/4). This variation is relatively small — the response to the preferred angle
10
differs from the response to a blank by 48%. Thus, while we found that other spatial kernels may
result in the baseline and orthogonal orientations having different dynamics the differences are slight
and far from the magnitude of the effect seen in the experimental data. Also, the experimental data
normally shows that, initially, all orientations are excitatory relative to the “blank”, while late in
the response many of the off-optimal orientations are below the “blank” baseline and appear to
be actively inhibited (Ringach et al. (1997a)). We did not observe this sort of dynamics in our
computations.
We conclude that a simple feedforward model is able to account for some of the features seen in
the dynamical experimental data. In particular, the “inversions” seen in the orientation distributions
may be a consequence of biphasic temporal impulse responses of the input signals. As magnocellular
LGN cells usually have biphasic temporal responses (Gielen et al., 1981), this implies that one should
observe “inversions” in layer 4Cα, which receives a strong magnocellular input. Interestingly, the
experimental data show that “inversions” are not seen in layer 4Cα or 4Cβ (Ringach et al., 1997b).
Thus, the feedforward model predicts a type of response in layer 4Cα which is not actually seen.
The simulations of the feed-forward model show that the probability of the angle orthogonal to
the preferred orientation and the probability of a “blank” behave in essentially the same way. This
behavior is seen in some of the V1 data, such as the cell in layer 4C shown in Fig. 1(a). However, as
depicted in Fig. 1(b–e), other cells do not show this type of behavior. Furthermore, the feed-forward
model also fails to generate the Mexican-hat profiles seen in many cells outside layer 4C.
A Network Model with Lateral Coupling
We now study a network of interacting neurons similar to the one proposed by Ben-Yishai et
al. (1995), Somers et al. (1995), and by Carandini and Ringach (1997). We consider a ring of
model cells in which each neuron is assumed to have a different preferred angle, and the population
of neurons spans the entire orientation range. The connections are all-to-all. As in Carandini and
Ringach (1997), we assume that each neuron has both excitatory and inhibitory effects on its neighbors — the population is not divided into excitatory and inhibitory subpopulations (see Fig. 7). We
make the further assumption that the strengths of the connections are a function of the difference
between neurons’s preferred angles; we did not consider connections that depend on the preferred
phases of the neurons or the physical positions of the neurons in V1. Labeling neurons by their
preferred angles, the strength of the excitatory (inhibitory) connection between two neurons, ae,
satisfies ae (θj , θk ) = ae(θj − θk ).
Figure 7:
The recurrent feedback network model. One layer of neurons interact with each other via excitatory (solid) and
inhibitory (dashed) connections. The preferred orientation for each neuron is indicated schematically with an oriented bar.
The connections are all-to-all; only the connections into the 0 ◦ neuron are shown here.
11
The voltage equations for the network are a natural generalization of the integro-differential
equation for a single neuron (3). The voltage of the kth neuron satisfies:
dvk
(t) = −λ(vk (t) − vr ) + DC
dt
Z t
ZZ
+
G(t − s)
K(x, y; θk , φk ) I(x, y, s) dx dy ds
0
+ Ce
X
Z
ae (θk − θj )
(10)
t
Ge(t − s) δ(s − ts (j)) ds
(11)
0
j
+ Ci
(9)
X
Z
ai (θk − θj )
t
Gi(t − s) δ(s − ts (j)) ds.
(12)
0
j
Ce and Ci are positive constants that reflect both the balance between excitatory lateral input and
inhibitory lateral input as well as the strength of lateral coupling relative to the LGN input. The
temporal kernel, G, and spatial kernel, K, are the same as in the feedforward model. All neurons
have the same resting potential, vr = −70 mV, are reset to vr when reaching the upper threshold
vth = −50 mV, and cannot decrease past the lower barrier of −90 mV. The same stimulus I(x, y, t)
is “shown” to all neurons, which are assumed to have spatially overlapping receptive fields. However,
since each neuron has a different preferred orientation, they will respond in different ways to the
stimulus.
The feedback connections between neurons are assigned temporal dynamics to provide a rough
model of synaptic transmission. This is done with temporal kernels Ge (t) and Gi(t) for the lateral
connections: Ge(t) = 20.84 (t/.0004)5e−t/.0004 and Gi(t) = 4.17 (t/.002)5e−t/.002. Both have been
R∞
chosen so that 0 Ge,i(t) dt = 1. We take the inhibitory connections to be slightly slower than the
excitatory connections (Somers et al., 1995; Ferster et al., 1996); Ge has its peak at 2 ms while Gi
has its peak at 10 ms.
Fig. 8(a) shows these temporal kernels. They are convolved against spike trains, represented as
a train of delta functions: ts (j) is the string of times at which the jth neuron has produced spike
events.
Finally, the excitatory effect of the jth neuron’s spike train on the kth neuron is weighted by
ae (θk − θj ) > 0 and the inhibitory effect is weighted by ai (θk − θj ) < 0. The functions ae and ai are
chosen to give a stronger weight to neurons with similar preferred orientations and a weaker weight
to neurons with dissimilar preferred orientations. Fig. 8(b) shows the lateral orientation kernels;
they have Gaussian shapes. We choose ai to be broader than ae (Somers et al., 1995; Ben-Yishai
et al., 1995).
In equation (9–12), the following quantities have been normalized:
Z
G(s) ds = 1
N ZZ
X
K(x, y; 0◦ , 0◦) sin [ω(x cos θk − y sin θk )] dx dy = N
k=1
Z
Z
Ge(s) ds = 1
Gi(s) ds = 1
X
ae (θk − θj ) = 1
j
X
j
12
ai (θk − θj ) = −1.
(a)
Gi (t), Ge (t)
400
300
200
100
0
0
10
20
30
40
Time (ms)
(b)
ai (θ), ae (θ)
0.05
0
-0.01
-90
-45
0
45
90
Orientation (deg)
Figure 8:
The temporal and spatial weighting of cortical excitation and inhibition in the feedback network model. (a ) The
lateral time kernels for excitation, Ge (t), and inhibition, Gi (t). These are graphs of synaptic current as a function of time
after a spike elicited a synaptic potential. The narrow and tall curve is Ge (t) and the shorter and broader curve is Gi (t).
(b) The orientation dependence of cortical excitation and inhibition are graphed as relative strength vs angle. Excitation
is plotted as positive, and is narrower than inhibition. Both are Gaussian functions centered at the preferred angle of the
2
neuron. For 16 neurons in a ring, ae (θ) = .5641 e−(θ/11.25)
2
and ai (θ) = −.1418 e−(θ/45) .
Equation (9–12) is of the form
dvk
dt
= −λ · leakage + DC + A · feed-forward
+Ce · lateral excitation + Ci · lateral inhibition,
(13)
(14)
Because of the above normalization, the relevant parameters are λ, A, Ce , and Ci. In the computations presented here there is no leakage (λ = 0). We verified that the observed phenomena persisted
in the presence of leakage. If Ce = Ci we call the network “balanced”. We define the “lateral
coupling ratio” as the ratio between the LGN input and the lateral coupling: (Ce + Ci)/(2A). We
performed computations for 16 neurons in a ring, with gratings at 80 possible angles and 6 possible
phases. The neurons all have the same preferred phase φk = 0◦ .
First we present orientation dynamics obtained in a balanced network with lateral coupling
ratio of .2451, which we found produces maximal sharpening. Fig. 9 shows the cross-correlation
functions as a function of the correlation time τ . This figure is to be compared to Fig. 4, which is
for the strictly feed-forward model. In both figures, the baseline and the orthogonal angle have the
same dynamics. Consider the probability distribution of both models at τ = 55 ms. The peak is
taller and narrower in Fig. 9 than in Fig. 4. At τ = 75 ms, there is a “Mexican-hat” type profile
in Fig. 9, which is absent in Fig. 4. The two figures show different dynamics for τ = 85 . . . 95 ms:
there is an inversion in Figure 4 that is absent in Figure 9.
Fig. 10(a) contrasts the cross-correlation functions of the feed-forward model at τ = 54 ms and
the network model at τ = 54 ms. 54 ms is optimal for both the feed-forward computation and the
13
97
87
77
67
57
47
37
27
1(N+1)
0
17 ms
-90
0
90
Orientation (deg)
Figure 9:
Orientation tuning dynamics of the feedback network model. The plotting conventions are the same as in Figs. 1
and 4. There is no leakage or DC input, λ, DC = 0. The visual stimulus has A = 416.2. The network is balanced,
Ce = Ci = 102, with a lateral coupling ratio of 0.245. The mean interspike time is 366 ms, mean firing rate 2.7 spikes per
second. A total of 148,000 spikes were sampled.
(a)
0.05
Pr(θ,54)
0.04
0.03
0.02
0.01
0
(b)
Pr(θ;75)
0.03
0.02
0.01
0
-90
-45
0
45
90
Orientation (deg)
Figure 10: Comparisons of orientation probability distributions for the feedforward and feedback models for the same
simulation as in Figure 9. (a ) Orientation probability distributions for feedback and feedforward models at their respective
peak times. Solid: P r(0◦ ; 54), feedback model. Dashed: P r(0◦ ; 54), strictly feed-forward. (b) Orientation probability
distributions P r(θ; 75) for feedback and feedforward models at 75 ms, time at which a Mexican-hat profile is seen in the
feedback model’s response. Solid: feedback model. Dashed: strictly feed-forward.
network computation in that P r(0◦ ; τ ) < P r(0◦ ; 54) for all other correlation times τ . There is a clear
sharpening in the profile; P r(0◦ ; 54) − P r(blank, 54) = .0337 versus P r(0◦; 54) − P r(blank, 54) =
.0420. The probability of the preferred orientation occurring 54 ms before a spike event has increased
by 25%. Fig. 10(b) compares the cross-correlation functions at τ = 75 ms. There is a “Mexicanhat” shaped distribution profile in the simulation of the feedback network. We considered a range
of different times-scales in the lateral coupling Ge(t) and Gi (t), including faster inhibition than
excitation. We found that this Mexican-hat shaped distribution is robust. The time-scales do affect
the degree of tuning; the maximum height of the peak and the time it occurs at are affected.
Next we present the effects of increasing the amount of lateral coupling in a balanced network,
Ce = Ci . We do this by holding A fixed and increasing Ce = Ci . Fig. 11 shows mean spike rate
14
Firing Rate (imp/sec)
16
14
12
10
8
6
4
2
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Lateral Coupling Ratio
Figure 11: Effects of lateral coupling in the feedback network model on mean firing rate. Seven runs with λ = DC = 0,
A = 416.2. Balanced network, Ce = Ci , lateral coupling ratios 0, .1225, .2451, .3063, .3676, .4901, .6127. The mean spike
rate increases as lateral coupling ratio is increased.
as a function of the lateral coupling ratio. As the lateral coupling ratio increases, the firing rate
increases. This is a nonlinear dynamic effect that results from the different time-scales of excitation
and inhibition in the temporal kernels Ge and Gi . Specifically, since the network is balanced, every
spike event will have both an inhibitory effect and an excitatory effect; the strengths are equal,
but the excitation is faster. This leads to a net increase of the mean firing rate. Increased lateral
coupling increases the mean firing rate in the ring model and is accompanied by a sharpening of
tuning (see below). This is to be contrasted with the feed-forward model where a decrease in the
mean firing rate is accompanied by sharpening.
0.05
0.03
°
Pr(0 ;τ)
0.04
0.02
0.01
0
0
100
200
τ (sec)
Effects of lateral coupling in a balanced feedback network model on orientation tuning dynamics. P r(0◦ ; τ)
is plotted as a function of τ. The lateral coupling ratios for these curves are: Solid:= 0,(feedforward); Dashed:= 0.1225;
Dot-dashed: = 0.2451; Dotted: = 0.3676; This is for the same simulations as in Figure 11, so the mean firing rates may be
gauged from that figure. The peak height of this function is an indicator of the maximal sharpness of orientation tuning.
Figure 12:
Fig. 12 shows P r(0◦, τ ) for four of the computations shown in Fig. 11. It is found that lateral
coupling provides sharpening. However, we find that when the ratio of lateral coupling to feedforward
becomes much larger, the degree of sharpening begins to decrease. From the four runs presented in
the figure, the largest degree of sharpening occurs for the lateral coupling ratio of .2451, the value
used in Figs. 9 and 10.
The feedback network model considered develops a Mexican-hat profile similar to that observed
in some V1 cells. However, the magnitude of the effect in the experimental data (see Fig. 1(c–
e)) is larger than the one generated by the model (Fig. 9). Furthermore, in the feedback network
model, as in the feedforward model, the probability of the angle orthogonal to the preferred one
behaves in a similar way as the probability of a “blank”. The feedback network model fails to show
15
“cross-orientation” inhibition effects seen in the data. We considered broader lateral inhibition,
including kernels with cross-orientation inhibition. We found that this did not cause the blanks and
the orthogonal angle to have different dynamics. The Mexican-hat profile persists, but is weaker.
1
Pr(θ;58)
0.8
0.6
0.4
0.2
0
-90
0
-45
45
90
Orientation (deg)
Figure 13: This graph presents normalized orientation probability distributions, at the time of maximum tuning, for
four different network computations, with different ratios of excitation to inhibition in the cortical lateral coupling. All
simulations have λ = DC = 0, and A = 416.2. In order to compare the half-widths, the curves are normalized by first
subtracting the probability of the orthogonal orientation and then scaled to have a maximum of one. The values used were
zero lateral coupling (dotted); Ce = Ci = 102, a balanced network (dashed); Ci = 3Ce = 306, net inhibitory (dot-dashed);
and Ci = 12Ce = 612, even more inhibitory (solid curve).
Up to this point we have calculated the responses of networks which had balanced excitation
and inhibition: Ce = Ci . We found somewhat better agreement with data in networks with stronger
inhibition than excitation, Ce < Ci. For these calculations, we first fixed the ratio Ce/Ci. We
then varied Ce and Ci , thus varying the lateral coupling ratio. We find sharpening is largest for a
particular coupling ratio. Fig. 13 presents the cross-correlation functions, at the time of maximum
tuning, for four different network computations where Ce and Ci were chosen in this manner. As
in Fig. 12, the balanced network Ce = Ci 6= 0 exhibits more sharpening than the feed-forward
model Ce = Ci = 0. Furthermore, it shows that there is even more sharpening in the networks with
3Ce = Ci, and 12Ce = Ci . For the feed-forward model, the maximum value of P r(0◦, τ )−P r(90◦ , τ )
is 0.0278. For the 12Ce = Ci computation, this maximum of P r(0◦, τ ) − P r(90◦, τ ) is 0.0424; i.e.,
lateral coupling which is mainly inhibitory significantly increases the probability of the preferred
orientation occurring before a spike.
Discussion
In this study, we evaluated previous models of cortical cells by calculating their orientation tuning
dynamics and comparing the result to experimental data. We first considered a well-studied model
of cortical simple cells, consisting of a linear spatio-temporal filter and a static integrate-and-fire
spike generator. This kind of model has been shown to explain many properties of simple cells
when the cortex is under a constant level of luminance and contrast gain control (Movshon et al.,
1978; Citron and Emerson, 1983; Jones and Palmer, 1987; McLean and Palmer, 1989; Albrecht and
Geisler, 1991; Maex and Orban, 1991; Reid et al., 1991; Tolhurst and Dean, 1991; DeAngelis et al.,
1993a,b; McLean et al., 1994; Reid et al., 1997). Therefore, it was of special interest to investigate
if such model can account for the phenomena observed in the dynamics of orientation tuning.
We found that this is not the case. The model failed in that (i ) it did not generate the Mexican-
16
hat profiles observed in some of the responses, (ii ) it was unable to generate situations where the
dynamics of the “blank” and that of the orthogonal orientation differed substantially, and (iii ) could
not be made to generate the active inhibition seen at off-optimal orientations in the late part of the
orientation tuning distributions. The model, however, exhibited “inversions”. This is somewhat
puzzling because “inversions” are not seen in layer 4C (Ringach et al., 1997b). Thus the feedforward
model, while capturing an aspect of the responses seen outside layer 4C, actually predicts an effect
that is not seen in this layer. It appears, therefore, that the behavior of cells in layer 4C (in particular
those in 4Cα which receive biphasic temporal input from the LGN magnocellular layers) are not
explained by the feedforward model. One possibility is that 4Cα cells do not fit the description of
a space-time separable feedforward kernel, as some of these neurons are known to be directionally
selective (Hawken et al., 1988).
In the second part of the paper we considered a class of models that has been proposed in the
literature as a putative mechanism for generating sharp orientation selectivity in the cortex (Andrews, 1965, 1967; Ben-Yishai et al., 1995; Somers et al., 1995; Douglas et al., 1995; Carandini and
Ringach, 1997). This class of models are described as a network model with “lateral” feedback,
in which neurons with similar preferred orientations excite each other while neurons with different
orientations preferences inhibit each other. We found that such a model generates orientation distributions which are more similar to those observed in the experimental data. In particular, the
model generates unimodal distributions early in the response (for τ < 50 msec) and Mexican-hat
shaped distributions later in the response (for τ > 60 msec). However, it is clear that the magnitude
of the Mexican-hat profiles is somewhat smaller in the simulations than in the real data. While
the feedback model seems to be a step in the right direction, there are additional features of the
data that remain unexplained. The network model, like the feedforward model, fails to provide an
adequate explanation for the fact that, in most cases, the orthogonal orientation and the “blank”
behave differently. In appears that in order to explain the full range of dynamical effects seen in the
data we need to refine our ideas and models of neuronal interactions in V1 cortical circuits.
As a next step in this direction we plan to measure experimentally the responses of LGN cells
when stimulated with the image sequences used in the experiments of Ringach et al (1997). It might
be the case that the failure of the models we considered is a consequence of a poor mathematical
representation of the LGN input to the cortex. By measuring the dynamics of LGN cells, and
incorporating this data into our models, we hope to advance towards our goal of establishing the
relative role of thalamic and cortical signals in the generation of orientation tuning.
Acknowledgments: M. P. was partially supported by an NSF post-doctoral fellowship and
NSF-DMS–9709128. D. L. R. and R. S. were supported by NSF-IBN-9720305, NIH-EY01472, and a
Sloan Foundation grant to NYU’s Program in Theoretical Neuroscience. We thank M. J. Hawken,
D. McLaughlin, D. Tranchina, and J. Wielaard for their valuable comments and criticisms.
17
References
Albrecht DG, Geisler WS (1991) Motion selectivity and the contrast-response function of simple
cells in the visual cortex. Vis Neurosci 7:531–546.
Andrews DP (1965) Perception of contours in the central fovea. Nature 205:1218–1220.
Andrews DP (1967) Perception of contour orientation in the central fovea. Part I. Short lines. Vis
Res 7:975–997.
Ben-Yishai R, Bar-Or RL, Sompolinksy H (1995) Theory of orientation tuning in the visual cortex.
Proc Natl Acad of Sci USA 92:3844–3848.
Carandini M, Ringach DL (1997) Predictions of a recurrent model of orientation selectivity. Vis Res
37:3061–3071.
Citron MC, Emerson RC (1983) White noise analysis of cortical directional selectivity in cat. Brain
Res 279:271–7.
Connors BW, Gutnick MJ, Prince DA (1982) Electrophysiological properties of neocortical neurons
in vitro. J Neurophysiol 48:1302–1320.
Das A (1996) Orientation in visual cortex: a simple mechanism emerges. Neuron 16:477–480.
Daugman JG (1985) Uncertainty relation for resolution in space, spatial frequency, and orientation
optimized by two-dimensional visual cortical filters. J Opt Soc Am A 2(7):1160–9.
de Boer E, Kuyper P (1968) Triggered correlation. IEEE Trans on Biomedical Engineering 15:169–
179.
DeAngelis GC, Ohzawa I, Freeman RD (1993a) Spatiotemporal organization of simple-cell receptive
fields in the cat’s striate cortex. ii. linearity of temporal and spatial summation. J Neurophysiol
69:1118–35.
DeAngelis GC, Ohzawa I, Freeman RD (1993b) Spatiotemporal organization of simple-cell receptive
fields in the cat’s striate cortex. i. general characteristics and postnatal development. J Neurophysiol 69:1091–117.
Douglas RJ, Koch C, Mahowald M, Martin KA, Suarez HH (1995) Recurrent excitation in neocortical
circuits. Science 269:981–985.
Ferster D (1986) Orientation selectivity of synaptic potentials in neurons of cat primary visual cortex.
J Neurosci 6:1284–1301.
Ferster D, Chung S, Wheat H (1996) Orientation selectivity of thalamic input to simple cells of cat
visual cortex. Nature 380:249–252.
Gielen CC, van Gisbergen JA, Vendrik AJ (1981) Characterization of spatial and temporal properties
of monkey LGN Y-cells. Biol Cyb 40(3):157–170.
Gut A (1988) Stopped random walks : limit theorems and applications. Springer Verlag, New York.
Hawken MJ, Parker AJ, Lund JS (1988) Laminar organization and contrast sensitivity of directionselective cells in the striate cortex of the old world monkey. J Neurosci 8:3541–3548.
18
Heeger DJ (1992a) Half-squaring in responses of cat striate cells. VISUAL NEUROSCI 9(5):427–443.
Heeger DJ (1992b) Normalization of cell responses in cat striate cortex. VISUAL NEUROSCI
9(2):181–197.
Hubel DH, Wiesel TN (1962) Receptive fields, binocular interaction and functional architecture of
cat’s visual cortex. J Physiol Lond 160:106–154.
Hubel DH, Wiesel TN (1968) Receptive fields and functional architecture of monkey striate cortex.
J Physiol Lond 195:215–245.
Jones JP, Palmer LA (1987) The two-dimensional spatial structure of simple receptive fields in the
cat striate cortex. J Neurophysiol 58:1187–1258.
M MC, Mechler F, Leonard CS, Movshon JA (1996) Spike train encoding by regular-spiking cells of
the visual cortex. J Neurophysiol 76(5):3425–3441.
Maex R, Orban GA (1991) Subtraction inhibition combined with a spiking threshold accounts for
cortical direction selectivity. Proc Natl Acad Sci USA 88(9):3549–53.
Maex R, Orban GA (1996) Model circuit of spiking neurons generating directional selectivity in
simple cells. J Neurophysiol 75:1515–1545.
Marcelja S (1980) Mathematical description of the responses of simple cortical cells. J Opt Soc Am
70(11):1297–1300.
Marmarelis PN, Marmarelis VZ (1978) Analysis of Physiological Systems: The White Noise Approach. NY: Plenum Press.
McCormick DA, Connors BW, Lighthall JW (1985) Comparative electrophysiology of pyramidal
and sparsely spiny stellate neurons of the neocortex. J Neurophysiol 54:782–806.
McLean J, Palmer LA (1989) Contribution of linear spatiotemporal receptive field structure to
velocity selectivity of simple cells in area 17 of cat. Vis Res 29:675–9.
McLean J, Raab S, Palmer LA (1994) Contribution of linear mechanisms to the specification of local
motion by simple cells in areas 17 and 18 of the cat. Vis Neurosci 11:271–94.
Movshon JA, Thompson ID, Tolhurst DJ (1978) Spatial summation in the receptive fields of simple
cells in the cat’s striate cortex. J Physiol Lond 283:53–77.
Palmer LA, Davis TL (1981) Receptive-field structure in cat striate cortex. J Neurophysiol 46(2):260–
76.
Reid RC, Alonso JM (1996) The processing and encoding of information in the visual cortex. Current
Opinion in Neurobiology 6:475–480.
Reid RC, Soodak RE, Shapley RM (1991) Directional selectivity and spatiotemporal structure of
receptive fields of simple cells in cat striate cortex. J Neurophysio 66:505–29.
Reid RC, Victor JD, Shapley RM (1997) The use of m-sequences in the analysis of visual neurons:
linear receptive field properties. Vis Neurosci 14:1015–1027.
19
Ringach DL, Hawken MJ, Shapley R (1997a) Dynamics of excitatory and inhibitory mechanisms
shaping the orientation tuning of neurons in V1. In Society for Neuroscience Annual Meeting.
Ringach DL, Hawken MJ, Shapley R (1997b) Dynamics of orientation tuning in macaque primary
visual cortex. Nature 387:281–284.
Ringach DL, Hawken MJ, Shapley R (1998) Spatial-phase dependent and independent response
components of oriented neurons in macaque V1. In Invest. Ophthal. & Vis. Sci. (Suppl.), volume 39,
p. S683.
Ringach DL, Sapiro G, Shapley R (1997c) A subspace reverse correlation technique for the study of
visual neurons. Vis Res 37:2455–2464.
Somers DC, Nelson SB, Sur M (1995) An emergent model of orientation selectivity in cat visual
cortical simple cells. J Neurosci 269:5448–5465.
Sompolinsky H, Shapley R (1997) New perspectives on the mechanisms for orientation selectivity.
Current Opinion in Neurobiology 7:514–522.
Tolhurst DJ, Dean AF (1991) Evaluation of a linear model of directional selectivity in simple cells
of the cat’s striate cortex. Vis Neurosci 6(5):421–8.
Victor JD (1992) Nonlinear systems analysis in vision: overview of kernel methods. In Pinter R,
Nabet B (eds.), Nonlinear vision: Determination of neural receptive fields, function and networks,
volume 1, pp. 1–37, CRC Press, Cleveland, Ohio.
Wörgötter F, Niebur E, Koch C (1991) Quantification and comparison of cell properties in cat’s
striate cortex determined by different types of stimuli. J Neurophysiol 66:1163–1176.
Appendix
Effects of leakage, phases, temporal kernels
Equation with δ-function forcing and excitatory responses
We consider the feedforward model (3) where the temporal kernel G is a δ-function. The membrane
voltage is then the solution of
dv
(t) = −λ(v(t) − vr ) + r(t),
(15)
dt
with the resting potential, vr , the upper voltage threshold, and lower barrier as described for the
feed-forward model. We assume the DC input is zero. The forcing of the voltage equation, r(t), is
piecewise constant in time, allowing us to solve the equation explicitly between spike events. We
denote the times at which the forcing changes by the sequence t0 , t1 , . . . , tn , . . .. For t ∈ [tn , tn+1 ),
(
v(t̃n ) + r(tn )(t − t̃n )
if λ = 0
v(t) =
r(tn )−λ(v(t̃n)−vr )
−λ(t−t̃n)
v(t̃n ) +
1−e
if λ 6= 0
λ
Here, t̃n = max(tn , tspike) where tspike is the time of the last spike and v(tspike ) = vr .
20
As before we use a spatial kernel (5) with preferred angle θK = 0◦ and preferred phase φK = 0◦.
Consider first the equation with non-negative responses r(θ) ≥ 0. This case arises if the phase of
the gratings is fixed and such that they yield only nonnegative responses.
As we are considering the case of N possible angles and one fixed phase, there are N +1 possible
values for r(tn ). If there is no leakage, λ = 0, or if there is weak leakage, 0 < λ ≤ min{r(tn )}/(−vr ),
one can view the voltage’s evolution before a spike as a directed random walk (i.e., steps of positive
or zero length). The spike occurs at the first time the random walk crosses the upper threshold,
after which the state is reset to the resting potential. This is a first passage-time problem.
We first consider the “instantaneous” correlation function, P r(θ; 0). In constructing this function, a histogram is made by considering the angles of the images that drove the voltage past the
threshold. As the voltage is immediately reset once it crosses the threshold, the voltage at time
t − ∆t must have been below the threshold. Hence, the “instantaneous” correlation function must
exclude all angles that generate negative steps. In other words, images which yield r < 0 would be
excluded, as would images which yield responses too weak to overcome the leakage. One can prove
the following:
Theorem: Let the voltage be governed by equation (15).
No leakage (see, for example, Gut (1988)) If λ = 0 and {r(tn )} ≥ 0, then the instantaneous crosscorrelation function is determined by the spatial response function r(θ),
r(θ)
P r(θ; 0) = P
r(θi )
Furthermore, the process immediately decorrelates, i.e.,
P r(θ; k∆t) =
1
N +1
k > 0.
Weak leakage If 0
<
λ
≤
min{r(tn )}/(−vr ),
then the instantaneous
cross-correlation function is determined by the response function, the upper threshold, the resting
potential, and the leakage:
r(θ) + λvr
P r(θ; 0) = P
(r(θi ) + λvr )
Again, the process immediately decorrelates: P r(θ; τ ) = 1/(N + 1) for τ > 0.
The theorem implies that weak leakage causes a “sharpening” of the instantaneous orientation
distribution. Such leakage-induced sharpening is observed generally in all our numerical simulations.
However, the above case is the only case for which we can prove sharpening occurs. In addition,
we note that the leakage-induced sharpening is obtained by translating r(θ) by a factor λvr and
normalizing. The profile does not change its convexity by such a transformation. Thus, leakage
effects cannot be responsible for the formation of a Mexican hat profile.
The theorem applies to the case where the leakage is weak relative to the forcing. This is to
be contrasted with the case where the leakage is strong relative to the forcing. Fig. 14(a) shows a
pair of numerically computed instantaneous correlation functions, as well as the normalized response
function for this case. The exclusion of angles which yield weak responses is clear. Fig. 14(b) shows
computational evidence that strong leakage causes a loss of instantaneous decorrelation.
21
(a)
0.08
Pr λ=0,1 (θ;0)
0.06
0.04
0.02
0
(b)
Pr λ=0,1 (θ;∆t)
0.019
0.018
0.017
0.016
0.015
-90
-45
0
45
90
Orientation (deg)
Figure 14: All images shown are in phase, φi = 0◦ . (a ) The effect of leakage on orientation tuning bandwidth. Solid:
normalized response function, crosses: P rλ=0 (θ; 0), the instantaneous correlation function for the no leakage case; dashed:
P rλ=1 (θ; 0), the instantaneous correlation function for the strong leakage case. (b) The effect of leakage is to introduce
correlation. Solid: P rλ=0 (θ; ∆t) the cross-correlation function for τ = ∆t and λ = 0, dashed: P rλ=1 (θ; ∆t) the crosscorrelation function for strong leakage, λ = 1, dotted: 1/(N + 1) Here, A = 260.2, with 300, 000 spikes sampled, mean firing
rate of 3.58 spikes/sec.
Equation with δ-function forcing and both excitatory and inhibitory responses
If we allow again gratings to have random spatial phases then the K can also produce inhibitory
responses. In the absence of leakage, the voltage evolves like an undirected random walk with, at
most, M N + 1 possible step lengths. If the average response over all spatial phases is zero for each
angle and if there is no lower barrier for the voltage, then the random walk has no positive drift.
The presence of a lower barrier introduces a effective positive drift even when the average response
is zero.
The possibility of inhibitory responses causes the loss of instantaneous decorrelation for τ > 0
even when there is no leakage. This is seen in a simple example: assume there are only two possible
responses +1 if the angle shown is 0◦ and −1 if the angle is not 0◦ . For such responses, both the
instantaneous and the τ = ∆t cross-correlation functions are 1 at 0◦ and zero elsewhere.
(a)
(b)
(c)
(d)
(e)
(f)
Response
0.1
0.05
0
-90
0
90
-90
0
90
Orientation (deg)
Figure 15: The images shown can be at any of six equally spaced spatial phases. Plotted here are the phase-angle
correlation functions, C(θi , φj ; ∆t), as a function of the orientations θi for the six phases φj . Each C has been normalized
to sum to one; allowing them to be interpreted as the relative frequency that an image at angle θ and phase φ occurs before
a spike event. Here, A = 1040.9, leakage present, λ = .6, and 160, 500 spikes sampled.
22
To gain insight into the type of effects one can expect when both excitatory and inhibitory
responses are present, we plot the normalized counts in C(θi , φj ) for different values of the phase
φj as a function of the orientations θi . We call these “phase-angle correlation functions”. The
phase-angle correlation functions for τ = ∆t are non-zero for almost all angles and all phases (see
Fig. 15). If (θ0 , φ0 ) has a non-zero probability of having been shown the time step before a spike,
this means that however negative the response rφ0 (θ0 ) might have been, there was a larger positive
response at the next time step, since otherwise there would have been no way to cross the voltage
threshold and generate a spike. Hence, in the absence of leakage, the only way for the τ = ∆t
phase-angle correlation function to be zero for (θ0 , φ0 ) is if −rφ0 (θ0 ) ≥ Rmax , where Rmax is the
largest possible positive response. In such a case, the angle θ0 would be excluded from the φ0 th
phase-angle cross-correlation function for τ = ∆t. As the responses shown in Figure 3 are symmetric
in φ, the only exclusion is (θ0 , φ0 ) = (0, 180◦). In the presence of leakage, angles near zero can also
be excluded from the 180◦ phase-angle cross-correlation function. Such behavior is shown in Figure
15(D).
This observation generalizes: if there is a very large negative response to a certain image, such
that −rφ0 (θ0 ) ≥ kRmax , then this angle will be excluded from the φ0 th phase-angle cross-correlation
function for correlation times τ = 0, . . . , k∆t. Furthermore, one cannot have non-zero probability
of showing an image at angle θ for τ = k∆t before a spike and yet have a zero probability for the
same angle at τ = (k + 1)∆t before a spike. Thus, as the correlation time τ increases the number of
angles for which there is zero probability decreases.
23
Download