Linear and Nonlinear Spectral Integration in Type IV Neurons of the Dorsal Cochlear Nucleus. II. Predicting Responses With the Use of Nonlinear Models
Israel Nelken1,
Peter J. Kim2, and
Eric D. Young2
1 Department of Physiology, The Hebrew University
Hadassah Medical School, Jerusalem 91120, Israel; and 2 Department of Biomedical Engineering and Center for Hearing Sciences, The Johns Hopkins University, Baltimore, Maryland 21205
 |
ABSTRACT |
Nelken, Israel, Peter J. Kim, and Eric D. Young. Linear and nonlinear spectral integration in type IV neurons of the dorsal cochlear nucleus. II. Predicting responses with the use of nonlinear models. J. Neurophysiol. 78: 800-811, 1997. Two nonlinear modeling methods were used to characterize the input/output relationships of type IV units, which are one principal cell type in the dorsal cochlear nucleus (DCN). In both cases, the goal was to derive predictive models, i.e., models that could predict the responses to other stimuli. In one method, frequency integration was estimated from response maps derived from single tones and simultaneous pairs of tones presented over a range of frequencies. This model combined linear integration of energy across frequency and nonlinear interactions of energy at different frequencies. The model was used to predict responses to noisebands with varying width and center frequency. In almost all cases, predictions using two-tone interactions were better than linear predictions based on single-tone responses only. In about half the cases, reasonable quantitative fits were achieved. The fits were best for noisebands with narrow bandwidth and low sound levels. In the second nonlinear method, the spectrotemporal receptive field (STRF) was derived from responses to broadband stimuli. The STRF could account for some qualitative features of the responses to broad noisebands and spectral notches embedded in broad noisebands. Quantitatively, however, the STRFs failed to predict the responses of type IV units even to simple broadband noise stimuli. For narrowband stimuli, the STRF failed to predict even qualitative features (such as excitatory and inhibitory frequency bands). The responses of DCN type IV units presumably result from interactions of two inhibitory sources, a strong one that is preferentially activated by narrowband stimuli and a weaker one that is preferentially activated by broadband stimuli. The results presented here suggest that the STRF measures effects related to the broadband inhibition, whereas two-tone interactions measure mostly effects related to narrowband inhibition. This explains why models based on two-tone interactions predict the responses to narrow noisebands much better then models based on STRFs. It is concluded that a minimal stimulus set for characterizing type IV units must contain both broadband and narrowband stimuli, because each stimulus class by itself activates only partially the integration mechanisms that shape the responses of type IV units. Similar conclusions are expected to hold in other parts of the auditory system: when characterizing a complex auditory unit, it is necessary to use a range of stimuli to ensure that all integration mechanisms are activated.
 |
INTRODUCTION |
The dorsal cochlear nucleus (DCN) in unanesthetized animals displays striking nonlinear response characteristics (Nelken and Young 1994
, 1997
; Spirou and Young 1991
). As a result, it has been difficult to predict the responses of DCN principal cells to stimuli with arbitrary spectral shape. This paper focuses on type IV units, which are one response type recorded from DCN principal cells. Tones and narrow bands of noise evoke inhibitory responses in type IV units over a broad range of frequencies and sound levels (Spirou and Young 1991
; Young and Brownell 1976
); from these responses, one would predict that broadband noise (BBN) stimuli would also evoke inhibitory responses. However, the responses to BBN in type IV units are in fact predominantly excitatory. Other examples of nonlinear behavior, in which the response to the sum of two stimuli is far different from the sum of the responses to the two stimuli presented separately, have been documented (Nelken and Young 1997
; Spirou and Young 1991
).
To understand the nature of the representation of the acoustic surround in DCN, and in other similarly complex auditory structures, it is desirable to have general predictive models of the response characteristics of type IV units. That is, it is desirable to have a model that will predict the neurons' responses to arbitrary acoustic stimuli. Such a model avoids the necessity of experimentally evaluating every new stimulus of interest and also provides a necessary basis for a comprehensive theoretical understanding of auditory signal processing in the brain.
Two approaches to such models have been taken. One approach is based on developing a physiologically based model that directly represents the neural interactions that shape the response properties of the structure. Qualitative (Nelken and Young 1994
) and semiquantitative (Blum et al. 1995
; Reed and Blum 1995
) models of DCN have been developed that account for its known properties. However, this approach is not general and can be applied to a structure only after extensive experimental studies. The alternative is to develop models that capture the important aspects of the input/output processing in a structure, without requiring a detailed knowledge of its internal processing mechanisms (e.g., Marmarelis and Marmarelis 1978
). A variety of quasilinear and nonlinear modeling techniques has been applied to the auditory system (e.g., Aertsen et al. 1981
; Backoff and Clopton 1991
; Eggermont et al. 1983b
; Nelken et al. 1994a
,b
; Schreiner and Calhoun 1994
; Shamma et al. 1995
; Wickesberg et al. 1984
). The predictive power of these methods has been evaluated to a limited extent. In some cases, good predictions are obtained, e.g., predictions of responses to four-tone complexes from responses to single- and two-tone complexes (Nelken et al. 1994b
) and predictions of responses to arbitrary spectral shapes from a linear superposition of rippled-spectra stimuli of different ripple frequencies (Shamma and Versnel 1995
). However, other studies have emphasized poor results of prediction attempts, particularly when the stimuli used to derive the model differ substantially from the test stimuli (Bonham et al. 1996
; Eggermont et al. 1983a
).
The DCN is a promising test system for the development of nonlinear input/output modeling methods because of the detailed descriptions available of its internal synaptic organization and of the response properties of its inputs. In this paper, we consider the application of two nonlinear modeling methods to the DCN, one based on single- and two-tone response maps (Nelken et al. 1994a
,b
) and the other on the spectrotemporal receptive field (STRF) (Aertsen and Johannesma 1981
). Both methods should capture some nonlinearities in the Wiener-Volterra sense. We emphasize predictions in which the levels of the stimuli used to develop the models and the stimuli used to test them are similar. Thus we emphasize the spectral integration capabilities of the models rather than their ability to accurately predict behavior as a function of level. For both methods, the crucial assumption for applicability is that the neglected higher-order terms are small relative to the terms that are kept in the model.
We show that the characterizations derived from these two methods differ, as do the predictive abilities of the two methods; each method is only able to predict responses to stimuli similar in spectral bandwidth to the stimuli used to develop the model. This result can be interpreted in terms of the current qualitative model of DCN synaptic organization, because the stimuli used in the two modeling methods activate different populations of inhibitory interneurons in DCN, and therefore different aspects of the nonlinear behavior of DCN type IV units.
 |
METHODS |
Animal preparation and protocol
The methods have been described in detail elsewhere (Nelken and Young 1997
). Briefly, single-unit recordings were made in DCN of decerebrate cats with the use of tipped platinum-iridium microelectrodes. Responses to best-frequency (BF) tones and BBN were used to classify units as type I, II, III, I/III, or IV (Young 1984
). This paper reports on responses of type IV units only.
Acoustic stimuli
Sound was delivered from an electrostatic driver to the left (ipsilateral) ear through a closed acoustic system connected to a hollow ear bar. Acoustic calibrations at the eardrum were performed for each animal by sweeping a tone between 0.04 and 40 kHz and measuring the resulting sound pressure level with a probe tube near the tympanic membrane. All stimuli (except for the pseudorandom noise used for the STRF, described below) were 200 ms long with 10 ms rise-fall times and a 1-s repetition period. Because of the length of the total paradigm, responses of any one unit were obtained only to subsets of the stimuli described below.
Two types of stimuli were used to develop predictive models of unit responses: 1) single tones and two-tone pairs across a range of frequencies were used to construct response maps, and 2) pseudorandom BBN was used for characterization of units' responses with the STRF (Eggermont and Smith 1990
).
Data for tone and two-tone response maps were collected with the use of tones from a fixed frequency table with 19 entries (from 0.5-1.5 octave below to 0.5-1.5 octave above BF). These 19 tones were presented individually and in tone pairs with all possible combinations of the 19 frequencies. Presentations were in random order.
Data for the STRF were obtained as responses to periodic pseudorandom BBN. Two segments of noise were used; these are the same as used by Kim and Young (1994)
and their generation is described in that paper. The noise consisted of 32,768 samples and was played at a sampling rate of 10, 20, 40, or 100 kHz, depending on the BF of the unit; the sampling rate was chosen to keep unit BF between one-sixth and one-third of the sampling rate, equivalent to one- to two-thirds of the aliasing rate. The noise was presented through a 16-bit D/A converter as a continuous sound of 100-200 periods.
When units were held long enough, the measures described above were repeated at another attenuation level, 10-30 dB above the first. Between each series of stimuli, BF tone and BBN rate-level functions were obtained to verify the stability of the unit. Because the design of the experiment required comparison of rate data obtained over periods of minutes to hours, unit stability was important. Recordings were not made from units unless spike isolation and triggering were clear and without artifact. Changes in unit properties over long periods of time still occurred in a minority of units (4 of 25 on which this report is based), but these occurred mostly for BBN at high levels, above those used for the comparisons in this paper. These units were therefore included in the sample analyzed here.
Responses to bands of noise of various bandwidths (Fig. 1A), to filtered noise (Fig. 1B), and to notch noise (Fig. 1C) were used to test the predictions of the modeling methods. The filtered noise was generated by filtering a pseudorandom BBN with a head-related transfer function (HRTF) for a cat (Rice et al. 1992
). The HRTF is the transfer function from free field to a point near the eardrum; when a stimulus with this spectral shape is presented through a closed sound system, the result simulates free-field presentation of a BBN from a particular direction in space. The HRTF stimulus was stored in digital form and played through a D/A converter at a variable sampling rate chosen to place the prominent spectral notch in the stimulus (arrow) at various frequencies relative to unit BF.

View larger version (13K):
[in this window]
[in a new window]
| FIG. 1.
Spectra of 3 noise stimuli used to test predictions of models. A: noiseband of bandwidth BW; center frequency is arithmetic middle of band, as shown. B: noise filtered with cat head-related transfer function (HRTF), shown at D/A sampling rate of 100 kHz. Arrow: prominent spectral null or notch, which was moved to various frequencies above and below unit's best frequency (BF) by changing sampling rate of D/A converter. HRTF, obtained from Rice et al. (1992) , applies to sound originating at 15° ipsilateral azimuth and 30° elevation in 1 cat. C: notch noise generated as described in text. NW: notch width. Notch was usually centered linearly on BF of unit.
|
|
The notch noise was generated by band-pass filtering (24 dB/octave) two independent BBNs with a low cutoff frequency of half the desired notch width and a high cutoff frequency equal to BF. The resulting noisebands were modulated in quadrature to the BF of the unit and added, generating a stationary noiseband with total bandwidth of 2 × BF and with a notch of the desired width ~30 dB deep centered on BF. Noisebands were generated similarly, by low-pass filtering (48 dB/octave) two independent BBNs with cutoff frequency equal to half the bandwidth and modulating the resulting bands to the desired center frequency. In later experiments, noise stimuli were generated digitally: the spectrum of the stimulus was generated in the frequency domain by filling the real and imaginary parts of the Fourier transform of the signal with independent Gaussian pseudorandom deviates (passbands) or zeros (stopbands). The resulting complex spectrum was then inverse transformed back into the time domain. The transform length was 32,768 points, giving a frequency resolution of 5 Hz at a sampling frequency of ~162 kHz. A 16-bit D/A converter was used to transform the digital samples into an analog waveform; stimulus level was set by attenuating the analog waveform after D/A conversion. Because the Nyquist rate (>80 kHz) was above the audible range of cats, no antialiasing filter was used. Frequency bands generated in this way had much steeper slopes than the noisebands generated with the use of the analog modulation technique. Nevertheless, the responses to corresponding stimuli were found to be similar.
Linear and nonlinear summation of tones
The single- and two-tone response map data were used to predict responses to noisebands and notch-noise stimuli. Two models were used: a first-order model in which the response to a noise is predicted as the sum of the responses to tones within the passband of the noise, and a second-order model in which the summation is taken over all single- and two-tone combinations within the passband. Because the responses of DCN type IV units change dramatically with sound level, it is necessary to choose the level of the tones to be comparable with that of the noise; in this case, the tone level was set so that the tone power was equal to the power in a narrow noiseband of width bw Hz, where the spectrum level of the narrow noiseband was the same as in the passband of the noiseband being predicted; bw is called the reference bandwidth below and was usually 200 Hz.
For the first-order model, let fL and fH be the lower and upper edge of the noise passband to be predicted and f0 < f1 < ··· fn < fn+1 be the tone frequencies chosen such that
then the predicted response R1noiseband was computed by
|
(1)
|
where Rspontaneous is the spontaneous rate, R(fk) is the response to the tone of frequency fk, and the weights wk account for the fact that the frequencies are usually logarithmically, rather than linearly, spaced
That is, each R(fk) is assumed to represent the response rate to noise energy in a frequency band of width wk centered on fk. The factor wk/bw in Eq. 1 corrects the tone response rate R(fk) for the fact that the tone is used to approximate a bandwidth wk of the noise signal, not the reference bandwidth bw, for which the level of the tone is appropriate. Note that we assume that it is driven rates that summate linearly, not total rates; that is, spontaneous rate is treated as if it were an independent input to the neuron. This is theoretically required, because the first- and higher-order terms in any functional expansion can only give zero output to zero input. A formula similar to Eq. 1 was used by Spirou and Young (1991)
, except that arbitrary scaling was used instead of division by bw.
Previous work has shown that a first-order model like Eq. 1 does not work in the DCN, because of the nonlinear properties of type IV units (Spirou and Young 1991
). The two-tone data can be used to derive a second-order correction factor c(f1,f2) that captures higher-order interactions between tones of frequencies f1 and f2 as (Nelken et al. 1994a
,b
)
|
(2)
|
where R(f1,f2) is the response rate to the two-tone combination of f1 and f2, and R(f1) and R(f2) are the rates to the tones individually, as above. The spontaneous rate is added to compensate for the fact that it is subtracted twice in the R(fi). Note that c(f1,f2) = 0 if the two tones combine linearly in the sense of Eq. 1. The nonlinear terms c(f1,f2) were computed from the single- and two-tone data with the use of Eq. 2. The second-order prediction R2noiseband was obtained from R1noiseband by adding terms derived from c(f1,f2) as
|
(3)
|
where
The summation is taken over the region fL < fj < fH and fL < fk < fH and the weighting factor wjk is defined similarly to wk in R1noiseband and D1. The need for the diagonal term D1 is explained in the APPENDIX.
The factor bw/(fH
fL) in Eq. 3 was developed empirically. An intuitive explanation for this correction factor is as follows: on the one hand the neuron sums up roughly n inputs where n is proportional to the integration bandwidth of the neuron; on the other hand, the summation defining C2 has contributions from about n2 two-tone contributions. The correction factor bw(fH
fL) is roughly proportional to 1/n, because it measures the width of the noiseband in units of bw and so fixes this discrepancy.
The theory presented above can be extended to predict the responses to noisebands as function of their level. The second-order prediction in this case is
|
(4)
|
Where L is the level of the noiseband, Lref is the reference level (at which the 2-tone data were presented), and the other terms are those computed for the prediction of the noiseband at the reference level. The dependence of the various first- and second-order terms on the level is derived from Eq. A1. The first-order terms (R1noiseband and D1) depend only on the level at a single frequency, and therefore scale linearly with level. The second-order term (C2) depends on products of tone levels at two frequencies, and therefore scales quadratically with level. Note that for L = Lref, Eq. 4 reduces to Eq. 3. These rate-level predictions were compared with the measured rate-level functions, which were always taken for noisebands centered at the BF.
Note that although the terms "first- and second-order predictions" are used here, it is the power spectrum of the stimulus that is used as the input to the prediction formulas. These predictions are therefore actually second order and fourth order when considered as functions of the stimulus waveform. It follows that the predictions based on the STRF are actually comparable with the first-order predictions with the use of the formulation above (Eq. 1), although they are usually considered to be second order (in terms of the stimulus waveform).
STRF calculations
STRFs were computed with the use of the same algorithms as in Kim and Young (1994)
. An STRF is a function of time and frequency that measures the average spectral density of the stimulus as a function of time before action potentials from a neuron. A peristimulus time histogram was computed from the responses to the pseudorandom BBN. For each bin of the peristimulus time histogram, the time-frequency distribution of the sound segment (10-20 ms) just preceding it was computed (the Wigner distribution was used here) (Eggermont and Smith 1990
; Kim and Young 1994
), and those distributions were averaged over all bins with weights proportional to the number of spikes per bin of the peristimulus time histogram. This is equivalent to averaging the Wigner distribution of the sound preceding each spike, as in Eq. 7 of Kim and Young (1994)
. The expected time-frequency distribution in the absence of correlation between spikes and sound was subtracted to account for small spectral irregularities in the noise; the result of the subtraction is the STRF. The expected distribution was computed as the STRF for a sequence of Poisson spikes, uncorrelated with the pseudorandom BBN. The STRF can be thought of as representing the average spectrotemporal "triggering event" for the unit and for the stimulus used to compute it (Aertsen et al. 1981
).
 |
RESULTS |
This paper is based on the responses of 25 type IV units to the two-tone stimulus paradigm. The responses of these units to other stimuli are described in the companion paper, which treats regions of linearity in type IV responses (Nelken and Young 1997
). The STRFs of nine type IV units were collected; three of these units were also tested with two-tone stimuli and noisebands. The results are presented in two stages. First it will be shown that two-tone stimuli can be used to model some nonlinear phenomena, mostly for narrowband stimuli. Second it will be shown that the STRF predicts some aspects of responses to broadband, but not narrowband, stimuli.
Two-tone stimuli can be used to predict the responses to narrow noisebands
Examples of two-tone response planes for three different units are shown in Fig. 2. The axes of each plane are the two frequencies composing the stimulus. The firing rate is represented by gray levels, according to the scale to the right of each plane. The line plot below each plane shows the responses to single tones on the same frequency axis. Figure 2A shows a two-tone response plane for one unit at a low level, close to the maximum of the BF rate-level function. Significant responses occur only when the two-tone combination contains near-BF tones, resulting in the cross pattern in the figure. In Fig. 2A, the responses along the diagonal (where the 2 tones are identical and behave as a single tone with a level 6 dB higher) are almost the same as the single-tone responses off the diagonal, because the former is measured at a level slightly above and the latter slightly below the maximum of the BF rate-level function. The nonlinear interaction term c(f1,f2) (Fig. 2D) is essentially zero except when both tones are very close to BF, where it is negative.

View larger version (47K):
[in this window]
[in a new window]
| FIG. 2.
A-C: examples of 2-tone responses from 3 units. In each part, 2-tone responses R(f1,f2) are shown at left as matrix of gray-scale values, with scale shown at right. Arrow: spontaneous discharge rate. Line plots: responses to single-tone R(f1) at same sound level; spontaneous rate is given by horizontal line. Note that 2-tone matrixes are symmetrical around major diagonal, because frequency combination f1,f2 is identical to frequency combination f2,f1 and each frequency combination was presented only once. Responses on major diagonal, where f1 = f2, were presented as single tone, with sound level increased by 6 dB, corresponding to in-phase addition of 2 identical tones. Each frequency combination was presented once. D-F: nonlinear interaction terms c(f1,f2), Eq. 2, for the 3 examples in A-C.
|
|
Figure 2B shows, for a different unit, a two-tone response plane at a higher level, on the descending limb of the BF rate-level function. In this case, the responses on the diagonal are noticeably smaller than the single-tone responses, because of the nonlinear rate-level function. In this case, c(f1,f2) (Fig. 2E) shows a larger range with significant interactions. Figure 2C shows a two-tone response plane at still higher levels for a third unit. Here the single-tone response is inhibitory at most frequencies. Note, however, that the two-tone responses (in Fig. 2F) show definite facilitation, for example the white spots near frequencies of 6.13 and 6.88 kHz, neither of which is excitatory when presented by itself. The two-tone response planes at low and medium levels invariably had the shape of a cross, showing that two-tone interactions occur mainly between frequency components that are close to each other and to BF (Fig. 2, D and E). At high levels, however, the shape of the two-tone response plane was highly variable (Fig. 2F).
First- and second-order predictions for the responses to narrow noisebands (Fig. 1A) were generated from the tone responses with the use of Eqs. 1 and 3. The noisebands had fixed bandwidth and spectral level, but varied in center frequency. The plots in Fig. 3 show discharge rate versus center frequency. The examples in Fig. 3 cover the range of goodness of fit that can be achieved with single- and two-tone models. In Fig. 3A, the first-order fit (· · ·) is generally too large. The second-order fit (- - -) follows closely the measured response (
), except for some oscillations at center frequencies between 4 and 5 kHz. This oscillation may be the result of noise in the measurements. Figure 3B is another example of a good fit. Figure 3C shows a borderline case. Although the second-order prediction correctly identifies the main peak of the response and its bandwidth, the prediction for the maximal rate is too large by a factor of ~2.

View larger version (22K):
[in this window]
[in a new window]
| FIG. 3.
Responses to noisebands (Fig. 1A) as center frequency is varied ( ). First-order predictions (R1 in Eq. 1, · · ·) and 2nd-order predictions (R2 in Eq. 3, - - -) calculated from tone and 2-tone responsemaps are superimposed. Values of d given in each plot are for 2nd-order prediction. Examples are ordered by quality. A and B are good fits, C is medium fit, D and E are bad fits. Unit BFs, noise bandwidths, and noise spectrum levels: (A) 4.23 and 0.4 kHz, level = 10 dB; (B) 2.62 and 1.6 kHz, level = 20 dB; (C) 17.7 and 1.6 kHz, level = 4 dB; (D) 5.9 and 2.4 kHz, level = 5 dB; (E) 17.8 and 2 kHz, level = 10 dB.
|
|
Figure 3, D and E, shows examples of bad fits. In Fig. 3D, the first-order prediction is too small most of the time. It is also qualitatively wrong, showing a dip where the actual response has a peak. The second-order correction fixes the qualitative discrepancy, but the correction is too large. As a result, the second-order prediction does show a peak at about the right frequency, but the rate at the peak is much too large. Finally, Fig. 3E shows a case in which there was a significant but weak response over a large frequency range. The first-order prediction oscillates wildly above and below the measured response. The second-order correction often has the right sign, but is too large at most frequencies. This creates the same pattern as in Fig. 3D: for frequencies at which the linear prediction is too low, the second-order prediction tends to be too large and vice versa.
The quality of the fits of the measured and predicted data were quantified with the use of an objective measure d equivalent to the
2 of the difference between the measured and predicted functions; if the difference between two functions is entirely due to noise, then d should be near 1, and d increases as the difference between the functions increases. This measure is described in detail elsewhere (Nelken and Young 1997
). In the predictions made here, cases with d ~ 1 were rare. This is probably the result of two factors: first, the different character of the predicting stimuli (2-tone complexes) and the predicted stimuli (noisebands); and second, the noise in the estimates of the second-order contributions (they are computed as the difference between 3 measured rates, increasing their SD by a factor of ~1.7). Fits with d < 30 were clearly good, as judged subjectively; fits with d > 300 were clearly bad. In between, there was a large group of results for which the objective d value and the subjective judgment did not correlate well (e.g., Fig. 3, C vs. D, both of which have similar d values). Values of d for the second-order fits are shown in Fig. 3, A-E.
A total of 73 two-tone response maps was recorded from the 25 type IV units. From these, 428 predictions were made. Fits with d ~ 1 were found for 18.5% of the cases (d < 2: 79 of 428). About half the cases had d < 30, showing reasonable fits (242 of 428, 57%). Another third of the cases lay in the middle range, 30 < d < 300, where correlation between d and subjective fit quality was weak (128 of 428, 30%). Finally, 13% of the cases had large d values (d > 300: 58 of 428).
Figure 4A shows a scatter plot of d against the bandwidth of the test noiseband; the bandwidth is measured in octaves re the reference noise bandwidth used in the prediction (bw in Eqs. 1 and 3, usually 200 Hz). The plot shows a tendency for the fit to degrade with bandwidth. In Fig. 4B, the same data are shown, but now bandwidth is measured re the BF of the unit. Now the plot is wedge shaped, showing larger scatter when fitting bandwidths that are comparable with BF. That is, the fit can be good or bad if the test noiseband is wide compared with BF, but the fit is always bad if the test bandwidth is wide compared with the reference bandwidth.

View larger version (36K):
[in this window]
[in a new window]
| FIG. 4.
A: scatter plot of d vs. test noise bandwidth for measured and predicted responses to test noisebands of varying center frequency (data as in Fig. 3). Bandwidth is measured relative to reference bandwidth used in predictions (bw in Eqs. 1 and 3, usually 200 Hz). B: d as function of noise bandwidth, measured in octaves re BF. C: d values at low and high spectrum levels for those cases in which 2 levels were measured (usually 30 dB apart). Line shows where d values are equal. Note that most of data lie above this line. Most extreme departure from line is case in which firing rate at higher level was almost completely depressed by both tones and noisebands, giving rise to almost perfect prediction. D: scatter plot of d values for 1st-order (Eq. 1) and 2nd-order (Eq. 3) predictions. Line shows where d values are equal; 2nd-order prediction improved fit in almost all cases.
|
|
Figure 4C shows a scatter plot of d values for predicting the responses to the same noise bandwidth at two different levels. There is a correlation between the fits at the two levels: units with bad fits at low levels tended to show bad fits at higher levels also. The line through the scatter is the equality line. In the majority of the cases, the fit at the higher level was worse than the fit at the lower level (P = 0.0025, 1-tailed Wilcoxon test).
An unresolved question in nonlinear modeling of auditory neurons is whether the second-order kernels actually improve prediction quality. For example, Wickesberg et al. (1984)
showed that for units in the anteroventral cochlear nucleus, the second-order Wiener kernels did not improve response predictions in some cases. For the method presented here, the second-order prediction was almost always better than the first-order prediction, as judged by the d measure. In Fig. 4D, the scatter plot of the d values for first- and second-order predictions lies almost entirely below the diagonal. The d value in some cases decreased by 3 orders of magnitude.
In addition to predicting the responses to bands at a fixed level with varying width and center frequency, it is possible to predict, with the use of similar formulas, the responses of type IV units to noisebands of varying level (Eq. 4). Such predictions are shown in Fig. 5. Figure 5, A and B, shows cases in which the second-order formula, based on response plots obtained at one sound level, actually predicts the nonmonotonicity of the type IV units. Figure 5C shows a case in which the second-order prediction is monotonic over the relevant level range, but the fit at low levels (below the peak of the rate-level function) is excellent. Finally, Fig. 5D shows a case in which the prediction was nonmonotonic, but the quantitative fit is bad
threshold is too low and the maximum firing rate of the unit is too high. As expected, there was strong correlation between cases in which the fixed-level predictions (i.e., Fig. 3) were good and cases in which the rate-level predictions were good. Twenty-one units were tested with at least one noiseband rate-level function; for a large majority (19 of 21) the rate-level prediction was successful (at least at low levels, as in Fig. 5C) at some bandwidths. Nonmonotonicity was predicted in 16 of 21 of the units. Only cases in which the two-tone data were measured close to the peak of the rate-level function (as in Fig. 2, A and B) showed reasonable predictions for noiseband rate-level functions, and the predictions were best for narrower noisebands (200-800 Hz). These results show that, in some circumstances, the two-tone response maps capture not only information about spectral integration at a fixed level but also the information needed to describe the behavior of type IV units as a function of level.

View larger version (22K):
[in this window]
[in a new window]
| FIG. 5.
Responses to noisebands as function of level ( ). Noisebands were always arithmetically centered on unit BF. First-order (· · ·) and 2nd-order (- - -) predictions (Eq. 4) are superimposed. A: good fit with 2nd-order prediction between threshold and level at which unit is strongly inhibited. B: good fit of 2nd-order prediction between threshold and ~10 dB re level of peak rate. C: good fit of both 1st- and 2nd-order predictions below peak of rate-level function; in this case, 2nd-order prediction did not show nonmonotonicity. D: bad fit of both 1st- and 2nd-order predictions over full level range. Qualitatively, however, 2nd-order fit does predict nonmonotonicity of rate-level function.
|
|
STRFs of DCN type IV units
STRFs were computed for nine type IV units, usually at multiple levels. Figure 6 shows three examples of type IV STRFs. The STRF is interpreted as the mean triggering event for spikes, in the sense that the STRF is an estimate of the average power spectrum in the stimulus as a function of time preceding spikes. A triggering event may be either an increase in the level of some frequencies in the noise (white regions in Fig. 6) or a decrease in the level of other frequencies (black regions in Fig. 6), or both. Spike latency is the time from the triggering event to the spike, which occurs at time 0 on the ordinate of the STRF plots.

View larger version (60K):
[in this window]
[in a new window]

View larger version (60K):
[in this window]
[in a new window]

View larger version (59K):
[in this window]
[in a new window]
| FIG. 6.
Examples of spectrotemporal receptive fields (STRFs) for type IV units. STRFs are shown in rectangles with the use of gray scale; scale is given at right. Because average stimulus spectrum has been subtracted, STRF would be constant near 0 in absence of any correlated response from neuron. White regions: peaks of energy (excitatory regions). Dark regions: valleys (inhibitory regions). Stimulus frequency is on abscissa and time preceding spike discharges is on ordinate. A: narrow excitatory region at 12.5 kHz with somewhat wider inhibitory flanks. Note repetition of excitation at longer latencies. B: wide excitatory region centered on 6 kHz with wide inhibitory region at longer times. C: medium-width excitatory region near 10 kHz that lasts for a long time, and flanking inhibitory bands.
|
|
Figure 6A shows a unit that had a strong, narrow excitatory region at a latency of 3-4 ms. It was surrounded on three sides (that is, at higher and lower frequencies and on BF at longer latency) by an inhibitory region. This is the most commonly seen pattern of STRF. A similar arrangement, although weaker, appears again at longer latencies.
It is suspected that the second, longer-latency repeat of the excitatory/inhibitory regions in Fig. 6A is in fact an artifact due to regularities in the spike discharge of the type IV unit (Backoff and Clopton 1991
; Kipke et al. 1991
; Parham and Kim 1992
). To check this possibility, the autocorrelation was computed for this unit; the interval between the two peaks in Fig. 6A (~1.5 ms) was within the refractory period, so the second peak is not produced by simple regularity in this unit. However, multipeaked STRFs were observed in three other units and, in those cases, the autocorrelation showed a significant peak at the repeat period of the STRF. Thus, in the majority of cases, repetitions in the STRF result from regularity of firing of the unit; that is, they are related to the intrinsic properties of the unit rather than to spectral integration mechanisms. A possible mechanism to explain the exceptional case in Fig. 6A is that the unit has an intrinsic oscillatory rhythm at the right frequency (~600 Hz), which is not observed ordinarily in spike discharges because of refractoriness. However, in those cases in which a spike was not discharged at the usual short latency (~4 ms) from the favorable triggering event, the triggering event still evoked the intrinsic oscillation, which increased the firing probability 1.5 ms later.
In general, the STRF shows very strong correlation along the temporal dimension, which causes vertical stripes to appear. The vertical stripes result from the use of a finite-length noise segment to compute the STRF. They correspond to the peaks and valleys of the nonflat power spectrum of the specific noise segments used. Although some of this effect is reduced by subtracting the expected STRF under the no-correlation condition, obviously the compensation is not full.
Another configuration of excitatory and inhibitory regions is shown in Fig. 6B. Here at suprathreshold levels the excitatory region is very wide, extending from ~4.5 to 7.5 kHz. It also has a weaker excitation at lower and higher frequencies, extending essentially across the whole frequency axis of the STRF. At low frequencies, the latency of the weak excitatory region increases slightly as frequency decreases, which may be a correlate of the traveling wave in the cochlea. It might be hypothesized that such a wide excitatory region would be manifested in the tone responses of the unit. Figure 7A shows the single-tone response map of this unit. This plot shows discharge rate as a function of frequency at eight sound levels. The horizontal lines are spontaneous rate; inhibitory regions, where rate is less than spontaneous, are shaded and excitatory regions are shaded in black. The unit does indeed have an excitatory region near threshold (
80 dB), which has a width similar to that of the strong excitatory region in the STRF; however, above 10 dB re threshold, the unit gave predominantly inhibitory responses to tones. Thus there is no qualitative correspondence between the tone response map and the STRF.

View larger version (49K):
[in this window]
[in a new window]
| FIG. 7.
Tone response maps and STRF response maps for 2 type IV units. A and C: tone response maps consists of rate-vs.-frequency plots at 8 or 9 attenuations; 0-dB attenuation is ~100 dB SPL, but actual sound level varies with frequency according to acoustic calibration (not shown). Horizontal lines: spontaneous rates; common rate scale is shown at bottom left. Inhibitory regions are shaded in gray and excitatory regions in black. B and D: STRF plots shows frequency marginals for STRFs computed from responses at 6 attenuations (0-dB attenuation corresponds to spectrum level of ~40 dB SPL). Frequency marginals are computed by averaging STRF along time dimension. Solid lines: frequency marginals that include BF excitatory peak in STRF but not (refractory) inhibitory effect at longer latencies; dashed lines include long-latency inhibitory effect. Marginals computed from latencies as follows: (B) solid lines, 4.5-7 ms, dashed lines, 4.5-10 ms; (D) solid lines, 3-6 ms, dashed lines, 3-8 ms. Ordinate scale for STRF is arbitrary, but is same at different stimulus levels.
|
|
Figure 6C shows a third configuration of excitation and inhibition, which is to some extent intermediate between the first two. There is a relatively wide short-latency excitatory region that seems to be restricted at longer latencies by inhibitory regions on either side of the BF.
The STRFs are a sensitive measure of DCN units' response characteristics in that STRFs vary considerably from unit to unit but, in most units, their shapes stay relatively constant as sound level changes. Figure 7 shows a comparison of tone response maps (Fig. 7, A and C) with STRFs (Fig. 7, B and D) for two units. The STRFs are shown as frequency marginals, which are computed by averaging the STRFs along the time axis over a range of latencies; they show the frequency tuning in the STRF, with variations in time averaged out. The solid lines show frequency marginals computed over latencies that include the principal excitatory peak, but not the inhibitory valleys at longer latency. The dashed lines show frequency marginals computed over latencies that cover the whole response area, including both the excitatory peak and the subsequent inhibitory valleys. The dashed lines therefore show the average deviation of the STRF from background across all latency values. The example shown in Fig. 7B is from the same unit as Fig. 6B. The broad excitatory area is clear in the solid lines; the dashed lines show that the inhibitory input centered at 5.7 kHz has a strong effect on the cell, producing a narrow net inhibitory input just below BF at high levels. The example in Fig. 7D is more typical of the type IV units in our sample in that it has an excitatory region centered at BF, an inhibitory region above BF, and a general shape that does not change with stimulus level. All STRFs had excitatory regions centered on BF, except for one case, where inhibitory regions only were seen at high levels (~20 dB above threshold).
Most type IV STRFs (7 of 9) showed significant inhibitory regions. Inhibitory regions at latencies just longer than excitatory regions (e.g., Fig. 6, A and B) can sometimes be attributed to refractoriness (Kim and Young 1994
). Inhibitory regions at frequencies just above or just below BF were also observed in most units (6 of 8); the case in Figs. 6B and 7B is one exception. The cases in Figs. 6, A and C, and 7D are more typical. These off-BF inhibitory regions probably result from neural interactions. In four of six cases with both BF excitation and off-BF inhibition, the inhibitory region had a slightly longer latency than the excitatory region, which is consistent with a multisynaptic inhibitory pathway. Very small off-BF inhibitory regions are observed in auditory nerve fibers (Kim and Young 1994
), but these are too small to explain the kind of inhibitory effects shown in Fig. 6.
Prediction of responses to other stimuli based on the STRF
The STRF has limited predictive power in the sense that it cannot be used to describe the responses of DCN units to most other stimuli. It can be seen in Fig. 7 that the STRF fails to predict the responses of most type IV units to tones. In fact, except for the case in Fig. 7B and one other in which the STRF was entirely inhibitory, the STRF predicts excitatory effects of BF tones at all levels; this occurs because the largest component of type IV STRFs at all levels is the excitatory peak near BF. By contrast, type IV units show inhibitory responses to BF tones at suprathreshold levels. This failure of the STRF to predict inhibitory responses to tones extends also to the prediction of the responses to narrow noisebands, because they give essentially the same responses as do tones (Nelken and Young 1997
).
It can be shown theoretically that the STRF can be used to predict rate responses to stimuli with arbitrary spectra by multiplying the frequency marginal of the STRF by the spectrum of the stimulus and summing across frequency. The ability of STRFs to predict the responses to broadband stimuli was tested with the use of this calculation. STRF predictions were compared only qualitatively with unit responses, because shifting and scaling were necessary to make the ranges of the predictions approximately equal to those of the measured rates. Four units were tested with BBN filtered to have the spectrum of a cat HRTF (Fig. 1B). In these cases, the bandwidth of the excitatory peak in the STRF was a good but qualitative predictor of the sensitivity of the unit to the spectral notch. The unit in Fig. 6B was rather insensitive to the center frequency of the notch, whereas units with a narrow peak in their STRF (as in Fig. 6A) were much more sensitive to the frequency of the notch relative to their BFs. For two additional units, the STRF failed to predict responses to spectral notches embedded in BBN (Fig. 1C). These units were unusual for DCN type IV units in that they were inhibited by BBN. The STRF failed in these cases by predicting an excitatory response to the noise. Thus the STRF has limited ability as a predictor of responses to wideband stimuli.
 |
DISCUSSION |
Nonlinear modeling methods
The arsenal of nonlinear modeling methods is rather limited at present. The most widely tested methods for nonlinear modeling of auditory units have been second-order reverse-correlation-based methods, used either in the time or frequency domains (Wiener kernel) (Wickesberg et al. 1984
) or as the STRF (Aertsen and Johannesma 1981
; Backoff and Clopton 1991
; Eggermont et al. 1983b
). In all cases, some function of the waveform just preceding a spike is averaged over all spike occurrences. Various forms of STRF have been used, but all are related to the Fourier transform of the second-order Wiener kernel across one time dimension, giving the average time-frequency distribution of energy preceding a spike.
The usefulness of the STRF for predicting responses has been found to be limited (Eggermont et al. 1983a
). There are two problems: first, the STRF assumes a stationary system and is generated from steady-state stimuli, whereas the responses of auditory neurons to interesting stimuli are not stationary. That problem is bypassed in this paper by attempting to predict only average discharge rate behavior and not detailed temporal aspects of responses. This is appropriate in DCN, because average discharge rate captures reasonably well the temporal modulation of units' responses (Young and Brownell 1976
). The second problem is that there is no reason to assume a priori that the nonlinearity of an auditory neuron is limited to second-order terms in the Wiener-Volterra sense. As is discussed below, the results of this paper suggest that the nonlinearities are of higher than second order in the DCN. Therefore, although the full Wiener-Volterra series would represent the unit completely, there is no reason to think that truncating it at the second order would give a useful approximation (Johnson 1980
). Unfortunately, estimating higher-order kernels is technically difficult.
The alternative approach to characterizing nonlinear neural systems is to build physiological models based on the internal organization of the system. In the case of the DCN, such models implement some or all of the known anatomic and physiological facts about DCN type IV units, and then test their ability to predict the responses to various stimuli. Three such models have been suggested in the past (Arle and Kim 1991
; Blum et al. 1995
; Pont and Damper 1991
; Reed and Blum 1995
). Although these models show some promise, they have not yet demonstrated predictive power for the kinds of stimuli analyzed here.
Tone and two-tone predictions
First-order predictions of responses to noisebands with the use of summation of responses to single tones have been found to have very limited power (Spirou and Young 1991
). In this paper, we show that two-tone responses can be used to improve the quality of these predictions, but that the improvement depends on the bandwidth of the noise and also on its level (Fig. 4). The second-order predictions almost always improve the fit qualitatively, but sometimes overcompensate for the errors in the single-tone fits (e.g., Fig. 3D). Presumably, this behavior reflects the fact that higher-order nonlinear terms are necessary to fully model the unit's responses. In addition, there are indications that some of the problems in the second-order prediction are caused by noise in the two-tone responses, because only one repetition of each combination of parameters was presented and the two-tone contributions are computed as differences of these noisy values.
In the companion paper (Nelken and Young 1997
), we argue that the major source of nonlinearity in type IV responses is the inhibitory input from type II units. The limited success that was achieved with second-order predictions is probably due to the fact that type II units are relatively weakly activated in the parameter range used. This follows from the fact that the tone levels used for the single- and two-tone response maps were usually at the peak of the rate-level function or on its descending limb, which is where type II units are just beginning to fire (Young and Voigt 1981
). Assuming that type II units are the major source of nonlinearity, the fact that the predictions deteriorated with increasing bandwidth (Fig. 4A) can be explained as resulting from an increasing difference between the narrowband predictor, which activates the type II units, and the broadband test response, which does not.
The ability of the second-order predictions to capture the nonmonotonicity of the rate-level function of narrow noisebands (Fig. 5) may be explained with a similar argument. Because the nonlinearity introduced by the type II unit is weak, it can be approximated by parabolic term, as in Eq. 4. At higher levels the rate-level function is no longer parabolic
in fact, the units usually shut down completely. As a result, the predictions based on two-tone measurements cannot predict the rate-level functions at higher levels. It is nevertheless noteworthy that one measurement, near the peak of the rate-level function, is sometimes able to capture the quantitative features of the whole rate-level functions from threshold to peak and to the inhibitory area.
STRF predictions
In contrast to the two-tone predictions, the STRFs of type IV units are computed with the use of responses to broadband stimuli. They do show qualitative agreement with the responses of some type IV units to notch stimuli, but are unable to predict the responses to narrowband stimuli (Fig. 7). In this sense, they complement the two-tone data, which are taken with the use of narrowband stimuli and are useful for predicting the responses to some narrowband stimuli. The main use of the STRF may be in defining and classifying the large variability in properties of type IV units (Nelken and Young 1994
; Spirou and Young 1991
; Young and Brownell 1976
).
The STRF results contain one puzzle: the extent of the inhibitory regions seen in the STRFs is much narrower than is expected from direct measurements of inhibitory bandwidths with broadband notch-noise stimuli, where the inhibitory bandwidth can be up to 1 octave re BF (median value ~0.5 octave) (Nelken and Young 1994
). With broadband stimuli, type II units are minimally activated and the inhibition observed with notch-noise stimuli probably comes from a second inhibitory interneuron, the wideband inhibitor. The wideband inhibitor's inhibition of type IV units is hypothesized to be weak (Nelken and Young 1994
), and the STRF suffers from a narrow dynamic range (Kim and Young 1994
). Thus the explanation for the difference in inhibitory bandwidth may be that only the central, strongest portion of the wideband inhibitor's input is reflected in the STRF. Alternatively, the narrow inhibitory inputs in the STRF may be the weak remains of type II inhibition or of inhibition from other unknown sources.
The STRF data also support the conclusion that the nonlinearity of type IV units is of higher than second order in the Wiener-Volterra sense. Although the general position and extent of excitatory and inhibitory domains in the STRF are consistent with the responses to broadband stimuli, quantitative predictions fail. Higher-order kernels are therefore needed to achieve quantitative agreement with the data, even for broadband stimuli.
Implications for auditory modeling
The main advantage of the modeling approach used in this study is that there are no free parameters in the prediction formulas, and therefore no training. The formulas are completely general, and they are made specific to DCN type IV units by plugging in the measured responses to a selected family of sounds. However, the results presented here show that it is very important to select correctly a family of sounds that captures the whole complexity of the integration mechanisms of the unit.
In this paper, two families of sounds were used: two-tone stimuli and BBN. Although this made it possible to characterize the responses of type IV units in some parameter ranges, these stimuli turned out to be insufficient for a full description of the responses of type IV units. One probable cause for this failure is that neither family activated both of the inhibitory sources on type IV units at the same time. Therefore the responses to sounds that activate both sources were not predicted correctly.
A possible set of sounds that might cover better the relevant parameter regimes would be composed of noisebands and pairs of noisebands of various bandwidths and separations. At the limiting case of zero bandwidth, pure tones would be used. This set is four-dimensional (2 coordinates of frequency and 2 coordinates of bandwidth) and therefore is difficult to characterize with fine sampling of all four coordinates. A judicious choice of bandwidths, with interpolation in between, must be made to make the use of this set practical.
This paper uses formal modeling techniques similar in spirit to the Wiener-Volterra approach. It does, however, spread a wider net, by using a large number of stimuli and testing essentially all the possible interrelations between them. The choice of the stimuli is based on previous physiological knowledge. It is hoped that in this way the nonlinear behavior can be explained better and parameter regimes that are still not well understood be identified. The results may be used in this way to direct further experiments and suggest new families of sounds that may be used to study DCN spectral integration mechanisms. Ultimately, of course, physiological models will be needed to provide complete explanation of DCN responses.
 |
ACKNOWLEDGEMENTS |
The comments of Prof. Haim Sompolinsky were helpful in preparing this manuscript.
This work was supported by National Institute of Deafness and Other Communications Disorders Grant DC-00115 and by a grant from the Israeli Academy of Sciences.
 |
APPENDIX |
In this appendix, mathematical derivations related to the prediction formulas are outlined. The basic model used here is the following
|
(A1)
|
Where I(f) is the stimulus spectrum, R[I(f)] is the predicted rate in response to the stimulus with spectrum I(f), K1 is called the first-order kernel, and K2 is called the second-order kernel.
For a first-order model, in which the second-order kernel is null, single tones are sufficient. Single tones are delta functions in the frequency domain, and so
|
(A2)
|
Note that we assume here that the units of the tone level are fixed so that the level used is equal to 1. In this case, the kernel K1(f) is just the single-tone response R(f) minus the spontaneous rate; by approximating the integral in Eq. A1 with sums and adding the frequency weighting factors, one gets Eq. 1 in the text.
For the second-order model, in which both kernels are assumed to be nonzero, both single-tone and two-tone data are necessary. The following combinations are used
Here the first combination describe single tones at the nominal level, the second describes single tones that are 6 dB louder (these will be the tones on the diagonal of the 2-tone response map), and the third describes combinations of different frequencies.
The predicted responses to these stimuli are, respectively
From these three equations, it is possible to extract the kernels
|
(A3)
|
In the text (Eq. 2), K2(fi,fj) is denoted c(fi,fj). Note that the first-order kernel K1(f) is different in the first- and second-order models (compare Eqs. A2 and A3); for the second-order model, the first-order kernel is corrected by half the diagonal of the second-order kernel. In Eqs. 3 and 4 in the text, this is expressed by correcting the first-order prediction R1 with the term D1. It can be shown that this term exactly cancels the contribution from the diagonal in the double integral that defines the second-order term. In principle, the prediction could be performed without correcting the first-order kernel, provided that the diagonal was set to zero in the double integral. However, this approach would not work when predicting the responses to noisebands of varying level (as in Eq. 4). In this case, the corrected first-order kernel is required.
In practice, it turned out that the second-order correction is very often in the right direction but too large. The discrepancy increased with bandwidth, and it turned out that over the population, the increase in discrepancy was proportional to the bandwidth being predicted. This finding led to the introduction of thefactor bw/(fH
fL) to scale the second-order correction. The surprising finding is that the proportionality factor is not significantly different from 1, although there is no a priori reason for that. In the physics literature, it is known that infinite series may sometimes be summed by multiplying a lower-order term in the series by a scaling factor. Further investigation of this finding may bring additional insight into the spectral integration mechanisms of type IV units.
 |
FOOTNOTES |
Address for reprint requests: I. Nelken, Dept. of Physiology, Hebrew University
Hadassah Medical School, PO Box 12272, Jerusalem 91120, Israel.
Received 30 August 1996; accepted in final form 15 April 1997.
 |
REFERENCES |
-
AERTSEN, A.M.H.J.,
JOHANNESMA, P.I.M.
The spectro-temporal receptive field.
Biol. Cybern.
42: 133-143, 1981.[Medline]
-
AERTSEN, A.M.H.J.,
OLDERS, J.H.J.,
JOHANNESMA, P.I.M.
Spectro-temporal receptive fields of auditory neurons in the grassfrog. III. Analysis of the stimulus-event relation for natural stimuli.
Biol. Cybern.
39: 195-209, 1981.[Medline]
-
ARLE, J. E.,
KIM, D. O.
Simulations of cochlear nucleus neural circuitry: excitatory-inhibitory response-area types I-IV.
J. Acoust. Soc. Am.
90: 3106-3121, 1991.[Medline]
-
BACKOFF, P. M.,
CLOPTON, B. M. A
spectrotemporal analysis of DCN single unit responses to wideband noise in guinea pig.
Hear. Res.
53: 28-40, 1991.[Medline]
-
BLUM, J. J.,
REED, M. C.,
DAVIES, J. M. A
computational model for signal processing by the dorsal cochlear nucleus. II. Responses to broadband and notch noise.
J. Acoust. Soc. Am.
98: 181-191, 1995.[Medline]
-
BONHAM, B. H.,
CALHOUN, B. M.,
SCHREINER, C. E.
Cells in the primary auditory cortex of the cat are not simple.
Abstr. Assoc. Res. Otolaryngol.
19: 113, 1996.
-
EGGERMONT, J. J.,
AERTSEN, A.M.H.J.,
JOHANNESMA, P.I.M.
Prediction of the responses of auditory neurons in the midbrain of the grass frog based on the spectro-temporal receptive field.
Hear. Res.
10: 191-202, 1983a.[Medline]
-
EGGERMONT, J. J.,
JOHANNESMA, P.I.M.,
AERTSEN, A.M.H.J.
Reverse-correlation methods in auditory research.
Q. Rev. Biophys.
16: 341-414, 1983b.[Medline]
-
EGGERMONT, J. J.,
SMITH, G. M.
Characterizing auditory neurons using the Wigner and Rihacek distributions: a comparison.
J. Acoust. Soc. Am.
87: 246-259, 1990.[Medline]
-
JOHNSON, D. H.
Applicability of white-noise nonlinear system analysis to the peripheral auditory system.
J. Acoust. Soc. Am.
68: 876-884, 1980.[Medline]
-
KIM, P. J.,
YOUNG, E. D.
Comparative analysis of spectro-temporal receptive fields, reverse correlation functions, and frequency tuning curves of auditory-nerve fibers.
J. Acoust. Soc. Am.
95: 410-422, 1994.[Medline]
-
KIPKE, D. R.,
CLOPTON, B. M.,
ANDERSON, D. J.
Shared-stimulus driving and connectivity in groups of neurons in the dorsal cochlear nucleus.
Hear. Res.
55: 24-38, 1991.[Medline]
-
MARMARELIS, P. Z.,
MARMARELIS, V. Z.
In: Analysis of Physiological Systems: The White-Noise Approach.,
. New York: Plenum, 1978
-
NELKEN, I.,
PRUT, Y.,
VAADIA, E.,
ABELES, M.
Population responses to multifrequency sounds in the cat auditory cortex: one- and two-parameter families of sounds.
Hear. Res.
72: 206-222, 1994a.[Medline]
-
NELKEN, I.,
PRUT, Y.,
VAADIA, E.,
ABELES, M.
Population responses to multifrequency sounds in the cat auditory cortex: four-tone complexes.
Hear. Res.
72: 223-236, 1994b.[Medline]
-
NELKEN, I.,
YOUNG, E. D.
Two separate inhibitory mechanisms shape the responses of dorsal cochlear nucleus type IV units to narrowband and wideband stimuli.
J. Neurophysiol.
71: 2446-2462, 1994.[Abstract/Free Full Text]
-
NELKEN, I.,
YOUNG, E. D.
Linear and nonlinear spectral integration in type IV neurons of the dorsal cochlear nucleus. I. Regions of linear interaction.
J. Neurophysiol.
78: 790-799, 1997.[Abstract/Free Full Text]
-
PARHAM, K.,
KIM, D. O.
Analysis of temporal discharge characteristics of dorsal cochlear nucleus neurons of unanesthetized decerebrate cats.
J. Neurophysiol.
67: 1247-1263, 1992.[Abstract/Free Full Text]
-
PONT, M. J.,
DAMPER, R. I. A
computational model of afferent neural activity from the cochlear to the dorsal acoustic stria.
J. Acoust. Soc. Am.
89: 1213-1228, 1991.[Medline]
-
REED, M. C.,
BLUM, J. J. A
computational model for signal processing by the dorsal cochlear nucleus. I. Responses to pure tones.
J. Acoust. Soc. Am.
97: 425-438, 1995.[Medline]
-
RICE, J. J.,
MAY, B. J.,
SPIROU, G. A.,
YOUNG, E. D.
Pinna-based spectral cues for sound localization in cat.
Hear. Res.
58: 132-152, 1992.[Medline]
-
SCHREINER, C. E.,
CALHOUN, B. M.
Spectral envelope coding in cat primary auditory cortex: properties of ripple transfer functions.
Auditory Neurosci.
1: 39-61, 1994.
-
SHAMMA, S. A.,
VERSNEL, H.
Ripple analysis in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary spectral profiles.
Auditory Neurosci.
1: 255-279, 1995.
-
SHAMMA, S. A.,
VERSNEL, H.,
KOWALSKI, N.
Ripple analysis in ferret primary auditory cortex. I. Response characteristics of single units to sinusoidally rippled spectra.
Auditory Neurosci.
1: 233-254, 1995.
-
SPIROU, G. A.,
YOUNG, E. D.
Organization of dorsal cochlear nucleus type IV unit response maps and their relationship to activation by bandlimited noise.
J. Neurophysiol.
65: 1750-1768, 1991.
-
WICKESBERG, R. E.,
DICKSON, J. W.,
GIBSON, M. M.,
GEISLER, C. D.
Wiener kernel analysis of responses from anteroventral cochlear nucleus neurons.
Hear. Res.
14: 155-174, 1984.[Medline]
-
YOUNG, E. D.
Response characteristics of neurons of the cochlear nuclei.
In: Hearing Science, Recent Advances,
edited by
and C. I. Berlin
. San Diego, CA: College Hill, 1984, p. 423-460
-
YOUNG, E. D.,
BROWNELL, W. E.
Responses to tones and noise of single cells in dorsal cochlear nucleus of unanesthetized cats.
J. Neurophysiol.
39: 282-300, 1976.[Abstract/Free Full Text]
-
YOUNG, E. D.,
VOIGT, H. F.
The internal organization of the dorsal cochlear nucleus.
In: Neuronal Mechanisms of Hearing,
edited by
J. Syka,
and L. Aitkin
. New York: Plenum, 1981, p. 127-133