The frequency dependence of phase locking in the auditory nerve influences various auditory coding mechanisms. The decline of phase locking with increasing frequency is commonly described by a low-pass filter. This study compares fitted low-pass filter parameters with the actual rate of phase locking decline. The decline is similar across studies and only 40 dB per decade, corresponding to the asymptotic decline of a second order filter.

The central prerequisite for encoding temporal fine structure (TFS) in the auditory system is the phase locking of auditory nerve (AN) fibers to the frequency of a stimulus (Moore, 2008). This phase-locking is usually quantified by vector strength (VS) (Goldberg and Brown, 1969). The detailed behavior of vector strength across frequency in humans remains unclear and estimates of a so called “upper frequency limit” vary substantially from 1.5 to 10 kHz (Verschooten , 2019). Establishing the vector strength of phase locking in the AN as a function of frequency would be highly informative for elucidating various auditory coding mechanisms from pitch perception to binaural hearing.

It is well known that vector strength in single fibers of the AN declines above a certain frequency (Rose , 1967; Johnson, 1980; Joris , 1994) that varies across species (Weiss and Rose, 1988a). The frequency above which vector strength becomes insignificant is often referred to as the upper frequency limit (Palmer and Russell, 1986; Verschooten , 2015; Joris and Verschooten, 2013). Weiss and Rose (1988a) criticized the comparison of the upper frequency limit among different species stating: “This metric obviously depends upon the method of detection and is generally the highest frequency for which the experimenter detected synchronization in the measurements.” The upper frequency limit is therefore ill-defined. Any attempt to define the upper limit for this continually decreasing function depends entirely on the noise floor [Michael Heinz in Verschooten (2019)]. Despite these shortcomings, the ill-defined characterization of the upper frequency limit is still often used in the auditory literature (Liu , 2006; Verschooten , 2015; Joris and Verschooten, 2013; Verschooten , 2019).

To avoid the above-mentioned shortcomings and to provide a clear definition, Weiss and Rose (1988a) described the frequency dependence using a low-pass filter cascade with the frequency response
G ( f ) = g ( 1 + ( f f 0 ) 2 ) n ,
(1)
where
f 0 = f corner 2 1 / n 1 .
(2)
Here f is the frequency, f corner the corner frequency, n the order, and g the gain at f = 0 Hz. In this filter design, a first-order filter is applied n-times. Therefore, the −3 dB point of this filter (f0) needs to be adjusted by Eq. (2) to guarantee that the −3 dB point of the filter cascade is at f corner independent of n.

To estimate the filter order, Weiss and Rose (1988a) compensated for the different corner frequencies across species and fitted a regression line to the five highest frequency points [two from Johnson (1980) and three from Weiss and Rose (1988b)], resulting in a decline of 106 dB/decade. This line is reproduced as the dotted black line in Fig. 1. Overall, this decline was related to a filter order of 4–6 by Weiss and Rose (1988a). However, using a cascaded low-pass filter of order n = 5 with a corner frequency f corner = 2500 does not account for the vector strength, e.g., of the cat data from Johnson (1980). Figure 1 clearly shows that the transition band of the filter is too wide and the nominal decline of 100 dB/decade is reached only at much higher frequencies.

Fig. 1.

Maximum vector strength for tones at the center frequency of the measured AN fiber as a function of frequency. Circles are data from the cat (Johnson, 1980). Corner frequency (green +) and decline (dotted black line) were estimated by Weiss and Rose (1988a). The transition band of the cascade low-pass filter [see Eq. (1)] with f corner = 2500 Hz and n = 5 (solid blue line) is too wide to account for the data.

Fig. 1.

Maximum vector strength for tones at the center frequency of the measured AN fiber as a function of frequency. Circles are data from the cat (Johnson, 1980). Corner frequency (green +) and decline (dotted black line) were estimated by Weiss and Rose (1988a). The transition band of the cascade low-pass filter [see Eq. (1)] with f corner = 2500 Hz and n = 5 (solid blue line) is too wide to account for the data.

Close modal

Despite these discrepancies to physiological data, the filter is prevalent in computational auditory models (see Table 1 for an overview). Some of the studies [e.g., Bernstein and Trahiotis (1996)] motivate their filter order based on the nominal order suggested by Weiss and Rose (1988a), arguably requiring a very low corner frequency of 425 Hz to obtain a sufficiently steep roll-off at higher frequencies. Others, e.g., Heinz (2001), instead use a higher filter order for the effect of the inner hair cell (IHC), whereby the combination with their other stages (outer hair cell, synapse) reproduces the phase-locking roll-off from Johnson (1980).

Table 1.

Low-pass filter contributing to the decline of AN phase locking as a function of frequency reported by several publications.

Order (n) f corner [f0] type
Weiss and Rose (1988a)   4–6  2500 Hz (cat)  cascade 
Lopez-Poveda and Eustaquio-Martín (2006)      
Verschooten (2015)   (3)    decline of 60 dB/dec. 
Peterson and Heil (2020)   depends on best frequency  Butterworth 
Bernstein and Trahiotis (1996)   425 Hz  cascade 
Breebaart (2001a)   770 Hz  cascade 
Heinz (2001) a   1500 Hzb [4800 Hz]  cascade 
Zhang (2001) a   1200 Hz [3800 Hz]  cascade 
Zilany (2009) a   1000 Hz [3000 Hz]c  cascade 
Order (n) f corner [f0] type
Weiss and Rose (1988a)   4–6  2500 Hz (cat)  cascade 
Lopez-Poveda and Eustaquio-Martín (2006)      
Verschooten (2015)   (3)    decline of 60 dB/dec. 
Peterson and Heil (2020)   depends on best frequency  Butterworth 
Bernstein and Trahiotis (1996)   425 Hz  cascade 
Breebaart (2001a)   770 Hz  cascade 
Heinz (2001) a   1500 Hzb [4800 Hz]  cascade 
Zhang (2001) a   1200 Hz [3800 Hz]  cascade 
Zilany (2009) a   1000 Hz [3000 Hz]c  cascade 
a

This filter is applied for only the effect of the inner hair cell (IHC).

b

According to Eq. (2). Heinz (2001) reported f corner  2500 Hz for the AN phase locking including their synapse stage.

c

Used in following versions up to Bruce (2018).

The two goals of the current study were therefore: (1) To revisit the fitting of a low-pass filter to vector strength data and (2) to quantify the decline of AN phase locking as a function of frequency.

To determine the decline of AN phase locking with increasing frequency, we propose three functions which we fit to three data sets each: (1) The cascade low-pass filter in Eq. (1) introduced by Weiss and Rose (1988a), in the following referred to as CASCADE. (2) A Butterworth low-pass filter, in the following referred to as BUTTERW, which has a fast transition from passband to the true decay of the filter order. In this case, the frequency response is given by
G ( f ) = g ( 1 + ( f f corner ) 2 n ) .
(3)
Note that the only difference to the frequency response of CASCADE is that the exponent/order n is not to the power of the whole denominator as in Eq. (1). (3) A constant decline archived by fitting a broken-stick function to the data in double logarithmic space,
G ̂ ( f ) / dB = 20 log 10 ( G ( f ) ) = { m log 10 ( f b / f ) + g for f f b , g for f < f b ,
(4)
where g determines the gain at low frequencies and m the slope of the function after the knee point f b. The resulting function is in the following referred to as LINEAR and is plotted with the other functions in Fig. 2. LINEAR is also defined by f corner the −3 dB point by adjusting the knee point f b.
Fig. 2.

Different filter functions to be fitted to the data with g = 1, n = 4. CASCADE is from Eq. (1), BUTTERW from Eq. (3), and LINEAR based on Eq. (4). The top panel shows the filter gain or VS across frequency. The lower panel shows the slope as a function of frequency.

Fig. 2.

Different filter functions to be fitted to the data with g = 1, n = 4. CASCADE is from Eq. (1), BUTTERW from Eq. (3), and LINEAR based on Eq. (4). The top panel shows the filter gain or VS across frequency. The lower panel shows the slope as a function of frequency.

Close modal

As data sets, we used vector strength derived from single-unit neuron recordings: from AN fibers in Johnson (1980) from cats, as well as from AN fibers in Heeringa (2020) from gerbils and in Palmer and Russell (1986) from guinea pigs. As the error metric, we chose the root mean square error (RMSE) in the logarithmic space. The RMSE for all three functions describing the vector strength in the three data sets was calculated as a function of order n, gain g, and corner frequency f corner. The minimum RMSE fit was derived by a grid search, the range and step size of the individual parameters are shown in Table 2. For the calculations, matlab (MathWorks, Natick, MA) was used. By definition, the filter order is an integer number but the filter equations produce meaningful output also for any positive non-integer number n. As the focus of the present study is on describing the data, some non-integer values for n were included.

Table 2.

Range and step size for the grid search for the minimum RMSE fit.

Range Step
Order n  1–16  1a 
Gain g  0.6–1  0.01 
f corner  1000–3000 Hz  10 Hz 
Range Step
Order n  1–16  1a 
Gain g  0.6–1  0.01 
f corner  1000–3000 Hz  10 Hz 
a

Smaller if necessary.

For eight of the nine fitted functions (3 types × 3 data sets), a global best fit was obtained within the abovementioned parameter range, i.e., not at the lowest or highest parameter values. The only exception is CASCADE, which showed no local minimum for the order in Johnson (1980). The best solutions are shown above the respective data sets in the left column in Fig. 3. The right column of Fig. 3 shows the RMSE across order n for the three functions and three data sets. LINEAR results in the lowest decline estimates followed by BUTTERW, followed by CASCADE. For CASCADE the best fit low-pass filter order is 3 for Palmer and Russell (1986) and 4 for (Heeringa , 2020). For these two data sets the roll-off within the shown range is about 40 dB/dec, as the cascade filters have not reached their maximum roll-off. For CASCADE, the RMSE declines across n for the Johnson (1980) data set and shows no local minimum for n 16. This decrease is caused by the data point with the highest frequency and lowest vector strength. For all data sets BUTTERW provides the lowest RMSE solutions. Fits were similar when only data with frequencies > 1.8 kHz or > 3 kHz were considered to estimate the roll-off.

Fig. 3.

(Left) The three panels show the vector strength for different data sets and the minimum RMSE solution for the three filter functions (color code). (Right) RMSE across filter order n for the three different functions (color code) and three different data sets.

Fig. 3.

(Left) The three panels show the vector strength for different data sets and the minimum RMSE solution for the three filter functions (color code). (Right) RMSE across filter order n for the three different functions (color code) and three different data sets.

Close modal

The decline in vector strength at high frequencies is much flatter than implied by the nominal low-pass filter order. Especially for the cascade filter, the slope at intermediate frequencies is different from the asymptotic values (cf. Fig. 2). Linear fits of the high-frequency roll-off suggest a steepness of about 40 dB/dec, corresponding to the asymptotic steepness of a second order low-pass filter.

As the low-pass characteristic of the phase locking decline is the outcome of a sequential interaction of several processing stages, the use of a cascade filter is a logical means for a biologically feasible simulation of the underlying processes. The low-pass filter order of this type is at least 3 and even much larger orders provide good fits. The roll-off within the range shown is generally no more than about 60 dB/dec, still considerably lower than the 106 dB/dec derived by Weiss and Rose (1988a). While it produced generally accurate fits, the biologically less plausible filters had lower RMS errors in all three data sets.

While the corner frequency appears to differ between different species, Palmer and Russell (1986) and Weiss and Rose (1988a) concluded that there is a similar decline across species. The similarity in decline across species is unsurprising, considering that the free parameters of the resistor-capacitor (RC) low-pass filter can only influence the corner frequency but not the filter order (Lopez-Poveda and Eustaquio-Martín, 2006). On the other hand, Altoè (2018) showed that the voltage-dependent activation of the K+ channels in the IHC enhances the phase-locking properties within a small resonant frequency range. The decline above the resonant frequency range is consequently steeper than with a conventional RC low-pass filter.

Without investigating specific mechanisms in the IHC-AN complex, this study only analyses the function of VS with respect to frequency that can be used in functional models. Thus, we do not claim that any of the filters we employ correspond to the physiological processes. From the fitted parameter (see Fig. 2), a small difference in slope can be deduced, with the corner frequency varying between species, as was reported by Weiss and Rose (1988a). This finding suggests that the common practice of using the same filter order in models of the human auditory system as determined for animal models [e.g., Bernstein and Trahiotis (1996)] is plausible.

The decline of vector strength across frequency is particularly important for understanding the interaural phase difference (IPD) encoding mechanism. Phase-locking information is a strict prerequisite, as it is the only available cue for IPD encoding. In normal hearing humans and for pure tones, it is well established that the sensitivity to IPD decreases rapidly at frequencies above  1300 Hz (Brughera , 2013; Klug and Dietz, 2022). A common hypothesis is that this decline in IPD sensitivity is caused by the decline in phase locking of the AN fibers (Joris and Verschooten, 2013; Verschooten , 2019). In computational models of human IPD perception, the decline of AN phase locking usually contributes considerably, or even exclusively, to the decline in IPD sensitivity (Breebaart , 2001b; Brughera , 2013; Bouse , 2019). However, the decline in vector strength in the AN ( m 60 dB/dec) is not sufficient to account for the dramatic decline of IPD sensitivity in humans occurring in the narrow range between 1300 and 1500 Hz ( m 150 dB/dec), as reported in Klug and Dietz (2022). Therefore, the high-frequency limit of IPD sensitivity is a less suitable correlate for the AN phase locking “limit” than previously thought (Verschooten , 2019; Bernstein and Trahiotis, 1996) and another effect must contribute to limiting IPD sensitivity.

The nominal decline of 80 to 120 dB/dec reported by Weiss and Rose (1988a) does not accurately describe the decline of vector strength in AN fibers across frequency. In the biophysically plausible and widely used cascade filter, the slope changes slowly with frequency, so the slope at intermediate frequencies differs substantially from its asymptote. The relevant vector strength values decline with approximately 40 dB/dec—the asymptotic decline of a second order filter.

We thank Amarins Heeringa for sharing her data. Furthermore, we are grateful to Henri Pöntynen for his assistance with formulations. This project received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation Programme (Grant Agreement No. 716800).

1.
Altoè
,
A.
,
Pulkki
,
V.
, and
Verhulst
,
S.
(
2018
). “
The effects of the activation of the inner-hair-cell basolateral K+ channels on auditory nerve responses
,”
Hear. Res.
364
,
68
80
.
2.
Bernstein
,
L. R.
, and
Trahiotis
,
C.
(
1996
). “
The normalized correlation: Accounting for binaural detection across center frequency
,”
J. Acoust. Soc. Am.
100
(
6
),
3774
3784
.
3.
Bouse
,
J.
,
Vencovský
,
V.
,
Rund
,
F.
, and
Marsalek
,
P.
(
2019
). “
Functional rate-code models of the auditory brainstem for predicting lateralization and discrimination data of human binaural perception
,”
J. Acoust. Soc. Am.
145
(
1
),
1
15
.
4.
Breebaart
,
J.
,
van de Par
,
S.
, and
Kohlrausch
,
A.
(
2001a
). “
Binaural processing model based on contralateral inhibition. I. Model structure
,”
J. Acoust. Soc. Am.
110
(
2
),
1074
1088
.
5.
Breebaart
,
J.
,
van de Par
,
S.
, and
Kohlrausch
,
A.
(
2001b
). “
Binaural processing model based on contralateral inhibition. II. Dependence on spectral parameters
,”
J. Acoust. Soc. Am.
110
(
2
),
1089
1104
.
6.
Bruce
,
I. C.
,
Erfani
,
Y.
, and
Zilany
,
M. S.
(
2018
). “
A phenomenological model of the synapse between the inner hair cell and auditory nerve: Implications of limited neurotransmitter release sites
,”
Hear. Res.
360
,
40
54
.
7.
Brughera
,
A.
,
Dunai
,
L.
, and
Hartmann
,
W. M.
(
2013
). “
Human interaural time difference thresholds for sine tones: The high-frequency limit
,”
J. Acoust. Soc. Am.
133
(
5
),
2839
2855
.
8.
Goldberg
,
J. M.
, and
Brown
,
P. B.
(
1969
). “
Response of binaural neurons of dog superior olivary complex to dichotic tonal stimuli: Some physiological mechanisms of sound localization
,”
J. Neurophysiol.
32
(
4
),
613
636
.
9.
Heeringa
,
A. N.
,
Zhang
,
L.
,
Ashida
,
G.
,
Beutelmann
,
R.
,
Steenken
,
F.
, and
Köppl
,
C.
(
2020
). “
Temporal coding of single auditory nerve fibers is not degraded in aging gerbils
,”
J. Neurosci.
40
(
2
),
343
354
.
10.
Heinz
,
M. G.
,
Colburn
,
H. S.
, and
Carney
,
L. H.
(
2001
). “
Evaluating auditory performance limits: I. One-parameter discrimination using a computational model for the auditory nerve
,”
Neural Comput.
13
(
10
),
2273
2316
.
11.
Johnson
,
D. H.
(
1980
). “
The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones
,”
J. Acoust. Soc. Am.
68
(
4
),
1115
1122
.
12.
Joris
,
P. X.
,
Carney
,
L. H.
,
Smith
,
P. H.
, and
Yin
,
T. C.
(
1994
). “
Enhancement of neural synchronization in the anteroventral cochlear nucleus. I. Responses to tones at the characteristic frequency
,”
J. Neurophysiol.
71
(
3
),
1022
1036
.
13.
Joris
,
P. X.
, and
Verschooten
,
E.
(
2013
). “
On the limit of neural phase locking to fine structure in humans
,” in
Basic Aspects of Hearing
, edited by
B. C. J.
Moore
,
R. D.
Patterson
,
I. M.
Winter
,
R. P.
Carlyon
, and
H. E.
Gockel
(
Springer
,
New York)
, pp.
101
108
.
14.
Klug
,
J.
, and
Dietz
,
M.
(
2022
). “
Frequency dependence of sensitivity to interaural phase differences in pure tones
,”
J. Acoust. Soc. Am.
152
(
6
),
3130
3141
.
15.
Liu
,
L.-F.
,
Palmer
,
A. R.
, and
Wallace
,
M. N.
(
2006
). “
Phase-locked responses to pure tones in the inferior colliculus
,”
J. Neurophysiol.
95
(
3
),
1926
1935
.
16.
Lopez-Poveda
,
E. A.
, and
Eustaquio-Martín
,
A.
(
2006
). “
A biophysical model of the inner hair cell: The contribution of potassium currents to peripheral auditory compression
,”
J. Assoc. Res. Otolaryngol.
7
,
218
235
.
17.
Moore
,
B. J. C.
(
2008
). “
The role of temporal fine structure processing in pitch perception, masking, and speech perception for normal-hearing and hearing-impaired people
,”
J. Assoc. Res. Otolaryngol.
9
,
399
–406.
18.
Palmer
,
A. R.
, and
Russell
,
I. J.
(
1986
). “
Phase-locking in the cochlear nerve of the guinea-pig and its relation to the receptor potential of inner hair-cells
,”
Hear. Res.
24
(
1
),
1
15
.
19.
Peterson
,
A. J.
, and
Heil
,
P.
(
2020
). “
Phase locking of auditory nerve fibers: The role of lowpass filtering by hair cells
,”
J. Neurosci.
40
(
24
),
4700
4714
.
20.
Rose
,
J. E.
,
Brugge
,
J. F.
,
Anderson
,
D. J.
, and
Hind
,
J. E.
(
1967
). “
Phase-locked response to low-frequency tones in single auditory nerve fibers of the squirrel monkey
,”
J. Neurophysiol.
30
(
4
),
769
793
.
21.
Verschooten
,
E.
,
Robles
,
L.
, and
Joris
,
P. X.
(
2015
). “
Assessment of the limits of neural phase-locking using mass potentials
,”
J. Neurosci.
35
(
5
),
2255
2268
.
22.
Verschooten
,
E.
,
Shamma
,
S.
,
Oxenham
,
A.
,
Moore
,
B.
,
Joris
,
P.
,
Heinz
,
M.
, and
Plack
,
C.
(
2019
). “
The upper frequency limit for the use of phase locking to code temporal fine structure in humans: A compilation of viewpoints
,”
Hear. Res.
377
,
109
–121.
23.
Weiss
,
T.
, and
Rose
,
C.
(
1988a
). “
A comparison of synchronization filters in different auditory receptor organs
,”
Hear. Res.
33
(
2
),
175
179
.
24.
Weiss
,
T.
, and
Rose
,
C.
(
1988b
). “
Stages of degradation of timing information in the cochlea: A comparison of hair-cell and nerve-fiber responses in the alligator lizard
,”
Hear. Res.
33
(
2
),
167
174
.
25.
Zhang
,
X.
,
Heinz
,
M. G.
,
Bruce
,
I. C.
, and
Carney
,
L. H.
(
2001
). “
A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression
,”
J. Acoust. Soc. Am.
109
(
2
),
648
670
.
26.
Zilany
,
M. S. A.
,
Bruce
,
I. C.
,
Nelson
,
P. C.
, and
Carney
,
L. H.
(
2009
). “
A phenomenological model of the synapse between the inner hair cell and auditory nerve: Long-term adaptation with power-law dynamics
,”
J. Acoust. Soc. Am.
126
(
5
),
2390
2412
.