There is increasing evidence that the auditory system shows frequency selectivity for spectro-temporal modulations. A recent study of the authors has shown spectro-temporal modulation masking patterns that were in agreement with the hypothesis of spectro-temporal modulation filters in the human auditory system [Oetjen and Verhey (2015). J. Acoust. Soc. Am. 137(2), 714–723]. In the present study, that experimental data and additional data were used to model this spectro-temporal frequency selectivity. The additional data were collected to investigate to what extent the spectro-temporal modulation-frequency selectivity results from a combination of a purely temporal amplitude-modulation filter and a purely spectral amplitude-modulation filter. In contrast to the previous study, thresholds were measured for masker and target modulations with opposite directions, i.e., an upward pointing target modulation and a downward pointing masker modulation. The comparison of this data set with previous corresponding data with the same direction from target and masker modulations indicate that a specific spectro-temporal modulation filter is required to simulate all aspects of spectro-temporal modulation frequency selectivity. A model using a modified Gabor filter with a purely temporal and a purely spectral filter predicts the spectro-temporal modulation masking data.

2.
Baer
,
T.
, and
Moore
,
B. C. J.
(
1993
). “
Effects of spectral smearing on the intelligibility of sentences in noise
,”
J. Acoust. Soc. Am.
94
,
1229
1241
.
3.
Bernstein
,
J.
,
Mehraei
,
G.
,
Shamma
,
S. A.
,
Gallun
,
F.
,
Theodoroff
,
S.
, and
Leek
,
M.
(
2013
). “
Spectrotemporal modulation sensitivity as a predictor of speech intelligibility for hearing-impaired listeners
,”
J. Am. Acad. Audiol.
24
,
293
306
.
6.
Biebel
,
U.
, and
Langner
,
G.
(
2002
). “
Evidence for interactions across frequency channels in the inferior colliculus of awake chinchilla
,”
Hear. Res.
169
,
151
168
.
7.
Chi
,
T.
,
Gao
,
Y.
,
Guyton
,
M. C.
,
Ru
,
P.
, and
Shamma
,
S. A.
(
1999
). “
Spectro-temporal modulation transfer functions and speech intelligibility
,”
J. Acoust. Soc. Am.
106
,
2719
2732
.
8.
Dau
,
T.
,
Kollmeier
,
B.
, and
Kohlrausch
,
A.
(
1997
). “
Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers
,”
J. Acoust. Soc. Am.
102
,
2892
2905
.
9.
Dau
,
T.
,
Verhey
,
J.
, and
Kohlrausch
,
A.
(
1999
). “
Intrinsic envelope fluctuations and modulation-detection thresholds for narrow-band noise carriers
,”
J. Acoust. Soc. Am.
106
,
2752
2760
.
10.
deCharms
,
R. C.
,
Blake
,
D. T.
, and
Merzenich
,
M. M.
(
1998
). “
Optimizing sound features for cortical neurons
,”
Science
280
,
1439
1443
.
11.
Depireux
,
D.
,
Simon
,
J.
,
Klein
,
D.
, and
Shamma
,
S. A.
(
2001
). “
Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex
,”
J. Neurophysiol.
85
,
1220
1234
.
12.
Drullman
,
R.
,
Festen
,
J. M.
, and
Plomp
,
R.
(
1994
). “
Effects of temporal envelope smearing on speech recognition
,”
J. Acoust. Soc. Am.
95
,
1053
1064
.
14.
Escabi
,
M. A.
, and
Schreiner
,
C. E.
(
2002
). “
Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain
,”
J. Neurosci.
22
,
4114
4131
.
15.
Ewert
,
S. D.
, and
Dau
,
T.
(
2000
). “
Characterizing frequency selectivity for envelope fluctuations
,”
J. Acoust. Soc. Am.
108
,
1181
1196
.
16.
Ewert
,
S. D.
,
Verhey
,
J. L.
, and
Dau
,
T.
(
2002
). “
Spectro-temporal processing in the envelope-frequency domain
,”
J. Acoust. Soc. Am.
112
,
2921
2931
.
18.
Glasberg
,
B. R.
, and
Moore
,
B. C. J.
(
1990
). “
Derivation of auditory filter shapes from notched-noise data
,”
Hear. Res.
47
,
103
108
.
19.
Houtgast
,
T.
(
1989
). “
Frequency selectivity in amplitude-modulation detection
,”
J. Acoust. Soc. Am.
85
,
1676
1680
.
20.
Houtgast
,
T.
,
Steeneken
,
H.
, and
Plomp
,
R.
(
1980
). “
Predicting speech intelligibility in rooms from the modulation transfer function. I. General room acoustics
,”
Acustica
46
,
60
72
.
21.
Jørgensen
,
S.
, and
Dau
,
T.
(
2011
). “
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing
,”
J. Acoust. Soc. Am.
130
,
1475
1487
.
22.
Kleinschmidt
,
M.
(
2003
). “
Localized spectro-temporal features for automatic speech recognition
,” in
Proceedings of Eurospeech
, pp.
2573
2576
.
25.
Kowalski
,
N.
,
Depireux
,
D. A.
, and
Shamma
,
S. A.
(
1996
). “
Analysis of dynamic spectra in ferret primary auditory cortex: I. Characteristics of single unit responses to moving ripple spectra
,”
J. Neurophysiol.
76
,
3503
3523
.
26.
Langner
,
G.
, and
Schreiner
,
C. E.
(
1988
). “
Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
,”
J. Neurophysiol.
60
,
1799
1822
.
27.
Levitt
,
H.
(
1971
). “
Transformed up-down methods in psychoacoustics
,”
J. Acoust. Soc. Am.
49
,
467
477
.
28.
Mesgarani
,
N.
,
Cheung
,
C.
,
Johnson
,
K.
, and
Chang
,
E. F.
(
2014
). “
Phonetic feature encoding in human superior temporal gyrus
,”
Science
343
,
1006
1010
.
29.
Mesgarani
,
N.
,
David
,
S.
,
Fritz
,
J.
, and
Shamma
,
S. A.
(
2008
). “
Phoneme representation and classification in primary auditory cortex
,”
J. Acoust. Soc. Am.
123
,
899
909
.
30.
Mesgarani
,
N.
, and
Shamma
,
S. A.
(
2007
). “
Denoising in the domain of spectrotemporal modulations
,”
EURASIP J. Audio Speech Music Process.
2007
,
42357
.
31.
Meyer
,
B.
, and
Kollmeier
,
B.
(
2011
). “
Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition
,”
Speech Commun.
53
,
753
767
.
32.
Miller
,
L. M.
,
Escabi
,
M. A.
,
Read
,
H. L.
, and
Schreiner
,
C. E.
(
2002
). “
Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex
,”
J. Neurophysiol.
87
,
516
527
.
34.
Moore
,
B. C. J.
,
Füllgrabe
,
C.
, and
Sek
,
A.
(
2008
). “
Estimation of the center frequency of the highest modulation filter
,”
J. Acoust. Soc. Am.
125
,
1075
1081
.
35.
Nagel
,
K.
, and
Doupe
,
A. J.
(
2008
). “
Organizing principles of spectro-temporal encoding in the avian primary auditory area field L
,”
Neuron
58
,
938
955
.
37.
Noordhoek
,
I.
, and
Drullman
,
R.
(
1997
). “
Effect of reducing temporal intensity modulations on sentence intelligibility
,”
J. Acoust. Soc. Am.
101
,
498
502
.
38.
Oetjen
,
A.
, and
Verhey
,
J. L.
(
2015
). “
Spectro-temporal modulation masking patterns reveal frequency selectivity
,”
J. Acoust. Soc. Am.
137
,
714
723
.
39.
Rodriguez
,
F. A.
,
Chen
,
C.
,
Read
,
H. L.
, and
Escabi
,
M. A.
(
2010a
). “
Neural modulation tuning characteristics scale to efficiently encode natural sound statistics
,”
J. Neurosci.
30
,
15969
15980
.
40.
Rodriguez
,
F. A.
,
Read
,
H. L.
, and
Escabi
,
M. A.
(
2010b
). “
Spectral and temporal modulation tradeoff in the inferior colliculus
,”
J. Neurophysiol.
103
,
887
903
.
41.
Saoji
,
A. A.
, and
Eddins
,
D. A.
(
2007
). “
Spectral modulation masking patterns reveal tuning to spectral envelope frequency
,”
J. Acoust. Soc. Am.
122
,
1004
1013
.
42.
Schädler
,
M. R.
,
Meyer
,
B.
, and
Kollmeier
,
B.
(
2012
). “
Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition
,”
J. Acoust. Soc. Am.
131
,
4134
4151
.
43.
Schönwiesner
,
M.
, and
Zatorre
,
R. J.
(
2008
). “
Spectro-temporal modulation transfer function of single voxels in the human auditory cortex measured with high-resolution fMRI
,”
Proc. Natl. Acad. Sci. U.S.A.
106
,
14611
14616
.
44.
Simon
,
J. Z.
,
Depireux
,
D. A.
,
Klein
,
D. J.
,
Fritz
,
J. B.
, and
Shamma
,
S. A.
(
2007
). “
Temporal symmetry in primary auditory cortex: Implications for cortical connectivity
,”
Neural Comput.
19
,
583
638
.
45.
Steeneken
,
H. J. M.
, and
Houtgast
,
T.
(
1980
). “
A physical method for measuring speech-transmission quality
,”
J. Acoust. Soc. Am.
67
,
318
326
.
49.
Verhey
,
J.
,
Dau
,
T.
, and
Kollmeier
,
B.
(
1999
). “
Within-channel cues in comodulation masking release (CMR): Experiments and model predictions using a modulation-filterbank model
,”
J. Acoust. Soc. Am.
106
,
2733
2745
.
You do not currently have access to this content.