Ideal time-frequency (TF) masks can reject noise and improve the recognition of speech-noise mixtures. An ideal TF mask is constructed with prior knowledge of the target speech signal. The intelligibility of a processed speech-noise mixture depends upon the threshold criterion used to define the TF mask. The study reported here assessed the effect of training on the recognition of speech in noise after processing by ideal TF masks that did not restore perfect speech intelligibility. Two groups of listeners with normal hearing listened to speech-noise mixtures processed by TF masks calculated with different threshold criteria. For each group, a threshold criterion that initially produced word recognition scores between 0.56–0.69 was chosen for training. Listeners practiced with one set of TF-masked sentences until their word recognition performance approached asymptote. Perceptual learning was quantified by comparing word-recognition scores in the first and last training sessions. Word recognition scores improved with practice for all listeners with the greatest improvement observed for the same materials used in training.

1.
Anzalone
,
M. C.
,
Calandruccio
,
L.
,
Doherty
,
K. A.
, and
Carney
,
L. H.
(
2006
). “
Determination of the potential benefit of time-frequency gain manipulation
,”
Ear Hear.
27
,
480
492
.
2.
Bolia
,
R. S.
,
Nelson
,
T. W.
,
Ericson
,
M. A.
, and
Simpson
,
B.
(
2000
). “
A speech corpus for multitalker communications research
,”
J. Acoust. Soc. Am.
107
,
1065
1066
.
3.
Brungart
,
D. S.
,
Chang
,
P. S.
,
Simpson
,
B. D.
, and
Wang
,
D. L.
(
2006
). “
Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
,”
J. Acoust. Soc. Am.
120
,
4007
4018
.
4.
Burk
,
M. H.
, and
Humes
,
L. E.
(
2007
). “
Effects of training on speech-recognition performance in noise using lexically hard words
,”
J. Speech Lang. Hear. Res.
50
,
25
40
.
5.
Burk
,
M. H.
, and
Humes
,
L. E.
(
2009
). “
Effects of long-term training on aided speech-recognition performance in noise in older adults
,”
J. Speech Lang. Hear. Res.
51
,
759
771
.
6.
Burk
,
M. H.
,
Humes
,
L. E.
,
Amos
,
N.
, and
Strauser
,
L.
(
2006
). “
Effect of training on word-recognition performance in noise for young normal-hearing and older hearing-impaired listeners
,”
Ear Hear.
27
,
263
278
.
7.
Choi
,
S.
,
Kirk
,
K.
,
Talavage
,
T.
,
Krull
,
V.
,
Smalt
,
C.
, and
Baker
,
S.
(
2009
). “
Effects of training format on perceptual learning of spectrally degraded voice,s
J. Acoust. Soc. Am.
125
,
2526
.
8.
Eddins
,
D. A.
, and
Li
,
C.
(
2012
). “
Psychometric properties of the coordinate response measure corpus with various types of background interference
,”
J. Acoust. Soc. Am.
131
,
EL177
EL183
.
9.
Helfer
,
K. S.
, and
Freyman
,
R. L.
(
2008
). “
Aging and speech-on-speech masking
,”
Ear Hear.
29
,
87
98
.
10.
Kim
,
G.
,
Lu
,
Y.
,
Hu
,
Y.
, and
Loizou
,
P. C.
(
2009
). “
An algorithm that improves speech intelligibility in noise for normal-hearing listeners
,”
J. Acoust. Soc. Am.
126
,
1486
1494
.
11.
Kjems
,
U.
,
Boldt
,
J. B.
,
Pedersen
,
M. S.
,
Lunner
,
T.
, and
Wang
,
D. L.
(
2009
). “
Role of mask pattern in intelligibility of ideal binary-masked noisy speech
,”
J. Acoust. Soc. Am.
126
,
1415
1426
.
12.
Li
,
N.
, and
Loizou
,
P. C.
(
2008
). “
Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
,”
J. Acoust. Soc. Am.
123
,
1673
1682
.
14.
Moore
,
B. C. J.
, and
Glasberg
,
B. R.
(
1987
). “
Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns
,”
Hear Res.
28
,
209
225
.
15.
Narayanan
,
A.
, and
Wang
,
D. L.
(
2010
). “
Robust speech recognition from binary masks
,”
J. Acoust. Soc. Am.
128
,
EL217
EL222
.
16.
Plomp
,
R.
, and
Mimpen
,
A. M.
(
1979
). “
Speech-reception threshold for sentences as a function of age and noise level
,”
J. Acoust. Soc. Am.
66
,
1333
1342
.
17.
Sweetow
,
R.
, and
Palmer
,
C. V.
(
2005
). “
Efficacy of individual auditory training in adults: A systematic review of the evidence
,”
J. Am. Acad. Audiol.
16
,
494
504
.
18.
Turner
,
C. W.
(
2006
). “
Hearing loss and the limits of amplification
,”
Audiol. Neuro-Otol.
11
,
Suppl. 1
,
2
5
.
19.
Tyler
,
R. S.
,
Witt
,
S. A.
,
Dunn
,
C. C.
, and
Wang
,
W.
(
2010
). “
Initial development of a spatially separated speech-in-noise and localization training program
,”
J. Am. Acad. Audiol.
21
,
390
403
.
20.
Wang
,
D. L.
(
2005
). “
On ideal binary mask as the computational goal of auditory scene analysis
,” in
Speech Separation by Humans and Machines
, edited by
P.
Divenyi
(
Kluwer Academic
,
Norwell, MA
), pp.
181
197
.
21.
Wang
,
D. L.
(
2008
). “
Time-frequency masking for speech separation and its potential for hearing aid design
,”
Trends Amplif.
12
,
332
353
.
22.
Wang
,
D. L.
,
Kjems
,
U.
,
Pedersen
,
M. S.
,
Boldt
,
J. B.
, and
Lunner
,
T.
(
2008
). “
Speech perception of noise with binary gains
,”
J. Acoust. Soc. Am.
124
,
2303
2307
.
23.
Wang
,
D. L.
,
Kjems
,
U.
,
Pedersen
,
M. S.
,
Boldt
,
J. B.
, and
Lunner
,
T.
(
2009
). “
Speech intelligibility in background noise with ideal binary time-frequency masking
,”
J. Acoust. Soc. Am.
125
,
2336
2347
.
You do not currently have access to this content.