Overlap-masking degrades speech intelligibility in reverberation [R. H. Bolt and A. D. MacDonald, J. Acoust. Soc. Am.21(6), 577580 (1949)]. To reduce the effect of this degradation, steady-state suppression has been proposed as a preprocessing technique [Arai et al, Proc. Autumn Meet. Acoust. Soc. Jpn., 2001; Acoust. Sci. Tech.23(8), 229232 (2002)]. This technique automatically suppresses steady-state portions of speech that have more energy but are less crucial for speech perception. The present paper explores the effect of steady-state suppression on syllable identification preceded by /a/ under various reverberant conditions. In each of two perception experiments, stimuli were presented to 22 subjects with normal hearing. The stimuli consisted of mono-syllables in a carrier phrase with and without steady-state suppression and were presented under different reverberant conditions using artificial impulse responses. The results indicate that steady-state suppression statistically improves consonant identification for reverberation times of 0.7 to 1.2s. Analysis of confusion matrices shows that identification of voiced consonants, stop and nasal consonants, and bilabial, alveolar, and velar consonants were especially improved by steady-state suppression. The steady-state suppression is demonstrated to be an effective preprocessing method for improving syllable identification by reducing the effect of overlap-masking under specific reverberant conditions.

1.
Arai
,
T.
, and
Greenberg
,
S.
(
1997
). “
The temporal properties of spoken Japanese are similar to those of English
,”
Proc. ESCA Eurospeech
, Vol.
2
, pp.
1011
1014
.
2.
Arai
,
T.
, and
Greenberg
,
S.
(
1998
). “
Speech intelligibility in the presence of cross-channel spectral asynchrony
,”
Proc. IEEE ICASSP
, Vol.
2
, pp.
933
936
.
3.
Arai
,
T.
,
Pavel
,
M.
,
Hermansky
,
H.
, and
Avendano
,
C.
(
1999
). “
Syllable intelligibility for temporally filtered LPC cepstral trajectories
,”
J. Acoust. Soc. Am.
105
(
5
),
2783
2791
.
4.
Arai
,
T.
,
Kinoshita
,
K.
,
Hodoshima
,
N.
,
Kusumoto
,
A.
, and
Kitamura
,
T.
(
2001
). “
Effects of suppressing steady-state portions of speech on intelligibility in reverberant environments
,”
Proc. Autumn Meet. Acoust. Soc. Jpn.
, Vol.
1
, pp.
449
450
(in Japanese).
5.
Arai
,
T.
,
Kinoshita
,
K.
,
Hodoshima
,
N.
,
Kusumoto
,
A.
, and
Kitamura
,
T.
(
2002
). “
Effects of suppressing steady-state portions of speech on intelligibility in reverberant environments
,”
Acoust. Sci. & Tech.
23
(
4
),
229
232
.
6.
Avendano
,
C.
, and
Hermansky
,
H.
(
1996
). “
Study on the dereverberation of speech based on temporal envelope filtering
,”
Proc. ESCA ICSLP
, Vol.
2
, pp.
889
892
.
7.
Bolt
,
R. H.
, and
MacDonald
,
A. D.
(
1949
). “
Theory of speech masking by reverberation
,”
J. Acoust. Soc. Am.
21
(
6
),
577
580
.
8.
Crandell
,
C. C.
, and
Smaldino
,
J. J.
(
2000
). “
Classroom acoustics for children with normal hearing and with hearing impairment
,”
J. Lang. Speech Hear. Services Schools
31
(
4
),
362
370
.
9.
Drullman
,
R.
,
Festen
,
J. M.
, and
Plomp
,
R.
(
1994
). “
Effect of temporal envelope smearing on speech reception
,”
J. Acoust. Soc. Am.
95
(
2
),
1053
1064
.
10.
Duquesnoy
,
A. J.
, and
Plomp
,
R.
(
1980
). “
Effect of reverberation and noise on the intelligibility of sentences in cases of presbyacusis
,”
J. Acoust. Soc. Am.
68
(
2
),
537
544
.
11.
Elliot
,
L. L.
(
1982
). “
Effects of noise on perception of speech by children and certain handicapped individuals
,”
J. Sound Vib.
16
(
12
),
10
14
.
12.
Finitzo-Hieber
,
T.
, and
Tillman
,
T.
(
1978
). “
Room acoustics effects on monosyllabic word discrimination ability for normal and hearing-impaired children
,”
J. Speech Hear. Res.
21
(
3
),
440
458
.
13.
Flanagan
,
J. L.
,
Johnston
,
J. D.
,
Zahn
,
R.
, and
Elko
,
G. W.
(
1985
). “
Computer-steered microphone arrays for sound transmission in large rooms
,”
J. Acoust. Soc. Am.
78
(
5
),
1508
1518
.
14.
Furui
,
S.
(
1986
). “
On the role of spectral transition for speech perception
,”
J. Acoust. Soc. Am.
80
(
4
),
1016
1025
.
15.
Gordon-Salant
,
S.
(
1986
). “
Recognition of natural and time/intensity altered CVs by young and elderly subjects with normal hearing
,”
J. Acoust. Soc. Am.
80
(
6
),
1599
1607
.
16.
Greenberg
,
S.
, and
Arai
,
T.
(
2004
). “
What are the essential cues for understanding spoken language?
IEICE Trans. Inf. Syst.
87-D
(
5
),
1059
1070
.
17.
Hermansky
,
H.
, and
Morgan
,
N.
(
1994
). “
RASTA processing of speech
,”
IEEE Trans. Speech Audio Process.
2
(
4
),
578
589
.
18.
Hodoshima
,
N.
,
Arai
,
T.
, and
Kusumoto
,
A.
(
2002
). “
Enhancing temporal dynamics of speech to improve intelligibility in reverberant environments
,”
Proc. Forum Acusticum Sevilla
.
19.
Houtgast
,
T.
, and
Steeneken
,
H. J. M.
(
1985
). “
A review of MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
,”
J. Acoust. Soc. Am.
77
(
3
),
1069
1077
.
20.
Kinoshita
,
K.
,
Nakatani
,
T.
, and
Miyoshi
,
M.
(
2005
). “
Efficient blind dereverberation framework for automatic speech recognition
,”
Proc. ISCA Interspeech
, pp.
3145
3148
.
21.
Kirk
,
R. E.
(
1995
).
Experimental Design: Procedures for the Behavioral Sciences
(
Brooks/Cole
, Pacific Grove, CA).
22.
Kitamura
,
T.
,
Kinoshita
,
K.
,
Arai
,
T.
,
Kusumoto
,
A.
, and
Murahara
,
Y.
(
2000
). “
Designing modulation filters for improving speech intelligibility in reverberant environments
,”
Proc. ICSLP
, Vol.
3
, pp.
586
589
.
23.
Knudsen
,
V. O.
(
1929
). “
The hearing of speech in auditoriums
,”
J. Acoust. Soc. Am.
1
(
1
),
56
82
.
24.
Kusumoto
,
A.
,
Arai
,
T.
,
Takahashi
,
M.
, and
Murahara
,
Y.
(
2000
). “
Modulation enhancement of speech as a preprocessing for reverberant chambers with the hearing-impaired
,”
Proc. IEEE ICASSP
, Vol.
2
, pp.
853
856
.
25.
Kusumoto
,
A.
,
Arai
,
T.
,
Kinoshita
,
K.
, and
Hodoshima
,
N.
(
2005
). “
Modulation enhancement of speech by preprocessing for improving intelligibility in reverberant environment
,”
Speech Commun.
45
(
2
),
101
113
.
26.
Langhans
,
T.
, and
Strube
,
H. W.
(
1982
). “
Speech enhancement by nonlinear multiband envelope filtering
,”
Proc. IEEE ICASSP
, Vol.
7
, pp.
156
159
.
27.
Miyoshi
,
M.
, and
Kaneda
,
Y.
(
1988
). “
Inverse filtering of room acoustics
,”
IEEE Trans. Acoust., Speech, Signal Process.
36
(
2
),
145
152
.
28.
Nábělek
,
A. K.
, and
Donahue
,
A. M.
(
1984
). “
Perception of consonants in reverberation by native and non-native listeners
,”
J. Acoust. Soc. Am.
75
(
2
),
632
634
.
29.
Nábělek
,
A. K.
, and
Pickett
,
J. M.
(
1974
). “
Monaural and binaural speech perception through hearing aids under noise and reverberation
,”
J. Speech Hear. Res.
17
,
724
739
.
30.
Nábělek
,
A. K.
, and
Robinette
,
L.
(
1978
). “
Influence of precedence effect on word identification by normally hearing and hearing-impaired subjects
,”
J. Acoust. Soc. Am.
63
(
1
),
187
194
.
31.
Nábělek
,
A. K.
, and
Robinson
,
P. K.
(
1982
). “
Monaural and binaural speech perception in reverberation for listeners of various ages
,”
J. Acoust. Soc. Am.
71
(
4
),
1242
1248
.
32.
Nábělek
,
A. K.
,
Letowski
,
T. R.
, and
Tucker
,
F. M.
(
1989
). “
Reverberant overlap- and self-masking in consonant identification
,”
J. Acoust. Soc. Am.
86
(
4
),
1259
1265
.
33.
Payton
,
K. L.
,
Uchanski
,
R. M.
, and
Braida
,
L. D.
(
1994
). “
Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing
,”
J. Acoust. Soc. Am.
95
(
3
),
1581
1592
.
34.
Schroeder
,
M. R.
(
1981
). “
Modulation transfer functions: Definition and measurement
,”
Acustica
49
(
3
),
179
182
.
35.
Strange
,
W.
,
Jenkins
,
J. J.
, and
Johnson
,
T. L.
(
1983
). “
Dynamic specification of coarticulated vowels
,”
J. Acoust. Soc. Am.
74
(
3
),
695
705
.
36.
Takata
,
Y.
, and
Nábělek
,
A. K.
(
1990
). “
English consonant recognition in noise and in reverberation by Japanese and American listeners
,”
J. Acoust. Soc. Am.
88
(
2
),
663
666
.
37.
Uchanski
,
R. M.
,
Geers
,
A. E.
, and
Protopapas
,
A.
(
2002
). “
Intelligibility of modified speech for young listeners with normal and impaired hearing
,”
J. Speech Lang. Hear. Res.
45
,
1027
1038
.
You do not currently have access to this content.