Although the first two or three formant frequencies are considered essential cues for vowel identification, certain limitations of this approach have been noted. Alternative explanations have suggested listeners rely on other aspects of the gross spectral shape. A study conducted by Ito, Tsuchida, and Yano [(2001). J. Acoust. Soc. Am. 110, 1141–1149] offered strong support for the latter, as attenuation of individual formant peaks left vowel identification largely unaffected. In the present study, these experiments are replicated in two dialects of English. Although the results were similar to those of Ito, Tsuchida, and Yano [(2001). J. Acoust. Soc. Am. 110, 1141–1149], quantitative analyses showed that when a formant is suppressed, participant response entropy increases due to increased listener uncertainty. In a subsequent experiment, using synthesized vowels with changing formant frequencies, suppressing individual formant peaks led to reliable changes in identification of certain vowels but not in others. These findings indicate that listeners can identify vowels with missing formant peaks. However, such formant-peak suppression may lead to decreased certainty in identification of steady-state vowels or even changes in vowel identification in certain dynamically specified vowels.

1.
Ainsworth
,
W.
, and
Millar
,
J.
(
1972
). “
The effect of relative formant amplitude on the perceived identity of synthetic vowels
,”
Lang. Speech
15
(
4
),
328
341
.
2.
Assmann
,
P. F.
,
Nearey
,
T. M.
, and
Hogan
,
J. T.
(
1982
). “
Vowel identification: Orthographic, perceptual, and acoustic aspects
,”
J. Acoust. Soc. Am.
71
(
4
),
975
989
.
3.
Bladon
,
A.
(
1982
). “
Arguments against formants in the auditory representation of speech
,” in
The Representation of Speech in the Peripheral Auditory System
, edited by
R.
Calson
and
B.
Granström
(
Elsevier Biomedical
,
Amsterdam
), pp.
95
102
.
4.
Bladon
,
A.
(
1983
). “
Two-formant models of vowel perception: Shortcomings and enhancement
,”
Speech Commun.
2
(
4
),
305
313
.
5.
Bladon
,
R.
, and
Lindblom
,
B.
(
1981
). “
Modeling the judgment of vowel quality differences
,”
J. Acoust. Soc. Am.
69
(
5
),
1414
1422
.
6.
Chistovich
,
L. A.
, and
Lublinskaya
,
V. V.
(
1979
). “
The ‘center of gravity’ effect in vowel spectra and critical distance between the formants: Psychoacoustical study of the perception of vowel-like stimuli
,”
Hear. Res.
1
(
3
),
185
195
.
7.
Croissant
,
Y.
(
2013
). “
mlogit: Multinomial logit model
,” https://CRAN.R-project.org/package=mlogit, r package version 0.2-4 (Last viewed April 18, 2018).
8.
Delattre
,
P.
,
Liberman
,
A. M.
,
Cooper
,
F. S.
, and
Gerstman
,
L. J.
(
1952
). “
An experimental study of the acoustic determinants of vowel color: Observations on one-and two-formant vowels synthesized from spectrographic patterns
,”
Word
8
(
3
),
195
210
.
9.
Fox
,
R. A.
,
Jacewicz
,
E.
, and
Chang
,
C.-Y.
(
2010
). “
Auditory spectral integration in the perception of diphthongal vowels
,”
J. Acoust. Soc. Am.
128
(
4
),
2070
2074
.
10.
Glasberg
,
B. R.
, and
Moore
,
B. C.
(
1990
). “
Derivation of auditory filter shapes from notched-noise data
,”
Hear. Res.
47
(
1-2
),
103
138
.
11.
Hillenbrand
,
J.
,
Getty
,
L. A.
,
Clark
,
M. J.
, and
Wheeler
,
K.
(
1995
). “
Acoustic characteristics of american english vowels
,”
J. Acoust. Soc. Am.
97
(
5
),
3099
3111
.
12.
Hillenbrand
,
J. M.
, and
Houde
,
R. A.
(
2003
). “
A narrow band pattern-matching model of vowel perception
,”
J. Acoust. Soc. Am.
113
(
2
),
1044
1055
.
13.
Hillenbrand
,
J. M.
,
Houde
,
R. A.
, and
Gayvert
,
R. T.
(
2006
). “
Speech perception based on spectral peaks versus spectral shape
,”
J. Acoust. Soc. Am.
119
(
6
),
4041
4054
.
14.
Hillenbrand
,
J. M.
, and
Nearey
,
T. M.
(
1999
). “
Identification of resynthesized/hvd/utterances: Effects of formant contour
,”
J. Acoust. Soc. Am.
105
(
6
),
3509
3523
.
15.
Ito
,
M.
,
Tsuchida
,
J.
, and
Yano
,
M.
(
2001
). “
On the effectiveness of whole spectral shape for vowel perception
,”
J. Acoust. Soc. Am.
110
(
2
),
1141
1149
.
16.
Kakusho
,
O.
,
Hirato
,
H.
,
Kato
,
K.
, and
Kobayashi
,
T.
(
1971
). “
Some experiments of vowel perception by harmonic synthesizer
,”
Acta Acust. united Ac.
24
(
4
),
179
190
, available at https://www.ingentaconnect.com/content/dav/aaua/1971/00000024/00000004/art00003#expand/collapse.
17.
Kiefte
,
M.
,
Enright
,
T.
, and
Marshall
,
L.
(
2010
). “
The role of formant amplitude in the perception of/i/and/u
,”
J. Acoust. Soc. Am.
127
(
4
),
2611
2621
.
18.
Kiefte
,
M.
, and
Kluender
,
K. R.
(
2005
). “
The relative importance of spectral tilt in monophthongs and diphthongs
,”
J. Acoust. Soc. Am.
117
(
3
),
1395
1404
.
19.
Kiefte
,
M.
, and
Kluender
,
K. R.
(
2008
). “
Absorption of reliable spectral characteristics in auditory perception
,”
J. Acoust. Soc. Am.
123
(
1
),
366
376
.
20.
Kiefte
,
M.
, and
Nearey
,
T. M.
(
2017
). “
Modeling consonant-context effects in a large database of spontaneous speech recordings
,”
J. Acoust. Soc. Am.
142
(
1
),
434
443
.
21.
Kiefte
,
M.
,
Nearey
,
T. M.
, and
Assmann
,
P. F.
(
2013
). “
Vowel perception in normal speakers
,” in
Handbook of Vowels and Vowel Disorders
(
Routledge Press
,
London
), Vol.
2
, p.
160
.
22.
Klatt
,
D. H.
(
1980
). “
Software for a cascade/parallel formant synthesizer
,”
J. Acoust. Soc. Am.
67
(
3
),
971
995
.
23.
Klatt
,
D.
(
1982
). “
Prediction of perceived phonetic distance from critical-band spectra: A first step
,” in
Proceedings of the IEEE International Conference on ICASSP'82 Acoustics, Speech, and Signal Processing
, May 3–5, Paris, France, pp.
1278
1281
.
24.
Lindqvist-Gauffin
,
J.
, and
Pauli
,
S.
(
1968
). “
The role of relative spectrum levels in vowel perception
,
” Speech Trans. Lab. Quart. Prog. Status Reports
9
(
4
),
12
.
25.
Maddox
,
W. T.
,
Molis
,
M. R.
, and
Diehl
,
R. L.
(
2002
). “
Generalizing a neuropsychological model of visual categorization to auditory categorization of vowels
,”
Percept. Psychophys.
64
(
4
),
584
597
.
26.
Molis
,
M. R.
(
2005
). “
Evaluating models of vowel perception
,”
J. Acoust. Soc. Am.
118
(
2
),
1062
1071
.
27.
Nearey
,
T. M.
(
1989
). “
Static, dynamic, and relational properties in vowel perception
,”
J. Acoust. Soc. Am.
85
(
5
),
2088
2113
.
28.
Nearey
,
T. M.
(
1990
). “
The segment as a unit of speech perception
,”
J. Phon.
18
(
3
),
347
373
.
29.
Nearey
,
T. M.
(
1997
). “
Speech perception as pattern recognition
,”
J. Acoust. Soc. Am.
101
(
6
),
3241
3254
.
30.
Nearey
,
T. M.
, and
Assmann
,
P. F.
(
1986
). “
Modeling the role of inherent spectral change in vowel identification
,”
J. Acoust. Soc. Am.
80
(
5
),
1297
1308
.
31.
Peterson
,
G. E.
, and
Barney
,
H. L.
(
1952
). “
Control methods used in a study of the vowels
,”
J. Acoust. Soc. Am.
24
(
2
),
175
184
.
32.
R Core Team
(
2017
). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (Last viewed December 28, 2019).
33.
Rand
,
T. C.
(
1974
). “
Dichotic release from masking for speech
,”
J. Acoust. Soc. Am.
55
(
3
),
678
680
.
34.
Roberts
,
B.
, and
Summers
,
R. J.
(
2019
). “
Dichotic integration of acoustic-phonetic information: Competition from extraneous formants increases the effect of second-formant attenuation on intelligibility
,”
J. Acoust. Soc. Am.
145
(
3
),
1230
1240
.
35.
Rosner
,
B. S.
, and
Pickering
,
J. B.
(
1994
).
Vowel Perception and Production
(
Oxford University Press
,
Oxford, UK
).
36.
Shannon
,
C. E.
(
1948
). “
A mathematical theory of communication
,”
Bell Syst. Technical J.
27
,
379
423
.
37.
Yu
,
D.
, and
Deng
,
L.
(
2014
).
Automatic Speech Recognition: A Deep Learning Approach
(
Springer
,
New York)
.
38.
Zahorian
,
S. A.
, and
Jagharghi
,
A. J.
(
1993
). “
Spectral-shape features versus formants as acoustic correlates for vowels
,”
J. Acoust. Soc. Am.
94
(
4
),
1966
1982
.

Supplementary Material

You do not currently have access to this content.