The perception of any given sound is influenced by surrounding sounds. When successive sounds differ in their spectral compositions, these differences may be perceptually magnified, resulting in spectral contrast effects (SCEs). For example, listeners are more likely to perceive /ɪ/ (low F1) following sentences with higher F1 frequencies; listeners are also more likely to perceive /ɛ/ (high F1) following sentences with lower F1 frequencies. Previous research showed that SCEs for vowel categorization were attenuated when sentence contexts were spoken by different talkers [Assgari and Stilp. (2015). J. Acoust. Soc. Am. 138(5), 3023–3032], but the locus of this diminished contextual influence was not specified. Here, three experiments examined implications of variable talker acoustics for SCEs in the categorization of /ɪ/ and /ɛ/. The results showed that SCEs were smaller when the mean fundamental frequency (f0) of context sentences was highly variable across talkers compared to when mean f0 was more consistent, even when talker gender was held constant. In contrast, SCE magnitudes were not influenced by variability in mean F1. These findings suggest that talker variability attenuates SCEs due to diminished consistency of f0 as a contextual influence. Connections between these results and talker normalization are considered.

1.
Assgari
,
A. A.
(
2018
). “
Assessing the relationship between talker normalization and spectral contrast effects in speech perception
,” Doctoral dissertation,
University of Louisville
, Louisville, Kentucky.
2.
Assgari
,
A. A.
, and
Stilp
,
C. E.
(
2015
). “
Talker information influences spectral contrast effects in speech categorization
,”
J. Acoust. Soc. Am.
138
(
5
),
3023
3032
.
3.
Assmann
,
P. F.
,
Nearey
,
T. M.
, and
Hogan
,
J. T.
(
1982
). “
Vowel identification: Orthographic, perceptual, and acoustic aspects
,”
J. Acoust. Soc. Am.
71
(
4
),
975
989
.
4.
Bates
,
D. M.
,
Maechler
,
M.
,
Bolker
,
B.
, and
Walker
,
S.
(
2014
).
lme4: Linear mixed-effects models using Eigen and S4. R package version 1
:1-17, https://cran.r-project.org/web/packages/lme4/index.html (Last viewed 3/6/2019).
5.
Boersma
,
P.
, and
Weenink
,
D.
(
2017
). “
Praat: Doing phonetics by computer
” [Computer program]. Version 5.3.61, http://www.praat.org/ (Last viewed January 1, 2014).
6.
Bosker
,
H. R.
,
Reinisch
,
E.
, and
Sjerps
,
M. J.
(
2017
). “
Cognitive load makes speech sound fast, but does not modulate acoustic context effects
,”
J. Mem. Lang.
94
,
166
176
.
7.
Bradlow
,
A. R.
, and
Pisoni
,
D. B.
(
1999
). “
Recognition of spoken words by native and non-native listeners: Talker-, listener-, and item-related factors
,”
J. Acoust. Soc. Am.
106
,
2074
2085
.
8.
Choi
,
J. Y.
,
Hu
,
E. R.
, and
Perrachione
,
T. K.
(
2018
). “
Varying acoustic-phonemic ambiguity reveals that talker normalization is obligatory in speech processing
,”
Attn., Percept., Psychophys
80
,
784
797
.
9.
Clarke
,
C. M.
, and
Garrett
,
M. F.
(
2004
). “
Rapid adaptation to foreign-accented English
,”
J. Acoust. Soc. Am.
116
(
6
),
3647
3658
.
10.
Creelman
,
C. D.
(
1957
). “
Case of the unknown talker
,”
J. Acoust. Soc. Am.
29
,
655
.
11.
Fant
,
G.
(
1973
).
Speech Sounds and Features
(
MIT Press
,
Cambridge, MA
).
12.
Fourcin
,
A.
(
1968
). “
Speech source inference
,”
IEEE Trans. Audio Electroacoust.
16
(
1
),
65
67
.
13.
Frazier
,
J. F.
,
Assgari
,
A. A.
, and
Stilp
,
C. E.
(
2019
). “
Musical instrument categorization is highly sensitive to spectral properties of earlier sounds
,”
Attn., Percept., Psychophys.
(in press).
14.
Garofolo
,
J.
,
Lamel
,
L.
,
Fisher
,
W.
,
Fiscus
,
J.
,
Pallett
,
D.
, and
Dahlgren
,
N.
(
1990
). “
DARPA TIMIT acoustic-phonetic continuous speech corpus CDROM
,” National Institute of Standards and Technology, NIST Order No. PB91-505065.
15.
Geiselman
,
R. E.
, and
Bellezza
,
F. S.
(
1976
). “
Long-term memory for speaker's voice and source location
,”
Memory Cognit.
4
(
5
),
483
489
.
16.
Goldinger
,
S. D.
(
1996
). “
Words and voices: Episodic traces in spoken word identification and recognition memory
,”
J. Exp. Psychol.
22
(
5
),
1166
1183
.
17.
Goldinger
,
S. D.
,
Pisoni
,
D. B.
, and
Logan
,
J. S.
(
1991
). “
On the nature of talker variability effects on recall of spoken word lists
,”
J. Exp. Psychol.
17
(
1
),
152
162
.
18.
Hillenbrand
,
J. M.
, and
Clark
,
M. J.
(
2009
). “
The role of f0 and formant frequencies in distinguishing the voices of men and women
,”
Attn., Percept., Psychophys.
71
(
5
),
1150
1166
.
19.
Holt
,
L. L.
(
2005
). “
Temporally nonadjacent nonlinguistic sounds affect speech categorization
,”
Psychol. Sci.
16
(
4
),
305
312
.
20.
Holt
,
L. L.
(
2006
). “
The mean matters: Effects of statistically defined nonspeech spectral distributions on speech categorization
,”
J. Acoust. Soc. Am.
120
(
5
),
2801
2817
.
21.
Huang
,
J.
, and
Holt
,
L. L.
(
2012
). “
Listening for the norm: Adaptive coding in speech categorization
,”
Front. Psychol.
3
,
10
.
22.
Kluender
,
K. R.
,
Stilp
,
C. E.
, and
Kiefte
,
M.
(
2013
). “
Perception of vowel sounds within a biologically realistic model of efficient coding
,” in
Vowel Inherent Spectral Change
, edited by
G.
Morrison
and
P.
Assmann
(
Springer
,
Berlin
), pp.
117
151
.
23.
Kuznetsova
,
A.
,
Brockhoff
,
P. B.
, and
Christensen
,
R. H. B.
(
2013
). “
lmerTest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package)
,” R package version.
24.
Ladefoged
,
P.
, and
Broadbent
,
D. E.
(
1957
). “
Information conveyed by vowels
,”
J. Acoust. Soc. Am.
29
(
1
),
98
104
.
25.
Laing
,
E. J.
,
Liu
,
R.
,
Lotto
,
A. J.
, and
Holt
,
L. L.
(
2012
). “
Tuned with a tune: Talker normalization via general auditory processes
,”
Front. Psychol.
3
,
203
.
26.
Long
,
J. A.
(
2018
). “
jtools: Analysis and presentation of social scientific data
,” R package version 1.1.0, https://cran.r-project.org/web/packages/jtools/index.html (Last viewed 3/6/2019).
27.
Lotto
,
A. J.
, and
Kluender
,
K. R.
(
1998
). “
General contrast effects in speech perception: Effect of preceding liquid on stop consonant identification
,”
Attn., Percept., Psychophys.
60
(
4
),
602
619
.
28.
Magnuson
,
J. S.
, and
Nusbaum
,
H. C.
(
2007
). “
Acoustic differences, listener expectations, and the perceptual accommodation of talker variability
,”
J. Exp. Psychol.
33
(
2
),
391
409
.
29.
Mullennix
,
J. W.
, and
Pisoni
,
D. B.
(
1990
). “
Stimulus variability and processing dependencies in speech perception
,”
Percept. Psychophys.
47
(
4
),
379
390
.
30.
Mullennix
,
J. W.
,
Pisoni
,
D. B.
, and
Martin
,
C. S.
(
1989
). “
Some effects of talker variability on spoken word recognition
,”
J. Acoust. Soc. Am.
85
(
1
),
365
378
.
31.
Nygaard
,
L. C.
,
Sommers
,
M. S.
, and
Pisoni
,
D. B.
(
1994
). “
Speech perception as a talker-contingent process
,”
Psychol. Sci.
5
(
1
),
42
46
.
32.
Peterson
,
G. E.
, and
Barney
,
H. L.
(
1952
). “
Control methods used in a study of the vowels
,”
J. Acoust. Soc. Am.
24
,
175
184
.
33.
R Development Core Team
(
2018
). “
R: A language and environment for statistical computing
,” R Foundation for Statistical Computing, Vienna,
http://
www.r-project.org/ (Last viewed 3/6/2019).
34.
Rand
,
T. C.
(
1971
). “
Vocal tract size normalization in the perception of stop consonants
,”
J. Acoust. Soc. Am.
50
(
1A
),
139
.
35.
Ryalls
,
B. O.
, and
Pisoni
,
D. B.
(
1997
). “
The effect of talker variability on word recognition in preschool children
,”
Develop. Psychol.
33
(
3
),
441
452
.
36.
Sjerps
,
M. J.
,
Mitterer
,
H.
, and
McQueen
,
J. M.
(
2011
). “
Constraints on the processes responsible for the extrinsic normalization of vowels
,”
Attn., Percept., Psychophys.
73
(
4
),
1195
1215
.
37.
Sjerps
,
M. J.
,
Zhang
,
C.
, and
Peng
,
G.
(
2018
). “
Lexical tone is perceived relative to locally surrounding context, vowel quality to preceding context
,”
J. Exp. Psychol.
44
(
6
),
914
924
.
38.
Spahr
,
A. J.
,
Dorman
,
M. F.
,
Litvak
,
L. M.
,
Van Wie
,
S.
,
Gifford
,
R. H.
,
Loizou
,
P. C.
, and
Cook
,
S.
(
2012
). “
Development and validation of the AzBio sentence lists
,”
Ear Hear.
33
(
1
),
112
117
.
39.
Stilp
,
C. E.
(
2017
). “
Acoustic context alters vowel categorization in perception of noise-vocoded speech
,”
J. Assoc. Res. Otolaryngol.
18
(
3
),
465
481
.
40.
Stilp
,
C. E.
, and
Alexander
,
J. M.
(
2016
). “
Spectral contrast effects in vowel categorization by listeners with sensorineural hearing loss
,”
Proc. Mtgs. Acoust.
26
,
060003
.
41.
Stilp
,
C. E.
,
Alexander
,
J. M.
,
Kiefte
,
M.
, and
Kluender
,
K. R.
(
2010
). “
Auditory color constancy: Calibration to reliable spectral properties across nonspeech context and targets
,”
Attn., Percept., Psychophys.
72
(
2
),
470
480
.
42.
Stilp
,
C. E.
,
Anderson
,
P. W.
, and
Winn
,
M. B.
(
2015
). “
Predicting contrast effects following reliable spectral properties in speech perception
,”
J. Acoust. Soc. Am.
137
(
6
),
3466
3476
.
43.
Stilp
,
C. E.
, and
Assgari
,
A. A.
(
2017
). “
Consonant categorization exhibits a graded influence of surrounding spectral context
,”
J. Acoust. Soc. Am.
141
(
2
),
EL153
EL158
.
44.
Stilp
,
C. E.
, and
Assgari
,
A. A.
(
2018
). “
Perceptual sensitivity to spectral properties in earlier sounds during speech categorization
,”
Attn., Percept., Psychophys.
80
(
5
),
1300
1310
.
45.
Stilp
,
C. E.
, and
Assgari
,
A. A.
(
2019
). “
Natural signal statistics shift speech sound categorization
,”
Attn., Percept., Psychophys.
(in press).
46.
Theodore
,
R. M.
, and
Miller
,
J. L.
(
2010
). “
Characteristics of listener sensitivity to talker-specific phonetic detail
,”
J. Acoust. Soc. Am.
128
(
4
),
2090
2099
.
47.
Theodore
,
R. M.
,
Myers
,
E. B.
, and
Lomibao
,
J. A.
(
2015
). “
Talker-specific influences on phonetic category structure
,”
J. Acoust. Soc. Am.
138
,
1068
1078
.
48.
Watkins
,
A. J.
(
1991
). “
Central, auditory mechanisms of perceptual compensation for spectral-envelope distortion
,”
J. Acoust. Soc. Am.
90
(
6
),
2942
2955
.
49.
Watkins
,
A. J.
, and
Makin
,
S. J.
(
1994
). “
Perceptual compensation for speaker differences and for spectral-envelope distortion
,”
J. Acoust. Soc. Am.
96
(
3
),
1263
1282
.
50.
Winn
,
M. B.
, and
Litovsky
,
R. Y.
(
2015
). “
Using speech sounds to test functional spectral resolution in listeners with cochlear implants
,”
J. Acoust. Soc. Am.
137
(
3
),
1430
1442
.
You do not currently have access to this content.