Most cues to speech intelligibility are within a narrow frequency range, with its upper limit not exceeding 4 kHz. It is still unclear whether speaker-related (indexical) information is available past this limit or how speaker characteristics are distributed at frequencies within and outside the intelligibility range. Using low-pass and high-pass filtering, we examined the perceptual salience of dialect and gender cues in both intelligible and unintelligible speech. Setting the upper frequency limit at 11 kHz, spontaneously produced unique utterances (n = 400) from 40 speakers were high-pass filtered with frequency cutoffs from 0.7 to 5.56 kHz and presented to listeners for dialect and gender identification and intelligibility evaluation. The same material and experimental procedures were used to probe perception of low-pass filtered and unmodified speech with cutoffs from 0.5 to 1.1 kHz. Applying statistical signal detection theory analyses, we found that cues to gender were well preserved at low and high frequencies and did not depend on intelligibility, and the redundancy of gender cues at higher frequencies reduced response bias. Cues to dialect were relatively strong at low and high frequencies; however, most were in intelligible speech, modulated by a differential intelligibility advantage of male and female speakers at low and high frequencies.

1.
Alexander
,
J. M.
(
2019
). “
The S-SH Confusion Test and the effects of frequency lowering
,”
J. Speech Lang. Hear. Res.
62
,
1486
1505
.
2.
Allen
,
J. B.
(
1996
). “
Harvey Fletcher's role in the creation of communication acoustics
,”
J. Acoust. Soc. Am.
99
,
1825
1839
.
3.
Amlani
,
A. M.
,
Punch
,
J. L.
, and
Ching
,
T. Y. C.
(
2002
). “
Methods and applications of the audibility index in hearing aid selection and fitting
,”
Trends Amplif.
6
,
81
129
.
4.
Burnham
,
D.
,
Kitamura
,
C.
, and
Vollmer-Conna
,
U.
(
2002
). “
What's new pussycat? On talking to babies and animals
,”
Science
296
(
5572
),
1435
.
5.
Chen
,
F.
(
2011
). “
The relative importance of temporal envelope information for intelligibility prediction: A study on cochlear-implant vocoded speech
,”
Med. Eng. Phys.
33
,
1033
1038
.
6.
Clopper
,
C.
, and
Smiljanic
,
R.
(
2011
). “
Effects of gender and regional dialect on prosodic patterns in American English
,”
J. Phon.
39
,
237
245
.
7.
Clopper
,
C.
, and
Smiljanic
,
R.
(
2015
). “
Regional variation in temporal organization in American English
,”
J. Phon.
49
,
1
15
.
8.
Deshpande
,
M. S.
, and
Holambe
,
R. S.
(
2011
). “
Robust speaker identification in the presence of car noise
,”
Int. J. Biom.
3
,
189
205
.
9.
Donai
,
J. J.
, and
Halbritter
,
R. M.
(
2017
). “
Gender identification using high-frequency speech energy: Effects of increasing the low-frequency limit
,”
Ear Hear.
38
,
65
73
.
10.
Donai
,
J. J.
, and
Lass
,
N. J.
(
2015
). “
Gender identification from high-pass filtered vowel segments: The use of high-frequency energy
,”
Atten. Percept. Psychophys.
77
,
2452
2462
.
11.
Donaldson
,
W.
(
1992
). “
Measuring recognition memory
,”
J. Exp. Psychol. Gen.
121
,
275
277
.
12.
Fitch
,
W. T.
, and
Giedd
,
J.
(
1999
). “
Morphology and development of the human vocal tract: A study using magnetic resonance imaging
,”
J. Acoust. Soc. Am.
106
,
1511
1522
.
13.
Fletcher
,
H.
(
1953
).
Speech and Hearing in Communication
(
Van Nostrand
,
New York
) (reprinted by the Acoustical Society of America, 1995).
14.
Fletcher
,
H.
, and
Galt
,
R. H.
(
1950
). “
The perception of speech and its relation to telephony
,”
J. Acoust. Soc. Am.
22
,
89
151
.
15.
Fox
,
R. A.
, and
Jacewicz
,
E.
(
2009
). “
Cross-dialectal variation in formant dynamics of American English vowels
,”
J. Acoust. Soc. Am.
126
,
2603
2618
.
16.
Fox
,
R. A.
, and
Jacewicz
,
E.
(
2012
). “
Dialectal and generational variations in vowels in spontaneous speech
,” in
Proceedings of Interspeech 2012
, September 9–13, Portland, OR (International Speech Communication Association, Baixas, France), pp.
1404
1407
.
17.
French
,
N. R.
, and
Steinberg
,
J. C.
(
1947
). “
Factors governing the intelligibility of speech sounds
,”
J. Acoust. Soc. Am.
19
,
90
119
.
18.
Frota
,
S.
,
Vigario
,
M.
, and
Martins
,
F.
(
2002
). “
Language discrimination and rhythm classes: Evidence from Portuguese
,” in
Proceedings of Speech Prosody 2002
, April 11–13, Aix-en-Provence, France (International Speech Communication Association, Baixas, France), pp.
319
322
.
19.
Gorea
,
A.
, and
Sagi
,
D.
(
2005
). “
Decision and attention
,” in
Neurobiology of Attention
, edited by
L.
Itti
,
G.
Rees
, and
J. K.
Tsotsos
(
Elsevier Academic
,
Amsterdam
), pp.
152
159
.
20.
Green
,
D. W.
, and
Swets
,
J. A.
(
1966
).
Signal Detection Theory and Psychophysics
(
Wiley
,
New York
).
21.
Hillenbrand
,
J.
,
Getty
,
L. A.
,
Clark
,
M. J.
, and
Wheeler
,
K.
(
1995
). “
Acoustic characteristics of American English vowels
,”
J. Acoust. Soc. Am.
97
,
3099
3111
.
22.
Hunter
,
L. L.
,
Monson
,
B. B.
,
Moore
,
D. R.
,
Dhar
,
S.
,
Wright
,
B. A.
,
Munro
,
K. J.
,
Zadeh
,
L. M.
,
Blankenship
,
C. M.
,
Stiepan
,
S. M.
, and
Siegel
,
K. H.
(
2020
). “
Extended high frequency hearing and speech perception implications in adults and children
,”
Hear. Res.
397
,
107922
.
23.
Jacewicz
,
E.
, and
Fox
,
R. A.
(
2018
). “
Regional variation in fundamental frequency of American English vowels
,”
Phonetica
75
,
273
309
.
24.
Jacewicz
,
E.
,
Fox
,
R. A.
, and
Salmons
,
J.
(
2007
). “
Vowel duration in three American English dialects
,”
Am. Speech
82
(
4
),
367
385
.
25.
Jacewicz
,
E.
,
Fox
,
R. A.
, and
Salmons
,
J.
(
2011a
). “
Cross-generational vowel change in American English
,”
Lang. Var. Change
23
(
1
),
45
86
.
26.
Jacewicz
,
E.
,
Fox
,
R. A.
, and
Salmons
,
J.
(
2011b
). “
Vowel change across three age groups of speakers in three regional varieties of American English
,”
J. Phon.
39
,
683
693
.
27.
Jacewicz
,
E.
,
Fox
,
R. A.
, and
Wei
,
L.
(
2010
). “
Between-speaker and within-speaker variation in speech tempo of American English
,”
J. Acoust. Soc. Am.
128
(
2
),
839
850
.
28.
Jongman
,
A.
,
Wayland
,
R.
, and
Wong
,
S.
(
2000
). “
Acoustic characteristics of English fricatives
,”
J. Acoust. Soc. Am.
108
,
1252
1263
.
29.
Kent
,
R. D.
, and
Read
,
C.
(
1992
).
The Acoustic Analysis of Speech
(
Singular
,
San Diego, CA
).
30.
Kewley-Port
,
D.
,
Pisoni
,
D. B.
, and
Studdert-Kennedy
,
M.
(
1983
). “
Perception of static and dynamic acoustic cues to place of articulation in initial stop consonants
,”
J. Acoust. Soc. Am.
73
,
1779
1793
.
31.
Kitamura
,
C.
, and
Burnham
,
D.
(
2003
). “
Pitch and communicative intent in mother's speech: Adjustments for age and sex in the first year
,”
Infancy
4
,
85
110
.
32.
Kitayama
,
S.
, and
Ishii
,
K.
(
2002
). “
Word and voice: Spontaneous attention to emotional utterances in two languages
,”
Cogn. Emot.
16
,
29
59
.
33.
Knoll
,
M. A.
,
Uther
,
M.
, and
Costall
,
A.
(
2009
). “
Effects of low-pass filtering on the judgment of vocal affect in speech directed to infants, adults and foreigners
,”
Speech Commun.
51
,
210
216
.
34.
Kolly
,
M.-J.
,
Leemann
,
A.
, and
Dellwo
,
V.
(
2014
). “
Foreign accent recognition based on temporal information contained in lowpass-filtered speech
,” in
Proceedings of Interspeech 2014
, September 14–18, Singapore, pp.
2175
2179
.
35.
Kreiman
,
J.
, and
Sidtis
,
D.
(
2011
).
Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception
(
Wiley-Blackwell
,
Malden, MA
).
36.
Labov
,
W.
(
2010
).
Principles of Linguistic Change: Cognitive and Cultural Factors
(
Wiley-Blackwell
,
Malden, MA
).
37.
Labov
,
W.
,
Ash
,
S.
, and
Boberg
,
C.
(
2006
).
The Atlas of North American English: Phonetics, Phonology and Sound Change
(
De Gruyter Mouton
,
Berlin
).
38.
Lass
,
N. J.
,
Almerino
,
C. A.
,
Jordan
,
L. F.
, and
Walsh
,
J. M.
(
1980
). “
The effect of filtered speech on speaker race and sex identifications
,”
J. Phon.
8
,
101
112
.
39.
Lass
,
N. J.
,
Hughes
,
K. R.
,
Bowyer
,
M. D.
,
Waters
,
L. T.
, and
Bourne
,
V. T.
(
1976
). “
Speaker sex identification from voiced, whispered and filtered isolated vowels
,”
J. Acoust. Soc. Am.
59
,
675
678
.
40.
Lehiste
,
I.
, and
Peterson
,
G. E.
(
1959
). “
The identification of filtered vowels
,”
Phonetica
4
,
161
177
.
41.
Leung
,
Y.
,
Oates
,
J.
, and
Chan
,
S. P.
(
2018
). “
Voice, articulation and prosody contribute to listener perceptions of speaker gender: A systematic review and meta-analysis
,”
J. Speech. Lang. Hear. Res.
61
,
266
297
.
42.
Licklider
,
J. C. R.
, and
Miller
,
G. A.
(
1951
). “
The perception of speech
,” in
Handbook of Experimental Psychology
, edited by
S. S.
Stevens
(
Wiley
,
New York
), pp.
1040
1074
.
43.
Lippmann
,
R. P.
(
1996
). “
Accurate consonant perception without mid-frequency speech energy
,”
IEEE Trans. Speech Audio Process.
4
,
66
69
.
44.
Lynn
,
S. K.
, and
Barrett
,
L. F.
(
2014
). “ ‘
Utilizing’ signal detection theory
,”
Psychol. Sci.
25
,
1663
1673
.
45.
Macmillan
,
N. A.
, and
Creelman
,
C. D.
(
2005
).
Detection Theory: A User's Guide
(
Lawrence Erlbaum
,
Mahwah, NJ
).
46.
McNally
,
R. J.
,
Otto
,
M. W.
, and
Hornig
,
C. D.
(
2001
). “
The voice of emotional memory: Content-filtered speech in panic disorder, social phobia, and major depressive disorder
,”
Behav. Res. Ther.
39
(
11
),
1329
1337
.
47.
Monson
,
B. B.
,
Lotto
,
A. J.
, and
Story
,
B. H.
(
2014
). “
Detection of high-frequency energy level changes in speech and singing
,”
J. Acoust. Soc. Am.
135
,
400
406
.
48.
Monson
,
B. B.
,
Rock
,
J.
,
Schulz
,
A.
,
Hoffman
,
E.
, and
Buss
,
E.
(
2019
). “
Ecological cocktail party listening reveals the utility of extended high-frequency hearing
,”
Hear. Res.
381
,
107773
.
49.
Motlagh Zadeh
,
L.
,
Silbert
,
H. N.
,
Sternasty
,
K.
,
Swanepoel
,
D. W.
,
Hunter
,
L. L.
, and
Moore
,
R. D.
(
2019
). “
Extended high frequency hearing enhances speech perception in noise
,”
Proc. Natl. Acad. Sci. U.S.A.
116
(
47
),
23753
23759
.
50.
Müsch
,
H.
, and
Buus
,
S.
(
2001a
). “
Using statistical decision theory to predict speech intelligibility. I. Model structure
,”
J. Acoust. Soc. Am.
109
,
2896
2909
.
51.
Müsch
,
H.
, and
Buus
,
S.
(
2001b
). “
Using statistical decision theory to predict speech intelligibility. II. Measurement and prediction of consonant-discrimination performance
,”
J. Acoust. Soc. Am.
109
,
2910
2920
.
52.
Nazzi
,
T.
,
Bertoncini
,
J.
, and
Mehler
,
J.
(
1998
). “
Language discrimination by newborns: Toward an understanding of the role of rhythm
,”
J. Exp. Psychol. Hum. Percept. Perform.
24
,
756
766
.
53.
Norris
,
D.
,
McQueen
,
J. M.
, and
Cutler
,
A.
(
2003
). “
Perceptual learning in speech.
Cogn. Psychol.
47
,
204
238
.
54.
Owren
,
M. J.
,
Berkowitz
,
M.
, and
Bachorowski
,
J. A.
(
2007
). “
Listeners judge talker sex more efficiently from male than from female vowels
,”
Percept. Psychophys.
69
,
930
941
.
55.
Pardo
,
J. S.
,
Pellegrino
,
E.
,
Dellwo
,
V.
, and
Möbius
,
B.
(
2022
). “
Special issue: Vocal accommodation in speech communication
,”
J. Phon.
95
,
101196
.
56.
Pépiot
,
E.
(
2014
). “
Male and female speech: A study of mean f0, f0 range, phonation type and speech rate in Parisian French and American English speakers
,” in
Proceedings of Speech Prosody 2014
, May 20–23, Dublin, Ireland, pp.
305
309
.
57.
Schaeffler
,
F.
, and
Summers
,
R.
(
1999
). “
Recognizing German dialects by prosodic features alone
,” in
Proceedings of the 14th International Congress of Phonetic Sciences
, August 1–7, San Francisco, CA, pp.
2311
2314
.
58.
Scherer
,
K. R.
(
2003
). “
Vocal communication of emotion: A review of research paradigms
,”
Speech Commun.
40
,
227
256
.
59.
See
,
J. D.
,
Warm
,
J. S.
,
Dember
,
W. N.
, and
Howe
,
S. R.
(
1997
). “
Vigilance and signal detection theory: An empirical evaluation of five measures of response bias
,”
Hum. Factors
39
,
14
29
.
60.
Shadle
,
C. H.
(
2023
). “
Alternatives to moments for characterizing fricatives: Reconsidering Forrest et al. (1988)
,”
J. Acoust. Soc. Am.
153
(
2
),
1412
1426
.
61.
Snodgrass
,
J. G.
, and
Corwin
,
J.
(
1988
). “
Pragmatics of measuring recognition memory: Applications to dementia and amnesia
,”
J. Exp. Psychol. Gen.
117
,
34
50
.
62.
Tabain
,
M.
(
1998
). “
Non-sibilant fricatives in English: Spectral information above 10 kHz
,”
Phonetica
55
(
3
),
107
130
.
63.
Thomas
,
E. R.
, and
Reaser
,
J.
(
2004
). “
Delimiting perceptual cues used for the ethnic labeling of African American and European American voices
,”
J. Socioling.
8
(
1
),
54
87
.
64.
Van Bezooijen
,
R.
, and
Gooskens
,
C.
(
1999
). “
Identification of language varieties: The contribution of different linguistic levels
,”
J. Lang. Soc. Psychol.
18
,
31
48
.
65.
Van Leyden
,
K.
, and
Van Heuven
,
V.
(
2006
). “
On the prosody of Orkney and Shetland dialects
,”
Phonetica
63
,
149
174
.
66.
Vitela
,
A. D.
,
Monson
,
B. B.
, and
Lotto
,
A. J.
(
2015
). “
Phoneme categorization relying solely on high-frequency energy
,”
J. Acoust. Soc. Am.
137
,
EL65
EL70
.
67.
Werker
,
J. F.
(
2018
). “
Perceptual beginnings to language acquisition
,”
Appl. Psycholinguist.
39
,
703
728
.

Supplementary Material

You do not currently have access to this content.