Whispered speech is a naturally produced mode of communication that lacks a fundamental frequency. Several other acoustic differences exist between whispered and voiced speech, such as speaking rate (measured as segment duration) and formant frequencies. Previous research has shown that listeners are less accurate at identifying linguistic information (e.g., identifying a speech sound) and speaker information (e.g., reporting speaker gender) from whispered speech. To further explore differences between voiced and whispered speech, acoustic differences were examined across three datasets (hVd, sVd, and ʃVd) and three speaker groups (ciswomen, transwomen, cismen). Consistent with previous studies, vowel duration was generally longer in whispered speech and formant frequencies were shifted higher, although the magnitude of these differences depended on vowel and gender. Despite the increase in duration, the acoustic vowel space area (measured either with a vowel quadrilateral or with a convex hull) was smaller in the whispered speech, suggesting that larger vowel space areas are not an automatic consequence of a lengthened articulation. Overall, these findings are consistent with previous literature showing acoustic differences between voiced and whispered speech beyond the articulatory change of eliminating fundamental frequency.

1.
Adler
,
R. K.
,
Hirsch
,
S.
, and
Pickering
,
J.
(
2018
).
Voice and Communication Therapy for the Transgender/Gender Diverse Client: A Comprehensive Clinical Guide
(
Plural Publishing
,
San Diego, CA
).
2.
Andrews
,
M. L.
, and
Schmidt
,
C. P.
(
1997
). “
Gender presentation: Perceptual and acoustical analyses of voice
,”
J. Voice
11
(
3
),
307
313
.
3.
Avery
,
J. D.
, and
Liss
,
J. M.
(
1996
). “
Acoustic characteristics of less-masculine-sounding male speech
,”
J. Acoust. Soc. Am.
99
(
6
),
3738
3748
.
4.
Bates
,
D.
,
Maechler
,
M.
,
Bolker
,
B.
, and
Walker
,
S.
(
2015
). “
Fitting linear mixed-effects models using lme4
,”
J. Stat. Softw.
67
(
1
),
1
48
.
5.
Ben-Shachar
,
M. S.
,
Makowski
,
D.
, and
Ludecke
,
D.
(
2020
). “
Compute and interpret indices of effect size, CRAN
,” https://github.com/easystats/effectsize (Last viewed 10/10/2020).
6.
Boersma
,
P.
, and
Weenink
,
D.
(
2017
). “
Praat: Doing phonetics by computer [computer program]
,” http://www.praat.org/ (Last viewed 12/6/2020).
7.
Booz
,
J. A.
(
2016
). “
Perceived gender in clear and conversational speech
,” Ph.D. thesis,
University of Utah
,
Salt Lake City, UT
.
8.
Carew
,
L.
,
Dacakis
,
G.
, and
Oates
,
J.
(
2007
). “
The effectiveness of oral resonance therapy on the perception of femininity of voice in male-to-female transsexuals
,”
J. Voice
21
,
591
603
.
9.
Eklund
,
I.
, and
Traunmüller
,
H.
(
1997
). “
Comparative study of male and female whispered and phonated versions of the long vowels of Swedish
,”
Phonetica
54
(
1
),
1
21
.
10.
Ferguson
,
S. H.
, and
Kewley-Port
,
D.
(
2007
). “
Talker differences in clear and conversational speech: Acoustic characteristics of vowels
,”
J. Speech Lang. Hear. Res.
50
(
5
),
1241
1255
.
11.
Fitch
,
W. T.
, and
Giedd
,
J.
(
1999
). “
Morphology and development of the human vocal tract: A study using magnetic resonance imaging
,”
J. Acoust. Soc. Am.
106
(
3
),
1511
1522
.
12.
Fox
,
R. A.
, and
Jacewicz
,
E.
(
2017
). “
Reconceptualizing the vowel space in analyzing regional dialect variation and sound change in American English
,”
J. Acoust. Soc. Am.
142
(
1
),
444
459
.
13.
Heeren
,
W.
, and
Heuven
,
V. V.
(
2009
). “
Perception and production of boundary tones in whispered Dutch
,” in
Proceedings of the Tenth Annual Conference of the International Speech Communication Association
, September 6–10, Brighton, UK.
14.
Hillenbrand
,
J. M.
, and
Clark
,
M. J.
(
2009
). “
The role of f(0) and formant frequencies in distinguishing the voices of men and women
,”
Atten. Percept. Psychophys.
71
(
5
),
1150
1166
.
15.
Hillenbrand
,
J. M.
,
Getty
,
L. A.
,
Clark
,
M. J.
, and
Wheeler
,
K.
(
1995
). “
Acoustic characteristics of American English vowels
,”
J. Acoust. Soc. Am.
97
(
5
),
3099
3111
.
16.
Houle
,
N.
, and
Levi
,
S. V.
(
2019
). “
Effect of phonation on perception of femininity/masculinity in transgender and cisgender speakers
,”
J. Voice
(published online).
17.
Ito
,
T.
,
Takeda
,
K.
, and
Itakura
,
F.
(
2005
). “
Analysis and recognition of whispered speech
,”
Speech Commun.
45
,
139
152
.
18.
Jacewicz
,
E.
,
Fox
,
R. A.
, and
Salmons
,
J.
(
2007
). “
Vowel duration in three American English dialects
,”
Am. Speech
82
(
4
),
367
385
.
19.
Johnson
,
K.
(
2006
). “
Resonance in an exemplar-based lexicon: The emergence of social identity and phonology
,”
J. Phon.
34
(
4
),
485
499
.
20.
Jovičić
,
S. T.
, and
Šarić
,
Z.
(
2008
). “
Acoustic analysis of consonants in whispered speech
,”
J. Voice
22
,
263
274
.
21.
Kawitzky
,
D.
, and
McAllister
,
T.
(
2020
). “
The effect of formant biofeedback on the feminization of voice in transgender women
,”
J. Voice
34
(
1
),
53
67
.
22.
Ladefoged
,
P.
, and
Broadbent
,
D. E.
(
1957
). “
Information conveyed by vowels
,”
J. Acoust. Soc. Am.
29
(
1
),
98
104
.
23.
Lass
,
N. J.
,
Hughes
,
K. R.
,
Bowyer
,
M. D.
,
Waters
,
L. T.
, and
Bourne
,
V. T.
(
1976
). “
Speaker sex identification from voiced, whispered, and filtered isolated vowels
,”
J. Acoust. Soc. Am.
59
(
3
),
675
678
.
24.
Lenth
,
R. V.
(
2018
). “
Emmeans: Estimated marginal means
,” Aka Least-squares Means, R Package Version 1(2), https://github.com/rvlenth/emmeans (Last viewed 11/23/2018).
25.
Leung
,
Y.
,
Oates
,
J.
, and
Chan Siew
,
P.
(
2018
). “
Voice, articulation, and prosody contribute to listener perceptions of speaker gender: A systematic review and meta-analysis
,”
J. Speech Lang. Hear. Res.
61
(
2
),
266
297
.
26.
McCloy
,
D. R.
(
2016
). phonR: tools for phoneticians and phonologists. R package version, 1.0-7. https://drammock.github.io/phonR/ (Last viewed 1/15/2018).
27.
Munson
,
B.
(
2007
). “
The acoustic correlates of perceived masculinity, perceived femininity, and perceived sexual orientation
,”
Lang. Speech
50
,
125
142
.
28.
Neel
,
A. T.
(
2008
). “
Vowel space characteristics and vowel identification accuracy
,”
J. Speech Lang. Hear. Res.
51
(
3
),
574
585
.
29.
Peterson
,
G. E.
, and
Barney
,
H. L.
(
1952
). “
Control methods used in a study of the vowels
,”
J. Acoust. Soc. Am.
24
(
2
),
175
184
.
30.
RStudio Team
(
2020
). “
RStudio: Integrated Development for R. RStudio
,” (PBC, Boston, MA) http://www.rstudio.com/ (Last viewed 12/14/2020).
31.
Schwartz
,
M. F.
(
1967
). “
Syllable duration in oral and whispered reading
,”
J. Acoust. Soc. Am.
41
(
5
),
1367
1369
.
32.
Schwartz
,
M. F.
, and
Rine
,
H. E.
(
1968
). “
Identification of speaker sex from isolated, whispered vowels
,”
J. Acoust. Soc. Am.
44
(
6
),
1736
1737
.
33.
Sharf
,
D. J.
(
1964
). “
Vowel duration in whispered and in normal speech
,”
Lang. Speech
7
(
2
),
89
97
.
34.
Sharifzadeh
,
H. R.
,
McLoughlin
,
I. V.
, and
Russell
,
M. J.
(
2012
). “
A comprehensive vowel space for whispered speech
,”
J. Voice
26
(
2
),
e49
e56
.
35.
Simpson
,
A. P.
(
2009
). “
Phonetic differences between male and female speech
,”
Lang. Ling. Compass
3
(
2
),
621
640
.
36.
Tartter
,
V. C.
(
1989
). “
What's in a whisper?
,”
J. Acoust. Soc. Am.
86
(
5
),
1678
1683
.
37.
Tartter
,
V. C.
(
1991
). “
Identifiability of vowels and speakers from whispered syllables
,”
Percept. Psychophys.
49
(
4
),
365
372
.
38.
Traunmüller
,
H.
(
1990
).
Analytical expressions for the tonotopic sensory scale
.
J. Acoust. Soc. Am.
88
(
1
),
97
100
..
39.
Vorperian
,
H. K.
,
Wang
,
S.
,
Chung
,
M. K.
,
Schimek
,
E. M.
,
Durtschi
,
R. B.
,
Kent
,
R. D.
, and
Gentry
,
L. R.
(
2009
). “
Anatomic development of the oral and pharyngeal portions of the vocal tract: An imaging study
,”
J. Acoust. Soc. Am.
125
(
3
),
1666
1678
.
40.
Whiteside
,
S. P.
(
1996
). “
Temporal-based acoustic-phonetic patterns in read speech: Some evidence for speaker sex differences
,”
J. Int. Phon. Assoc.
26
(
1
),
23
40
.

Supplementary Material

You do not currently have access to this content.