The vowel space area (VSA) has been studied as a quantitative index of intelligibility to the extent it captures articulatory working space and reductions therein. The majority of such studies have been empirical wherein measures of VSA are correlated with perceptual measures of intelligibility. However, the literature contains minimal mathematical analysis of the properties of this metric. This paper further develops the theoretical underpinnings of this metric by presenting a detailed analysis of the statistical properties of the VSA and characterizing its distribution through the moment generating function. The theoretical analysis is confirmed by a series of experiments where empirically estimated and theoretically predicted statistics of this function are compared. The results show that on the Hillenbrand and TIMIT data, the theoretically predicted values of the higher-order statistics of the VSA match very well with the empirical estimates of the same.

1.
P.
Flipsen
and
S.
Lee
, “
Reference data for the American English acoustic vowel space
,”
Clin. Linguist. Phonet.
26
(
11–12
),
926
933
(
2012
).
2.
H. K.
Vorperian
and
R. D.
Kent
, “
Vowel acoustic space development in children: A synthesis of acoustic and anatomic data
,”
J. Speech, Lang. Hear. Res.
50
(
6
),
1510
1545
(
2007
).
3.
S.
Skodda
,
W.
Grönheit
, and
U.
Schlegel
, “
Impairment of vowel articulation as a possible marker of disease progression in Parkinson's disease
,”
PLoS One
7
(
2
),
e32132
(
2012
).
4.
L. B.
Leonard
,
S.
Ellis Weismer
,
C. A.
Miller
,
D. J.
Francis
,
J. B.
Tomblin
, and
R. V.
Kail
, “
Speed of processing, working memory, and language impairment in children
,”
J. Speech, Lang. Hear. Res.
50
(
2
),
408
428
(
2007
).
5.
S.
Sapir
,
L. O.
Ramig
,
J. L.
Spielman
, and
C.
Fox
, “
Formant centralization ratio: A proposal for a new acoustic measure of dysarthric speech
,”
J. Speech, Lang. Hear. Res.
53
(
1
),
114
125
(
2010
).
6.
E.
Jacewicz
and
R. A.
Fox
, “
Dialectal and age-related acoustic variation in vowels in spontaneous speech
,”
J. Acoust. Soc. Am.
132
(
3
),
2002
(
2012
).
7.
J.
Lam
,
K.
Tjaden
, and
G.
Wilding
, “
Acoustics of clear speech: Effect of instruction
,”
J. Speech, Lang. Hear. Res.
55
(
6
),
1807
1821
(
2012
).
8.
J.
Hillenbrand
,
L. A.
Getty
,
M. J.
Clark
, and
K.
Wheeler
, “
Acoustic characteristics of American English vowels
,”
J. Acoust. Soc. Am.
97
(
5
),
3099
3111
(
1995
).
9.
J. S.
Garofolo
,
L. F.
Lamel
,
W. M.
Fisher
,
J. G.
Fiscus
,
D. S.
Pallett
and
N. L.
Dahlgren
, “
DARPA TIMIT acoustic phonetic continuous speech corpus
,”
CDROM
,
1993
.
10.
A.
Papoulis
and
S. U.
Pillai
,
Probability, Random Variables and Stochastic Processes
(
McGraw-Hill
,
New York
,
2002
),
852
p.
11.
T.
Becker
,
M.
Jessen
, and
C.
Grigoras
, “
Forensic speaker verification using formant features and Gaussian mixture models
,” in
Proceedings of Interspeech
,
Brisbane, Australia
,
2008
, pp.
1505
1508
.
12.
A.
Moos
, “
Long-term formant distribution
,” Master’s thesis,
Universitat des Saarlandes, Saarbrcken, Germany
,
2008
,
92
pp.
13.
P.
Boersma
, “
praat, a system for doing phonetics by computer
,”
Glot Int.
5
(
9/10
),
341
345
(
2001
).
14.
C. C.
Craig
, “
On the frequency function of xy
,”
Ann. Math. Stat.
7
(
1
),
1
15
(
1936
).
15.
R.
Ware
and
F.
Lad
, “
Approximating the distribution for sums of products of normal variables
,” Technical Report UCDMS 2003/15, University of Canterbury (
2003
).
You do not currently have access to this content.