A 3D cine-MRI technique was developed based on a synchronized sampling method [Masaki et al., J. Acoust. Soc. Jpn. E20, 375379 (1999)] to measure the temporal changes in the vocal tract area function during a short utterance /aiueo/ in Japanese. A time series of head-neck volumes was obtained after 640 repetitions of the utterance produced by a male speaker, from which area functions were extracted frame-by-frame. A region-based analysis showed that the volumes of the front and back cavities tend to change reciprocally and that the areas near the larynx and posterior edge of the hard palate were almost constant throughout the utterance. The lower four formants were calculated from all the area functions and compared with those of natural speech sounds. The mean absolute percent error between calculated and measured formants among all the frames was 4.5%. The comparison of vocal tract shapes for the five vowels with those from the static MRI method suggested a problem of MRI observation of the vocal tract: data from static MRI tend to result in a deviation from natural vocal tract geometry because of the gravity effect.

1.
Adachi
,
S.
, and
Yamada
,
M.
(
1999
). “
An acoustical study of sound production in biphonic singing, Xöömij
,”
J. Acoust. Soc. Am.
105
,
2920
2932
.
2.
Alwan
,
A.
,
Narayanan
,
S.
, and
Haker
,
K.
(
1997
). “
Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part II. The rhotics
,”
J. Acoust. Soc. Am.
101
,
1078
1089
.
3.
Baer
,
T.
,
Gore
,
J. C.
,
Gracco
,
L. C.
, and
Nye
,
P. W.
(
1991
). “
Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels
,”
J. Acoust. Soc. Am.
90
,
799
828
.
4.
Caussé
,
R.
,
Kergomard
,
J.
, and
Lurton
,
X.
(
1984
). “
Input impedance of brass musical instruments—Comparison between experiments and numerical models
,”
J. Acoust. Soc. Am.
75
,
241
254
.
5.
Chiba
,
T.
, and
Kajiyama
,
M.
(
1942
).
The Vowels—Its Nature and Structure
(
Tokyo-Kaiseikan
,
Tokyo
).
6.
Dang
,
J.
, and
Honda
,
K.
(
1997
). “
Acoustic characteristics of the piriform fossa in models and humans
,”
J. Acoust. Soc. Am.
101
,
456
465
.
7.
Fant
,
G.
(
1960
).
Acoustic Theory of Speech Production
(
Mouton
,
The Hague
) (2nd ed.,
1970
).
8.
Foldvik
,
A. K.
,
Kristiansen
,
U.
, and
Kværness
,
J.
(
1993
). “
A time-evolving three-dimensional vocal tract model by means of magnetic resonance imaging (MRI)
,”
Proc. Eurospeech
93
,
557
559
.
9.
Foldvik
,
A. K.
,
Husby
,
O.
,
Kværness
,
J.
,
Nordli
,
I. C.
, and
Rinck
,
P. A.
(
1990
). “
MRI (Magnetic resonance Imaging) film of articulatory movements
,”
Proc. ICSLP
1
,
421
422
.
10.
Foldvik
,
A. K.
,
Kristiansen
,
U.
,
Kværness
,
J.
,
Torp
,
A.
, and
Torp
,
H.
(
1995
). “
Three-dimensional ultrasound and magnetic resonance imaging: A new dimension in phonetic resarch
,”
Proc. 12th ICPhS
4
,
46
49
.
11.
Harshman
,
R.
,
Ladefoged
,
P.
, and
Goldstein
,
L.
(
1977
). “
Factor analysis of tongue shapes
,”
J. Acoust. Soc. Am.
62
,
693
707
.
12.
Hiraishi
,
K.
,
Narabayashi
,
I.
,
Fujita
,
O.
,
Yamamoto
,
K.
,
Sagami
,
A.
,
Hisada
,
Y.
,
Saika
,
Y.
,
Adachi
,
I.
, and
Hasegawa
,
H.
(
1995
). “
Blueberry juice: preliminary evaluation as an oral contrast agent in gastrointestinal MR imaging
,”
Radiology
194
,
119
123
.
13.
Honda
,
K.
,
Takemoto
,
H.
,
Kitamura
,
T.
,
Fujita
,
S.
, and
Takano
,
S.
(
2004
). “
Exploring human speech production mechanisms by MRI
,”
IEICE Trans. Inf. Syst.
87
,
1050
1058
.
14.
Johansson
,
C.
,
Sundberg
,
J.
,
Wilbrand
,
H.
, and
Ytterbergh
,
C.
(
1983
). “
From sagittal distance to area: A study of transverse, cross-sectional area in the pharynx by means of computer tomography
,”
R. Inst. Technol. STL-QPSR
4/1983
,
39
49
.
15.
Kitamura
,
T.
,
Honda
,
K.
, and
Takemoto
,
H.
(
2005a
). “
Individual variation of the hypopharyngeal cavities and its acoustic effects
,”
Acoust. Sci. & Tech.
26
,
16
26
.
16.
Kitamura
,
T.
,
Takemoto
,
H.
,
Honda
,
K.
,
Shimada
,
Y.
,
Fujimoto
,
I.
,
Shakudo
,
Y.
,
Masaki
,
S.
,
Kuroda
,
K.
,
Oku-uchi
,
N.
, and
Senda
,
M.
(
2005b
). “
Difference in vocal tract shape between upright and supine postures: Observations by an open-type MR scanner
,”
Acoust. Sci. & Tech.
26
,
465
468
.
17.
Maeda
,
S.
(
1990
). “
Compensatory articulation during speech: evidence from the analysis and synthesis of vocal tract shapes using an articulatory model
,” in
Speech Production and Speech Modeling
, edited by
W. J.
Hardcastle
and
A.
Marchal
(
Kluwer
,
Dordrecht
), pp.
131
150
.
18.
Masaki
,
S.
,
Tiede
,
M. K.
,
Honda
,
K.
,
Shimada
,
Y.
,
Fujimoto
,
I.
,
Nakamura
,
Y.
, and
Ninomiya
,
N.
(
1999
). “
MRI-based speech production study using a synchronized sampling method
,”
J. Acoust. Soc. Jpn. (E)
20
,
375
379
.
19.
Mohammad
,
M.
,
Moore
,
E.
,
Carter
,
J. N.
,
Shadle
,
C. H.
, and
Gunn
,
S. J.
(
1997
). “
Using MRI to image the moving vocal tract during speech
,”
Proc. Eurospeech
97
,
2027
2030
.
20.
Narayanan
,
S.
,
Byrd
,
D.
, and
Kaun
,
A.
(
1999
). “
Geometry, kinematics, and acoustics of Tamil liquid consonants
,”
J. Acoust. Soc. Am.
106
,
1993
2007
.
21.
Narayanan
,
S. S.
,
Alwan
,
A. A.
, and
Haker
,
K.
(
1995
). “
An articulatory study of fricative consonants using magnetic resonance imaging
,”
J. Acoust. Soc. Am.
98
,
1325
1347
.
22.
Narayanan
,
S. S.
,
Alwan
,
A. A.
, and
Haker
,
K.
(
1997
). “
Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals
,”
J. Acoust. Soc. Am.
101
,
1064
1077
.
23.
Narayanan
,
S.
,
Nayak
,
K.
,
Lee
,
S.
,
Sethy
,
A.
, and
Byrd
,
D.
(
2004
). “
An approach to real-time magnetic resonance imaging for speech production
,”
J. Acoust. Soc. Am.
115
,
1771
1776
.
24.
Pruessmann
,
K. P.
,
Weiger
,
M.
,
Scheidegger
,
M. B.
, and
Boesiger
,
P.
(
1999
). “
SENSE: sensitivity encoding for fast MRI
,”
Magn. Reson. Med.
42
,
952
962
.
25.
Rokkaku
,
M.
,
Hashimoto
,
K.
,
Imaizumi
,
S.
,
Niimi
,
S.
, and
Kiritani
,
S.
(
1986
). “
Measurements of the Three-Dimensional Shape of the Vocal Tract Based on the Magnetic Resonance Imaging Technique
,”
Ann. Bull. RILP.
20
,
47
54
.
26.
Sechtem
,
U.
,
Pflugfelder
,
P.
, and
Higgins
,
C. B.
(
1987
). “
Quantification of cardiac function by conventional and cine magnetic resonance imaging
,”
Cardiovasc. Intervent Radiol.
10
,
365
373
.
27.
Shirai
,
K.
, and
Honda
,
M.
(
1976
). “
An articulatory model and the estimation of articulatory parameters by nonlinear regression method
,”
Electron. Commun. Jpn.
55
,
35
43
.
28.
Stone
,
M.
(
1990
). “
A three-dimensional model of tongue movement based on ultrasound and x-ray microbeam data
,”
J. Acoust. Soc. Am.
87
,
2207
2217
.
29.
Stone
,
M.
,
Davis
,
E. P.
,
Douglas
,
A. S.
,
NessAiver
,
M.
,
Gullapalli
,
R.
,
Levine
,
W. S.
, and
Lundberg
,
A.
(
2001a
). “
Modeling the motion of the internal tongue from tagged cine-MRI images
,”
J. Acoust. Soc. Am.
109
,
2974
2982
.
30.
Stone
,
M.
,
Davis
,
E. P.
,
Douglas
,
A. S.
,
NessAiver
,
M.
,
Gullapalli
,
R.
,
Levine
,
W. S.
, and
Lundberg
,
A. J.
(
2001b
). “
Modeling tongue surface contours from cine-MRI images
,”
J. Speech Lang. Hear. Res.
44
,
1026
1040
.
31.
Story
,
B. H.
,
Titze
,
I. R.
, and
Hoffman
,
E. A.
(
1996
). “
Vocal tract area functions from magnetic resonance imaging
,”
J. Acoust. Soc. Am.
100
,
537
554
.
32.
Story
,
B. H.
,
Titze
,
I. R.
, and
Hoffman
,
E. A.
(
1998
). “
Vocal tract area functions for an adult female speaker based on volmetric imaging
,”
J. Acoust. Soc. Am.
104
,
471
487
.
33.
Story
,
B. H.
,
Titze
,
I. R.
, and
Hoffman
,
E. A.
(
2001
). “
The relationship of vocal tract shape to three voice qualities
,”
J. Acoust. Soc. Am.
109
,
1651
1667
.
34.
Takemoto
,
H.
,
Kitamura
,
T.
,
Nishimoto
,
H.
, and
Honda
,
K.
(
2004
). “
A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions
,”
Acoust. Sci. & Tech.
25
,
468
474
.
35.
Yang
,
C.-S.
, and
Kasuya
,
H.
(
1994
). “
Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects
,”
Proc. ICSLP
94
,
623
626
.
You do not currently have access to this content.