Normalized amplitude quotient (NAQ) is presented as a method to parametrize the glottal closing phase using two amplitude-domain measurements from waveforms estimated by inverse filtering. In this technique, the ratio between the amplitude of the ac flow and the negative peak amplitude of the flow derivative is first computed using the concept of equivalent rectangular pulse, a hypothetical signal located at the instant of the main excitation of the vocal tract. This ratio is then normalized with respect to the length of the fundamental period. Comparison between NAQ and its counterpart among the conventional time-domain parameters, the closing quotient, shows that the proposed parameter is more robust against distortion such as measurement noise that make the extraction of conventional time-based parameters of the glottal flow problematic. Experiments with breathy, normal, and pressed vowels indicate that NAQ is also able to separate the type of phonation effectively.

1.
Alku
,
P.
,
Strik
,
H.
, and
Vilkman
,
E.
(
1997
). “
Parabolic spectral parameter—A new method for quantification of the glottal flow
,”
Speech Commun.
22
,
67
79
.
2.
Alku, P., and Vilkman, E. (1994). “Estimation of the glottal pulseform based on discrete all-pole modeling,” in Proceedings of the International Conference on Spoken Language Processing 1994 (Yokohama), 1619–1622.
3.
Alku
,
P.
, and
Vilkman
,
E.
(
1996a
). “
A comparison of glottal voice source quantification parameters in breathy, normal, and pressed phonation of female and male speakers
,”
Folia Phoniatr Logop
48
,
240
254
.
4.
Alku
,
P.
, and
Vilkman
,
E.
(
1996b
). “
Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering
,”
Speech Commun.
18
,
131
138
.
5.
Carlson, B. (1986). Communication Systems (McGraw-Hill, Singapore), pp. 177–178.
6.
Carlson, R., Fant, G., Gobl, C., Granström, B., Karlsson, I., and Lin, Q. (1989). “Voice source rules for text-to-speech synthesis,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 223–226.
7.
Childers
,
D. G.
, and
Lee
,
C. K.
(
1991
). “
Vocal quality factors: Analysis, synthesis, and perception
,”
J. Acoust. Soc. Am.
90
,
2394
2410
.
8.
Dromey
,
C.
,
Stathopoulos
,
E. T.
, and
Sapienza
,
C. M.
(
1992
). “
Glottal airflow and electroglottographic measures of vocal function at multiple intensities
,”
J. Voice
6
,
44
54
.
9.
El-Jaroudi
,
A.
, and
Makhoul
,
J.
(
1991
). “
Discrete all-pole modeling
,”
IEEE Trans. Signal Process.
39
,
411
423
.
10.
Fant
,
G.
(
1993
). “
Some problems in voice source analysis
,”
Speech Commun.
13
,
7
22
.
11.
Fant, G. (1995). “The LF-model revisited. Transformations and frequency domain analysis,” Speech Transmission Laboratory, Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 2–3, pp. 119–156.
12.
Fant
,
G.
(
1997
). “
The voice source in connected speech
,”
Speech Commun.
22
,
125
139
.
13.
Fant, G., Kruckenberg, A., Liljencrants, J., and Båvegård, M. (1994). “Voice source parameters in continuous speech. Transformation of LF-parameters,” in Proceedings of the International Conference on Spoken Language Processing 1994 (Yokohama), pp. 1451–1454.
14.
Fant, G., Liljencrants, J., and Lin, Q. (1985). “A four-parameter model of glottal flow,” Speech Transmission Laboratory, Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 4, pp. 1–13.
15.
Fant, G., and Lin, Q. (1988). “Frequency domain interpretation and derivation of glottal flow parameters,” Speech Transmission Laboratory, Quarterly Progress and Status Report, Royal Institute of Technology, Stockholm 2–3, pp. 1–21.
16.
Fröhlich
,
M.
,
Michaelis
,
D.
, and
Strube
,
H.
(
2001
). “
SIM—Simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals
,”
J. Acoust. Soc. Am.
110
,
479
488
.
17.
Hertegård
,
S.
,
Gauffin
,
J.
, and
Karlsson
,
I.
(
1992
). “
Physiological correlates of the inverse filtered flow waveform
,”
J. Voice
6
,
224
234
.
18.
Holmberg
,
E. B.
,
Hillman
,
R. E.
, and
Perkell
,
J. S.
(
1988
). “
Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice
,”
J. Acoust. Soc. Am.
84
,
511
529
.
19.
Howell
,
P.
, and
Williams
,
M.
(
1992
). “
Acoustic analysis and perception of vowels in children’s and teenagers’ stuttered speech
,”
J. Acoust. Soc. Am.
91
,
1697
1706
.
20.
Monsen
,
R. B.
, and
Engebretson
,
A. M.
(
1977
). “
Study of variations in the male and female glottal wave
,”
J. Acoust. Soc. Am.
62
,
981
993
.
21.
Moore, B. C. (1982). An Introduction to the Psychology of Hearing (Academic, London), p. 82.
22.
Rothenberg
,
M.
(
1973
). “
A new inverse-filtering technique for deriving the glottal air flow waveform during voicing
,”
J. Acoust. Soc. Am.
53
,
1632
1645
.
23.
Strik
,
H.
, and
Boves
,
L.
(
1992
). “
On the relation between voice source parameters and prosodic features in connected speech
,”
Speech Commun.
11
,
167
174
.
24.
Sulter
,
A. M.
, and
Wit
,
H. P.
(
1996
). “
Glottal volume velocity waveform characteristics in subjects with and without vocal training, related to gender, sound intensity, fundamental frequency, and age
,”
J. Acoust. Soc. Am.
100
,
3360
3373
.
25.
Sundberg
,
J.
,
Andersson
,
M.
, and
Hultqvist
,
C.
(
1999
). “
Effects of subglottal pressure variation on professional baritone singers’ voice sources
,”
J. Acoust. Soc. Am.
105
,
1965
1971
.
26.
Sundberg
,
J.
,
Titze
,
I.
, and
Scherer
,
R.
(
1993
). “
Phonatory control in male singing: A study of the effects of subglottal pressure, fundamental frequency, and mode of phonation on the voice source
,”
J. Voice
7
,
15
29
.
27.
Wilks, S. S. (1962). Mathematical Statistics (Wiley, New York), p. 74.
28.
Wong
,
D. Y.
,
Markel
,
J. D.
, and
Gray
, Jr.,
A. H.
(
1979
). “
Least-squares glottal inverse filtering from acoustic speech waveforms
,”
IEEE Trans. Acoust., Speech, Signal Process.
27
,
350
355
.
This content is only available via PDF.
You do not currently have access to this content.