This paper presents an approach to determine the open phase region of a glottal cycle based on changes in the characteristics of the vocal tract system. The glottal closing phase contributes to major excitation of the vocal tract system. The opening phase affects the vocal tract system characteristics by effectively increasing the length of the tract, due to coupling of the subglottal region. To determine the glottal open region, it is necessary to estimate the vocal tract characteristics from the segment with subglottal coupling. The proposed method derives the dominant resonance frequency (DRF) of the vocal tract system at every sampling instant, using a heavily decaying window (HDW) for analysis. The DRF contour transits to lower frequencies during glottal open region, when compared to the glottal closed region. The open region, within the glottal cycles from voiced speech segment, is extracted using the HDW method. The results are compared with the open region derived from the electroglottograph (EGG) signals and speech signals. The results show that the proposed method based on DRF contour, derived from the speech signals, seems to perform better than the methods based on EGG signals.

1.
D. G.
Childers
,
A. M.
Smith
, and
G. P.
Moore
, “
Relationships between electroglottograph, speech, and vocal cord contact
,”
Fol. Phon. Logopaed.
36
(
3
),
105
118
(
1984
).
2.
D. G.
Childers
and
A. K.
Krishnamurthy
, “
A critical review of electroglottography
,”
Crit. Rev, Biomed. Eng.
12
(
2
),
131
161
(
1984
).
3.
M. R.
Thomas
and
P. A.
Naylor
, “
The SIGMA algorithm: A glottal activity detector for electroglottographic signals
,”
IEEE Trans. Audio Speech Language Processing
17
(
8
),
1557
1566
(
2009
).
4.
A.
Bouzid
and
N.
Ellouze
, “
Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima
,” in
Eighth European Conference on Speech Communication and Technology (EUROSPEECH,03)
, Geneva, Switzerland (
2003
), pp.
2837
2840
.
5.
P.
Davies
,
G.
Lindsey
,
H.
Fuller
, and
A.
Fourcin
, “
Variation of glottal open and closed phases for speakers of english
,”
Proc. Inst. Acoust.
8
(
7
),
539
546
(
1986
).
6.
M. R.
Thomas
,
J.
Gudnason
, and
P. A.
Naylor
, “
Estimation of glottal closing and opening instants in voiced speech using the YAGA algorithm
,”
IEEE Trans. Audio Speech Language Processing
20
(
1
),
82
91
(
2012
).
7.
J.
Kane
,
S.
Scherer
,
L.-P.
Morency
, and
C.
Gobl
, “
A comparative study of glottal open quotient estimation techniques
,” in
Proceedings of the International Conference on Spoken Language Processing (INTER-SPEECH, 13)
, Lyon, France (
2013
).
8.
T.
Drugman
,
B.
Bozkurt
, and
T.
Dutoit
, “
A comparative study of glottal source estimation techniques
,”
Comput. Speech Language Processing
26
(
1
),
20
34
(
2012
).
9.
J.
Walker
and
P.
Murphy
, “
A review of glottal waveform analysis
,” in
Progress in Nonlinear Speech Processing
(
Springer
,
New York
,
2007
), pp.
1
21
.
10.
D.
Wong
,
J.
Markel
, and
A.
Gray
, Jr.
, “
Least squares glottal inverse filtering from the acoustic speech waveform
,”
IEEE Trans. Acoust. Speech Signal Processing
27
(
4
),
350
355
(
1979
).
11.
E. R.
Abberton
,
D. M.
Howard
, and
A. J.
Fourcin
, “
Laryngographic assessment of normal voice: A tutorial
,”
Clin. Ling. Phonetics
3
(
3
),
281
296
(
1989
).
12.
M.
Rothenberg
and
J. J.
Mahshie
, “
Monitoring vocal fold abduction through vocal fold contact area
,”
J. Speech Language Hear. Res.
31
(
3
),
338
351
(
1988
).
13.
D. G.
Childers
and
C.
Lee
, “
Vocal quality factors: Analysis, synthesis, and perception
,”
J. Acoust. Soc. Am.
90
(
5
),
2394
2410
(
1991
).
14.
L. R.
Rabiner
and
R. W.
Schafer
,
Digital Processing of Speech Signals
(
Prentice Hall
,
Upper Saddle River, NJ
,
1978
).
15.
A.
El-Jaroudi
and
J.
Makhoul
, “
Discrete all-pole modeling
,”
IEEE Trans. Signal Processing
39
(
2
),
411
423
(
1991
).
16.
P.
Alku
and
E.
Vilkman
, “
Estimation of the glottal pulseform based on discrete all-pole modeling
,” in
International Conference on Speech and Language Processing
, Yokohama, Japan (
1994
), pp.
1619
1622
.
17.
Y.
Ting
and
D.
Childers
, “
Speech analysis using the weighted recursive least squares algorithm with a variable forgetting factor
,” in
International Conference on Acoustics, Speech, and Signal Processing (ICASSP ‘90)
, Albuquerque, NM (
1990
), pp.
389
392
.
18.
P.
Alku
, “
Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
,”
Speech Commun.
11
(
2
),
109
118
(
1992
).
19.
W. R.
Gardner
and
B. D.
Rao
, “
Noncausal all-pole modeling of voiced speech
,”
IEEE Trans. Speech Audio Processing
5
(
1
),
1
10
(
1997
).
20.
M. R.
Thomas
,
J.
Gudnason
, and
P. A.
Naylor
, “
Detection of glottal closing and opening instants using an improved dypsa framework
,” in
Proceedings 17th European Signal Processing Conference
, Glasgow, Scotland (
2009
), pp.
2191
2195
.
21.
T.
Drugman
and
T.
Dutoit
, “
Glottal closure and opening instant detection from speech signals
,” in
Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH ‘09)
, Brighton, UK (
2009
), pp.
2891
2894
.
22.
A.
Bouzid
and
N.
Ellouze
, “
Open quotient measurements based on multiscale product of speech signal wavelet transform
,”
J. Elect. Comput. Eng.
2007
,
62521
.
23.
C.
d'Alessandro
and
N.
Sturmel
, “
Glottal closure instant and voice source analysis using time-scale lines of maximum amplitude
,”
Sadhana
36
(
5
),
601
622
(
2011
).
24.
I.
Arroabarren
and
A.
Carlosena
, “
Glottal source parameterization: A comparative study
,” in
ISCA Tutorial and Research Workshop on Voice Quality: Functions, Analysis and Synthesis
(
2003
).
25.
H.
Strik
, “
Automatic parametrization of differentiated glottal flow: Comparing methods by means of synthetic flow pulses
,”
J. Acoust. Soc. Am.
103
,
2659
2669
(
1998
).
26.
P.
Alku
, “
Glottal inverse filtering analysis of human voice production-a review of estimation and parameterization methods of the glottal excitation and their applications
,”
Sadhana
36
(
5
),
623
650
(
2011
).
27.
R.
Timcke
,
H.
von Leden
, and
P.
Moore
, “
Laryngeal vibrations: Measurements of the glottic wave: Part 1. the normal vibratory cycle
,”
AMA Archiv. Otolaryngol.
68
(
1
),
1
19
(
1958
).
28.
P.
Alku
,
T.
Bäckström
, and
E.
Vilkman
, “
Normalized amplitude quotient for parametrization of the glottal flow
,”
J. Acoust. Soc. Am.
112
,
701
710
(
2002
).
29.
N.
Henrich
,
C.
d'Alessandro
, and
B.
Doval
, “
Spectral correlates of voice open quotient and glottal flow asymmetry: Theory, limits and experimental data
,” in
Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH ‘01)
, Aalborg, Denmark (
2001
), pp.
47
50
.
30.
R. D.
Francesco
and
E.
Moulines
, “
Detection of the glottal closure by jumps in the statistical properties of the signal
,” in
First European Conference on Speech Communication and Technology (EUROSPEECH, 89)
, Paris, France (
1989
), pp.
2039
2042
.
31.
E.
Moulines
and
R.
Di Francesco
, “
Detection of the glottal closure by jumps in the statistical properties of the speech signal
,”
Speech Commun.
9
(
5
),
401
418
(
1990
).
32.
B.
Yegnanarayana
and
D. N.
Gowda
, “
Spectro-temporal analysis of speech signals using zero-time windowing and group delay function
,”
Speech Commun.
55
(
6
),
782
795
(
2013
).
33.
K. S. R.
Murty
and
B.
Yegnanarayana
, “
Epoch extraction from speech signals
,”
IEEE Trans. Audio Speech Language Processing
16
(
8
),
1602
1613
(
2008
).
34.
M.
Anand Joseph
,
S.
Guruprasad
, and
B.
Yegnanarayana
, “
Extracting formants from short segments of speech using group delay functions
,” in
Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH ‘06)
, Pittsburgh, PA (
2006
), pp.
1009
1012
.
35.
R. S.
Prasad
and
B.
Yegnanarayana
, “
Acoustic segmentation of speech using zero time liftering
,” in
Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH ‘13)
, Lyon, France (
2013
), pp.
2292
2296
.
36.
D. G.
Childers
and
C.-F.
Wong
, “
Measuring and modeling vocal source-tract interaction
,”
IEEE Trans. Biomed. Eng.
41
(
7
),
663
671
(
1994
).
37.
A.
Barney
,
A.
De Stefano
, and
N.
Henrich
, “
The effect of glottal opening on the acoustic response of the vocal tract
,”
Acta Acust. Acust.
93
(
6
),
1046
1056
(
2007
).
38.
N.
Dhananjaya
and
B.
Yegnanarayana
, “
Voiced/nonvoiced detection based on robustness of voiced epochs
,”
IEEE Signal Processing Lett.
17
(
3
),
273
276
(
2010
).
39.
J.
Kominek
and
A. W.
Black
, “
The CMU-ARCTIC speech databases
,” in
Fifth ISCA Workshop on Speech Synthesis
, Pittsburgh, PA (
2004
).
40.
Department of Electrical and Electronic Engineering, Imperial College London, http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html (Last viewed July 6,
2016
).
41.
A.
Bouzid
and
N.
Ellouze
, “
Voice source parameter measurement based on multi-scale analysis of electroglottographic signal
,”
Speech Commun.
51
(
9
),
782
792
(
2009
).
42.
J.
Pérez
and
A.
Bonafonte
, “
Automatic voice-source parameterization of natural speech
,” in
Proceedings of the International Conference on Spoken Language Processing (INTERSPEECH ‘05)
, Lisboa, Lisbon (
2005
), pp.
1065
1068
.
43.
F.
Gunnar
,
The Acoustic Theory of Speech Production
(
Mouton
,
the Hague, the Netherlands
,
1960
).
You do not currently have access to this content.