This paper deals with study of formant and harmonic contours by processing the group delay (GD) spectrograms of speech signals. The GD spectrum is the negative derivative of the phase spectrum with respect to frequency. Recent study shows that the GD spectrogram can be obtained without phase wrapping. Formant frequency contours can be observed in the display of the peaks of the instantaneous wideband equivalent GD spectrogram, derived using the modified single frequency filtering (SFF) analysis of speech signals. Harmonic frequency contours can be observed in the display of the peaks of the instantaneous narrowband equivalent GD spectrogram, derived using the modified SFF analysis of speech signals. For synthetic speech signals, the observed formant contours match the ground truth formant contours from which the signal is derived. For natural speech signals, the observed formant contours match approximately with the given ground truth formant contours mostly in the voiced regions. The results are illustrated for several randomly selected utterances from the TIMIT database. While this study helps to observe the contours of formants in the display, automatic extraction of the formant frequencies needs further processing, requiring logic for eliminating the spurious points, without forcing the number of formants.
Skip Nav Destination
Article navigation
October 2024
October 11 2024
Processing group delay spectrograms for study of formant and harmonic contours in speech signals
B. Yegnanarayana
;
B. Yegnanarayana
a)
1
International Institute of Information Technology
, Hyderabad 500032, India
Search for other works by this author on:
Vishala Pannala
Vishala Pannala
b)
2
Department of Artificial Intelligence and Data Science, Koneru Lakshmaiah Education Foundation
, Hyderabad 500075, India
Search for other works by this author on:
a)
Email: [email protected]
b)
Email: [email protected]
J. Acoust. Soc. Am. 156, 2422–2433 (2024)
Article history
Received:
November 20 2023
Accepted:
September 20 2024
Citation
B. Yegnanarayana, Vishala Pannala; Processing group delay spectrograms for study of formant and harmonic contours in speech signals. J. Acoust. Soc. Am. 1 October 2024; 156 (4): 2422–2433. https://doi.org/10.1121/10.0032364
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
126
Views
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Group delay spectrogram of speech signals without phase wrapping
J. Acoust. Soc. Am. (March 2022)
Analysis of phase derivatives of speech signals
J. Acoust. Soc. Am. (September 2022)
Analysis of aperiodicity in artistic Noh singing voice using an impulse sequence representation of excitation source
J. Acoust. Soc. Am. (December 2019)
Deep neural architectures for dialect classification with single frequency filtering and zero-time windowing feature representations
J. Acoust. Soc. Am. (February 2022)
Separation of components from impulses in reassigned spectrograms
J. Acoust. Soc. Am. (March 2007)