All-pole modeling is a widely used formant estimation method, but its performance is known to deteriorate for high-pitched voices. In order to address this problem, several all-pole modeling methods robust to fundamental frequency have been proposed. This study compares five such previously known methods and introduces a technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes temporally weighted linear prediction (LP) in which the square of the prediction error is multiplied by a given parametric weighting function. The weighting downgrades the contribution of the main excitation of the vocal tract in optimizing the filter coefficients. Consequently, the resulting all-pole model is affected more by the characteristics of the vocal tract leading to less biased formant estimates. By using synthetic vowels created with a physical modeling approach, the results showed that WLP-AME yields improved formant frequencies for high-pitched sounds in comparison to the previously known methods (e.g., relative error in the first formant of the vowel [a] decreased from 11% to 3% when conventional LP was replaced with WLP-AME). Experiments conducted on natural vowels indicate that the formants detected by WLP-AME changed in a more regular manner between repetitions of different pitch than those computed by conventional LP.
Skip Nav Destination
Article navigation
August 2013
August 01 2013
Formant frequency estimation of high-pitched vowels using weighted linear predictiona)
Paavo Alku;
Paavo Alku
b)
Department of Signal Processing and Acoustics, Aalto University
, P.O. Box 13000, FI-00076 Aalto, Finland
Search for other works by this author on:
Jouni Pohjalainen;
Jouni Pohjalainen
Department of Signal Processing and Acoustics, Aalto University
, P.O. Box 13000, FI-00076 Aalto, Finland
Search for other works by this author on:
Martti Vainio;
Martti Vainio
Institute of Behavioural Sciences, SigMe Group, University of Helsinki
, P.O. Box 9, FI-00014 Helsinki, Finland
Search for other works by this author on:
Anne-Maria Laukkanen;
Anne-Maria Laukkanen
Department of Speech Communication and Voice Research, University of Tampere
, Fi-33014 Tampere, Finland
Search for other works by this author on:
Brad H. Story
Brad H. Story
Speech Acoustics Laboratory, University of Arizona
, Tucson, Arizona 85721
Search for other works by this author on:
b)
Author to whom correspondence should be addressed. Electronic mail: [email protected]
a)
Portions of this work were presented in “Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction,” Proceedings of Interspeech, Portland, OR, September 2012.
J. Acoust. Soc. Am. 134, 1295–1313 (2013)
Article history
Received:
January 02 2012
Accepted:
June 10 2013
Citation
Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne-Maria Laukkanen, Brad H. Story; Formant frequency estimation of high-pitched vowels using weighted linear prediction. J. Acoust. Soc. Am. 1 August 2013; 134 (2): 1295–1313. https://doi.org/10.1121/1.4812756
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Accuracy and variability of acoustic measures of voicing onset
J Acoust Soc Am (January 2003)
Adaptive auditory feedback control of the production of formant trajectories in the Mandarin triphthong /iau/ and its pattern of generalization
J. Acoust. Soc. Am. (October 2010)
Quasi-closed phase forward-backward linear prediction analysis of speech for accurate formant detection and estimation
J. Acoust. Soc. Am. (September 2017)
Vocal tract area functions and formant frequencies in opera tenors’ modal and falsetto registers
J. Acoust. Soc. Am. (June 2011)
Developmental and cross-linguistic variation in the infant vowel space: The case of Canadian English and Canadian French
J. Acoust. Soc. Am. (October 2006)