The measurement of formant frequencies of vowels is among the most common measurements in speech studies, but measurements are known to be biased by the particular fundamental frequency (F0) exciting the formants. Approaches to reducing the errors were assessed in two experiments. In the first, synthetic vowels were constructed with five different first formant (F1) values and nine different F0 values; formant bandwidths, and higher formant frequencies, were constant. Input formant values were compared to manual measurements and automatic measures using the linear prediction coding-Burg algorithm, linear prediction closed-phase covariance, the weighted linear prediction-attenuated main excitation (WLP-AME) algorithm [Alku, Pohjalainen, Vainio, Laukkanen, and Story (2013). J. Acoust. Soc. Am. 134(2), 1295–1313], spectra smoothed cepstrally and by averaging repeated discrete Fourier transforms. Formants were also measured manually from pruned reassigned spectrograms (RSs) [Fulop (2011). Speech Spectrum Analysis (Springer, Berlin)]. All but WLP-AME and RS had large errors in the direction of the strongest harmonic; the smallest errors occur with WLP-AME and RS. In the second experiment, these methods were used on vowels in isolated words spoken by four speakers. Results for the natural speech show that F0 bias affects all automatic methods, including WLP-AME; only the formants measured manually from RS appeared to be accurate. In addition, RS coped better with weaker formants and glottal fry.
Skip Nav Destination
Article navigation
February 2016
February 09 2016
Comparing measurement errors for formants in synthetic and natural vowelsa)
Christine H. Shadle;
Christine H. Shadle
b)
Haskins Laboratories
, 300 George Street, New Haven, Connecticut 06511, USA
Search for other works by this author on:
Hosung Nam;
Hosung Nam
c)
Haskins Laboratories
, 300 George Street, New Haven, Connecticut 06511, USA
Search for other works by this author on:
D. H. Whalen
D. H. Whalen
d)
Haskins Laboratories
, 300 George Street, New Haven, Connecticut 06511, USA
Search for other works by this author on:
b)
Electronic mail: [email protected]
c)
Also at: Department of English Language and Literature, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul, 136-701, South Korea.
d)
Also at: Program in Speech-Language-Hearing Sciences, CUNY Graduate Center, 365 Fifth Ave., New York, NY 10016, USA.
a)
Portions of this work were presented at the ASA meetings in San Francisco, December 2013, and Providence, May 2014.
J. Acoust. Soc. Am. 139, 713–727 (2016)
Article history
Received:
August 04 2014
Accepted:
January 08 2016
Citation
Christine H. Shadle, Hosung Nam, D. H. Whalen; Comparing measurement errors for formants in synthetic and natural vowels. J. Acoust. Soc. Am. 1 February 2016; 139 (2): 713–727. https://doi.org/10.1121/1.4940665
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Accuracy of six techniques for measuring formants in isolated words
J Acoust Soc Am (April 2014)
Determination of glottal open regions by exploiting changes in the vocal tract system characteristics
J. Acoust. Soc. Am. (July 2016)
Formants are easy to measure; resonances, not so much: Lessons from Klatt (1986)
J. Acoust. Soc. Am. (August 2022)
F0-induced formant measurement errors result in biased variabilities
J. Acoust. Soc. Am. (May 2019)
How the peak glottal area affects linear predictive coding-based formant estimates of vowels
J. Acoust. Soc. Am. (July 2019)