The concept of a formant as representing a peak in the pressure spectrum is assumed to be applicable to both voiced and unvoiced speech. A scheme has been developed that estimates five formant frequencies and amplitudes. The cepstrum technique is used, along with intensity, zero‐crossing, and slope‐change information, to make voiced‐unvoiced decisions and to estimate the fundamental voicing frequency. An attempt is made to detect bursts of energy (due primarily to stop consonants) in the time waveform and to analyze them with sufficient time resolution so that the burst characteristic is preserved. The formant estimating procedure is based on assumed formant exclusive domains in frequency space. Several smoothing procedures are used to remove discontinuities from the formant and fundamental frequency data, and the smoothed values are used to control a five‐pole parallel synthesizer. The synthesizer is excited with a pulse train, noise, or a mixture of the two. Examples of the natural and synthetic speech are presented.
Skip Nav Destination
Article navigation
August 11 2005
Scheme for Automatic Formant Analysis of Speech
William J. Strong;
William J. Strong
Department of Physics, Brigham Young University, Provo, Utah 84601
Search for other works by this author on:
R. Byron Purves
R. Byron Purves
Department of Physics, Brigham Young University, Provo, Utah 84601
Search for other works by this author on:
J. Acoust. Soc. Am. 51, 110–111 (1972)
Citation
William J. Strong, R. Byron Purves; Scheme for Automatic Formant Analysis of Speech. J. Acoust. Soc. Am. 1 January 1972; 51 (1A_Supplement): 110–111. https://doi.org/10.1121/1.1981293
Download citation file: