Because of their high correlation and reduced sensitivity to quantization errors, the line spectrum pair (LSP) frequency parameters have been used recently for efficient quantization of LPC information for speech coding. In the present paper, the LSP representation is used for speech recognition and a new perception‐based LSP distance measure is proposed. This distance measure exploits the following two properties of the speech perception process [D. H. Klatt, Proc. ICASSP, 1278–1281 (1982)]: (1) The formant frequencies are the most important parameters for speech perception: and (2) the formant bandwidths and spectral tilt contribute very little to speech perception. The present distance measure uses these two properties in the form of weights in a weighted Euclidean distance measure. In order to derive these weights, the LPC power spectrum P(f) is computed for each speech frame and the weight for a given LSP frequency ft is taken to be proportional to [P(ft)]c. The perception‐based LSP distance measure is studied here on a speaker‐dependent speech recognition task. The optimum value of the exponent c is found here to be 0.15. The perception‐based LSP distance measure results in a recognition score of 95.7%, while the recognition accuracy is found to be 89.9% by using the conventional Euclidean distance measure on LSP parameters and 94.1% by using the liftered cepstral distance measure [B. H. Juang et al., IEEE Trans. Acoust. Speech Signal Process. ASSP‐35, 947–954 (1987)].
Skip Nav Destination
Article navigation
November 1988
August 13 2005
A perception‐based LSP distance measure for speech recognition
K. K. Paliwal
K. K. Paliwal
Tata Institute of Fundamental Research, Homi Bhabha Road, Bombay 400005, India
Search for other works by this author on:
J. Acoust. Soc. Am. 84, S14–S15 (1988)
Citation
K. K. Paliwal; A perception‐based LSP distance measure for speech recognition. J. Acoust. Soc. Am. 1 November 1988; 84 (S1): S14–S15. https://doi.org/10.1121/1.2025867
Download citation file:
Citing articles via
All we know about anechoic chambers
Michael Vorländer
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Performance study of ray-based ocean acoustic tomography methods for estimating submesoscale variability in the upper ocean
Etienne Ollivier, Richard X. Touret, et al.
Related Content
Evaluation of cepstral lifters for articulatory codebook search
J Acoust Soc Am (August 2005)
An approach to noise cancellation for speech recognition systems
J Acoust Soc Am (August 2005)
Unsupervised speaker adaptation of spectra based on a minimum fuzzy vector quantization error criterion
J Acoust Soc Am (August 2005)
Nonlinear time‐scale modification of speech signal with varied segmental duration characteristics
J Acoust Soc Am (August 2005)
Evaluation of lifters by frequency response
J Acoust Soc Am (August 2005)