The use of hidden Markov models is a powerful tool in building automatic speech recognition systems for continuous speech. Instead of just using frame‐by‐frame information, dynamic information is also introduced by taking the time derivative of static parameter vectors. The static and dynamic parameters are processed in separate codebooks and these observations are then integrated at a probabilistic level [Gupta et al., Proc. ICASSP, 697–700 (1987)]. The single‐user, connected‐word recognition system is developed for a subvocabulary of the Dutch language (appropriate for banking transactions). The phone models (used as subword units) are made more robust by using the co‐occurrence smoothing algorithm [K. F. Lee and H. W. Hon, IEEE Trans. Acoust. Speech Signal Process. ASSP‐37(11), 1641–1648 (1989)], which enables accurate recognition, even with limited training data. Results will be presented at the meeting.
Skip Nav Destination
Article navigation
May 1990
August 13 2005
Phone recognition in continuous speech (Dutch)
Paul van Alphen
Paul van Alphen
Inst. of Phonetic Sci., Univ. of Amsterdam, Herengracht 338, 1016 CG Amsterdam, The Netherlands
Search for other works by this author on:
Paul van Alphen
Inst. of Phonetic Sci., Univ. of Amsterdam, Herengracht 338, 1016 CG Amsterdam, The Netherlands
J. Acoust. Soc. Am. 87, S107 (1990)
Citation
Paul van Alphen; Phone recognition in continuous speech (Dutch). J. Acoust. Soc. Am. 1 May 1990; 87 (S1): S107. https://doi.org/10.1121/1.2027812
Download citation file:
94
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Templates as states in a hidden Markov model
J. Acoust. Soc. Am. (August 2005)
Very large vocabulary recognition (VLVR): using prosodic and spectral filters
J. Acoust. Soc. Am. (August 2005)
Speech processing using an auditory model and neural networks
J. Acoust. Soc. Am. (August 2005)
Speaker normalization using second‐order connectionist networks
J. Acoust. Soc. Am. (August 2005)
Speaker‐independent speech recognition with word models generated from written text
J. Acoust. Soc. Am. (August 2005)