Dynamic programming has come to be widely used and accepted in automatic speech recognition. However, two different but similar applications have often been described more in terms of their differences than their similarities. On the one hand, dynamic programming is used to find the best nonlinear dynamic time warping to align two instances of a word. On the other hand, dynamic programming may be used to find the best state sequence for a hidden Markov process. Not only are these procedures essentially equivalent, but significant generalization comes from an explicit unification. Dynamic programming may be used not only to align two instances of a word, but also to align an instance of a word with an arbitrary finite state model for the word, or even to align two arbitray models. Multiple instances of a word may contribute to a single model, and multiple passes on a finite set of training data can be used to further refine word models.
Skip Nav Destination
Article navigation
November 1982
August 12 2005
Unifying dynamic programming methods
James K. Baker
James K. Baker
DRAGON Systems, Inc., 173 Highland Street, West Newton, MA 02165
Search for other works by this author on:
James K. Baker
DRAGON Systems, Inc., 173 Highland Street, West Newton, MA 02165
J. Acoust. Soc. Am. 72, S32 (1982)
Citation
James K. Baker; Unifying dynamic programming methods. J. Acoust. Soc. Am. 1 November 1982; 72 (S1): S32. https://doi.org/10.1121/1.2019830
Download citation file:
114
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Alignment classification method to facilitate automatic acoustic‐phonetic statistics collection
J. Acoust. Soc. Am. (August 2005)
A powerful post‐processing algorithm for time domain pitch trackers
J. Acoust. Soc. Am. (August 2005)
Decisions about features
J. Acoust. Soc. Am. (August 2005)
Very large vocabulary recognition (VLVR): using prosodic and spectral filters
J. Acoust. Soc. Am. (August 2005)