In recent years, the development of a feature‐based general statistical framework has been pursued for automatic speech recognition via novel designs of minimal or atomic units of speech, aiming at a parsimonious scheme to share the interword and interphone speech data and at a unified way to account for the context‐dependent behaviors in speech. The basic design philosophy has been motivated by the theory of distinctive features and by a new form of phonology which argues for use of multidimensional articulatory structures. In this paper, the most recently developed feature‐based recognizer is presented, which is capable of operating on all classes of English sounds. Detailed descriptions of the design considerations for the recognizer and of key aspects of the design process are provided. This process, which is called lexicon ‘‘compilation,’’ consists of three elements (1) establishing a feature‐specification system; (2) constructing a probabilistic and fractional temporal overlapping pattern across the features; and (3) mapping from the feature‐overlap pattern to a state‐transition graph. A standard phonetic classification task from the TIMIT database is used as a test bed to evaluate the performance of the recognizer. The experimental results provide preliminary evidence for the effectiveness of the feature‐based approach to speech recognition.
Skip Nav Destination
Article navigation
May 1994
May 01 1994
A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
Li Deng;
Li Deng
Spoken Language Systems Group, Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
Search for other works by this author on:
Don X. Sun
Don X. Sun
Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, New York 11794‐3600
Search for other works by this author on:
J. Acoust. Soc. Am. 95, 2702–2719 (1994)
Article history
Received:
February 15 1993
Accepted:
November 30 1993
Citation
Li Deng, Don X. Sun; A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features. J. Acoust. Soc. Am. 1 May 1994; 95 (5): 2702–2719. https://doi.org/10.1121/1.409839
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Vowel signatures in emotional interjections and nonlinguistic vocalizations expressing pain, disgust, and joy across languages
Maïa Ponsonnet, Christophe Coupé, et al.
The alveolar trill is perceived as jagged/rough by speakers of different languages
Aleksandra Ćwiek, Rémi Anselme, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Interword coarticulation modeling for continuous speech recognition
J Acoust Soc Am (August 2005)
Analysis on the effects of tonal coarticulation at word and nonword syllable boundaries of Mandarin based on the tone nucleus model
J Acoust Soc Am (November 2006)
Control of speech prosody in Broca's aphasia
J Acoust Soc Am (August 2005)
The effects of four variables on the intelligibility of synthesized sentences
J Acoust Soc Am (October 2003)
A coarticulation model for continuous digit recognition
J Acoust Soc Am (August 2005)