This paper describes the methods of statistical analysis used to classify letters in a feature‐based, speaker‐independent isolated letter recognition system. A hierarchical decision structure was implemented so that decisions at each node of the decision tree could be made on the basis of a small number of relevant features. For example, the 26 letters were first classified into vowel categories on the basis of first and second formant frequencies. The specific decisions, and the features used to make them, were selected by a clustering analysis of training data. At each decision node of the recognition system the test utterance was first analyzed using Fisher linear discriminant functions, with threshold weights individually set for each pairwise decision in order to minimize misclassifications. When a decision could not be made with certainty, classification was performed using a maximum likelihood procedure assuming multivariate Gaussian statistics. The sequential use of nonparametric and parametric discriminant functions produced superior classification to that obtained with either of the separate analyses. The overall system structure will be discussed in terms of practical tradeoffs between the number of features used at each decision node and the system's overall probability of error. [Work supported by NSF.]
Skip Nav Destination
,
,
,
Article navigation
November 1982
August 12 2005
Decisions about features
Scott M. Brill;
Scott M. Brill
Department of Computer Science, Carnegie‐Mellon University, Pittsburgh, PA 15213
Department of Electrical Engineering, Carnegie‐Mellon University, Pittsburgh, PA 15213
Search for other works by this author on:
Michael S. Phillips;
Michael S. Phillips
Department of Computer Science, Carnegie‐Mellon University, Pittsburgh, PA 15213
Department of Electrical Engineering, Carnegie‐Mellon University, Pittsburgh, PA 15213
Search for other works by this author on:
Moshe J. Lasry;
Moshe J. Lasry
Department of Computer Science, Carnegie‐Mellon University, Pittsburgh, PA 15213
Department of Electrical Engineering, Carnegie‐Mellon University, Pittsburgh, PA 15213
Search for other works by this author on:
Richard M. Stern
Richard M. Stern
Department of Computer Science, Carnegie‐Mellon University, Pittsburgh, PA 15213
Department of Electrical Engineering, Carnegie‐Mellon University, Pittsburgh, PA 15213
Search for other works by this author on:
Scott M. Brill
,
Michael S. Phillips
,
Moshe J. Lasry
,
Richard M. Stern
,
Department of Computer Science, Carnegie‐Mellon University, Pittsburgh, PA 15213
J. Acoust. Soc. Am. 72, S32 (1982)
Citation
Scott M. Brill, Michael S. Phillips, Moshe J. Lasry, Richard M. Stern; Decisions about features. J. Acoust. Soc. Am. 1 November 1982; 72 (S1): S32. https://doi.org/10.1121/1.2019826
Download citation file:
11
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
I can't hear you without my glasses
Tessa Bent
Related Content
Very large vocabulary recognition (VLVR): using prosodic and spectral filters
J. Acoust. Soc. Am. (August 2005)
A powerful post‐processing algorithm for time domain pitch trackers
J. Acoust. Soc. Am. (August 2005)
Alignment classification method to facilitate automatic acoustic‐phonetic statistics collection
J. Acoust. Soc. Am. (August 2005)
Unifying dynamic programming methods
J. Acoust. Soc. Am. (August 2005)