A method of quantizing the output of a vocoder‐type speech analyzer is described. This quantization effectively classifies the sound at each instant into one of a small number (typically, less than 100) of classes, but in such a way as to allow relatively accurate reconstruction of the original unquantized voltages The boundaries of the classes are chosen on the basis of the statistical distribution of the voltages, using redundancies which may exist to reduce the number of classes to a low value. Experimental results are presented, illustrating classification schemes resulting from this procedure. These schemes bear some resemblance to conventional phonetic analyses of speech sounds, but they have the advantage of being derivable by purely mechanical means, without the use of human intuition or judgment. The use of such sound classification for automatic speech recognition is discussed.
Skip Nav Destination
Article navigation
June 1961
June 01 1961
Speech Sound Classification Based on Signal Statistics Free
R. Bakis
R. Bakis
International Business Machines Corporation Research Center, Yorktown Heights, New York
Search for other works by this author on:
R. Bakis
International Business Machines Corporation Research Center, Yorktown Heights, New York
J. Acoust. Soc. Am. 33, 852 (1961)
Citation
R. Bakis; Speech Sound Classification Based on Signal Statistics. J. Acoust. Soc. Am. 1 June 1961; 33 (6_Supplement): 852. https://doi.org/10.1121/1.1936893
Download citation file:
33
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
Decision Functions for Voiced‐Unvoiced‐Silence Detection
J. Acoust. Soc. Am. (June 1961)
Phasor Analysis of the Stereophonic Phenomena
J. Acoust. Soc. Am. (June 1961)
Automatic Talker Recognition Using Time‐Frequency Pattern Matching
J. Acoust. Soc. Am. (June 1961)
Measurement of Speaker Recognition
J. Acoust. Soc. Am. (June 1961)
Further Progress with Colorless Artificial Reverberation
J. Acoust. Soc. Am. (June 1961)