The automatic recognition of stop and nasal consonants is known to be a difficult recognition task. This paper presents various techniques that can be used to improve the discrimination of stop and nasal consonants. An improved spectral representation for stop consonants is proposed, which unlike other feature representations, emphasizes the mid‐to‐high regions of the spectrum. A subspace projection approach, which is used as a preprocessing step in a hidden Markov model based system, is also proposed for improved nasal discrimination. This approach finds a transformation matrix which maps the original nasal observation onto a subspace such that the ‘‘distance’’ between the nasals is maximized on the subspace. Two statistical distance measures are investigated for finding the transformation matrix, namely the divergence and the Bhattacharyya measures. Results on stop and nasal consonant recognition will be presented using the subspace approach and the improved spectral stop representation.
Skip Nav Destination
,
,
Article navigation
November 1995
November 01 1995
Techniques for improved stop and nasal consonant discrimination
Philipos C. Loizou;
Philipos C. Loizou
Dept. of Elec. Eng., Arizona State Univ., Tempe, AZ 85287‐7206
Search for other works by this author on:
Michael F. Dorman;
Michael F. Dorman
Arizona State Univ., Tempe, AZ 85287‐0102
Search for other works by this author on:
Andreas S. Spanias
Andreas S. Spanias
Arizona State Univ., Tempe, AZ 85287‐7206
Search for other works by this author on:
Philipos C. Loizou
Michael F. Dorman
Andreas S. Spanias
Dept. of Elec. Eng., Arizona State Univ., Tempe, AZ 85287‐7206
J. Acoust. Soc. Am. 98, 2892 (1995)
Citation
Philipos C. Loizou, Michael F. Dorman, Andreas S. Spanias; Techniques for improved stop and nasal consonant discrimination. J. Acoust. Soc. Am. 1 November 1995; 98 (5_Supplement): 2892. https://doi.org/10.1121/1.414308
Download citation file:
94
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Related Content
Clicks in a Chinese nursery rhyme
J. Acoust. Soc. Am. (November 1995)
Aerodynamic evidence of preconsonantal stop lenition in Taiwanese
J. Acoust. Soc. Am. (November 1995)
English /r/ and /l/ production in American and bilingual Japanese subjects
J. Acoust. Soc. Am. (November 1995)
Improvements in the perception of American English vowels by Brazilian bilinguals
J. Acoust. Soc. Am. (November 1995)
Acoustic cues for /θ/ in American English
J. Acoust. Soc. Am. (November 1995)