This work aims to detect vowel place as part of a knowledge‐based speech recognition system. Vowel place was classified into 6 groups based on tongue advancement [Front/Back] and height [High/Mid/Low]. Experiments were performed using 300 /hVd/ utterance data from Hillenbrand [J. Acoust. Soc. Am. 97, 3099‐3111] and 6600 TIMIT vowels. Features used include fundamental frequency (F0) and formant value (F1̃F3), where formant measurements were classified into separate groups using F0 measurements. The nearest class was found using a simple Mahalanobis distance measure, and yielded a 91.5% classification rate for the /hVd/ data. The results for the TIMIT data were 64.4%, and error analysis with regard to adjacent segment manner and place was carried out to observe the effects of coarticulation, which was not observed in the /hVd/ data.
Skip Nav Destination
Article navigation
Meeting abstract. No PDF available.
May 01 2008
Vowel place detection for a knowledge‐based speech recognition system
Sukmyung Lee;
Sukmyung Lee
Yonsei University, 134 Sinchon‐dong, Seodaemun‐gu, 120‐749 Seoul, Republic of Korea, pooh390@dsp.yonsei.ac.kr
Search for other works by this author on:
Jeung‐Yoon Choi
Jeung‐Yoon Choi
Yonsei University, 134 Sinchon‐dong, Seodaemun‐gu, 120‐749 Seoul, Republic of Korea, jychoi@yonsei.ac.kr
Search for other works by this author on:
J. Acoust. Soc. Am. 123, 3330 (2008)
Citation
Sukmyung Lee, Jeung‐Yoon Choi; Vowel place detection for a knowledge‐based speech recognition system. J. Acoust. Soc. Am. 1 May 2008; 123 (5_Supplement): 3330. https://doi.org/10.1121/1.2933844
Download citation file:
7
Views