Researchers in the past have suggested several acoustic correlates of nasalization including extra pole‐zero pairs near the first formant (F1), a reduction in F1 amplitude, and an increase in F1 bandwidth. Even though these correlates have been known for a long time, considerable work is still needed to automate the extraction of acoustic parameters (APs) for nasalization. This work looked at 37 different APs which were pared down to 8 APs based on F statistic obtained by ANOVA. In preliminary experiments, an accuracy of 69.79% has been obtained for the task of discriminating between oral and nasalized vowels on the TIMIT database using a support vector Machine (SVM)‐based classifier. The classification was done on a frame basis, and a segment was declared nasalized if more than 30% of the frames were found to be nasalized. Note that all vowels adjacent to nasal consonants were assumed to be nasalized. Thus, the accuracy may actually be higher since some vowels before nasal consonants may not be nasalized. Further, these results were obtained by using a linear kernel in SVMs. We hope the results would improve when a radial basis function kernel is used. [Work supported by Honda and NSF Grant BCS0236707.]
Skip Nav Destination
,
Article navigation
November 2006
Meeting abstract. No PDF available.
November 01 2006
Automatic detection of vowel nasalization using knowledge‐based acoustic parameters
Tarun Pruthi;
Tarun Pruthi
Dept. of Elec. Eng. and Inst. of Systems Res., Univ. of Maryland, College Park, MD 20742
Search for other works by this author on:
Carol Y. Espy‐Wilson
Carol Y. Espy‐Wilson
Dept. of Elec. Eng. and Inst. of Systems Res., Univ. of Maryland, College Park, MD 20742
Search for other works by this author on:
Tarun Pruthi
Carol Y. Espy‐Wilson
Dept. of Elec. Eng. and Inst. of Systems Res., Univ. of Maryland, College Park, MD 20742
J. Acoust. Soc. Am. 120, 3377 (2006)
Citation
Tarun Pruthi, Carol Y. Espy‐Wilson; Automatic detection of vowel nasalization using knowledge‐based acoustic parameters. J. Acoust. Soc. Am. 1 November 2006; 120 (5_Supplement): 3377. https://doi.org/10.1121/1.4781609
Download citation file:
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Related Content
Acoustic parameters for nasality based on a model of the auditory cortex
J. Acoust. Soc. Am. (May 2006)
Simulating and understanding the effects of velar coupling area on nasalized vowel spectra
J. Acoust. Soc. Am. (September 2005)
Knowledge‐based formant tracking with confidence measure using dynamic programming
J. Acoust. Soc. Am. (September 2005)
Simulation and analysis of nasalized vowels based on magnetic resonance imaging data
J. Acoust. Soc. Am. (June 2007)
The effect of articulatory placement on acoustic characteristics of nasalization.
J. Acoust. Soc. Am. (October 2008)