Acoustic cues related to the voice source, including harmonic structure and spectral tilt, were examined for relevance to prosodic boundary detection. The measurements considered here comprise five categories: duration, pitch, harmonic structure, spectral tilt, and amplitude. Distributions of the measurements and statistical analysis show that the measurements may be used to differentiate between prosodic categories. Detection experiments on the Boston University Radio Speech Corpus show equal error detection rates around 70% for accent and boundary detection, using only the acoustic measurements described, without any lexical or syntactic information. Further investigation of the detection results shows that duration and amplitude measurements, and, to a lesser degree, pitch measurements, are useful for detecting accents, while all voice source measurements except pitch measurements are useful for boundary detection.
Skip Nav Destination
October 01 2005
Finding intonational boundaries using acoustic cues related to the voice source
Jeung-Yoon Choi, Mark Hasegawa-Johnson, Jennifer Cole; Finding intonational boundaries using acoustic cues related to the voice source. J. Acoust. Soc. Am. 1 October 2005; 118 (4): 2579–2587. https://doi.org/10.1121/1.2010288
Download citation file: