This study investigates rhythmic features based on the short-time energy function of speech signals with the aim of finding robust, speaker-independent features that indicate speaker intoxication. Data from the German Alcohol Language Corpus, which comprises read, spontaneous, and command&control speech uttered by 162 speakers of both genders and various age groups when sober and intoxicated, were analyzed. Energy contours are compared directly (Root Mean Squared Error, statistical correlation, or the Euclidean distance in the spectral space of the contour) and by parameterization of the contour using the Discrete Cosine Transform (DCT) and the first and second moments of the lower DCT spectrum. Contours are also analyzed by Principal Components Analysis aiming at fundamental “eigen contour” changes that might encode intoxication. Energy contours differ significantly with intoxication in terms of distance measures, the second and fourth DCT coefficients, and the first and second moments of the lower DCT spectrum. Principal Components Analysis did not yield interpretable “eigen contours” that could be used in distinguishing intoxicated from sober contours.
Skip Nav Destination
May 01 2014
The influence of alcoholic intoxication on the short-time energy function of speech
Christian Heinrich, Florian Schiel; The influence of alcoholic intoxication on the short-time energy function of speech. J. Acoust. Soc. Am. 1 May 2014; 135 (5): 2942–2951. https://doi.org/10.1121/1.4870705
Download citation file: