Emotional information in speech is commonly described in terms of prosody features such as F0, duration, and energy. In this paper, the focus is on how F0 characteristics can be used to effectively parametrize emotional quality in speech signals. Using an analysis-by-synthesis approach, F0 mean, range, and shape properties of emotional utterances are systematically modified. The results show the aspects of the F0 parameter that can be modified without causing any significant changes in the perception of emotions. To model this behavior the concept of emotional regions is introduced. Emotional regions represent the variability present in the emotional speech and provide a new procedure for studying speech cues for judgments of emotion. The method is applied to F0 but can be also used on other aspects of prosody such as duration or loudness. Statistical analysis of the factors affecting the emotional regions, and discussion of the effects of F0 modifications on the emotion and speech quality perception are also presented. The results show that F0 range is more important than F0 mean for emotion expression.
Skip Nav Destination
Article navigation
June 2008
June 01 2008
On the robustness of overall F0-only modifications to the perception of emotions in speech
Murtaza Bulut;
Murtaza Bulut
a)
Signal Analysis and Interpretation Laboratory, http://sail.usc.edu, Electrical Engineering Department,
University of Southern California
, Los Angeles, California 90089
Search for other works by this author on:
Shrikanth Narayanan
Shrikanth Narayanan
Signal Analysis and Interpretation Laboratory, http://sail.usc.edu, Electrical Engineering Department,
University of Southern California
, Los Angeles, California 90089
Search for other works by this author on:
a)
Author to whom correspondence should be addressed. Electronic mail: murtaza.bulut@philips.com
J. Acoust. Soc. Am. 123, 4547–4558 (2008)
Article history
Received:
October 26 2006
Accepted:
March 24 2008
Citation
Murtaza Bulut, Shrikanth Narayanan; On the robustness of overall F0-only modifications to the perception of emotions in speech. J. Acoust. Soc. Am. 1 June 2008; 123 (6): 4547–4558. https://doi.org/10.1121/1.2909562
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Could not validate captcha. Please try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
Citing articles via
Related Content
F0 control in electrolarynx speech.
J Acoust Soc Am (October 2008)
Distinct relative F0 levels elicit categorical effects in F0 maximum and minimum alignment
J Acoust Soc Am (May 2004)
The role of f0 on acquisition of a phonological contrast in Korean stop system
J Acoust Soc Am (April 2016)
FM detection and F0 discrimination of complex tones
J Acoust Soc Am (May 1994)
Robust F0 estimation based on complex speech analysis
J Acoust Soc Am (November 2006)