Natural speech typically contains various phenomena deviated from the formal mode such as read speech. It is well known that those paralinguistic phenomena have an important role to give the human emotions and the state of the speakers in speech communication. This study attempts to extract the deviation as an acoustic ‘‘vagueness,’’ defined by temporal and dynamical acoustic features of speech. Especially the change of the vagueness during a certain period of speech, such as a 10‐minute presentation, is focused. As the acoustic features, it used (i) modulation spectrum and (ii) syllable speed, which may have relations to the speech clarity and the tempo. For the experiments, 70 academic presentation speech data in the Corpus of Spontaneous Japanese (CSJ) are used. As the experimental results, significant properties in the patterns of the modulation spectrum and the syllable speed are obtained as a difference of the beginning and the ending periods of the presentation. This result will contribute to a humanlike speech dialog system.
Skip Nav Destination
,
,
,
Article navigation
November 2006
Meeting abstract. No PDF available.
November 01 2006
An analysis of acoustic deviation manner in spontaneous speech
Norimichi Hosogai;
Norimichi Hosogai
Chiba Inst. of Technol., 2‐17‐1 Tsudanuma, Narashino, Chiba 275‐0016, Japan
Search for other works by this author on:
Kanae Okita;
Kanae Okita
Chiba Inst. of Technol., 2‐17‐1 Tsudanuma, Narashino, Chiba 275‐0016, Japan
Search for other works by this author on:
Takuya Aida;
Takuya Aida
Chiba Inst. of Technol., 2‐17‐1 Tsudanuma, Narashino, Chiba 275‐0016, Japan
Search for other works by this author on:
Shigeki Okawa
Shigeki Okawa
Chiba Inst. of Technol., 2‐17‐1 Tsudanuma, Narashino, Chiba 275‐0016, Japan
Search for other works by this author on:
Norimichi Hosogai
Kanae Okita
Takuya Aida
Shigeki Okawa
Chiba Inst. of Technol., 2‐17‐1 Tsudanuma, Narashino, Chiba 275‐0016, Japan
J. Acoust. Soc. Am. 120, 3293 (2006)
Citation
Norimichi Hosogai, Kanae Okita, Takuya Aida, Shigeki Okawa; An analysis of acoustic deviation manner in spontaneous speech. J. Acoust. Soc. Am. 1 November 2006; 120 (5_Supplement): 3293. https://doi.org/10.1121/1.4777852
Download citation file:
100
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Speed-dependent directivity patterns of road-traffic vehicles
Christian Dreier, Michael Vorländer
Related Content
Realization of rhythmic dialogue on spoken dialogue system using paralinguistic information
J. Acoust. Soc. Am. (November 2006)
A method for estimating the degree of a speaker’s anger using acoustic features and linguistic representation
J. Acoust. Soc. Am. (November 2006)
An analysis of note deviation manner in piano music
J. Acoust. Soc. Am. (November 2006)
New anthropomorphic talking robot—investigation of the three‐dimensional articulation mechanism and improvement of the pitch range
J. Acoust. Soc. Am. (November 2006)
A nonstationary speech production model based on a r m a x analysis
J. Acoust. Soc. Am. (November 2001)