Because of advancements of speech and language processing, a number of spoken dialogue systems have been constructed. However, because most of them adopt existing text‐to‐speech synthesizers to generate output speech, it is rather difficult to reflect all the linguistic information that is obtained during the reply sentence generation. To resolve this situation, a framework must correctly reflect higher‐level linguistic information, such as syntactic structure and discourse information, on the prosody of output speech: concept‐to‐speech conversion, where reply sentences are generated from information (to be transmitted) and converted into speech in a unified process. A spoken dialogue system for road guidance was constructed, and concept‐to‐speech conversion was realized in the system. The linguistic information of the generated sentence is handled in tag LISP form to retain the syntactic structures throughout the process. Moreover, a new method of sentence generation from concept was realized with this system: it handles a concept in phrase units and aggregates them to form a sentence. It is tested whether the linguistic information could be reflected properly on the prosody of output speech. Results of listening experiments verified the effectiveness of our proposed method.
Skip Nav Destination
,
,
,
Article navigation
November 2006
Meeting abstract. No PDF available.
November 01 2006
Concept‐to‐speech conversion for reply speech generation in a spoken dialogue system for road guidance and its prosodic control Free
Yuji Yagi;
Yuji Yagi
School of Eng., Univ. of Tokyo, Bldg. No. 2, 7‐3‐1, Hongo, Bunkyo‐ku, Tokyo, Japan
Search for other works by this author on:
Seiya Takada;
Seiya Takada
School of Eng., Univ. of Tokyo, Bldg. No. 2, 7‐3‐1, Hongo, Bunkyo‐ku, Tokyo, Japan
Search for other works by this author on:
Keikichi Hirose;
Keikichi Hirose
School of Eng., Univ. of Tokyo, Bldg. No. 2, 7‐3‐1, Hongo, Bunkyo‐ku, Tokyo, Japan
Search for other works by this author on:
Nobuaki Minematsu
Nobuaki Minematsu
School of Eng., Univ. of Tokyo, Bldg. No. 2, 7‐3‐1, Hongo, Bunkyo‐ku, Tokyo, Japan
Search for other works by this author on:
Yuji Yagi
Seiya Takada
Keikichi Hirose
Nobuaki Minematsu
School of Eng., Univ. of Tokyo, Bldg. No. 2, 7‐3‐1, Hongo, Bunkyo‐ku, Tokyo, Japan
J. Acoust. Soc. Am. 120, 3038 (2006)
Citation
Yuji Yagi, Seiya Takada, Keikichi Hirose, Nobuaki Minematsu; Concept‐to‐speech conversion for reply speech generation in a spoken dialogue system for road guidance and its prosodic control. J. Acoust. Soc. Am. 1 November 2006; 120 (5_Supplement): 3038. https://doi.org/10.1121/1.4787191
Download citation file:
51
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Drawer-like tunable ventilated sound barrier
Yong Ge, Yi-jun Guan, et al.
Related Content
Development of nonvoice dialogue interface for robot systems
J. Acoust. Soc. Am. (November 2006)
Interaction between prosody and discourse structure in a simulated man–machine dialogue
J. Acoust. Soc. Am. (November 1997)
Realization of rhythmic dialogue on spoken dialogue system using paralinguistic information
J. Acoust. Soc. Am. (November 2006)
Modeling prosody in speech processing
J. Acoust. Soc. Am. (November 2006)
The novel Ventriloquist paradigm: Studying L2 phonetic learning in dialogue with experimental control over phonetic detail
J. Acoust. Soc. Am. (October 2016)