Speech has become increasingly important in human–computer interaction. Spoken dialog interfaces rely on automatic speech recognition, speech synthesis, language understanding, and dialog management. A main issue in dialog systems is that they typically are limited to pre‐programmed vocabularies and sets of sentences. The research reported here focuses on developing an adaptive spoken dialog interface capable of acquiring new linguistic units and their corresponding semantics during the human–computer interaction. The adaptive interface identifies unknown words and phrases in the users utterances and asks the user for the corresponding semantics. The user can provide the meaning or the semantic representation of the new linguistic units through multiple modalities, including speaking, typing, pointing, touching, or showing. The interface then stores the new linguistic units in a semantic grammar and creates new objects defining the corresponding semantic representation. This process takes place during natural interaction between user and computer and, thus, the interface does not have to be rewritten and compiled to incorporate the newly acquired language. Users can personalize the adaptive spoken interface for different domain applications, or according to their personal preferences. [Work supported by NSF.]
Skip Nav Destination
,
Article navigation
May 2002
Meeting abstract. No PDF available.
May 01 2002
Adaptive interface for spoken dialog
Sorin Dusan;
Sorin Dusan
Ctr. for Adv. Information Processing, Rutgers Univ., 96 Frelinghuysen Rd., Piscataway, NJ 08854
Search for other works by this author on:
James Flanagan
James Flanagan
Ctr. for Adv. Information Processing, Rutgers Univ., 96 Frelinghuysen Rd., Piscataway, NJ 08854
Search for other works by this author on:
Sorin Dusan
James Flanagan
Ctr. for Adv. Information Processing, Rutgers Univ., 96 Frelinghuysen Rd., Piscataway, NJ 08854
J. Acoust. Soc. Am. 111, 2481 (2002)
Citation
Sorin Dusan, James Flanagan; Adaptive interface for spoken dialog. J. Acoust. Soc. Am. 1 May 2002; 111 (5_Supplement): 2481. https://doi.org/10.1121/1.4778628
Download citation file:
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Speed-dependent directivity patterns of road-traffic vehicles
Christian Dreier, Michael Vorländer
Related Content
Hidden Markov model‐based speech synthesis as a tool for constructing comunicative spoken dialog systems
J. Acoust. Soc. Am. (November 2006)
Modeling of spoken dialogue
J. Acoust. Soc. Am. (October 1996)
The dialogue terminal
J. Acoust. Soc. Am. (August 2005)
Acquisition and evaluation of a human-robot elderly spoken dialog corpus for developing computerized cognitive assessment systems
J. Acoust. Soc. Am. (October 2016)
Realization of rhythmic dialogue on spoken dialogue system using paralinguistic information
J. Acoust. Soc. Am. (November 2006)