Automatic speech recognition (ASR) in noisy environments requires innovative use of disparate technology to overcome the special demands caused by multiple speakers and minimal signal‐to‐noise ratios (SNRs). Use of nonairborne acoustic sensors for ASR imposes special requirements on speech engines due to the changes in spectral information caused by alternative pickup locations. Specifically, the relative power of voiced and nonvoiced components is often reversed when compared to the relative power of these components collected with conventional microphone technology. Our research entails the evaluation of various ASR sampling configurations in conjunction with different body location points for the physiological sensor. Physiology provides clues to speakers’ stress or cognition. For ASR, our goal is to build a suite of optimal sampling configurations for several strategic body locations (e.g., throat temple, thorax, etc.). The experimental design includes traditional word error rate and subjective task completion components. These experiments were conducted in environments with SNR ranges of 10, 3, 0, and −1 dB. These SNR ranges cover the optimal commercial ASR environment of 10 dB to ranges where commercial ASR systems with conventional microphone technology are completely ineffective. Pilot studies indicate good performance below 0‐dB SNR for a sensor that is throat located.
Skip Nav Destination
,
,
Article navigation
October 1999
Meeting abstract. No PDF available.
October 01 1999
Getting two birds with one phone: An acoustic sensor for both speech recognition and medical monitoring
James D. Bass;
James D. Bass
Army Res. Lab., 2800 Powder Mill Rd., Adelphi, MD 20783, [email protected]
Search for other works by this author on:
Michael V. Scanlon;
Michael V. Scanlon
Army Res. Lab., 2800 Powder Mill Rd., Adelphi, MD 20783, [email protected]
Search for other works by this author on:
Thomas K. Mills
Thomas K. Mills
Army Res. Lab., 2800 Powder Mill Rd., Adelphi, MD 20783, [email protected]
Search for other works by this author on:
James D. Bass
Michael V. Scanlon
Thomas K. Mills
Army Res. Lab., 2800 Powder Mill Rd., Adelphi, MD 20783, [email protected]
J. Acoust. Soc. Am. 106, 2180 (1999)
Citation
James D. Bass, Michael V. Scanlon, Thomas K. Mills; Getting two birds with one phone: An acoustic sensor for both speech recognition and medical monitoring. J. Acoust. Soc. Am. 1 October 1999; 106 (4_Supplement): 2180. https://doi.org/10.1121/1.427277
Download citation file:
Citing articles via
I can't hear you without my glasses
Tessa Bent
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Voice activity detection with voice/nonvoice classification using reliable fundamental frequency
J. Acoust. Soc. Am. (November 2006)
Development of nonvoice dialogue interface for robot systems
J. Acoust. Soc. Am. (November 2006)
Semi-automatic classification of bird vocalizations using spectral peak tracks
J. Acoust. Soc. Am. (November 2006)
Speech recognition meets bird song: A comparison of statistics‐based and template‐based techniques
J. Acoust. Soc. Am. (October 1999)
Frameless processing of bird songs using set membership identification
J. Acoust. Soc. Am. (October 2003)