Temporal modulation is frequently considered for improving the intelligibility of noisy speech owing to the importance of the envelope in conveying speech information. In this study, we evaluated the performance of temporal modulation on word scores obtained with speech-in-noise. We firstly divided the noisy speech into 16 contiguous subbands from 200 Hz to 6 kHz with bandwidth approximately 1.5xERB of an auditory filter. The temporal envelope was constructed for each subband from the absolute value. It was then low-pass filtered at 16 Hz and used as the instantaneous gain for the subband's noisy speech. We established a psychophysical test (Modified Rhyme Test) with different speech-in-noise SNR values of 0, −3, −6, and −9 dB. Eleven native speakers (age 29 ± 8) with normal hearing were recruited for the study. The signals were processed by MATLAB and presented over earphones to participants seated in an audiometric room. Comparing the processed and unprocessed noisy speech, the mean differences in word scores for SNRs of 0, −3, −6, and −9 dB were −0.4%, 1.5%, −10.5%, and 1.4%, respectively. Using ANOVA, we concluded that the temporal modulation-based algorithm does not produce a statistically significant improvement in speech intelligibility. [Work supported by NIOSH.]
Skip Nav Destination
Article navigation
October 2020
Meeting abstract. No PDF available.
October 01 2020
Investigation of a temporal modulation based method on the intelligibility of speech in speech-spectrum shaped noise
Rahim Soleymanpour;
Rahim Soleymanpour
Biomedical Eng., Univ. of Connecticut, 263 Farmington Ave., Farmington, CT 06030, [email protected]
Search for other works by this author on:
Insoo Kim
Insoo Kim
Dept. of Medicine, Univ. of Connecticut, Farmington, CT
Search for other works by this author on:
J. Acoust. Soc. Am. 148, 2652 (2020)
Citation
Rahim Soleymanpour, Anthony J. Brammer, Insoo Kim; Investigation of a temporal modulation based method on the intelligibility of speech in speech-spectrum shaped noise. J. Acoust. Soc. Am. 1 October 2020; 148 (4_Supplement): 2652. https://doi.org/10.1121/1.5147378
Download citation file:
45
Views
Citing articles via
Vowel signatures in emotional interjections and nonlinguistic vocalizations expressing pain, disgust, and joy across languages
Maïa Ponsonnet, Christophe Coupé, et al.
The alveolar trill is perceived as jagged/rough by speakers of different languages
Aleksandra Ćwiek, Rémi Anselme, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Effect of envelope signal-to-noise ratio on the intelligibility of speech in speech-spectrum shaped noise
J. Acoust. Soc. Am. (October 2020)
Using ideal binary masking based on signal-to-noise ratio of temporal amplitude envelope to improve the intelligibility of speech in noise
J Acoust Soc Am (October 2021)
Improving speech understanding for face-to-face communication in noise when wearing hearing protectors
J. Acoust. Soc. Am. (March 2024)
Self-administered, internet-enabled, modified rhyme test (MRT) for evaluating consonant confusion in remote subjects
J. Acoust. Soc. Am. (March 2024)
Relationships between the modified rhyme test and objective metrics of speech intelligibility.
J Acoust Soc Am (March 2010)