Several filterbank-based metrics have been proposed to predict speech intelligibility (SI). However, these metrics incorporate little knowledge of the auditory periphery. Neurogram-based metrics provide an alternative, incorporating knowledge of the physiology of hearing by using a mathematical model of the auditory nerve response. In this work, SI was assessed utilizing different filterbank-based metrics (the speech intelligibility index and the speech-based envelope power spectrum model) and neurogram-based metrics, using the biologically inspired model of the auditory nerve proposed by Zilany, Bruce, Nelson, and Carney [(2009), J. Acoust. Soc. Am. 126(5), 2390–2412] as a front-end and the neurogram similarity metric and spectro temporal modulation index as a back-end. Then, the correlations with behavioural scores were computed. Results showed that neurogram-based metrics representing the speech envelope showed higher correlations with the behavioural scores at a word level. At a per-phoneme level, it was found that phoneme transitions contribute to higher correlations between objective measures that use speech envelope information at the auditory periphery level and behavioural data. The presented framework could function as a useful tool for the validation and tuning of speech materials, as well as a benchmark for the development of speech processing algorithms.
Skip Nav Destination
Article navigation
January 2017
January 18 2017
Predicting phoneme and word recognition in noise using a computational model of the auditory periphery
Arturo Moncada-Torres;
Arturo Moncada-Torres
a)
1Department of Neurosciences, ExpORL,
KU Leuven
, Herestraat 49, Bus 721, 3000 Leuven, Belgium
Search for other works by this author on:
Astrid van Wieringen;
Astrid van Wieringen
1Department of Neurosciences, ExpORL,
KU Leuven
, Herestraat 49, Bus 721, 3000 Leuven, Belgium
Search for other works by this author on:
Ian C. Bruce;
Ian C. Bruce
2Department of Electrical and Computer Engineering,
McMaster University
, 1280 Main Street West, Hamilton, Ontario L8S 4K1, Canada
Search for other works by this author on:
Jan Wouters;
Jan Wouters
1Department of Neurosciences, ExpORL,
KU Leuven
, Herestraat 49, Bus 721, 3000 Leuven, Belgium
Search for other works by this author on:
Tom Francart
Tom Francart
1Department of Neurosciences, ExpORL,
KU Leuven
, Herestraat 49, Bus 721, 3000 Leuven, Belgium
Search for other works by this author on:
a)
Electronic mail: arturo.moncadatorres@med.kuleuven.be.
J. Acoust. Soc. Am. 141, 300–312 (2017)
Article history
Received:
June 15 2016
Accepted:
December 08 2016
Citation
Arturo Moncada-Torres, Astrid van Wieringen, Ian C. Bruce, Jan Wouters, Tom Francart; Predicting phoneme and word recognition in noise using a computational model of the auditory periphery. J. Acoust. Soc. Am. 1 January 2017; 141 (1): 300–312. https://doi.org/10.1121/1.4973569
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Pay-Per-View Access
$40.00
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Co-speech head nods are used to enhance prosodic prominence at different levels of narrow focus in French
Christopher Carignan, Núria Esteve-Gibert, et al.
Source and propagation modelling scenarios for environmental impact assessment: Model verification
Michael A. Ainslie, Robert M. Laws, et al.
Related Content
Hidden hearing loss and computational models of the auditory pathway: Predicting speech intelligibility decline
J Acoust Soc Am (October 2016)
Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement
J. Acoust. Soc. Am. (June 2020)
Physiological prediction of masking release for normal-hearing and hearing-impaired listeners
J Acoust Soc Am (May 2013)
Predicting the quality of enhanced wideband speech with a cochlear model
J Acoust Soc Am (September 2017)
A framework for computational modelling of interaural time difference discrimination of normal and hearing-impaired listeners
J. Acoust. Soc. Am. (August 2018)