Many objective measures have been reported to predict speech intelligibility in noise, most of which were designed and evaluated with English speech corpora. Given the different perceptual cues used by native listeners of different languages, examining whether there is any language effect when the same objective measure is used to predict speech intelligibility in different languages is of great interest, particularly when non-linear noise-reduction processing is involved. In the present study, an extensive evaluation is taken of objective measures for speech intelligibility prediction of noisy speech processed by noise-reduction algorithms in Chinese, Japanese, and English. Of all the objective measures tested, the short-time objective intelligibility (STOI) measure produced the most accurate results in speech intelligibility prediction for Chinese, while the normalized covariance metric (NCM) and middle-level coherence speech intelligibility index () incorporating the signal-dependent band-importance functions (BIFs) produced the most accurate results for Japanese and English, respectively. The objective measures that performed best in predicting the effect of non-linear noise-reduction processing in speech intelligibility were found to be the BIF-modified NCM measure for Chinese, the STOI measure for Japanese, and the BIF-modified measure for English. Most of the objective measures examined performed differently even under the same conditions for different languages.
Skip Nav Destination
Article navigation
December 2014
December 01 2014
Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English
Junfeng Li;
Junfeng Li
a)
Institute of Acoustics
, Chinese Academy of Sciences, Beijing, 100190, China
Search for other works by this author on:
Risheng Xia;
Risheng Xia
Institute of Acoustics
, Chinese Academy of Sciences, Beijing, 100190, China
Search for other works by this author on:
Dongwen Ying;
Dongwen Ying
Institute of Acoustics
, Chinese Academy of Sciences, Beijing, 100190, China
Search for other works by this author on:
Yonghong Yan;
Yonghong Yan
Institute of Acoustics
, Chinese Academy of Sciences, Beijing, 100190, China
Search for other works by this author on:
Masato Akagi
Masato Akagi
School of Information Science, Japan Advanced Institute of Science and Technology
, Ishikawa, 923-1292, Japan
Search for other works by this author on:
a)
Author to whom correspondence should be addressed. Electronic mail: [email protected]
J. Acoust. Soc. Am. 136, 3301–3312 (2014)
Article history
Received:
June 16 2014
Accepted:
October 23 2014
Citation
Junfeng Li, Risheng Xia, Dongwen Ying, Yonghong Yan, Masato Akagi; Investigation of objective measures for intelligibility prediction of noise-reduced speech for Chinese, Japanese, and English. J. Acoust. Soc. Am. 1 December 2014; 136 (6): 3301–3312. https://doi.org/10.1121/1.4901079
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
All we know about anechoic chambers
Michael Vorländer
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Performance study of ray-based ocean acoustic tomography methods for estimating submesoscale variability in the upper ocean
Etienne Ollivier, Richard X. Touret, et al.
Related Content
Intelligibility prediction for speech mixed with white Gaussian noise at low signal-to-noise ratios
J. Acoust. Soc. Am. (February 2021)
Analysis of acoustic features for speech intelligibility prediction models analysis of acoustic features for speech intelligibility prediction models
J Acoust Soc Am (October 2016)
Contributions of cochlea-scaled entropy and consonant-vowel boundaries to prediction of speech intelligibility in noise
J. Acoust. Soc. Am. (May 2012)
Comparison of ideal mask-based speech enhancement algorithms for speech mixed with white noise at low mixture signal-to-noise ratios
J. Acoust. Soc. Am. (December 2022)
Comparison of predictive measures of speech recognition after noise reduction processing
J. Acoust. Soc. Am. (September 2014)