The aim of the present study is to examine the relationship between word error rate (WER) from an automatic speech recognition system and perceptual judgments (foreign-accentedness, fluency, and comprehensibility) from human raters. In a previous study, Franco et al. (1997) used HMM-derived scores based on posterior probabilities of phone segments, and Deville et al. (1999) used an HMM/ANN recognition approach to show how the results of automatic speech recognition can be used for perceptual judgments. Park and Culnan (2019) showed the possibility to assimilate human raters' perceptual judgments by using neural network models only with the speech signal, and suggested that the model worked better on accentedness judgments than fluency judgments. In this study, we will examine whether WERs of English sentences produced by three language groups (American, Korean, Chinese) are significantly different, and if there is any difference, we will analyze the correlations between WER and perceptual judgments. The perceptual data used in Park and Culnan (2019) will be used for the analysis. The preliminary results of this study will be used to find important features to build more accurate automatic proficiency judgment models.
Skip Nav Destination
,
Article navigation
October 2020
Meeting abstract. No PDF available.
October 01 2020
The relationship between word error rate and perceptual judgment
Seongjin Park;
Seongjin Park
Dept. of Linguist, Univ. of Arizona, Tucson, AZ 85721, [email protected]
Search for other works by this author on:
John Culnan
John Culnan
Univ. of Arizona, Tucson, AZ
Search for other works by this author on:
Seongjin Park
John Culnan
Dept. of Linguist, Univ. of Arizona, Tucson, AZ 85721, [email protected]
J. Acoust. Soc. Am. 148, 2763 (2020)
Citation
Seongjin Park, John Culnan; The relationship between word error rate and perceptual judgment. J. Acoust. Soc. Am. 1 October 2020; 148 (4_Supplement): 2763. https://doi.org/10.1121/1.5147687
Download citation file:
127
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Speed-dependent directivity patterns of road-traffic vehicles
Christian Dreier, Michael Vorländer
Related Content
Automatic perceptual judgment using neural networks
J. Acoust. Soc. Am. (October 2019)
Automatic proficiency judgments: Accentedness, fluency, and comprehensibility
J. Acoust. Soc. Am. (October 2021)
Effects of noise and talker intelligibility on judgments of accentedness
J. Acoust. Soc. Am. (May 2018)
Variability in human judgments of foreign accent strength
J. Acoust. Soc. Am. (November 2000)
Diverse environments and their impact on accentedness judgments
J. Acoust. Soc. Am. (October 2020)