Understanding how the human speech production system is related to the human auditory system has been a perennial subject of inquiry. To investigate the production–perception link, in this paper, a computational analysis has been performed using the articulatory movement data obtained during speech production with concurrently recorded acoustic speech signals from multiple subjects in three different languages: English, Cantonese, and Georgian. The form of articulatory gestures during speech production varies across languages, and this variation is considered to be reflected in the articulatory position and kinematics. The auditory processing of the acoustic speech signal is modeled by a parametric representation of the cochlear filterbank which allows for realizing various candidate filterbank structures by changing the parameter value. Using mathematical communication theory, it is found that the uncertainty about the articulatory gestures in each language is maximally reduced when the acoustic speech signal is represented using the output of a filterbank similar to the empirically established cochlear filterbank in the human auditory system. Possible interpretations of this finding are discussed.
Skip Nav Destination
Article navigation
June 2011
June 14 2011
Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures
Prasanta Kumar Ghosh;
Prasanta Kumar Ghosh
a)
Signal Analysis and Interpretation Laboratory, Department of Electrical Engineering, University of Southern California
, Los Angeles, California 90089
Search for other works by this author on:
Louis M. Goldstein;
Louis M. Goldstein
Department of Linguistics, University of Southern California
, Los Angeles, California 90089
Search for other works by this author on:
Shrikanth S. Narayanan
Shrikanth S. Narayanan
Signal Analysis and Interpretation Laboratory, Department of Electrical Engineering, University of Southern California
, Los Angeles, California 90089
Search for other works by this author on:
a)
Author to whom correspondence should be addressed. Electronic mail: prasantg@usc.edu.
J. Acoust. Soc. Am. 129, 4014–4022 (2011)
Article history
Received:
June 17 2010
Accepted:
March 13 2011
Citation
Prasanta Kumar Ghosh, Louis M. Goldstein, Shrikanth S. Narayanan; Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures. J. Acoust. Soc. Am. 1 June 2011; 129 (6): 4014–4022. https://doi.org/10.1121/1.3573987
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
Citing articles via
Related Content
Recoverability constraints on gestural overlap in Georgian stop sequences
J Acoust Soc Am (May 2000)
The dynamic gammawarp auditory filterbank
J. Acoust. Soc. Am. (March 2018)
A low‐cost filterbank spectrometer for submm observations in radio astronomy
Rev Sci Instrum (May 1991)
An auditory filterbank design and its hardware implementation with DSP
J Acoust Soc Am (August 2005)
Behavioral and neurophysiological signatures of the modulation filterbank in an animal model
J Acoust Soc Am (March 2023)