We propose a four‐stage model to represent auditory response magnitude over time and across frequency channels. The first stage performs spectral estimation by computing power at each of the output frequency channels. Frequency is scaled in basilar membrane distance and the input is passed through a window having a duration inversely proportional to the frequency. The next stage has been described earlier [R. V. Shannon, J. Acoust. Soc. Am. Suppl. 1 65, S56 (1979)]; it portrays frequency analysis in the cochlea. This stage includes mechanisms of cochlear filtering (by means of two filter banks, one having sharp and the other having broad filters), nonlinear compression of the input power, and lateral suppression. The third stage models temporal adaptation in the auditory nerve. The final stage is a temporal integrator. Speech sounds analyzed by the model acquire several interesting characteristics: (i) The dynamic range of the input is greatly reduced (<15 dB), (ii) bandpass information (e.g., the one in formants, fricatives, etc.) is represented as a spectral edge at the low side of the bandpass region, (iii) bursts (especially high‐frequency plosives) acquire temporal sharpness, (iv) individual glottal pulses are clearly visible only at high frequencies. [Work supported by Institut National de la Recherche Scientifique, the Veterans Administration, and grants by N.I.H.]
Skip Nav Destination
Article navigation
November 1983
August 12 2005
Neural response patterns to speech sounds—A model Free
Pierre L. Divenyi;
Pierre L. Divenyi
Speech and Hearing Research Facility, V. A. Medical Center, Martinez, CA 94553
I.N.R.S.‐Telecommunications, Université de Québec, Verdun, Quebec H3E 1H6, Canada
Search for other works by this author on:
Robert V. Shannon;
Robert V. Shannon
Department of Otolaryngology, University of California, San Francisco, CA 94143
Search for other works by this author on:
Stephen R. Saunders
Stephen R. Saunders
Bell‐Northern Research, Verdun, Quebec, H3E 1H6, Canada
Search for other works by this author on:
Pierre L. Divenyi
Speech and Hearing Research Facility, V. A. Medical Center, Martinez, CA 94553
I.N.R.S.‐Telecommunications, Université de Québec, Verdun, Quebec H3E 1H6, Canada
Robert V. Shannon
Department of Otolaryngology, University of California, San Francisco, CA 94143
Stephen R. Saunders
Bell‐Northern Research, Verdun, Quebec, H3E 1H6, Canada
J. Acoust. Soc. Am. 74, S68 (1983)
Citation
Pierre L. Divenyi, Robert V. Shannon, Stephen R. Saunders; Neural response patterns to speech sounds—A model. J. Acoust. Soc. Am. 1 November 1983; 74 (S1): S68. https://doi.org/10.1121/1.2021097
Download citation file:
91
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
Exploiting lawful variability in the signal: The TRACE model of speech perception
J. Acoust. Soc. Am. (August 2005)
The pursuit of invariance in speech signals
J. Acoust. Soc. Am. (August 2005)
Temporal organization of interarticulator muscle activity in American‐English monosyllables
J. Acoust. Soc. Am. (August 2005)
Temporal summation in the auditory system of the goldfish
J. Acoust. Soc. Am. (May 1981)
Suppressing the suppressor: Component interaction in the presence of low band‐pass noise
J. Acoust. Soc. Am. (August 2005)