The amplitude envelopes of rectified bandpass filtered speech have been found to provide useful cues for speech perception [K. W. Grant, L. D. Braida, and R. J. Renn, J. Acoust. Soc. Am. 95, 1065–1073 (1994)]. An analog terminal was built to yield 25 such envelopes from filters with center (carrier) frequencies from 150 to 4850 Hz. Each envelope was then subjected to another round of bandpass filtering and rectification to yield a modulation spectrum of up to nine channels with center (modulation) frequencies from half the carrier frequency to 700 Hz. The spectra were examined for cues for the identification of voicing, fundamental frequency, and consonants. Voicing was generally characterized by the concentration of formant energy at a single carrier and modulation frequency, corresponding to the formant and fundamental frequencies, respectively. The second formant of the front vowel /i/ and nasal release sometimes exhibited bimodal modulation spectra, suggesting multiple sources of modulation. Stop consonants and fricatives were characterized by elements scattered at high carrier and modulation frequencies whose occurrences might not coincide. Some consonants could be identified with elements at specific modulation frequencies: e.g., /g/ and /j/ suggested a 700‐Hz source modulating carriers whose frequencies depended on the following vowel.
Skip Nav Destination
Article navigation
November 1994
November 01 1994
Spectral analysis of amplitude envelopes of bandpass filtered speech
King‐Leung Kong
King‐Leung Kong
Dept. of Psychol., Univ. of Hong Kong, Hong Kong
Search for other works by this author on:
King‐Leung Kong
Dept. of Psychol., Univ. of Hong Kong, Hong Kong
J. Acoust. Soc. Am. 96, 3350 (1994)
Citation
King‐Leung Kong; Spectral analysis of amplitude envelopes of bandpass filtered speech. J. Acoust. Soc. Am. 1 November 1994; 96 (5_Supplement): 3350. https://doi.org/10.1121/1.410634
Download citation file:
35
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
I can't hear you without my glasses
Tessa Bent
Related Content
Perceptual centers as an index of speech rhythm
J. Acoust. Soc. Am. (November 1994)
Prosodic cues of repetitions in Spanish spontaneous discourse
J. Acoust. Soc. Am. (November 1994)
Perceptual centers in Japanese disyllables
J. Acoust. Soc. Am. (November 1994)
Perceptual centers are affected by stress location in English disyllables
J. Acoust. Soc. Am. (November 1994)