Features (landmarks) in sound are located in time and spectrum. Two dimensional (time x spectrum) Gabor filters can be used to detect useful classes of these. We use a set of logarithmically spaced bandpass filters whose outputs are coded as spikes to perform spectral analysis. These are convolved with Gabor filters to create spectrotemporal feature maps. Using auditory-nerve like spikes to code zero-crossings retains precise timings and amplitude information, and makes convolution computation relatively straightforward. Gabor patches with “horizontal” bars (parallel to time axis) can be used to detect harmonicity, and patches with “vertical” bars (parallel to spectrum axis) can detect envelope modulations. Because the time resolution is maintained in the preprocessing, vertical bars may be close together (e.g., 5 to 10 ms apart), enabling detection of amplitude modulation due to unresolved harmonics. This is useful for both for speech voicing detection, and for animal utterances. Such filters may be localized in spectrum, allowing tracking of voicing energy. Filters with bars at other angles can detect frequency modulation. Using constellations of these features (and others, such as onsets), we can characterize and interpret sound sources.
Skip Nav Destination
Article navigation
April 2014
Meeting abstract. No PDF available.
April 01 2014
Spectrotemporal Gabor filters for feature detection
Leslie S. Smith;
Leslie S. Smith
Computing Sci. and Mathematics, Univ. of Stirling, Stirling, Scotland FK9 4LA,
United Kingdom
, [email protected]
Search for other works by this author on:
Andrew K. Abel
Andrew K. Abel
Computing Sci. and Mathematics, Univ. of Stirling, Stirling, Scotland FK9 4LA,
United Kingdom
, [email protected]
Search for other works by this author on:
J. Acoust. Soc. Am. 135, 2297 (2014)
Citation
Leslie S. Smith, Andrew K. Abel; Spectrotemporal Gabor filters for feature detection. J. Acoust. Soc. Am. 1 April 2014; 135 (4_Supplement): 2297. https://doi.org/10.1121/1.4877554
Download citation file:
51
Views
Citing articles via
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
All we know about anechoic chambers
Michael Vorländer
Related Content
Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition
J. Acoust. Soc. Am. (April 2015)
Spectrotemporal weighting of binaural cues: Effects of a diotic interferer on discrimination of dynamic interaural differences
J. Acoust. Soc. Am. (October 2016)
Reconfigurable Gabor Filter For Fingerprint Recognition Using FPGA Verilog
AIP Conference Proceedings (June 2009)
The effect of non-linear dimension reduction on Gabor filter bank feature space
J Acoust Soc Am (November 2013)
Characteristics of spectro-temporal modulation frequency selectivity in humans
J. Acoust. Soc. Am. (March 2017)