This paper describes a system for modeling, recognizing, and classifying sound textures. The described system translates contemporary approaches from video texture analysis, creating a unique approach in the realm of audio and music. The signal is first represented as a set of mode functions by way of the Empirical Mode Decomposition technique for time/frequency analysis, before expressing the dynamics of these modes as a linear dynamical system (LDS). Both linear and nonlinear techniques are utilized in order to learn the system dynamics, which leads to a successful distinction between unique classes of textures. Five classes of sounds comprised a data set, consisting of crackling fire, typewriter action, rainstorms, carbonated beverages, and crowd applause, drawing on a variety of source recordings. Based on this data set the system achieved a classification accuracy of 90%, which outperformed both a Mel-Frequency Cepstral Coefficient based LDS-modeling approach from the literature, as well as one based on a standard Gaussian Mixture Model classifier.
Skip Nav Destination
,
,
Article navigation
October 2012
October 03 2012
Sound texture recognition through dynamical systems modeling of empirical mode decomposition Available to Purchase
Doug Van Nort;
Doug Van Nort
a)
School of Architecture and Electronic Arts Department,
Rensselaer Polytechnic Institute
, 110 8th Street, Troy, New York 12180
Search for other works by this author on:
Jonas Braasch;
Jonas Braasch
School of Architecture and Electronic Arts Department,
Rensselaer Polytechnic Institute
, 110 8th Street, Troy, New York 12180
Search for other works by this author on:
Pauline Oliveros
Pauline Oliveros
School of Architecture and Electronic Arts Department,
Rensselaer Polytechnic Institute
, 110 8th Street, Troy, New York 12180
Search for other works by this author on:
Doug Van Nort
a)
Jonas Braasch
Pauline Oliveros
School of Architecture and Electronic Arts Department,
Rensselaer Polytechnic Institute
, 110 8th Street, Troy, New York 12180a)
Author to whom correspondence should be addressed; electronic mail: [email protected]
J. Acoust. Soc. Am. 132, 2734–2744 (2012)
Article history
Received:
September 27 2011
Accepted:
August 22 2012
Citation
Doug Van Nort, Jonas Braasch, Pauline Oliveros; Sound texture recognition through dynamical systems modeling of empirical mode decomposition. J. Acoust. Soc. Am. 1 October 2012; 132 (4): 2734–2744. https://doi.org/10.1121/1.4751535
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
Stochastic generation of hourly rainstorm events in Johor
AIP Conf. Proc. (February 2015)
Acoustics is physics: Its part in over a half‐century of teaching physics
J. Acoust. Soc. Am. (August 2005)
The depth-dependence of rain noise in the Philippine Sea
J. Acoust. Soc. Am. (May 2013)
Modeling directed weighted network based on event coincidence analysis and its application on spatial propagation characteristics
Chaos (June 2023)
The discrete charm of rain
Physics Today (January 2011)