This paper investigates the use of musical priors for sparse expansion of audio signals of music, on an overcomplete dual-resolution dictionary taken from the union of two orthonormal bases that can describe both transient and tonal components of a music audio signal. More specifically, chord and metrical structure information are used to build a structured model that takes into account dependencies between coefficients of the decomposition, both for the tonal and for the transient layer. The denoising task application is used to provide a proof of concept of the proposed musical priors. Several configurations of the model are analyzed. Evaluation on monophonic and complex polyphonic excerpts of real music signals shows that the proposed approach provides results whose quality measured by the signal-to-noise ratio is competitive with state-of-the-art approaches, and more coherent with the semantic content of the signal. A detailed analysis of the model in terms of sparsity and in terms of interpretability of the representation is also provided and shows that the model is capable of giving a relevant and legible representation of Western tonal music audio signals.
Skip Nav Destination
Article navigation
July 2013
July 11 2013
Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors Available to Purchase
Hélène Papadopoulos;
Hélène Papadopoulos
a)
Laboratoire des Signaux et Systèmes
, UMR 8506, CNRS-SUPELEC-Univ Paris-Sud, 91172 Gif-sur-Yvette Cedex, France
Search for other works by this author on:
Matthieu Kowalski
Matthieu Kowalski
Laboratoire des Signaux et Systèmes
, UMR 8506, CNRS-SUPELEC-Univ Paris-Sud, 91172 Gif-sur-Yvette Cedex, France
Search for other works by this author on:
Hélène Papadopoulos
a)
Laboratoire des Signaux et Systèmes
, UMR 8506, CNRS-SUPELEC-Univ Paris-Sud, 91172 Gif-sur-Yvette Cedex, France
Matthieu Kowalski
Laboratoire des Signaux et Systèmes
, UMR 8506, CNRS-SUPELEC-Univ Paris-Sud, 91172 Gif-sur-Yvette Cedex, France
a)
Author to whom correspondence should be addressed. Electronic mail: [email protected]
J. Acoust. Soc. Am. 134, 666–685 (2013)
Article history
Received:
March 04 2012
Accepted:
May 08 2013
Citation
Hélène Papadopoulos, Matthieu Kowalski; Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors. J. Acoust. Soc. Am. 1 July 2013; 134 (1): 666–685. https://doi.org/10.1121/1.4807821
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
66
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
Effect of instrument timbre on melodic contour identification by cochlear implant users
J. Acoust. Soc. Am. (September 2008)
Dictionary learning of sound speed profiles
J. Acoust. Soc. Am. (March 2017)
Tonal features of Chinese plucked string instruments extracted from constant-Q transform spectrum
J. Acoust. Soc. Am. (April 2012)
A dedicated greedy pursuit algorithm for sparse spectral representation of music sound
J. Acoust. Soc. Am. (October 2016)
Ultrasonic Signal Decomposition via Matching Pursuit with an Adaptive and Interpolated Dictionary
AIP Conf. Proc. (March 2007)