Monaural sound source separation is the process of separating sound sources from a single channel mixture. In mixtures of pitched musical instruments, the problem of overlapping harmonics poses a significant challenge to source separation and reconstruction. One standard method to resolve overlapped harmonics is based on the assumption that harmonics of the same source have correlated amplitude envelopes: common amplitude modulation (CAM). Based on CAM, overlapped harmonics are approximated using the amplitude envelope from the non‐overlapped harmonics of the same note. CAM assumes non‐overlapped harmonics from the same noteare available and have similar amplitude envelopes to the overlapped harmonics. This is not always the case. A technique is proposed for harmonic temporal envelope estimation based on the idea of scene completion. The system learns the harmonic envelope for each instrument’s notes from the non‐overlapped harmonics of other notes played by that instrument, wherever they occur in the recording. This model is used to reconstruct the overlapped harmonic envelopes for obstructed harmonics. This allows reconstruction of completely overlapped notes, yet does not require predetermined instrument models. Experiments show the proposed algorithm performs better than an existing system based on CAM when the harmonics of pitched instrument are strongly overlapped.
Skip Nav Destination
Article navigation
October 2010
Meeting abstract. No PDF available.
October 18 2010
Reconstructing individual monophonic instruments from musical mixtures using scene completion.
Jinyu Han;
Jinyu Han
EECS Dept., Northwestern Univ., 2133 Sheridan Rd., Rm. 3‐323, Evanston, IL 60208, [email protected]
Search for other works by this author on:
Bryan Pardo
Bryan Pardo
EECS Dept., Northwestern Univ., 2133 Sheridan Rd., Rm. 3‐323, Evanston, IL 60208, [email protected]
Search for other works by this author on:
J. Acoust. Soc. Am. 128, 2309 (2010)
Citation
Jinyu Han, Bryan Pardo; Reconstructing individual monophonic instruments from musical mixtures using scene completion.. J. Acoust. Soc. Am. 1 October 2010; 128 (4_Supplement): 2309. https://doi.org/10.1121/1.3508126
Download citation file:
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Automatic transcription of monophonic piano music
J Acoust Soc Am (September 2012)
Intelligent computer accompaniment systems
J Acoust Soc Am (October 1999)
Instrumental analysis of extended saxophone techniques for live electronics.
J Acoust Soc Am (March 2010)
Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors
J. Acoust. Soc. Am. (July 2013)
Methods of articulation in piano performance
J Acoust Soc Am (August 2005)