Musical instrument timbre has been intensively investigated through dissimilarity rating tasks. It is now well known that audio descriptors such as attack time and spectral centroid, among others, account well for the dimensions of the timbre spaces underlying these dissimilarity ratings. Nevertheless, it remains very difficult to reproduce these perceptual judgments from distances computed on acoustical representations such as the waveform or the spectrogram. Interestingly, biologically inspired representations based on spectrotemporal modulation spectra such as spectrotemporal receptive fields (STRF) have been shown to be well-suited to reproduce human dissimilarity ratings (Patil et al., 2012). Here, we propose a meta-analysis of seven former studies on timbre spaces in light of these recently developed representations. We implemented a computational framework that optimizes the correlation between the perceptual results and distances obtained from a set of different acoustic representations, in particular through the STRF. We observed that distances computed from spectrotemporal modulation representations provide the best correlation with the perceptual results across the seven timbre spaces. Finally, we highlighted the parts of the representations contributing the most to the correlation suggesting new insights into the underlying perceptual metrics. [Supported by Canada Research Chair, NSERC (RGPIN-2015-05208, RGPAS-478121-15), (RGPIN-262808-2012), and EU MSCf (Project MIM, H2020-MSCA-IF-2014, GA no. 659).]
Skip Nav Destination
Article navigation
March 2018
Meeting abstract. No PDF available.
March 01 2018
Human dissimilarity ratings of musical instrument timbre: A computational meta-analysis
Etienne Thoret;
Etienne Thoret
Schulich School of Music, McGill Univ., 555 Rue Sherbrooke Ouest, Montreal, QC H3A 1E3, Canada, [email protected]
Search for other works by this author on:
Baptiste Caramiaux;
Baptiste Caramiaux
CNRS LRI UMR 8623, Univ. Paris-Sud, Paris, France
Search for other works by this author on:
Philippe Depalle;
Philippe Depalle
Schulich School of Music, McGill Univ., Montreal, QC, Canada
Search for other works by this author on:
Stephen McAdams
Stephen McAdams
Schulich School of Music, McGill Univ., Montreal, QC, Canada
Search for other works by this author on:
J. Acoust. Soc. Am. 143, 1745–1746 (2018)
Citation
Etienne Thoret, Baptiste Caramiaux, Philippe Depalle, Stephen McAdams; Human dissimilarity ratings of musical instrument timbre: A computational meta-analysis. J. Acoust. Soc. Am. 1 March 2018; 143 (3_Supplement): 1745–1746. https://doi.org/10.1121/1.5035697
Download citation file:
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Perceptually salient spectrotemporal modulations for recognition of sustained musical instruments
J. Acoust. Soc. Am. (December 2016)
Auditory perception through the cortical analysis of spectrotemporal modulations.
J Acoust Soc Am (April 2009)
Auditory gist: Recognition of very short sounds from timbre cues
J. Acoust. Soc. Am. (March 2014)
Acoustic structure of the five perceptual dimensions of timbre in orchestral instrument tones
J. Acoust. Soc. Am. (January 2013)
A biomimetic multi‐resolution spectrotemporal model for musical timbre recognition.
J Acoust Soc Am (March 2010)