We determined how the perceived naturalness of music and speech (male and female talkers) signals was affected by various forms of linear filtering, some of which were intended to mimic the spectral “distortions” introduced by transducers such as microphones, loudspeakers, and earphones. The filters introduced spectral tilts and ripples of various types, variations in upper and lower cutoff frequency, and combinations of these. All of the differently filtered signals (168 conditions) were intermixed in random order within one block of trials. Levels were adjusted to give approximately equal loudness in all conditions. Listeners were required to judge the perceptual quality (naturalness) of the filtered signals on a scale from 1 to 10. For spectral ripples, perceived quality decreased with increasing ripple density up to 0.2 and with increasing ripple depth. Spectral tilts also degraded quality, and the effects were similar for positive and negative tilts. Ripples and/or tilts degraded quality more when they extended over a wide frequency range (87–6981 Hz) than when they extended over subranges. Low- and mid-frequency ranges were roughly equally important for music, but the mid-range was most important for speech. For music, the highest quality was obtained for the broadband signal (55–16 854 Hz). Increasing the lower cutoff frequency from 55 Hz resulted in a clear degradation of quality. There was also a distinct degradation as the upper cutoff frequency was decreased from 16 845 Hz. For speech, there was a marked degradation when the lower cutoff frequency was increased from 123 to 208 Hz and when the upper cutoff frequency was decreased from 10 869 Hz. Typical telephone bandwidth (313 to 3547 Hz) gave very poor quality.
Skip Nav Destination
Article navigation
July 2003
July 03 2003
Perceived naturalness of spectrally distorted speech and music
Brian C. J. Moore;
Brian C. J. Moore
Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
Search for other works by this author on:
Chin-Tuan Tan
Chin-Tuan Tan
Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
Search for other works by this author on:
J. Acoust. Soc. Am. 114, 408–419 (2003)
Article history
Received:
January 30 2002
Accepted:
April 07 2003
Citation
Brian C. J. Moore, Chin-Tuan Tan; Perceived naturalness of spectrally distorted speech and music. J. Acoust. Soc. Am. 1 July 2003; 114 (1): 408–419. https://doi.org/10.1121/1.1577552
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Rapid detection of fish calls within diverse coral reef soundscapes using a convolutional neural network
Seth McCammon, Nathan Formel, et al.
Related Content
The effect of compression speed on intelligibility: Simulated hearing-aid processing with and without original temporal fine structure information
J. Acoust. Soc. Am. (September 2012)
Notionally steady background noise acts primarily as a modulation masker of speech
J. Acoust. Soc. Am. (July 2012)
Acoustically distinct and perceptually ambiguous: ʔayʔaǰuθəm (Salish) fricatives
J. Acoust. Soc. Am. (April 2020)
Spectral dynamics of sibilant fricatives are contrastive and language specific
J. Acoust. Soc. Am. (October 2016)
Relative contributions of specific frequency bands to the loudness of broadband sounds
J. Acoust. Soc. Am. (September 2017)