An accurate pair of head-related impulse responses (HRIR) is necessary to render a realistic and immersive virtual auditory space. Since personalized HRIRs are challenging to measure, a method of using two machine-learning models to separately estimate the shape and time delays of the HRIRs based on an individual's anthropometric measurements of the head and pinnae is proposed. The investigation evaluates the perceptual accuracy of the resulting HRIRs with two subjective listening studies. First, an ABX test was conducted to determine if listeners could distinguish between auralizations made with an actual HRIR from the CIPIC database and the personalized HRIR based on the corresponding anthropometric measurements. In the second study, subjects performed localization tests in virtual reality to determine the perceptual accuracy of their personalized HRIR compared to an average HRIR. Each set of tests included one of three auralizations (noise, music, or speech) played in five source locations. Perceptual accuracy was evaluated by comparing the localization errors for the average and estimated HRIRs. The stimuli were repeated four times to investigate if subjects improved over time. The subjects were also asked to rate their listening experience to assess factors such as externalization and difficulty of localization.
Skip Nav Destination
Article navigation
October 2020
Meeting abstract. No PDF available.
October 01 2020
Investigating the perceptual accuracy of machine-learning generated personalized head-eelated impulse responses
Ming Yang Lee;
Ming Yang Lee
Mech. Eng., The Cooper Union for the Advancement of Sci. and Art, 40 Cooper Square, New York, NY 10003, [email protected]
Search for other works by this author on:
Melody Baglione
Melody Baglione
Mech. Eng., The Cooper Union for the Advancement of Sci. and Art, New York, NY
Search for other works by this author on:
J. Acoust. Soc. Am. 148, 2768 (2020)
Citation
Ming Yang Lee, Martin S. Lawless, Melody Baglione; Investigating the perceptual accuracy of machine-learning generated personalized head-eelated impulse responses. J. Acoust. Soc. Am. 1 October 2020; 148 (4_Supplement): 2768. https://doi.org/10.1121/1.5147705
Download citation file:
95
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Rapid detection of fish calls within diverse coral reef soundscapes using a convolutional neural network
Seth McCammon, Nathan Formel, et al.
Related Content
Personalizing head-related transfer functions using anthropometric measurements by combining two machine-learning models
J Acoust Soc Am (March 2019)
Extracting the frequencies of the pinna spectral notches in measured head related impulse responses
J. Acoust. Soc. Am. (July 2005)
Identification of perceptually relevant methods of inter-aural time difference estimation
J. Acoust. Soc. Am. (August 2017)
Head-related transfer function recommendation based on perceptual similarities and anthropometric features
J. Acoust. Soc. Am. (December 2020)
Identifying a perceptually relevant estimation method of the inter-aural time delay
J Acoust Soc Am (May 2017)