The spatial filtering effect brought on by sound propagation from the sound source to the outer ear is referred to as the head-related transfer function (HRTF). The personalization of HRTF is essential to enhance the personalized immersive audio experience in virtual and augmented reality. Our work aims to employ deep learning to predict the customized HRTF from anthropometric measurements. However, existing measured HRTF databases each employ a different geographic sampling, making it difficult to combine these databases into training data-hungry deep learning methods while each of them only contains dozens of subjects. Following our previous work, we use a neural field, a neural network that maps the spherical coordinates to the magnitude spectrum to represent each subject’s set of HRTFs. We constructed a generative model to learn the latent space across subjects using such a consistent representation of HRTF across datasets. In this work, by learning the mapping of the anthropometric measurements to the latent space and then reconstructing the HRTF, we further investigate the neural field representation to carry out HRTF personalization. Thanks to the grid-agnostic nature of our method, we are able to train on combined datasets and even validate the performance on grids unseen during training.
Skip Nav Destination
Article navigation
March 2023
Article Contents
March 01 2023
Grid-agnostic personalized head-related transfer function modeling with neural fields Free
You Zhang;
You Zhang
Elec. and Comput. Eng., Univ. of Rochester, 500 Wilson Blvd, Rochester, NY 14620, [email protected]
Search for other works by this author on:
Zhiyao Duan
Zhiyao Duan
Elec. and Comput. Eng., Univ. of Rochester, Rochester, NY
Search for other works by this author on:
You Zhang
Elec. and Comput. Eng., Univ. of Rochester, 500 Wilson Blvd, Rochester, NY 14620, [email protected]
Yuxiang Wang
Mark Bocko
Zhiyao Duan
Elec. and Comput. Eng., Univ. of Rochester, Rochester, NY
J. Acoust. Soc. Am. 153, A125 (2023)
Citation
You Zhang, Yuxiang Wang, Mark Bocko, Zhiyao Duan; Grid-agnostic personalized head-related transfer function modeling with neural fields. J. Acoust. Soc. Am. 1 March 2023; 153 (3_supplement): A125. https://doi.org/10.1121/10.0018387
Download citation file:
323
Views
Citing articles via
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Effects of network selection and acoustic environment on bounding-box object detection of delphinid whistles using a deep learning tool
Peter C. Sugarman, Elizabeth L. Ferguson, et al.
Introduction to the special issue on: Advances in soundscape: Emerging trends and challenges in research and practice
Francesco Aletta, Bhan Lam, et al.
Related Content
Personalizing head-related transfer functions using anthropometric measurements by combining two machine-learning models
J. Acoust. Soc. Am. (March 2019)
System for automatic personalization of head-related transfer functions based on computer vision, photo-anthropometry, and inference from a database
J. Acoust. Soc. Am. (November 2013)
Individualization of head-related transfer functions by the principle component analysis based on anthropometric measurements
J. Acoust. Soc. Am. (October 2016)
The quantification of head-related transfer function’s dependency on anthropometric features
J. Acoust. Soc. Am. (October 2021)
Personalization of head-related transfer functions in the median plane based on spectral correction with pinna angle
J. Acoust. Soc. Am. (October 2016)