When a broadband source of radiated noise transits past a fixed hydrophone, a Lloyd's mirror constructive/destructive interference pattern can be observed in the output spectrogram. By taking the spectrum of a (log) spectrum, the power cepstrum detects the periodic structure of the Lloyd's mirror fringe pattern by generating a sequence of pulses located at the fundamental quefrency and its multiples. The fundamental quefrency, which is the reciprocal of the frequency difference between adjacent destructive interference fringes, equates to the multipath delay time. An experiment is conducted where a motorboat transits past a hydrophone located about 1 m above the seafloor in very shallow water (20 m deep). The hydrophone has a frequency bandwidth of 90 kHz, and its output is sampled at 250 kHz. A cepstrogram database is compiled from multiple vessel transits, and its cepstrum-based feature vectors (along with ground-truth range data) form the input to train a convolutional neural network (CNN) so that it can predict the source range relative to the hydrophone for other (“unseen”) vessel transits. The CNN provides an accurate prediction of the instantaneous source range even when the range estimate from conventional multipath passive ranging is biased, which occurs at low grazing angles (<).
Skip Nav Destination
,
,
Article navigation
December 2019
December 31 2019
Convolutional neural network for single-sensor acoustic localization of a transiting broadband source in very shallow water Available to Purchase
Special Collection:
Acoustic Localization
Eric L. Ferguson;
Eric L. Ferguson
a)
1
Australian Centre for Field Robotics, The University of Sydney
, New South Wales, 2006, Australia
Search for other works by this author on:
Stefan B. Williams;
Stefan B. Williams
1
Australian Centre for Field Robotics, The University of Sydney
, New South Wales, 2006, Australia
Search for other works by this author on:
Craig T. Jin
Craig T. Jin
2
Computer Audio and Research Laboratory, The University of Sydney, New South Wales
, 2006, Australia
Search for other works by this author on:
Eric L. Ferguson
1,a)
Stefan B. Williams
1
Craig T. Jin
2
1
Australian Centre for Field Robotics, The University of Sydney
, New South Wales, 2006, Australia
2
Computer Audio and Research Laboratory, The University of Sydney, New South Wales
, 2006, Australia
a)
Electronic mail: [email protected]
J. Acoust. Soc. Am. 146, 4687–4698 (2019)
Article history
Received:
December 27 2018
Accepted:
June 08 2019
Citation
Eric L. Ferguson, Stefan B. Williams, Craig T. Jin; Convolutional neural network for single-sensor acoustic localization of a transiting broadband source in very shallow water. J. Acoust. Soc. Am. 1 December 2019; 146 (6): 4687–4698. https://doi.org/10.1121/1.5138594
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
Source range and depth estimation of propeller cavitation bubble collapse transients in a multipath environment
J. Acoust. Soc. Am. (October 2023)
Multitask convolutional neural network for acoustic localization of a transiting broadband source using a hydrophone array
J. Acoust. Soc. Am. (July 2021)
On first rahmonic amplitude in the analysis of synthesized aperiodic voice signals
J. Acoust. Soc. Am. (November 2006)
Experimental study of shout detection with the rahmonic structure
Proc. Mtgs. Acoust. (May 2013)
Shouted speech detection using hidden markov model with rahmonic and mel-frequency cepstrum coefficients
J. Acoust. Soc. Am. (October 2016)