Many marine mammals produce highly nonlinear frequency modulations. Determining the time-frequency support of these sounds offers various applications, which include recognition, localization, and density estimation. This study introduces a low parameterized automated spectrogram segmentation method that is based on a theoretical probabilistic framework. In the first step, the background noise in the spectrogram is fitted with a Chi-squared distribution and thresholded using a Neyman–Pearson approach. In the second step, the number of false detections in time-frequency regions is modeled as a binomial distribution, and then through a Neyman–Pearson strategy, the time-frequency bins are gathered into regions of interest. The proposed method is validated on real data of large sequences of whistles from common dolphins, collected in the Bay of Biscay (France). The proposed method is also compared with two alternative approaches: the first is smoothing and thresholding of the spectrogram; the second is thresholding of the spectrogram followed by the use of morphological operators to gather the time-frequency bins and to remove false positives. This method is shown to increase the probability of detection for the same probability of false alarms.
Automated segmentation of linear time-frequency representations of marine-mammal sounds
Also at: Grenoble Images Parole Signal Automatique Lab, Grenoble Institute of Technology and Centre National de la Recherche Scientifique, 11 rue des Mathématiques, 38402 Saint-Martin d'Hères, France.
Florian Dadouchi, Cedric Gervaise, Cornel Ioana, Julien Huillery, Jérôme I. Mars; Automated segmentation of linear time-frequency representations of marine-mammal sounds. J. Acoust. Soc. Am. 1 September 2013; 134 (3): 2546–2555. https://doi.org/10.1121/1.4816579
Download citation file: