The goal is to learn the features and the representation adapted for cetacean sound dynamics without any priors. Thus, we develop data driven model to generate voicing and click of cetaceans audio signals. We learn representation and features of stationary or nonstationary emission using neural network from raw audio. We use different types of convolutions (causal, with strides, with dilation [1]), or gradient inversion [2]. Experiments are conducted on various kind of calls of humpback whales from nips4b challenge [3] or Orca whale. We compare the topology for transient encoding on Physeters and Inia g. For each model, we detail the resulting filters and discuss on the topology. We acknowledge Region PACA and NortekMED for Roger’s Phd grant, & DGA and Région Haut de France for Ferrari’s Phd grant. [1] Oord, Dieleman, Zen, Simonyan, Vinyals, Graves et al. Wavenet : A generative model for raw audio, arXiv:1609.03499, 2016 [2] Balestriero, Roger, Glotin, Baraniuk, Semi-Supervised Learning via New Deep Network Inversion, arXiv:1711.04313, 2017 [3] Glotin, LeCun, Mallat et al. Proc. 1st wkp on Neural Information Processing for Bioacoustics NIPS4B, joint to NIPS Alberta USA, 2013 http://sabiod.org/nips4b/challenge2.html, http://sabiod.org/NIPS4B2013_book.pdf
Skip Nav Destination
Article navigation
September 2018
Meeting abstract. No PDF available.
September 01 2018
Towards the topology of autoencoder of calls versus clicks of marine mammal
Vincent Roger;
Vincent Roger
Univ. Toulon, AMU, CNRS, LIS, Toulon, France
Search for other works by this author on:
Maxence Ferrari;
Maxence Ferrari
Univ. Toulon, AMU, CNRS, LIS, Toulon, France
Search for other works by this author on:
Ricard Marxer;
Ricard Marxer
Univ. Toulon, AMU, CNRS, LIS, Toulon, France
Search for other works by this author on:
Faicel Chamroukhi;
Faicel Chamroukhi
LMNO UMR CNRS 6139, Caen, France
Search for other works by this author on:
Herve Glotin
Herve Glotin
Univ. Toulon, AMU, CNRS, LIS, USTV, Ave. Université, BP20132, La Garde 83957, France, [email protected]
Search for other works by this author on:
J. Acoust. Soc. Am. 144, 1777–1778 (2018)
Citation
Vincent Roger, Maxence Ferrari, Ricard Marxer, Faicel Chamroukhi, Herve Glotin; Towards the topology of autoencoder of calls versus clicks of marine mammal. J. Acoust. Soc. Am. 1 September 2018; 144 (3_Supplement): 1777–1778. https://doi.org/10.1121/1.5067859
Download citation file:
87
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Rapid detection of fish calls within diverse coral reef soundscapes using a convolutional neural network
Seth McCammon, Nathan Formel, et al.
Related Content
Deep learning for ethoacoustical mapping: Application to a single Cachalot long term recording on joint observatories in Vancouver Island
J Acoust Soc Am (September 2018)
Ultra high definition 3D tracking of biosonar of Amazon River dolphins and others mammals into the wild
J Acoust Soc Am (September 2018)
Joint observatories following a single male Cachalot during 12 weeks —The Yukusam story
J Acoust Soc Am (September 2018)