A method for the automatic detection of calls of the frog Diasporus hylaeformis (Eleutherodactylidae) in audio recordings is proposed. The method uses the loudness, timber, and pitch of the vocalizations to identify the calls of the most prevalent individual in a recording. The first step consists in calculating the loudness of the signal to recognize the sections where the focal individual's vocalizations are. The second step consists in using the timber of the signal to recognize vocalizations. Finally, we use two principles we observed in the sounds produced by this species to discriminate between the calls of the most prevalent individual and other calls: individuals tend to vocalize using an almost constant pitch and different individuals use different pitches. Results show that the method is resistant to background noise (including calls of individuals of the same species), microphone-manipulation-induced noise, and human voice, and also that it adapts well to variations in the microphone level produced during the recording.
Skip Nav Destination
Article navigation
October 2011
Meeting abstract. No PDF available.
October 01 2011
Automatic detection of vocalizations of the frog Diasporus hylaeformis in audio recordings
Arturo Camacho;
Arturo Camacho
Esc. de CC. de la Comp. e Inf., Univ. de Costa Rica, P.O. Box 2060, San José, Costa Rica, arturo.camacho@ecci.ucr.ac.cr
Search for other works by this author on:
Adrián García-Rodríguez;
Adrián García-Rodríguez
Esc. de Biología, Univ. de Costa Rica
Search for other works by this author on:
Federico Bolaños
Federico Bolaños
Esc. de Biología, Univ. de Costa Rica
Search for other works by this author on:
J. Acoust. Soc. Am. 130, 2500 (2011)
Citation
Arturo Camacho, Adrián García-Rodríguez, Federico Bolaños; Automatic detection of vocalizations of the frog Diasporus hylaeformis in audio recordings. J. Acoust. Soc. Am. 1 October 2011; 130 (4_Supplement): 2500. https://doi.org/10.1121/1.3654948
Download citation file:
Citing articles via
Vowel signatures in emotional interjections and nonlinguistic vocalizations expressing pain, disgust, and joy across languages
Maïa Ponsonnet, Christophe Coupé, et al.
The alveolar trill is perceived as jagged/rough by speakers of different languages
Aleksandra Ćwiek, Rémi Anselme, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Monitoring frog species recovery in secondary tropical forests using automated species identification.
J Acoust Soc Am (October 2010)
Using feature vectors to detect frog calls in wireless sensor networks
J. Acoust. Soc. Am. (April 2012)
Amplification and spectral shifts of vocalizations inside burrows of the frog Eupsophus calcaratus (Leptodactylidae)
J Acoust Soc Am (August 2004)
Heterogeneity of vocal sac inflation patterns in Odorrana tormota plays a role in call diversity
J. Acoust. Soc. Am. (March 2016)
Using bond graphs to model vocal production in túngara frogs
J Acoust Soc Am (September 2012)