Automatically detecting animal signals in soundscape recordings is of benefit to passive acoustic monitoring programs which may be undertaken for research or conservation. Numerous algorithms exist, which are typically optimized for certain situations (i.e., certain animal sound types and ambient noise conditions). Adding to the library of algorithms, this paper developed, tested, and compared three detectors for Omura's whale vocalizations (15–62 Hz; <15 s) in marine soundscape recordings which contained noise from other animals, wind, earthquakes, ships, and seismic surveys. All three detectors were based on processing of spectrographic representations. The specific methods were spectrogram cross-correlation, entropy computation, and spectral intensity “blob” tracing. The latter two were general-purpose detectors that were adapted for detection of Omura's whale vocalizations. Detector complexity and post-processing effort varied across the three detectors. Performance was assessed qualitatively using demonstrative examples, and quantitatively using Receiver-Operating Characteristics and Precision-Recall curves. While the results of quantitative assessment were dominated by the spectrogram cross-correlation method, qualitative assessment showed that all three detectors offered promising performance.

1.
Aragones
,
L. V.
,
Roque
,
M. A. A.
,
Flores
,
M. B.
,
Encomienda
,
R. P.
,
Laule
,
G. E.
,
Espinos
,
B. G.
,
Maniago
,
F. E.
,
Diaz
,
G. C.
,
Alesna
,
E. B.
, and
Braun
,
R. C.
(
2010
). “
The Philippine marine mammal strandings from 1998 to 2009: Animals in the Philippines in peril?
,”
Aquat. Mamm.
36
(
3
),
219
233
.
2.
Au
,
W. W. L.
, and
Lammers
,
M. O.
(
2016
).
Listening in the Ocean
(
Springer
,
New York
).
3.
Baker
,
M. C.
(
2006
). “
Differentiation of mating vocalizations in birds: Acoustic features in mainland and island populations and evidence of habitat-dependent selection on Songs
,”
Ethology
112
(
8
),
757
771
.
4.
Baumgartner
,
M. F.
, and
Mussoline
,
S. E.
(
2011
). “
A generalized baleen whale call detection and classification system
,”
J. Acoust. Soc. Am.
129
(
5
),
2889
2902
.
5.
Baumgartner
,
M. F.
,
Van Parijs
,
S. M.
,
Wenzel
,
F. W.
,
Tremblay
,
C. J.
,
Carter Esch
,
H.
, and
Warde
,
A. M.
(
2008
). “
Low frequency vocalizations attributed to sei whales (Balaenoptera borealis
),”
J. Acoust. Soc. Am.
124
(
2
),
1339
1349
.
6.
Cerchio
,
S.
,
Andrianantenaina
,
B.
,
Lindsay
,
A.
,
Rekdahl
,
M.
,
Andrianarivelo
,
N.
, and
Rasoloarijao
,
T.
(
2015
). “
Omura's whales (Balaenoptera omurai) off northwest Madagascar: Ecology, behaviour and conservation needs
,”
R. Soc. Open Sci.
2
(
10
),
150301
.
7.
Cerchio
,
S.
,
Yamada
,
T. K.
, and
Brownell
,
R. L.
, Jr.
(
2019
). “
Global distribution of Omura's whales (Balaenoptera omurai) and assessment of range-wide threats
,”
Front. Mar. Sci.
6
,
67
.
8.
Charif
,
R.
,
Waack
,
A.
, and
Strickman
,
L.
(
2010
).
Raven Pro 1.4 User's Manual
(
Cornell Lab of Ornithology
,
Ithaca, NY
).
9.
Cooke
,
J. G.
, and
Brownell
,
R. L.
, Jr.
(
2019
). “
Balaenoptera omurai (amended version of 2018 assessment
,” The IUCN Red List of Threatened Species, (Last viewed 4 January 2020).
10.
Cypriano-Souza
,
A. L.
,
de Meirelles
,
A. C. O.
,
Carvalho
,
V. L.
, and
Bonatto
,
S. L.
(
2017
). “
Rare or cryptic? The first report of an Omura's whale (Balaenoptera omurai) in the South Atlantic Ocean
,”
Mar. Mammal Sci.
33
(
1
),
80
95
.
11.
Datta
,
S.
, and
Sturtivant
,
C.
(
2002
). “
Dolphin whistle classification for determining group identities
,”
Signal Process.
82
(
2
),
251
258
.
12.
de Vos
,
A.
(
2017
). “
First record of Omura's whale, Balaenoptera omurai, in Sri Lankan waters
,”
Mar. Biodivers. Rec.
10
(
1
),
18
.
13.
Dougherty
,
E. R.
(
1992
).
An Introduction to Morphological Image Processing
(
SPIE
,
Bellingham, WA
).
14.
Dubnov
,
S.
(
2004
). “
Generalization of spectral flatness measure for non-gaussian linear processes
,”
IEEE Signal Process. Lett.
11
(
8
),
698
701
.
15.
Erbe
,
C.
(
2013
). “
Underwater passive acoustic monitoring & noise impacts on marine fauna—A workshop report
,”
Acoust. Aust.
41
(
1
),
113
119
.
16.
Erbe
,
C.
,
Dunlop
,
R.
,
Jenner
,
K. C. S.
,
Jenner
,
M. N.
,
McCauley
,
R. D.
,
Parnum
,
I.
,
Parsons
,
M.
,
Rogers
,
T.
, and
Salgado-Kent
,
C.
(
2017
). “
Review of underwater and in-air sounds emitted by Australian and Antarctic marine mammals
,”
Acoust. Aust.
45
(
2
),
179
241
.
17.
Erbe
,
C.
, and
King
,
A. R.
(
2008
). “
Automatic detection of marine mammals using information entropy
,”
J. Acoust. Soc. Am.
124
(
5
),
2833
2840
.
18.
Evans
,
P. G.
, and
Hammond
,
P. S.
(
2004
). “
Monitoring cetaceans in European waters
,”
Mam. Rev.
34
(
1-2
),
131
156
.
19.
Fawcett
,
T.
(
2006
). “
An introduction to ROC analysis
,”
Pattern Recogn. Lett.
27
(
8
),
861
874
.
20.
Gillespie
,
D.
(
2004
). “
Detection and classification of right whale calls using an ‘edge’ detector operating on a smoothed spectrogram
,”
Can. Acoust.
32
(
2
),
39
47
.
21.
Jung
,
J. L.
,
Mullié
,
W. C.
,
Van Waerebeek
,
K.
,
Wagne
,
M. M.
,
Ould Bilal
,
A. S.
,
Ould Sidaty
,
Z. E. A.
,
Toomey
,
L.
,
Méheust
,
E.
, and
Marret
,
F.
(
2016
). “
Omura's whale off West Africa: Autochthonous population or inter-oceanic vagrant in the Atlantic Ocean?
,”
Mar. Biol. Res.
12
(
1
),
66
75
.
22.
Kershenbaum
,
A.
, and
Roch
,
M. A.
(
2013
). “
An image processing based paradigm for the extraction of tonal sounds in cetacean communications
,”
J. Acoust. Soc. Am.
134
(
6
),
4435
4445
.
23.
Kim
,
J. H.
,
Kim
,
H. W.
,
Kim
,
E. M.
, and
Sohn
,
H.
(
2018
). “
First record of the Omura's whale (Balaenoptera omurai) in Korean waters
,”
Anim. Syst. Evol. Divers.
34
(
3
),
162
167
.
24.
Lindeberg
,
T.
(
1993
). “
Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention
,”
Int. J. Comput. Vis.
11
(
3
),
283
318
.
25.
Madhusudhana
,
S.
,
Gavrilov
,
A.
, and
Erbe
,
C.
(
2016
). “
A generic system for the automatic extraction of narrowband signals of biological origin in underwater audio
,”
J. Acoust. Soc. Am.
29
,
010002
.
26.
Madhusudhana
,
S.
,
Gavrilov
,
A.
, and
Erbe
,
C.
(
2018
). “
A general purpose automatic detector of broadband transient signals in underwater audio
,” in
Proceedings of 2018 OCEANS-MTS/IEEE Kobe Techno-Oceans (OTO)
, May 28–31, Kobe, Japan, pp.
1
6
.
27.
Mahalanobis
,
P. C.
(
1936
). “
On the generalized distance in statistics
,”
Proc. Natl. Inst. Sci. (Calcutta)
2
,
49
55
.
28.
McCauley
,
R. D.
,
Thomas
,
F.
,
Parsons
,
M. J. G.
,
Erbe
,
C.
,
Cato
,
D.
,
Duncan
,
A. J.
,
Gavrilov
,
A. N.
,
Parnum
,
I. M.
, and
Salgado-Kent
,
C.
(
2017
). “
Developing an underwater sound recorder: The long and short (time) of it
.,”
Acoust. Aust.
45
(
2
),
301
311
.
29.
Mellinger
,
D. K.
(
2004
). “
A comparison of methods for detecting right whale calls
,”
Can. Acoust.
32
(
2
),
55
65
.
30.
Mellinger
,
D. K.
, and
Clark
,
C. W.
(
2000
). “
Recognizing transient low-frequency whale sounds by spectrogram correlation
,”
J. Acoust. Soc. Am.
107
(
6
),
3518
3529
.
31.
Munger
,
L. M.
,
Mellinger
,
D. K.
,
Wiggins
,
S. M.
,
Moore
,
S. E.
, and
Hildebrand
,
J. A.
(
2005
). “
Performance of spectrogram cross-correlation in detecting right whale calls in long-term recordings from the Bering Sea
,”
Can. Acoust.
33
(
2
),
25
34
.
32.
Munkres
,
J.
(
1957
). “
Algorithms for the assignment and transportation problems
,”
SIAM J. Appl. Math.
5
(
1
),
32
38
.
33.
Obin
,
N.
, and
Liuni
,
M.
(
2012
). “
On the generalization of Shannon entropy for speech recognition
,” in
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT)
, December 2–5, Miami, FL, pp.
97
102
.
34.
Ponnampalam
,
L. S.
(
2012
). “
Opportunistic observations on the distribution of cetaceans in the Malaysian South China, Sulu and Sulawesi Seas and an updated checklist of marine mammals in Malaysia
,”
Raffles Bull. Zool.
60
(
1
),
221
231
.
35.
Powers
,
D. M.
(
2011
). “
Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation
,”
J. Mach. Learn. Technol.
2
(
1
),
37
63
.
36.
Shannon
,
C. E.
(
1948
). “
A mathematical theory of communication
,”
Bell Syst. Tech. J.
27
(
3
),
379
423
.
37.
Shannon
,
C. E.
, and
Weaver
,
W.
(
1998
).
The Mathematical Theory of Communication
(
University of Illinois Press
,
Champaign, IL
).
38.
Thode
,
A. M.
,
Kim
,
K. H.
,
Blackwell
,
S. B.
,
Greene
,
C. R.
, Jr.
,
Nations
,
C. S.
,
McDonald
,
T. L.
, and
Macrander
,
A. M.
(
2012
). “
Automated detection and localization of bowhead whale sounds in the presence of seismic airgun surveys
,”
J. Acoust. Soc. Am.
131
(
5
),
3726
3747
.
39.
Wada
,
S.
,
Oishi
,
M.
, and
Yamada
,
T. K.
(
2003
). “
A newly discovered species of living baleen whale
,”
Nature
426
(
6964
),
278
281
.
40.
Wang
,
H. G.
,
Fan
,
Z. Y.
,
Shen
,
H.
, and
Peng
,
Y. J.
(
2006
). “
Description of new record species of whales from Chinese coastal waters
,”
Fish. Sci.
25
(
2
),
85
87
.
41.
Zimmer
,
W. M. X.
(
2011
).
Passive Acoustic Monitoring of Cetaceans
(
Cambridge University Press
,
Cambridge, UK
).

Supplementary Material

You do not currently have access to this content.