A concern for applications of machine learning techniques to bioacoustics is whether or not classifiers learn the categories for which they were trained. Unfortunately, information such as characteristics of specific recording equipment or noise environments can also be learned. This question is examined in the context of identifying delphinid species by their echolocation clicks. To reduce the ambiguity between species classification performance and other confounding factors, species whose clicks can be readily distinguished were used in this study: Pacific white-sided and Risso's dolphins. A subset of data from autonomous acoustic recorders located at seven sites in the Southern California Bight collected between 2006 and 2012 was selected. Cepstral-based features were extracted for each echolocation click and Gaussian mixture models were used to classify groups of 100 clicks. One hundred Monte-Carlo three-fold experiments were conducted to examine classification performance where fold composition was determined by acoustic encounter, recorder characteristics, or recording site. The error rate increased from 6.1% when grouped by acoustic encounter to 18.1%, 46.2%, and 33.2% for grouping by equipment, equipment category, and site, respectively. A noise compensation technique reduced error for these grouping schemes to 2.7%, 4.4%, 6.7%, and 11.4%, respectively, a reduction in error rate of 56%–86%.

1.
Atal
,
B. S.
(
1974
). “
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
,”
J. Acoust. Soc. Am.
55
,
1304
1312
.
2.
Au
,
W. L.
, and
Hastings
,
M. C.
(
2008
).
Principles of Marine Bioacoustics
(
Springer
,
New York
),
679
pp.
3.
Au
,
W. W. L.
,
Carder
,
D. A.
,
Penner
,
R. H.
, and
Scronce
,
B. L.
(
1985
). “
Demonstration of adaptation in beluga whale echolocation signals
,”
J. Acoust. Soc. Am.
77
,
726
730
.
4.
Auckenthaler
,
R.
,
Carey
,
M.
, and
Lloyd-Thomas
,
H.
(
2000
). “
Score normalization for text-independent speaker verification systems
,”
Digit. Signal Process.
10
,
42
54
.
5.
Blackwell
,
S. B.
,
Nations
,
C. S.
,
McDonald
,
T. L.
,
Greene
,
C. R.
,
Thode
,
A. M.
,
Guerra
,
M.
, and
Macrander
,
A. M.
(
2013
). “
Effects of airgun sounds on bowhead whale calling rates in the Alaskan Beaufort Sea
,”
Mar. Mammal Sci.
24
,
342
365
.
6.
Boll
,
S.
(
1979
). “
Suppression of acoustic noise in speech using spectral subtraction
,”
IEEE Trans. Acoust., Speech, Signal Process.
27
,
113
120
.
7.
Dunn
,
R. B.
,
Quatieri
,
T. F.
,
Reynolds
,
D. A.
, and
Campbell
,
J. P.
(
2001
). “
Speaker recognition from coded speech and the effects of score normalization
,” in
Proceedings of the Asilomar Signals, Systems, Comp.
,
Pacific Grove, CA
, Vol.
2
, pp.
1562
1567
.
8.
Emerson
,
J. D.
, and
Strenio
,
J.
(
1983
). “
Boxplots and batch comparison
,” in
Understanding Robust and Exploratory Data Analysis
, edited by
D. C.
Hoaglin
,
F.
Mosteller
, and
J. W.
Tukey
(
John Wiley & Sons, Inc.
,
New York
), pp.
58
96
.
9.
Helble
,
T.
,
Ierley
,
G. R.
,
D'Spain
,
G. L.
,
Roch
,
M. A.
, and
Hildebrand
,
J. A.
(
2012
). “
A generalized power-law detection algorithm for humpback whale vocalizations
,”
J. Acoust. Soc. Am.
131
,
2682
2699
.
10.
Huang
,
X.
,
Acero
,
A.
, and
Hon
,
H. W.
(
2001
).
Spoken Language Processing
(
Prentice Hall PTR
,
Upper Saddle River, NJ
), pp.
516
, 520–522.
11.
Johnson
,
M.
,
Madsen
,
P. T.
,
Zimmer
,
W. M. X.
,
de Soto
,
N. A.
, and
Tyack
,
P. L.
(
2006
). “
Foraging Blainville's beaked whales (Mesoplodon densirostris) produce distinct click types matched to different phases of echolocation
,”
J. Exp. Biol.
209
,
5038
5050
.
12.
Kaiser
,
J. F.
(
1990
). “
On a simple algorithm to calculate the ‘energy’ of a signal
,” in
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
,
Albuquerque, NM
, Vol.
1
, pp.
381
.
13.
Kandia
,
V.
, and
Stylianou
,
Y.
(
2006
). “
Detection of sperm whale clicks based on the Teager-Kaiser energy operator
,”
Appl. Acoust.
67
,
1144
1163
.
14.
Mellinger
,
D. K.
,
Stafford
,
K. M.
,
Moore
,
S. E.
,
Dziak
,
R. P.
, and
Matsumoto
,
H.
(
2007
). “
An overview of fixed passive acoustic observation methods for cetaceans
,”
J. Oceanogr.
20
,
36
45
.
15.
Moretti
,
D.
,
Thomas
,
L.
,
Marques
,
T.
,
Harwood
,
J.
,
Dilley
,
A.
,
Neales
,
B.
,
Shaffer
,
J.
,
McCarthy
,
E.
,
New
,
L.
,
Jarvis
,
S.
, and
Morrissey
,
R.
(
2014
). “
A risk function for behavioral disruption of Blainville's beaked whales (Mesoplodon densirostris) from mid-frequency active sonar
,”
PLoS One
9
(
1
),
e85064
.
16.
Reynolds
,
D. A.
(
1996
). “
The effects of handset variability on speaker recognition performance experiments on the switchboard corpus
,” in
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
,
Atlanta, GA
, Vol.
1
, pp.
113
117
.
17.
Roch
,
M. A.
,
Klinck
,
H.
,
Baumann-Pickering
,
S.
,
Mellinger
,
D. K.
,
Qui
,
S.
,
Soldevilla
,
M. S.
, and
Hildebrand
,
J. A.
(
2011
). “
Classification of echolocation clicks from odontocetes in the Southern California Bight
,”
J. Acoust. Soc. Am.
129
,
467
475
.
18.
Soldevilla
,
M. S.
,
Henderson
,
E. E.
,
Campbell
,
G. S.
,
Wiggins
,
S. M.
,
Hildebrand
,
J. A.
, and
Roch
,
M. A.
(
2008
). “
Classification of Risso's and Pacific white-sided dolphins using spectral properties of echolocation clicks
,”
J. Acoust. Soc. Am.
124
,
609
624
.
19.
Soldevilla
,
M. S.
,
Wiggins
,
S. M.
, and
Hildebrand
,
J. A.
(
2010
). “
Spatio-temporal comparison of Pacific white-sided dolphin echolocation click types
,”
Aquat. Biol.
9
,
49
62
.
20.
Welch
,
P. D.
(
1967
). “
The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms
,”
IEEE Trans. Audio Electroacoust.
15
(
2
),
70
73
.
21.
Whitman
,
B.
,
Flake
,
G.
, and
Lawrence
,
S.
(
2001
). “
Artist detection in music with Minnowmatch
,” in
Proceedings of the IEEE Neural Networks Signal Processing XI
, pp.
559
568
.
22.
Wiggins
,
S. M.
, and
Hildebrand
,
J. A.
(
2007
). “
High-frequency Acoustic Recording Package (HARP) for broad-band, long-term marine mammal monitoring
,” in
International Symposium of Underwater Technology
,
Tokyo, Japan
, pp.
551
557
.
23.
Zimmer
,
W. M. X.
(
2011
).
Passive Acoustic Monitoring of Cetaceans
(
Cambridge University Press
,
Cambridge
),
356
pp.
You do not currently have access to this content.