Wild beluga whistle source levels (SLs) are estimated from 52 three-dimensional (3D) localized calls using a 4-hydrophone array. The probability distribution functions of the root-mean-square (rms) SL in the time domain, and the peak, the strongest 3-dB, and 10-dB SLs from the spectrogram, were non-Gaussian. The average rms SL was 143.8 ± 6.7 dB re 1 μPa at 1 m. SL spectral metrics were, respectively, 145.8 ± 8 dB, 143.2 ± 7.1 dB, and 138.5 ± 6.9 dB re 1 μPa2·Hz−1 at 1 m.

Beluga whales (Delphinapterus leucas) have a circumpolar distribution in arctic and sub-arctic waters that ranges from 47°N to 80°N latitude, along the coasts of Alaska, Canada, Greenland, and Russia. Some beluga whale sub-populations, such as those from Eastern Hudson Bay, Ungava Bay, and St. Lawrence Estuary, are on the list of endangered species of the Committee on the Status of Endangered Wildlife in Canada.1 Therefore, these populations benefit from recovery programs that include scientific studies to determine the causes of their decline and fill knowledge gaps on their ecology.

Passive Acoustic Monitoring (PAM) is nowadays a well-established method to study cetaceans through the sounds they produce.2 PAM has been extensively used to study free-ranging beluga whales at various locations. Especially, PAM has been successfully applied to study the properties of beluga echolocation clicks3 and click-trains,4 to describe their call repertoire and characterize their properties (frequency content, inflexion points, duration).5–7 It has also been used to determine their space-time distribution8–10 and to estimate the masking effects of their sounds by anthropogenic noises.11,12

One of the basic parameters involved in PAM applications is the energy level of the emitting source. This determines, among others, the communication space of the animals, the signal-to-noise ratio (SNR) and masking, as well as the PAM detection range and probability function for density estimation. To our knowledge, the source level (SL) of wild beluga whistles has not been documented so far. In this letter, we contribute to fill this knowledge gap by estimating the SL of whistles produced by free-ranging beluga whales from the Saguenay-St. Lawrence marine park (Canada).

Section 2 presents the recording setup and methods used for localizing the whistles and for estimating their SLs. Results are then presented and discussed in Sec. 3.

The data were recorded in May 2009 at the Saguenay fjord mouth, in the St. Lawrence estuary [Fig. 1(a)]. They were acquired by a cabled coastal array connected to a data acquisition board (IOTech DaqBoard 3000/USB, Cleveland, Ohio) that simultaneously recorded the signals coming from four hydrophones (HTI 96 MIN, High Tech Inc., Gulfport, MS). The signals were digitized with 16 bit and a sampling frequency of 48 kHz, and saved as WAV files on a PC. The array was tilted according to the natural slope of the fjord and had an aperture of about 200 m. The four hydrophones were at 28, 69, 134, and 141 m depths, and were kept about 4 m above the seafloor. This setup allows the absolute localization of belugas that are close to the array. Precise hydrophone positions were calculated from acoustic localization using known sources emitted at eight different positions around the array. A more detailed description of this setup and of the calibration procedure can be found in Roy et al.3 

Fig. 1.

(Color online) (a) Position of the array at the Saguenay fjord mouth in the St. Lawrence estuary; (b) sound speed profile measured during the deployment of the array; (c) location of the 52 annotated beluga whistles and of the 4 hydrophones.

Fig. 1.

(Color online) (a) Position of the array at the Saguenay fjord mouth in the St. Lawrence estuary; (b) sound speed profile measured during the deployment of the array; (c) location of the 52 annotated beluga whistles and of the 4 hydrophones.

Close modal

We manually annotated 52 beluga whale whistles by visually scanning the spectrogram of the recorded data. These whistles were chosen because they were not mixed with other sounds, they had good SNRs, and they were recorded on all four hydrophones. Whistles were then bandpass filtered between 1.5 and 5 kHz, the frequency band where they were concentrated.

The three-dimensional (3D) location of the belugas was performed by computing the time-difference-of-arrivals (TDOAs) of the whistles on the four hydrophones. The TDOAs were estimated from a modified cross-correlation function known as cross-recurrence plot analysis (CRPA) between pairs of hydrophones.13 This CRPA method performs a similarity analysis between the waveform received by each hydrophone, and detects a series of samples that are similar on each pair of hydrophones. The TDOA is then estimated from this detected series. Such an approach appears to be more robust than the classical autocorrelation when signals have high amplitude modulations13 as it is the case with cetacean whistles. The six TDOAs obtained for each whistle were then validated and misestimated TDOAs were discarded.

For the localization we proceeded as in Roy et al.3 The space around the array was discretized according to a 3D grid with a mesh size of 10 m horizontally and 5 m vertically. Then, we calculated the theoretical TDOAs by propagating sources at the nodes of the grid to the location of the hydrophones. A sound speed of 1444 m s−1 based on measured sound speed profile was used [Fig. 1(b)] and direct straight trajectories were assumed. These were reasonable assumptions because the belugas were close to the array [Fig. 1(c)]. Finally, the source location was determined by minimizing the Euclidean distance between the measured TDOAs and the set of precalculated TDOAs.

As PAM users are usually interested in knowing the maximum detection range of PAM systems and the active communication space of cetaceans, we focused on the four following SL metrics: root-mean-square (rms) SL, spectral peak, strongest 3-dB, and 10-dB SLs. The SLs were calculated for each hydrophone as follows: SL=RL+20log10(r), with RL the corresponding received level and r the range between the sound source and each hydrophone. Transmission loss was assumed to be spherical spreading. We did not compensate for the absorption in the water and fading due to multipath propagation.

The rms SL was computed in a window that makes up 80% of the total cumulative energy of the signal.14 However, one has to be aware that this metric can sometimes be biased, as it is very dependent of the length of the averaging window that is used and of the criteria chosen to determine this latter.14 

The three other SL metrics were measured on the spectrogram, whose settings included a 1024 points running window, a 90% overlap, and 16 384 zero-padded fast Fourier transform [Fig. 2(a)]. Only the pixels of the spectrogram associated to the whistles and having a SNR > 10 dB were considered for the SL estimation. To identify these particular pixels, we first calculated the spectrogram of the noise with the same parameters as above, in a 0.1 s window following each whistle [Fig. 2(b)]. The mean level of this noise spectrogram was then computed in each frequency band to estimate the average noise level by frequency band [Fig. 2(c)]. We then subtracted this average noise level from the initial spectrogram [Fig. 2(a)] to get the SNR of each time-frequency bin of the spectrogram [Fig. 2(d)]. This SNR spectrogram was thresholded to retain pixels with a SNR > 10 dB, forming the mask of the whistle [Fig. 2(e), red]. Finally, we applied the mask on the initial spectrogram of the whistle [Fig. 2(a)] to extract the pixels of the spectrogram associated to the whistles and having a SNR > 10 dB [Fig. 2(f)].

Fig. 2.

(Color online) Steps followed to extract from the spectrogram all the pixels associated to a beluga whale whistle and having a SNR > 10 dB.

Fig. 2.

(Color online) Steps followed to extract from the spectrogram all the pixels associated to a beluga whale whistle and having a SNR > 10 dB.

Close modal

The spectral peak SL estimation follows the one proposed by Rossong and Terhune,15 First, we measured the peak SL on the loudest of the four hydrophones and at the frequency of the highest amplitude in the whistle. The peak SL was then measured at this frequency for the other three hydrophones.

The strongest 3- and 10-dB SLs from the thresholded spectrogram were extracted independently for each hydrophone. These are not classical SL metrics but we believe they are of interest because spectrogram segmentation methods are often used to detect whistling activities by PAM and these SLs provide information on the level of the useful pixels for the detection. We first measured the peak SL on the thresholded spectrogram of each hydrophone and then extracted all the pixels that were within the range [SLpeak−3 dB; SLpeak] and [SLpeak−10 dB; SLpeak] (Fig. 3).

Fig. 3.

(Color online) Example of thresholded spectrogram (a) with the strongest 3-dB SLs (b) and 10-dB SLs (c).

Fig. 3.

(Color online) Example of thresholded spectrogram (a) with the strongest 3-dB SLs (b) and 10-dB SLs (c).

Close modal

For each of the 52 annotated whistles, the 4 SL metrics were estimated for each hydrophone and their probability density functions (PDFs) were computed (Fig. 4, Table 1). The mean whistle rms SL is 143.8 (±6.7) dB re 1 μParms at 1 m, the mean spectral peak SL is 145.8 (±8) dB re 1 μPa2 Hz−1 at 1 m, and the strongest 3- and 10-dB SLs have an average of 143.2 (±7.1) dB re 1 μPa2 Hz−1 at 1 m and 138.5 (±6.9) dB re 1 μPa2 Hz−1 at 1 m, respectively.

Fig. 4.

Distribution of the estimated SLs on the four hydrophones: (a) rms SLs; (b) spectral peak SL; (c) strongest 3-dB SLs; (d) strongest 10-dB SLs.

Fig. 4.

Distribution of the estimated SLs on the four hydrophones: (a) rms SLs; (b) spectral peak SL; (c) strongest 3-dB SLs; (d) strongest 10-dB SLs.

Close modal
Table 1.

Percentiles of the rms SLs (dB re 1 μPa at 1 m), and spectral peak SLs, strongest 3-dB and 10-dB SLs (dB re 1 μPa2 Hz−1 at 1 m).

PercentilesMin10th20th30th40th50th60th70th80th90th100th
SLrms 123.5 133.3 135.4 136.6 137.9 139.3 140.8 143.2 147.5 151.0 158.2 
SLspectral peak 123.7 133.3 135.1 137.6 139.4 141.1 143.5 145.9 149.1 153.3 161.4 
SLstrongest 3 dB 120.7 131.6 134.4 135.9 137.7 139.2 140.8 142.4 146.2 150.1 161.5 
SLstrongest 10 dB 113.7 127.5 129.9 131.7 133.3 134.7 136.4 138.5 141.2 144.8 161.5 
PercentilesMin10th20th30th40th50th60th70th80th90th100th
SLrms 123.5 133.3 135.4 136.6 137.9 139.3 140.8 143.2 147.5 151.0 158.2 
SLspectral peak 123.7 133.3 135.1 137.6 139.4 141.1 143.5 145.9 149.1 153.3 161.4 
SLstrongest 3 dB 120.7 131.6 134.4 135.9 137.7 139.2 140.8 142.4 146.2 150.1 161.5 
SLstrongest 10 dB 113.7 127.5 129.9 131.7 133.3 134.7 136.4 138.5 141.2 144.8 161.5 

SL metric PDFs are non-Gaussian and have a first mode at lower SLs followed by a secondary mode or a plateau at higher SLs [Figs. 4(a)–4(d)]. These particular PDFs might result from propagation effects that modulate the whistle amplitude. For instance, the average shift between the peak-to-peak SL PDF (not shown here) and the rms SL PDF [Fig. 4(a)] was found to be of 20.6 dB, whereas, without modulation, a pure cosine signal would have produced a shift of 9 dB. Another hypothesis to explain the SL PDFs is that the SL depends on whistle type and of variations in the emission levels as for killer whales.16 Further analysis with much more whistles would be needed to confirm these hypotheses. At last, the directivity of the beluga whistles might also be involved in the dispersion of these PDFs. Such a directivity has already been observed for other species.17,18 With directional sources, off beam axis receiving angles and corresponding lower SLs are more probable than on-axis ones. This could produce the observed SL PDFs.

Overall, the observed whistle SLs fall within the range of previous estimates for other species. Minimum, maximum, median, and mean whistle SLs are similar to those of white beaked dolphins,18 killer whales,16 bottlenose, and spotted dolphins.19 

The authors would like to thank the DGA for supporting the scholarship of O.L.B.

1.
DFO
, “
Recovery Strategy for the beluga whale (Delphinapterus leucas) St. Lawrence Estuary population in Canada. Species at Risk Act Recovery Strategy Series
,” Fisheries and Oceans Canada, Ottawa (
2012
), pp.
1
87
.
2.
D. K.
Mellinger
,
K. M.
Stafford
,
S.
Moore
,
R. P.
Dziak
, and
H.
Matsumoto
, “
Fixed passive acoustic observation methods for cetaceans
,”
Oceanography
20
,
36
45
(
2007
).
3.
N.
Roy
,
Y.
Simard
, and
C.
Gervaise
, “
3D tracking of foraging belugas from their clicks: Experiment from a coastal hydrophone array
,”
Appl. Acoust.
71
,
1050
1056
(
2010
).
4.
O.
Le Bot
,
J. I.
Mars
,
C.
Gervaise
, and
Y.
Simard
, “
Rhythmic analysis for click train detection and source separation with examples on beluga whales
,”
Appl. Acoust.
95
,
37
49
(
2015
).
5.
B. L.
Sjare
and
T. G.
Smith
, “
The vocal repertoire of white whales, Delphinapterus leucas, summering in Cunningham Inlet, Northwest Territories
,”
Can. J. Zoo.
64
,
407
415
(
1986
).
6.
J.
Karlsen
,
A.
Bisther
,
C.
Lydersen
,
T.
Haug
, and
K.
Kovacs
, “
Summer vocalisations of adult male white whales (Delphinapterus leucas) in Svalbard, Norway
,”
Polar Biol.
25
,
808
817
(
2002
).
7.
E. G.
Chmelnitsky
and
S. H.
Ferguson
, “
Beluga whale, Delphinapterus leucas, vocalizations from the Churchill River, Manitoba, Canada
,”
J. Acoust. Soc. Am.
131
,
4821
4835
(
2012
).
8.
Y.
Simard
,
N.
Roy
,
S.
Giard
,
C.
Gervaise
,
M.
Conversano
, and
N.
Ménard
, “
Estimating whale density from their whistling activity: Example with St. Lawrence beluga
,”
Appl. Acoust.
71
,
1081
1086
(
2010
).
9.
M. O.
Lammers
,
M.
Castellote
,
R. J.
Small
,
S.
Atkinson
,
J.
Jenniges
,
A.
Rosinski
,
J. N.
Oswald
, and
C.
Garner
, “
Passive acoustic monitoring of Cook Inlet beluga whales (Delphinapterus leucas)
,”
J. Acoust. Soc. Am.
134
,
2497
2504
(
2013
).
10.
Y.
Simard
,
L.
Loseto
,
S.
Gautier
, and
N.
Roy
, “
Monitoring beluga habitat use and underwater noise levels in the Mackenzie Estuary: Application of passive acoustics in summers 2011 and 2012
,”
Can. Tech. Rep. Fish., Aquat. Sci.
3068
,
1
57
(
2014
).
11.
P. M.
Scheifele
,
S.
Andrew
,
R. A.
Cooper
,
M.
Darre
,
F. E.
Musiek
, and
L.
Max
, “
Indication of a Lombard vocal response in the St. Lawrence River beluga
,”
J. Acoust. Soc. Am.
117
,
1486
1492
(
2005
).
12.
C.
Gervaise
,
Y.
Simard
,
N.
Roy
,
B.
Kinda
, and
N.
Ménard
, “
Shipping noise in whale habitat: Characteristics, sources, budget, and impact on belugas in Saguenay–St. Lawrence Marine Park hub
,”
J. Acoust. Soc. Am.
132
,
76
89
(
2012
).
13.
O.
Le Bot
,
J. I.
Mars
, and
C.
Gervaise
, “
Time-difference-of-arrival estimation based on cross recurrence plots, with application to underwater acoustic signals
,” in
Recurrence Plots and Their Quantifications: Expanding Horizons
(
Springer
,
New York
,
2016
), Chap. 14.
14.
P. T.
Madsen
, “
Marine mammals and noise: Problems with root mean square sound pressure levels for transients
,”
J. Acoust. Soc. Am.
117
,
3952
3957
(
2005
).
15.
M. A.
Rossong
and
J. M.
Terhune
, “
Source levels and communication-range models for harp seal (Pagophilus groenlandicus) underwater calls in the Gulf of St. Lawrence, Canada
,”
Can. J. Zool.
87
,
609
617
(
2009
).
16.
P. J. O.
Miller
, “
Diversity in sound pressure levels and estimated active space of resident killer whale vocalizations
,”
J. Comp. Physiol. A
192
,
449
459
(
2006
).
17.
M. O.
Lammers
and
W. W. L.
Au
, “
Directionality in the whistles of Hawaiian spinner dolphins (Stenella longirostris): A signal feature to cue direction of movement?,”
Marine Mammal Sci.
19
,
249
264
(
2003
).
18.
M. H.
Rasmussen
,
M. O.
Lammers
,
K.
Beedholm
, and
L. A.
Miller
, “
Source levels and harmonic content of whistles in white-beaked dolphins (Lagenorhynchus albirostris)
,”
J. Acoust. Soc. Am.
120
,
510
517
(
2006
).
19.
A. S.
Frankel
,
D.
Zeddies
,
P.
Simard
, and
D.
Mann
, “
Whistle source levels of free-ranging bottlenose dolphins and Atlantic spotted dolphins in the Gulf of Mexico
,”
J. Acoust. Soc. Am.
135
,
1624
1631
(
2014
).