This paper presents B-CLEAN-SC, a variation of CLEAN-SC for broadband sources. In contrast to CLEAN-SC, which “deconvolves” the beamforming map for each frequency individually, B-CLEAN-SC processes frequency intervals. Instead of performing a deconvolution iteration at the location of the maximum level, B-CLEAN-SC performs it at the location of the over-frequency-averaged maximum to improve the location estimation. The method is validated and compared to standard CLEAN-SC on synthetic cases and real-world experiments for broad- and narrowband sources. It improves the source reconstruction at low and high frequencies and suppresses noise, while it only increases the need for memory and not computational effort.
1. Introduction
Conventional beamforming is a well-established tool to identify and quantify sound sources on complex objects, such as cars, trains, and aircrafts (Merino-Martínez , 2019). Naive methods estimate the sound power by virtually steering the cross spectral matrix (CSM) to different focus points to obtain an independent estimation for each focus point. The resulting beamforming map is convoluted with the array's point spread function (PSF), which limits the resolution at low frequencies by the array's aperture and at high frequencies by aliasing that results from the discrete microphone spacing. More advanced methods exist, such as gridless methods (Chardon, 2023; Goudarzi, 2023; Kujawski and Sarradj, 2022; Sarradj, 2022). However, they are computationally expensive and often only proven to work on academic examples.
There exist a variety of “deconvolution” methods that aim to reconstruct the true source distribution from the so-called dirty beamforming maps. While advanced source reconstruction methods such as DAMAS (Brooks and Humphreys, 2006; Chardon , 2021; Ehrenfried and Koop, 2007) exist, CLEAN-SC (Sijtsma, 2007) is the gold standard in industrial environments (Ahlefeldt, 2017; Ahlefeldt , 2023) because it is fast and robust.
CLEAN-SC solves the deconvolution iteratively at each individual frequency. It assumes a dominant source per iteration so that the dirty map is dominated by its PSF. It then estimates that the source is located at the location of maximum power spectral density (PSD) in the map and measures the coherence between the location and all other locations. It then subtracts the source from the CSM and dirty map. It then repeats the process to find additional sources until a stopping criterion is met. This process works well for spatially compact sources (Merino-Martínez , 2020) at medium frequencies, where the PSF shows pronounced mainlobes and low sidelobes. At low frequencies (compared to the array's aperture), the PSF of two adjacent sources will overlap and form a single blob in the dirty map. Thus, the maximum of the dirty map is no longer located at a true source position, but between multiple source positions. At these low frequencies, CLEAN-SC fails to identify the true sources and reconstructs the PSD wrongly. At high frequencies, the focus grid can often no longer resolve the mainlobe. Additionally, grating-lobes are present in the dirty map that are of the same magnitude as the mainlobe. Thus, the maximum is often positioned at a grating-lobe, which results in noisy CLEAN-SC maps at these high frequencies. The improved algorithm HR-CLEAN-SC (Sijtsma , 2017) aims to solve the low-frequency issues of CLEAN-SC, which requires an initial CLEAN-SC solution and an additional iteration to obtain a solution. The spatial resolution of HR-CLEAN-SC is approximately doubled compared to CLEAN-SC, but less so if diagonal removal (DR) is applied.
Recently, a variation of the gridless CSM-fitting method global optimization (GO) was introduced for broadband sources (Goudarzi, 2023; Malgoezar , 2017) based on the observation that sources typically have a constant location over frequency (Goudarzi , 2021). Broadband GO showed that introducing the condition of a shared location over frequency smooths out local minima in the optimization cost function, which are caused by the side- and grating-lobes of the array's PSF. While the results were superior compared to CLEAN-SC and standard GO, the computational effort makes the method currently not suitable for industry applications (Chardon, 2023; Goudarzi, 2023).
This paper introduces broadband-CLEAN-SC (B-CLEAN-SC), which aims to relax the problems of CLEAN-SC at high and low frequencies by adapting the idea of broadband GO: the processing of multiple frequencies at once, so that the sidelobes cancel out and true source positions can be identified. This is done by introducing a simple change to the CLEAN-SC algorithm: Instead of processing each frequency individually, B-CLEAN-SC processes frequency intervals at once (but still obtains smallband solutions). Here, the only difference lies in the determination of the location, from which the source power is sampled. B-CLEAN-SC averages the dirty maps over the frequency interval and uses the location of the maximum averaged source power. It then performs a standard CLEAN-SC iteration for each of the frequencies in the interval with individual source powers per frequency but at the shared location. Thus, the reconstruction at lower frequencies benefits from the resolution at higher frequencies, and the averaging of side- and grating-lobes stabilizes the process at high frequencies.
2. Methodology
This section presents the standard CLEAN-SC algorithm and the proposed B-CLEAN-SC algorithm.
2.1 Standard CLEAN-SC
FUNCTION CLEAN-SC : |
for f in f do |
while do {or another arbitrary stopping criterion is met} |
{find pos. of max. amplitude} |
{find steering vector to the corresp. loc.} |
if DR then |
{diag. matrix from steering vector} |
{iteratively find steering vector if DR} |
end if |
{calc. CSM for the identified source} |
{subtract identified source from dirty CSM} |
{subtract corresponding beamforming result from dirty map} |
{add identified source strength to CLEAN-SC output} |
end while |
end for |
return |
FUNCTION CLEAN-SC : |
for f in f do |
while do {or another arbitrary stopping criterion is met} |
{find pos. of max. amplitude} |
{find steering vector to the corresp. loc.} |
if DR then |
{diag. matrix from steering vector} |
{iteratively find steering vector if DR} |
end if |
{calc. CSM for the identified source} |
{subtract identified source from dirty CSM} |
{subtract corresponding beamforming result from dirty map} |
{add identified source strength to CLEAN-SC output} |
end while |
end for |
return |
2.2 B-CLEAN-SC
FUNCTION B-CLEAN-SC : |
while do |
{change to the CLEAN-SC algorithm} |
if DR then |
end if |
end while |
return |
FUNCTION B-CLEAN-SC : |
while do |
{change to the CLEAN-SC algorithm} |
if DR then |
end if |
end while |
return |
Note that the position is not necessarily located on the mainlobe of a dominant source for all frequencies if the sources have a strong frequency-dependent power. Especially at low frequencies, where the PSF of a dominant source may cover all other sources and dominate the estimated power at their true positions, this would lead to an overestimation of their power, and a subtraction of the mainlobe, when subtracting coherent portions of the map (Sijtsma , 2017). To relax this issue, a low gain factor α is needed, so that the number of necessary B-CLEAN-SC iterations increases. Since only the initial calculation of the dirty map is computationally expensive, the extra iterations are not performance relevant.
3. Results
This section presents three different cases. Section 3.1 presents a synthetic example that aims to clarify the behavior of CLEAN-SC and B-CLEAN-SC. Section 3.2 presents an open wind tunnel experiment with ground truth, so that the methods can be evaluated quantitatively. Last, Sec. 3.3 presents a closed wind tunnel experiment without ground truth, based on which the methods are evaluated qualitatively. Throughout this section, CLEAN-SC will be performed with DR, a maximum of iterations per frequency, where NS is the number of true sources, and a gain factor of per iteration. B-CLEAN-SC will be performed with DR, a maximum of iterations, and per iteration. To reduce the visual complexity of the results, beamforming maps are obtained only in one dimension for case 1 and two dimensions for cases 2 and 3 with steering vector formulation III (Sarradj, 2012). The results will be presented over the Helmholtz number , where D is the array's aperture, and a is the speed of sound.
3.1 Synthetic results
Case 1 is a synthetic one-dimensional (1D) example that highlights the differences between standard CLEAN-SC and B-CLEAN-SC. The array is located at , y = 0. There are three sources Si at , y = 0.5. The CSM is calculated at 256 frequencies, . The focus grid is located at , y = 0.5, . The PSD of S1 linearly increases over frequency from to . The PSD of S2 linearly decreases in the same way so that S2 dominates at low frequencies and S1 dominates at high frequencies. Additionally, S3 is a smallband source that is only present at at . For B-CLEAN-SC, the frequencies are processed in intervals of .
Figure 1 shows the results of case 1. Figures 1(e) and 1(f) show the estimated PSD, integrated from the same colored region of interest (ROI) in Figs. 1(a) and 1(d). The black lines represent noise, integrated from the area that does not correspond to any ROI indicating beamforming and deconvolution artifacts. Additionally, a magenta line shows the integration of all sources within the map, as an estimation of the overall sound power. The ground truth is depicted with dotted lines for reference.
CLEAN-SC reconstructs the dominant source S2 well down to , below which the maximum within the dirty map is estimated with a wrong level along the sidelobes and then at the edges of the focal range. For S1, the PSD reconstruction works well down to , below which CLEAN-SC gradually underestimates its power and gradually misses the correct location. The smallband source S3 is reconstructed perfectly. B-CLEAN-SC perfectly estimates the sources' locations. The PSDs are reconstructed well throughout the frequency range, except for an underestimation of S1 at . For B-CLEAN-SC, there is no noise.
3.2 Experiment with ground truth
Case 2 is a generic open wind tunnel experiment at Mach (M) = 0.06 with a streamlined monopole speaker that is moved to three different locations (Goudarzi, 2023; Goudarzi , 2021) with different spectra and known ground truth. The sources are located at . The array consists of 7 × 7 equidistantly spaced microphones with and is located at outside of the flow. The equidistant two-dimensional (2D) focus grid covers at z = 0. The sampling rate is , and .
Figure 2 shows the results for case 2. Figures 2(a) and 2(d) show that CLEAN-SC results in noisier OASPL maps than B-CLEAN-SC. When integrating the maps over x, Figs. 2(b) and 2(e) show that CLEAN-SC is able to determine the correct location down to . B-CLEAN-SC correctly determines throughout the frequency range. Strong sidelobes are reconstructed as “ghost sources” that move closer to the true source position with increasing frequency. Figures 2(c) and 2(f) show the spectrum estimation. Source S1 is estimated well by CLEAN-SC at . Source S2 is estimated well down to , below which it can no longer be separated from S3, estimated well down to . Below this frequency, the overall power was estimated well but could not be attributed to a true source position, so it was integrated as noise. Both S1 and S3 are reconstructed down a signal-to-signal ratio (SSR) of around , which was used as an iteration stopping criterion. B-CLEAN-SC shows similar results, with improved reconstructions of S2 and S3 and lower levels of noise.
. | 1a . | 2 . | 4 . | 8 . | 16b . | 32 . | 64 . | 128 . |
---|---|---|---|---|---|---|---|---|
Correct PSD (%) | 61.7 | 62.8 | 62.5 | 65.6 | 66.7 | 66.7 | 66.1 | 64.8 |
Mean error (dB) | 2.3 | 2.2 | 2.5 | 2.5 | 1.3 | 2.5 | 3.9 | 8.9 |
SNR (dB) | 14.5 | 14.4 | 14.8 | 16.4 | 17.8 | 18.4 | 22.6 | 19.1 |
. | 1a . | 2 . | 4 . | 8 . | 16b . | 32 . | 64 . | 128 . |
---|---|---|---|---|---|---|---|---|
Correct PSD (%) | 61.7 | 62.8 | 62.5 | 65.6 | 66.7 | 66.7 | 66.1 | 64.8 |
Mean error (dB) | 2.3 | 2.2 | 2.5 | 2.5 | 1.3 | 2.5 | 3.9 | 8.9 |
SNR (dB) | 14.5 | 14.4 | 14.8 | 16.4 | 17.8 | 18.4 | 22.6 | 19.1 |
Corresponds to the CLEAN-SC result in Fig. 2, all with DR, , N = 30.
Corresponds to the B-CLEAN-SC result, all with DR, , N = 30.
3.3 Wind tunnel experiment
Case 3 is a closed wind tunnel measurement of a Dornier 728 (Do728) at (Ahlefeldt, 2013). The 2D focus grid is rotated so that it covers and follows the wing. The spiral array with consists of 149 microphones and is located approximately from the wing. The signal is sampled at , and the CSM is sampled for 128 frequencies at . Since there exists no ground truth, the results will be discussed only qualitatively.
Figures 3(a) and 3(d) show the estimated OASPL . The results correspond well to the geometric features of the wing, but CLEAN-SC shows noise in the top right corner, and sources such as the flap side edge are not well localized. Figures 3(b) and 3(e) show the PSD(y, f), integrated over x and normalized per frequency. Thus, the only sources that can be confused in this depiction are an outboard slat track and the flap side edge at . The model is depicted for reference, so that the estimated sources can be attributed to its geometrical features, such as the slat tracks. Note that the x-component of the model is plotted along the frequency axis, but the color-map does not include any x-information. Within the CLEAN-SC result, one can clearly identify slat tracks in a frequency range of . Otherwise, the result mostly shows the inboard Krüger slat, the nacelle area, and the noise for . Below , the source separation fails. The B-CLEAN-SC result shows the same slat tracks as dominant sources. However, they are also reconstructed at low and high frequencies. Additionally, there is nearly no noise for . Additional sources are visible between the slat tracks, which are typically connected to slat cove tones (Goudarzi, 2022). Overall, the location of the estimated sources strongly correlates to the geometrical features of the model and is consistent over the whole frequency range.
Based on the analysis of these data (Goudarzi, 2022), ROIs are defined that cover the inner (Krüger) slat and the slat tracks (blue), the outer slat (orange), and the flap side edge (green). The ROIs are chosen so that the integrated source types are similar (Ahlefeldt , 2023; Goudarzi, 2022). Figures 3(a) and 3(d) show the (identical) ROI, and Figs. 3(c) and 3(f) show the corresponding PSD. Below , CLEAN-SC fails to reconstruct individual sources, which results in strong noise, additional to the noise . B-CLEAN-SC reconstructs the PSD throughout the frequency range with approximately less noise. For frequencies where both methods produce a source spectrum, they coincide.
4. Discussion
Case 1 showed that CLEAN-SC can predict arbitrary results at low frequencies. B-CLEAN-SC fixed this by averaging frequency intervals of dirty maps to determine source locations. This works, as the locations of side- and grating-lobes change with frequency so that they cancel out during the averaging. Additionally, the source location at low frequencies below the Rayleigh resolution limit is determined based on higher frequencies, where the source positions can still be resolved. The case showed that B-CLEAN-SC also works for sources with a frequency-dependent spectrum and smallband sources. Here, the initial source marker is not guaranteed to be located on the dominant source for all frequencies. Thus, B-CLEAN-SC is prone to “confuse” the power contribution of these sources. To relax this problem, a low iteration gain factor of was used. Additionally, using frequency intervals instead of using the whole spectrum further relaxes this issue.
Case 2 showed that B-CLEAN-SC is able to correctly determine the location and power of the sources at low frequencies in an open wind tunnel experiment, and its overall noise level was 6 dB lower compared to CLEAN-SC. The introduced metrics and Table 1 showed that B-CLEAN-SC improves with increasing frequency intervals in spatial and spectral accuracy up to an optimum at 1/4 of the total frequencies, after which spatial accuracy is traded for a deteriorating spectral estimation. One can possibly account for this by defining frequency-dependent intervals so that the intervals are large at low and high frequencies and small at medium frequencies where CLEAN-SC works well. A lower gain factor further relaxes this issue but increases the number of iterations.
Case 3 showed that for a real-world wind tunnel measurement of a Do728, B-CLEAN-SC was able to reconstruct sources throughout the frequency range, compared to CLEAN-SC, which identified sources mainly at . Since their location is roughly constant over frequency and corresponds to the geometric features (slat track, flap side edge, etc.), these identified locations are presumably correct. The B-CLEAN-SC result is less noisy compared to the CLEAN-SC result. The ROI PSD showed nearly identical results for both methods in the frequency region where CLEAN-SC correctly identified sources, which was smaller compared to B-CLEAN-SC.
5. Conclusion
This paper presented B-CLEAN-SC, a variation of CLEAN-SC specifically tailored to broadband sources. B-CLEAN-SC assumes that the location of broadband sources is constant over frequency intervals. For synthetic and experimental wind tunnel data, B-CLEAN-SC outperformed CLEAN-SC at low frequencies. For experimental real data, B-CLEAN-SC also resulted in 3 dB less broadband noise throughout the frequency range. On wind tunnel data of a Do728, both methods showed that the source location assumption is valid, improves the spatial estimation of sources, and reduces noise.
The algorithmic difference between CLEAN-SC and B-CLEAN-SC is small. B-CLEAN-SC processes multiple frequencies at once and uses one additional operation per iteration compared to CLEAN-SC. As it requires a lower gain factor, the number of iterations increase inverse proportionally to the gain factor to meet a convergence criterion that is, however, not performance relevant. The necessary memory scales linearly with the number of employed frequencies within the interval compared to standard CLEAN-SC, which, in terms of today's computational capacities, should not be an issue. This makes B-CLEAN-SC a viable method for little computational effort but improved results at low and high frequencies.
AUTHOR DECLARATIONS
Conflict of Interest
The author declares that he has no conflict of interest.
DATA AVAILABILITY
The data that support the findings of this study are available from the corresponding author upon reasonable request.