This Letter proposes a joint geoacoustic inversion method for modal group velocity dispersion and amplitudes of waveform by incorporating a Pearson correlation constraint. Numerical simulations show that this joint inversion leads to improved geoacoustic inversion performance with smaller uncertainties compared to separate inversion methods when applied to data from a single receiver. Additionally, the effective use of the Wasserstein metric from optimal transport theory is explored and compared to the more-common L2 norm misfit measure. The Letter also presents a qualitative representation of joint inversion convergence obtained through multiple independent runs of genetic algorithms. The algorithm is applied to simulated data.

Ocean acoustic modal group velocity dispersion data and recorded signal waveform data are widely used in the field of geoacoustic inversion,1,2 source localization,3 and environmental inversion.4 However, geoacoustic inversion based on modal dispersion involves travel-time information but not information on seabed attenuation needed for amplitude information.5 Many characteristics of the acoustic field can be extracted from the broadband signals and utilized in geoacoustic inversions. Furthermore, different characteristics exhibit varying sensitivities to geoacoustic parameters. Previous joint inversion studies employed the multi-stage inversion approach6 to address the issue of specific sensitivity between various parametrizations and characteristics. However, errors in parameters at different stages are also propagated, making it difficult to adequately assess the error of inversion results.7 Waveform inversions utilize relative time or time correction among multimodal arrivals,8–10 the inversion accuracy of which decreases with the increase in uncertainty error, i.e., the periodic jump phenomenon.

As the criterion principle of modal dispersion inversion is based on phase comparison, it is unable to provide an estimation of attenuation.5 In a single receiver modal context, attenuation must be estimated using mode amplitude considerations, such as modal amplitude ratios inversion6 or broadband waveform inversion. When one data feature cannot simultaneously invert all model parameters, joint inversion methods are generally used. Therefore, this Letter carries out a joint geoacoustic inversion of modal group velocity dispersion and the full waveform envelope obtained via the Hilbert transform (FWH), combining the two by incorporating a constraint based on the Pearson correlation coefficient (PCC)11 and Wasserstein metric.8 However, it should be emphasized that joint inversion of different quantities derived from the same measurements may not always be ideal unless these quantities exhibit suitable independence.

The remainder of this Letter is organized as follows: Section 2 provides a short description of previous work related to modal dispersion geoacoustic inversion and waveform inversion. We describe our proposed joint inversion approach, including the formulation of the objective function and the optimization strategy. The performance and analysis of joint geoacoustic inversion are presented in Sec. 3. Finally, conclusions are given in Sec. 4.

When propagation is described by normal mode theory, time-frequency (TF) warping transform12 maps a recorded signal y obs ( t ) after propagation over a range r into the warped signal W h y obs ( t ) using a warping function χ ( t ):
(1)
Transforming y obs ( t ) with the warping function χ ( t ) = ( t 2 + t r 2 ) 1 / 2 as shown in Eq. (1) has been successfully used to isolate normal modes in shallow-water fields.13–16 The signal time origin is t r = r / c w, with c w being the sound speed in the waveguide. Bonnel et al.2 have shown that the warping is a robust transformation, which can be applied to most low-frequency shallow-water scenarios, even in non-ideal waveguides.12 Once the mode m has been filtered, the modal dispersion curve
(2)
where v g m ( f ) and t s ( f ) represents the group speed of mode m at frequency f and source group delay, respectively, can be used as input data for geoacoustic inversion.17 If the source is broadband impulsive, the source group delay t s ( f ) equals t r, representing the earliest possible time for any energy from the signal to reach the receiver.
Compared to dispersion geoacoustic inversion, which optimizes the matching of dispersion curves within a specific frequency range, waveform geoacoustic inversion is also an optimization problem but focuses on matching the envelope structure in the time domain. Mathematically, it can be expressed as
(3)
where Φ ( m ) and m is the mismatch function between the replica signal G ( m ) and the FWH signal h obs ( t ), and model parameters, respectively. For experimental inversion, a regularization term is usually added, such as Tikhonov regularization,18 to reduce the ill-posed property of geoacoustic inversion.

In the traditional waveform inversion approach based on the Bartlett metric or L 2 norm misfit function, the periodic jump phenomenon of waveform signal is inevitable. More specifically, when the temporal shift between the synthesized waveform and the observed waveform exceeds half a period, the objective function is prone to falling into multiple local minima. This will affect inversion performance in terms of convergence and precision. Possible solutions to this challenge could include the implementation of the Wasserstein metric as the objective function. Compared to the point-by-point comparison of signals in the L 2 norm, the Wasserstein metric19,20 used as the objective function allows for global signal quality comparisons, thus helping to alleviate the problem of local minima.

Consider two equal probability measures, μ ( x ) P ( X ) and ν ( y ) P ( Y ), defined on two complete and separable metric spaces X and Y, respectively. The corresponding probability density functions f and g are defined as d μ = fdx and d ν = gdy on space n. In this study, f and g represent the amplitude of the replica and the observation, respectively. For the mapping S: μ ν, which satisfies μ [ B ] = v [ S 1 ( B ) ], for any measurable set B Y, the corresponding minimum transportation cost from μ to ν is defined as
(4)
where y = S ( x ), and M is the set of all transport plans that can redistribute f into g. The symbol c ( x , y ) indicates the cost of transporting one unit mass from location x to y. Particularly, when c ( x , y ) = | x y | 2 for p = 2, I ( f , g , c ) becomes the Wasserstein metric:
(5)
The Wasserstein metric requires the distributions f and g to be mass-conservation and non-negative.21 The recorded acoustic waveform signal does not naturally satisfy these requirements. To tackle this obstacle, we need to transform acoustic signals into probability densities by using the proper scaling function22 
(6)
where the generic constant b is related to the convexity of the objective function. The constant b can be determined by the convexity of the objective function under different values.22 

Figure 1(a) illustrates the overview of the joint inversion framework and outlines the relationships involved. This study implements warping and Hilbert transformations on T-F signals to extract dispersion curves and waveform envelope, respectively, for use as inputs in the joint inversion. The process involves dividing the full parameter space into two separate subspaces that are optimized and searched for using distinct genetic algorithms (GA). In essence, the dispersion inversion and waveform inversion are carried out independently, but are combined through a joint constraint, which takes account of the PCC between the parameter populations.

Fig. 1.

Diagram for the joint inversion. (a) The joint geoacoustic inversion frame of modal dispersion and the FWH using an alternative iteration approach. (b) The simulated geoacoustic model and sound speed profile.

Fig. 1.

Diagram for the joint inversion. (a) The joint geoacoustic inversion frame of modal dispersion and the FWH using an alternative iteration approach. (b) The simulated geoacoustic model and sound speed profile.

Close modal

For each iteration, when an algebraic update is applied to a population in the dispersion inversion dataset, the PCC joint constraint is also updated and replaces the joint constraint in the objective function of the FWH inversion dataset, and vice versa.

According to the joint inversion strategy proposed by Egbert and Kelbert,23 we define the joint objective inversion for geoacoustic inversion as
(7)
(8)
where T ( m 1 ) and H ( m 2 ) are misfits of dispersion and the envelope data, while φ J ( m 1 , m 2 ) is the joint constraint formed by PCC. λ 1 , 2 is the weighting coefficient for the term of joint constraint, representing the regularization. These two coefficients are determined and iteratively updated based on the magnitudes of the mismatch and constraint terms.11 Writing Eqs. (7) and (8) in matrix, we have
(9)
(10)
where τ m obs, h obs denote, respectively, the observed data of two quantities, and m represent geoacoustic models (including N parameters) that are iteratively determined using GA. K is the forward operator for the normal mode method. Here, we use the KRAKENC24 to calculate dispersion curves in TF domain. G is the operator that generates the replica field of the FWH signal via Hilbert transform. C t and C h are diagonal matrices, representing the estimation of the standard deviation of noisy data for dispersion curves and FWH, respectively. p i is the PCC between these two populations of parameter i in the two GAs for each evolutionary generation.
The simulated additive errors are uncorrelated, and we assume that each mode has a distinct standard deviation. Consider the common approximation of independent errors and diagonal covariance matrices C k = v k I, where v k is the variance for the k-th mode and I is the identity matrix. Considering joint inversion based on Wasserstein metric (Was-J), the objective function of FWH is given by
(11)

Therefore, Eqs. (9) and (10) establish the objective function of the joint inversion under L2 norm (L2-J), while Eqs. (9) and (11) establish it under Wasserstein metric.

The simulation is range-independent, and the environmental characteristics are based on the general features observed in the Shallow Water 2006 (SW06) experiment.25 The simulated environmental parameters are given in Fig. 1(b) and corresponding parameter settings are shown in Table 1. We now demonstrate the numerical experiment by presenting actual pulse simulations obtained by a source pulse with the frequency band 50–200 Hz, emitted at depth 22 m, and recorded 7 km away at depth 20 m, with a signal-to-noise ratio (SNR) of 6 dB. SNR = 10 log ( σ s 2 / σ b 2 ) can be expressed as a logarithmic ratio of the signal power σ s 2 to the Gaussian white noise power σ b 2 for the same bandwidth. The GA method with a population of 100 and a crossover rate of 0.5 is used to optimize J D , FWH ( m ).

Table 1.

Summary of separated inversion and joint inversion averaged over 100 independent runs of GAs. The standard deviation is in parentheses. The rightmost column gives the normalized root mean square error (NRMSE) of this inversion results.

Parametera H ( m ) c s ( m / s ) ρ s ( g / c m 3 ) c b ( m / s ) ρ b ( g / c m 3 ) α s ( d B / λ ) α b ( d B / λ ) NRMSE
Real value  25.0  1600.0  1.70  2100.0  2.50  0.150  0.600  — 
L2-D  24.5 (± 2.9)  1601.0 (± 5.8)  1.65 (± 0.11)  2260.0 (± 119.7)  2.58 (± 0.16)  —  —  — 
L2-FWH  29.7 (± 4.2)  1619.5 (± 13.6)  1.89 (± 0.15)  2091.0 (± 99.4)  2.45 (± 0.13)  0.220 (± 0.025)  0.405 (± 0.158)  7.35e-5 
L2-J  23.7 (± 2.6)  1592.5 (± 9.6)  1.59 (± 0.09)  2077.5 (± 117.4)  2.22 (± 0.14)  0.135 (± 0.024)  0.545 (± 0.102)  6.58e-5 
Was-J  25.8 (± 2.4)  1603.0 (± 6.3)  1.73 (± 0.08)  2101.0 (± 61.8)  2.43 (± 0.14)  0.160 (± 0.014)  0.557 (± 0.141)  4.35e-5 
Parametera H ( m ) c s ( m / s ) ρ s ( g / c m 3 ) c b ( m / s ) ρ b ( g / c m 3 ) α s ( d B / λ ) α b ( d B / λ ) NRMSE
Real value  25.0  1600.0  1.70  2100.0  2.50  0.150  0.600  — 
L2-D  24.5 (± 2.9)  1601.0 (± 5.8)  1.65 (± 0.11)  2260.0 (± 119.7)  2.58 (± 0.16)  —  —  — 
L2-FWH  29.7 (± 4.2)  1619.5 (± 13.6)  1.89 (± 0.15)  2091.0 (± 99.4)  2.45 (± 0.13)  0.220 (± 0.025)  0.405 (± 0.158)  7.35e-5 
L2-J  23.7 (± 2.6)  1592.5 (± 9.6)  1.59 (± 0.09)  2077.5 (± 117.4)  2.22 (± 0.14)  0.135 (± 0.024)  0.545 (± 0.102)  6.58e-5 
Was-J  25.8 (± 2.4)  1603.0 (± 6.3)  1.73 (± 0.08)  2101.0 (± 61.8)  2.43 (± 0.14)  0.160 (± 0.014)  0.557 (± 0.141)  4.35e-5 
a

Parameter settings are shown in Fig. 1(b) and the search space: 20 m ⩽ H⩽ 30 m with 0.1 m steps; 1400 m/s ⩽ c s⩽ 1700 m/s with 0.5 m/s steps; 1.4 g/cm3  ρ s⩽ 1.9 g/cm3 with 0.01 steps; 1900 m/s ⩽ c b ⩽ 2300 m/s with 0.5 m/s steps; 2.2 g/cm3 ρ b ⩽ 2.7 g/cm3 with 0.01 steps; 0.05 dB/λ ⩽ α s⩽ 0.35 dB/λ) with 0.005 steps; 0.4 dB/λ ⩽ α b ⩽ 0.8 dB/λ with 0.005 steps.

This numerical example shows the advantage of joint inversion method on reducing geoacoustic inversion uncertainty. To estimate the performance of joint inversion algorithm, we need to consider the calculation time as well as the generation of convergence iteration. For each approach, the GA was run over 100 with different starting values through Monte Carlo sampling. The average running times of the dispersion curve inversion based on L2 norm (L2-D), FWH inversion based on L2 norm (L2-FWH), L2-J, and Was-J, are 22.5, 29.3, 45.5, and 37.4 min, respectively. The runtime of joint inversions is significantly higher than that of the individual inversions. According to Fig. 2, the standard deviation of the objective function based on the Was-J is the lowest. The color interval in generations 4–8 is the iteration interval after the implementation of joint constraint (when the value of the objective function mismatch term decreases to one-third of that during the original generation). Based on the inset in Fig. 2, it is observed that the joint term constraint leads to a better value of H in the early stages of the optimization process.

Fig. 2.

The figure presents the values of four different objective functions plotted against GA iteration number, averaged over 100 independent runs of GAs. The blue and green shaded regions correspond to the mean ±1 standard deviation of objective functions for each iteration in the joint inversion approach, while the line plots illustrate that in individual inversions. For the joint inversions, the objective function value is the sum of two objective functions. The red shaded area corresponds to the 4th–8th iterative interval. In the inset, a magnified view is provided to illustrate the distribution of the sedimentary layer thickness parameter specifically during this iteration of the Was-J inversion. The black solid line is a nonparametric kernel-smoothed representation of the inset histogram.

Fig. 2.

The figure presents the values of four different objective functions plotted against GA iteration number, averaged over 100 independent runs of GAs. The blue and green shaded regions correspond to the mean ±1 standard deviation of objective functions for each iteration in the joint inversion approach, while the line plots illustrate that in individual inversions. For the joint inversions, the objective function value is the sum of two objective functions. The red shaded area corresponds to the 4th–8th iterative interval. In the inset, a magnified view is provided to illustrate the distribution of the sedimentary layer thickness parameter specifically during this iteration of the Was-J inversion. The black solid line is a nonparametric kernel-smoothed representation of the inset histogram.

Close modal

Was-J results can be summarized by parameter distributions derived from 100 optimization runs, as shown in Fig. 3, where the green color indicates the best model. The sediment depth and soundspeed, as well as basement soundspeed, are superimposed on Fig. 3(a) to facilitate comparison. For the soundspeed parameters, the inversion results compared to the true value show a variation range of ± 375 m/s in the basement layer, while in the sedimentary layer, it is only ± 20 m/s. Therefore, the distribution of the basement soundspeed is not included in Fig. 3, due to the lack of representativeness in the results obtained from different inversion methods. When compared to the results of the L2-J with high uncertainty and standard deviations, the Was-J clearly outperforms. First, it is evident that the mean values of all parameter distributions are in close proximity to the true values. The sediment parameters ( H, c s, and α s) are well estimated. Moreover, the sediment soundspeed c s and attenuation α s exhibit high peakedness and concentration in their unimodal distributions, reflecting the low uncertainty in Was-J inversion. According to Table 1, the overall NRMSE of parameters in Was-J is 4.35 × 10−5, and deviation range of other parameters, except α b, is 0.1%–2.97%, both of which are better than FWH and L2-J. On the other hand, L2-FWH method produces the highest NRMSE in most cases and performs poorly in estimating H, ρ s, and α b.

Fig. 3.

Qualitative appraisals of the model ensemble produced by 100 GAs runs, each with different crossover, mutation, and starting values through Monte Carlo sampling. (a) Distributions of sound speed and sediment thickness models. The results are color-coded according to the cost function: three groups showing the best one (green), 30% (brown, also called medium), and all (blue) models. Black vertical lines are the true value of sound speeds in the sediment and basement. The horizontal dashed line and the patch show the true value and distribution interval of sediment depth, respectively. Data fits for (b) FWH and (c) dispersion curves between observations and estimations. The gray uncertainty intervals represent the range of values encompassing the data's mean and standard deviation, with an interval containing 95% of the models. The black solid lines and the red solid lines indicate the average inversion values and the observed values, respectively. The grouping of the best and full models and the colors is consistent with (a). (d)–(i) Nonparametric kernel smooth distributions of the five parameters ( H, c s, ρ s, ρ b, α s, and α b), representing an approximation of parameter distributions. Black vertical dashed lines represent the true value of parameters.

Fig. 3.

Qualitative appraisals of the model ensemble produced by 100 GAs runs, each with different crossover, mutation, and starting values through Monte Carlo sampling. (a) Distributions of sound speed and sediment thickness models. The results are color-coded according to the cost function: three groups showing the best one (green), 30% (brown, also called medium), and all (blue) models. Black vertical lines are the true value of sound speeds in the sediment and basement. The horizontal dashed line and the patch show the true value and distribution interval of sediment depth, respectively. Data fits for (b) FWH and (c) dispersion curves between observations and estimations. The gray uncertainty intervals represent the range of values encompassing the data's mean and standard deviation, with an interval containing 95% of the models. The black solid lines and the red solid lines indicate the average inversion values and the observed values, respectively. The grouping of the best and full models and the colors is consistent with (a). (d)–(i) Nonparametric kernel smooth distributions of the five parameters ( H, c s, ρ s, ρ b, α s, and α b), representing an approximation of parameter distributions. Black vertical dashed lines represent the true value of parameters.

Close modal

Furthermore, as shown in Fig. 3(c), the uncertainty interval of dispersion curve of mode 4 is large, particularly near the Airy phase. This is attributed to the inaccuracy in extracting the fourth mode during the warping transformation and the parameter sensitivity of this mode. Finally, the parameter distribution of the sediment thickness in Was-J is a unimodal distribution, whereas in the FWH, it is a bimodal distribution (which is not shown limited by the length of the paper). This demonstrates that the joint inversion approach presented in this research successfully combines the advantages of dispersion inversion for sedimentary sound speed with the advantages of FWH for attenuation. Although it is not feasible to directly quantify the uncertainty associated with the joint inversion method involving dependent datasets, conducting a statistical analysis of the estimates obtained from multiple inversions with various initial values, crossovers, and mutations, can provide insights into the overall uncertainty of the inversion results. This analysis reveals that the overall uncertainty, considering the various sources of uncertainty, is relatively minimal for the inversion results.

This joint inversion approach is also applied to the Workshop's 97 geoacoustic inversion benchmark AT cases, with details of the environmental settings and search intervals provided in the reference.26 The results are summarized in Table 2.

Table 2.

Parameter true values and their estimates of the benchmark AT cases. Differences (diff) between true (TRUE) parameter values and the inversion results are also shown.

ID h sed ( m ) c sed ( D ) ( m / s ) c sed ( D + h sed ) ( m / s ) ρ sed ( g / c m 3 ) c hsp ( m / s ) ρ hsp ( g / c m 3 ) α sed ( d B / λ ) α hsp ( d B / λ ) E1a E2
TRUE-a  44.57  1546.91  1624.75  1.40277  1676.30  1.77546  0.12368  0.08046  —  — 
L2-J  y ̂ i  44.87  1546.0  1626.0  1.40  1675.5  1.69  0.110  0.070  0.0371  0.075 
diff  −0.30  0.91  −1.25  0.80  0.08  0.013  0.01  —  — 
Was-J  y ̂ i  44.18  1546.0  1629.5  1.41  1679.0  1.61  0.125  0.050  0.074  0.152 
diff  0.39  0.91  −4.75  −0.01  −2.7  0.16  −0.002  0.030  —  — 
TRUE-a  13.4381  1558.27  1564.62  1.77156  1758.80  1.95231  0.31423  0.19822  —  — 
L2-J  y ̂ i  13.21  1558.5  1568.5  1.74  1720.0  1.95  0.325  0.175  0.049  0.076 
diff  0.23  −0.23  −3.88  0.03  38.8  −0.01  0.023  —  — 
Was-J  y ̂ i  13.53  1561.5  1565.0  1.81  1753.5  1.99  0.325  0.195  0.035  0.048 
diff  −0.09  −3.23  −0.38  −0.03  5.3  −0.04  −0.011  0.003  —  — 
TRUE-a  30.3733  1533.93  1557.58  1.42907  1615.15  1.68715  0.08399  0.05278  —  — 
L2-J  y ̂ i  36.8  1533.0  1569.0  1.43  1619.0  1.60  0.095  0.070  0.073  0.105 
diff  −6.42  0.09  −11.4  −3.85  0.08  −0.011  −0.012  —  — 
Was-J  y ̂ i  30.5  1533.5  1559.0  1.42  1615.0  1.63  0.105  0.060  0.031  0.054 
diff  −0.13  0.43  −1.42  0.15  0.05  −0.02  —  — 
ID h sed ( m ) c sed ( D ) ( m / s ) c sed ( D + h sed ) ( m / s ) ρ sed ( g / c m 3 ) c hsp ( m / s ) ρ hsp ( g / c m 3 ) α sed ( d B / λ ) α hsp ( d B / λ ) E1a E2
TRUE-a  44.57  1546.91  1624.75  1.40277  1676.30  1.77546  0.12368  0.08046  —  — 
L2-J  y ̂ i  44.87  1546.0  1626.0  1.40  1675.5  1.69  0.110  0.070  0.0371  0.075 
diff  −0.30  0.91  −1.25  0.80  0.08  0.013  0.01  —  — 
Was-J  y ̂ i  44.18  1546.0  1629.5  1.41  1679.0  1.61  0.125  0.050  0.074  0.152 
diff  0.39  0.91  −4.75  −0.01  −2.7  0.16  −0.002  0.030  —  — 
TRUE-a  13.4381  1558.27  1564.62  1.77156  1758.80  1.95231  0.31423  0.19822  —  — 
L2-J  y ̂ i  13.21  1558.5  1568.5  1.74  1720.0  1.95  0.325  0.175  0.049  0.076 
diff  0.23  −0.23  −3.88  0.03  38.8  −0.01  0.023  —  — 
Was-J  y ̂ i  13.53  1561.5  1565.0  1.81  1753.5  1.99  0.325  0.195  0.035  0.048 
diff  −0.09  −3.23  −0.38  −0.03  5.3  −0.04  −0.011  0.003  —  — 
TRUE-a  30.3733  1533.93  1557.58  1.42907  1615.15  1.68715  0.08399  0.05278  —  — 
L2-J  y ̂ i  36.8  1533.0  1569.0  1.43  1619.0  1.60  0.095  0.070  0.073  0.105 
diff  −6.42  0.09  −11.4  −3.85  0.08  −0.011  −0.012  —  — 
Was-J  y ̂ i  30.5  1533.5  1559.0  1.42  1615.0  1.63  0.105  0.060  0.031  0.054 
diff  −0.13  0.43  −1.42  0.15  0.05  −0.02  —  — 
a

These error norms are defined by E 1 = ( 1 / P ) i = 1 P x i - x ̂ iand E 2 = ( 1 / P ) i = 1 P ( x i - x ̂ i ) 2 1 / 2, where i is the parameter index, P is the total number of unknown parameters, xi is the ith true (normalized) parameter value, and x i ̂ is the ith estimated (normalized) parameter value.

This Letter develops a joint inversion approach of modal dispersion and FWH data based on the PCC. Essentially, PCC is a joint constraint on the temporal dispersion characteristics and attenuation dispersion characteristics. By simultaneously coupling these two individual inversions, this joint approach prevents the error propagation associated with two-staged inversion approaches. In addition, this Letter provides further evidence that incorporating the Wasserstein metric to formulate the objective function demonstrates enhanced robustness in joint inversion, resulting in decreased uncertainty of certain inverted parameters, as compared to employing the L2 norm.

Specifically, we successfully implement the Wasserstein metric in joint geoacoustic inversion, by using the softscaling functions to transform acoustic signals into probability densities. The simulation results demonstrate that the joint inversion approach, especially based on the Wasserstein metric, outperforms these two existing individual inversions in terms of reducing the uncertainty of inversion results. However, this enhanced performance comes at the expense of compromising the accuracy of certain parameters other than sediment attenuation and sediment sound speed, as well as increasing the optimization time. Overall, the results obtained from the benchmark problems are in general agreement with the results of the simulated data experiments, according to Table 2.

The joint inversion method has the potential to improve the reliability and accuracy of the inversion results, but may also increase the uncertainty if the joint inversion involves dependent datasets. Therefore, future work will further seek to analyze the noise sensitivity and compare the performance and quantitative uncertainties under different SNR conditions.

This work was supported by the National Natural Science Foundation of China under Grant No. 41775027. The authors gratefully acknowledge Professor Julien Bonnel for discussion on the geoacoustic inversion using warping operators and Professor Stan Dosso for his invaluable guidance.

The authors have no conflicts to disclose.

The data that support the findings of this study are available from the corresponding author upon reasonable request.

1.
A.
Dremeau
,
F.
Le Courtois
, and
J.
Bonnel
, “
Reconstruction of dispersion curves in the frequency-wavenumber domain using compressed sensing on a random array
,”
IEEE J. Ocean. Eng.
42
(
4
),
914
922
(
2017
).
2.
J.
Bonnel
,
S. E.
Dosso
, and
C. N.
Ross
, “
Bayesian geoacoustic inversion of single hydrophone light bulb data using warping dispersion analysis
,”
J. Acoust. Soc. Am.
134
(
1
),
120
130
(
2013
).
3.
M. D.
Collins
,
A.
Turgut
,
M. J.
Buckingham
,
P.
Gerstoft
, and
M.
Siderius
, “
Selected topics of the past thirty years in ocean acoustics
,”
J. Theor. Comput. Acoust.
30
(
03
),
2240001
(
2022
).
4.
J.
Dettmer
and
S. E.
Dosso
, “
Probabilistic two-dimensional water-column and seabed inversion with self-adapting parameterizations
,”
J. Acoust. Soc. Am.
133
(
5
),
2612
2623
(
2013
).
5.
J.
Bonnel
,
A.
Thode
,
D.
Wright
, and
R.
Chapman
, “
Nonlinear time-warping made simple: A step-by-step tutorial on underwater acoustic modal separation with a single hydrophone
,”
J. Acoust. Soc. Am.
147
(
3
),
1897
1926
(
2020
).
6.
G. R.
Potty
,
J. H.
Miller
, and
J. F.
Lynch
, “
Inversion for sediment geoacoustic properties at the New England Bight
,”
J. Acoust. Soc. Am.
114
(
4
),
1874
1887
(
2003
).
7.
N. R.
Chapman
and
E. C.
Shang
, “
Review of geoacoustic inversion in underwater acoustics
,”
J. Theor. Comput. Acoust.
29
(
03
),
2130004
(
2021
).
8.
J.
Deng
,
P.
Zhu
,
W.
Kofman
,
J.
Jiang
,
Y.
Yuan
, and
A.
Herique
, “
Electromagnetic full waveform inversion based on quadratic Wasserstein metric
,”
IEEE Trans. Antennas Propag.
70
(
12
),
11934
11945
(
2022
).
9.
O.
Carriere
,
J. P.
Hermand
, and
J. V.
Candy
, “
Inversion for time-evolving sound-speed field in a shallow ocean by ensemble Kalman filtering
,”
IEEE J. Ocean. Eng.
34
(
4
),
586
602
(
2009
).
10.
W.
Zhang
,
Y. W.
Huang
,
L.
Li
, and
Y.
Song
, “
Inversion of sound speed profile based on waveform structure matching
,” in
2011 Symposium on Piezoelectricity, Acoustic Waves and Device Applications (SPAWDA),
Shenzhen, China (IEEE, Piscataway, NJ, 2011), pp.
94
97
.
11.
Y.
Liu
,
X.
Na
,
C.
Yin
,
Y.
Su
,
S.
Sun
,
B.
Zhang
,
X.
Ren
, and
V. C.
Baranwal
, “
3-D joint inversion of airborne electromagnetic and magnetic data based on local Pearson correlation constraints
,”
IEEE Trans. Geosci. Remote Sens.
60
,
5910813
(
2022
).
12.
J.
Bonnel
and
N. R.
Chapman
, “
Geoacoustic inversion in a dispersive waveguide using warping operators
,”
J. Acoust. Soc. Am.
130
(
2
),
EL101
EL107
(
2011
).
13.
J.
Bonnel
,
C.
Gervaise
,
B.
Nicolas
, and
J. I.
Mars
, “
Single-receiver geoacoustic inversion using modal reversal
,”
J. Acoust. Soc. Am.
131
(
1
),
119
128
(
2012
).
14.
J.
Bonnel
,
G.
Le Touze
,
B.
Nicolas
, and
J. I.
Mars
, “
Physics-based time-frequency representations for underwater acoustics: Power class utilization with waveguide-invariant approximation
,”
IEEE Signal Process. Mag.
30
(
6
),
120
129
(
2013
).
15.
J.
Bonnel
,
S.
Caporale
, and
A.
Thode
, “
Waveguide mode amplitude estimation using warping and phase compensation
,”
J. Acoust. Soc. Am.
141
(
3
),
2243
2255
(
2017
).
16.
T. W.
Tan
,
O. A.
Godin
,
B. G.
Katsnelson
, and
M.
Yarina
, “
Passive geoacoustic inversion in the Mid-Atlantic Bight in the presence of strong water column variability
,”
J. Acoust. Soc. Am.
147
(
6
),
EL453
EL459
(
2020
).
17.
N. R.
Chapman
, “
Inverse methods in underwater acoustics
,” in
Applied Underwater Acoustics
, edited by
T. H.
Neighbors
and
D.
Bradley
(
Elsevier
, Amsterdam, the Netherlands, 2017), Chap. 9, pp.
553
585
.
18.
Z.
Li
,
B.
Qiao
,
B.
Wen
,
Y.
Wang
, and
X.
Chen
, “
Acoustic mode measuring approach developed on generalized minimax-concave regularization and Tikhonov regularization
,”
IEEE Trans. Instrum. Meas.
71
,
6500411
(
2022
).
19.
B.
Engquist
and
B. D.
Froese
, “
Application of the Wasserstein metric to seismic signals
,”
Commun. Math. Sci.
12
(
5
),
979
988
(
2014
).
20.
Y.
Yang
,
B.
Engquist
,
J.
Sun
, and
B. F.
Hamfeldt
, “
Application of optimal transport and the quadratic Wasserstein metric to full-waveform inversion
,”
Geophysics
83
(
1
),
1JF
Z3
(
2018
).
21.
L.
Métivier
,
R.
Brossier
,
Q.
Mérigot
, and
E.
Oudet
, “
A graph space optimal transport distance as a generalization of Lp distances: Application to a seismic imaging inverse problem
,”
Inverse Probl.
35
(
8
),
085001
(
2019
).
22.
B.
Engquist
and
Y.
Yang
, “
Optimal transport based seismic inversion: Beyond cycle skipping
,”
Comm. Pure Appl. Math.
75
(
10
),
2201
2244
(
2022
).
23.
G. D.
Egbert
and
A.
Kelbert
, “
Computational recipes for electromagnetic inverse problems
,”
Geophys. J. Int.
189
(
1
),
251
267
(
2012
).
24.
M. B.
Porter
,
The KRAKEN Normal Mode Program
(
Naval Research Laboratory
,
Washington, DC
,
1992
).
25.
D.
Tang
,
J.
Moum
,
J.
Lynch
,
P.
Abbot
,
R.
Chapman
,
P.
Dahl
,
T.
Duda
,
G.
Gawarkiewicz
,
S.
Glenn
,
J.
Goff
,
H.
Graber
,
J.
Kemp
,
A.
Maffei
,
J.
Nash
, and
A.
Newhall
, “
Shallow Water '06: A joint acoustic propagation/nonlinear internal wave physics experiment
,”
Oceanography
20
(
4
),
156
167
(
2007
).
26.
A.
Tolstoy
,
N. R.
Chapman
, and
G.
Brooke
, “
Workshop '97: Benchmarking for geoacoustic inversion in shallow water
,”
J. Comput. Acoust.
06
(
01n02
),
1
28
(
1998
).