This Letter proposes a joint geoacoustic inversion method for modal group velocity dispersion and amplitudes of waveform by incorporating a Pearson correlation constraint. Numerical simulations show that this joint inversion leads to improved geoacoustic inversion performance with smaller uncertainties compared to separate inversion methods when applied to data from a single receiver. Additionally, the effective use of the Wasserstein metric from optimal transport theory is explored and compared to the more-common L2 norm misfit measure. The Letter also presents a qualitative representation of joint inversion convergence obtained through multiple independent runs of genetic algorithms. The algorithm is applied to simulated data.
1. Introduction
Ocean acoustic modal group velocity dispersion data and recorded signal waveform data are widely used in the field of geoacoustic inversion,1,2 source localization,3 and environmental inversion.4 However, geoacoustic inversion based on modal dispersion involves travel-time information but not information on seabed attenuation needed for amplitude information.5 Many characteristics of the acoustic field can be extracted from the broadband signals and utilized in geoacoustic inversions. Furthermore, different characteristics exhibit varying sensitivities to geoacoustic parameters. Previous joint inversion studies employed the multi-stage inversion approach6 to address the issue of specific sensitivity between various parametrizations and characteristics. However, errors in parameters at different stages are also propagated, making it difficult to adequately assess the error of inversion results.7 Waveform inversions utilize relative time or time correction among multimodal arrivals,8–10 the inversion accuracy of which decreases with the increase in uncertainty error, i.e., the periodic jump phenomenon.
As the criterion principle of modal dispersion inversion is based on phase comparison, it is unable to provide an estimation of attenuation.5 In a single receiver modal context, attenuation must be estimated using mode amplitude considerations, such as modal amplitude ratios inversion6 or broadband waveform inversion. When one data feature cannot simultaneously invert all model parameters, joint inversion methods are generally used. Therefore, this Letter carries out a joint geoacoustic inversion of modal group velocity dispersion and the full waveform envelope obtained via the Hilbert transform (FWH), combining the two by incorporating a constraint based on the Pearson correlation coefficient (PCC)11 and Wasserstein metric.8 However, it should be emphasized that joint inversion of different quantities derived from the same measurements may not always be ideal unless these quantities exhibit suitable independence.
The remainder of this Letter is organized as follows: Section 2 provides a short description of previous work related to modal dispersion geoacoustic inversion and waveform inversion. We describe our proposed joint inversion approach, including the formulation of the objective function and the optimization strategy. The performance and analysis of joint geoacoustic inversion are presented in Sec. 3. Finally, conclusions are given in Sec. 4.
2. Theory and joint inversion framework
2.1 Dispersion curve estimation
2.2 Acoustic waveform inversion based on Wasserstein metric
In the traditional waveform inversion approach based on the Bartlett metric or norm misfit function, the periodic jump phenomenon of waveform signal is inevitable. More specifically, when the temporal shift between the synthesized waveform and the observed waveform exceeds half a period, the objective function is prone to falling into multiple local minima. This will affect inversion performance in terms of convergence and precision. Possible solutions to this challenge could include the implementation of the Wasserstein metric as the objective function. Compared to the point-by-point comparison of signals in the norm, the Wasserstein metric19,20 used as the objective function allows for global signal quality comparisons, thus helping to alleviate the problem of local minima.
2.3 Pearson correlation coefficient and joint inversion algorithm
Figure 1(a) illustrates the overview of the joint inversion framework and outlines the relationships involved. This study implements warping and Hilbert transformations on T-F signals to extract dispersion curves and waveform envelope, respectively, for use as inputs in the joint inversion. The process involves dividing the full parameter space into two separate subspaces that are optimized and searched for using distinct genetic algorithms (GA). In essence, the dispersion inversion and waveform inversion are carried out independently, but are combined through a joint constraint, which takes account of the PCC between the parameter populations.
Diagram for the joint inversion. (a) The joint geoacoustic inversion frame of modal dispersion and the FWH using an alternative iteration approach. (b) The simulated geoacoustic model and sound speed profile.
Diagram for the joint inversion. (a) The joint geoacoustic inversion frame of modal dispersion and the FWH using an alternative iteration approach. (b) The simulated geoacoustic model and sound speed profile.
For each iteration, when an algebraic update is applied to a population in the dispersion inversion dataset, the PCC joint constraint is also updated and replaces the joint constraint in the objective function of the FWH inversion dataset, and vice versa.
3. Numerical results and discussions
3.1 Environment setup and simulated data
The simulation is range-independent, and the environmental characteristics are based on the general features observed in the Shallow Water 2006 (SW06) experiment.25 The simulated environmental parameters are given in Fig. 1(b) and corresponding parameter settings are shown in Table 1. We now demonstrate the numerical experiment by presenting actual pulse simulations obtained by a source pulse with the frequency band 50–200 Hz, emitted at depth 22 m, and recorded 7 km away at depth 20 m, with a signal-to-noise ratio (SNR) of 6 dB. can be expressed as a logarithmic ratio of the signal power to the Gaussian white noise power for the same bandwidth. The GA method with a population of 100 and a crossover rate of 0.5 is used to optimize .
Summary of separated inversion and joint inversion averaged over 100 independent runs of GAs. The standard deviation is in parentheses. The rightmost column gives the normalized root mean square error (NRMSE) of this inversion results.
Parametera . | . | . | . | . | . | . | . | NRMSE . |
---|---|---|---|---|---|---|---|---|
Real value | 25.0 | 1600.0 | 1.70 | 2100.0 | 2.50 | 0.150 | 0.600 | — |
L2-D | 24.5 (± 2.9) | 1601.0 (± 5.8) | 1.65 (± 0.11) | 2260.0 (± 119.7) | 2.58 (± 0.16) | — | — | — |
L2-FWH | 29.7 (± 4.2) | 1619.5 (± 13.6) | 1.89 (± 0.15) | 2091.0 (± 99.4) | 2.45 (± 0.13) | 0.220 (± 0.025) | 0.405 (± 0.158) | 7.35e-5 |
L2-J | 23.7 (± 2.6) | 1592.5 (± 9.6) | 1.59 (± 0.09) | 2077.5 (± 117.4) | 2.22 (± 0.14) | 0.135 (± 0.024) | 0.545 (± 0.102) | 6.58e-5 |
Was-J | 25.8 (± 2.4) | 1603.0 (± 6.3) | 1.73 (± 0.08) | 2101.0 (± 61.8) | 2.43 (± 0.14) | 0.160 (± 0.014) | 0.557 (± 0.141) | 4.35e-5 |
Parametera . | . | . | . | . | . | . | . | NRMSE . |
---|---|---|---|---|---|---|---|---|
Real value | 25.0 | 1600.0 | 1.70 | 2100.0 | 2.50 | 0.150 | 0.600 | — |
L2-D | 24.5 (± 2.9) | 1601.0 (± 5.8) | 1.65 (± 0.11) | 2260.0 (± 119.7) | 2.58 (± 0.16) | — | — | — |
L2-FWH | 29.7 (± 4.2) | 1619.5 (± 13.6) | 1.89 (± 0.15) | 2091.0 (± 99.4) | 2.45 (± 0.13) | 0.220 (± 0.025) | 0.405 (± 0.158) | 7.35e-5 |
L2-J | 23.7 (± 2.6) | 1592.5 (± 9.6) | 1.59 (± 0.09) | 2077.5 (± 117.4) | 2.22 (± 0.14) | 0.135 (± 0.024) | 0.545 (± 0.102) | 6.58e-5 |
Was-J | 25.8 (± 2.4) | 1603.0 (± 6.3) | 1.73 (± 0.08) | 2101.0 (± 61.8) | 2.43 (± 0.14) | 0.160 (± 0.014) | 0.557 (± 0.141) | 4.35e-5 |
Parameter settings are shown in Fig. 1(b) and the search space: 20 m ⩽ ⩽ 30 m with 0.1 m steps; 1400 m/s ⩽ ⩽ 1700 m/s with 0.5 m/s steps; 1.4 g/cm3 ⩽ ⩽ 1.9 g/cm3 with 0.01 steps; 1900 m/s ⩽ ⩽ 2300 m/s with 0.5 m/s steps; 2.2 g/cm3 ⩽ ⩽ 2.7 g/cm3 with 0.01 steps; 0.05 dB/λ ⩽ ⩽ 0.35 dB/λ) with 0.005 steps; 0.4 dB/λ ⩽ ⩽ 0.8 dB/λ with 0.005 steps.
3.2 Comparison with separated inversion schemes
This numerical example shows the advantage of joint inversion method on reducing geoacoustic inversion uncertainty. To estimate the performance of joint inversion algorithm, we need to consider the calculation time as well as the generation of convergence iteration. For each approach, the GA was run over 100 with different starting values through Monte Carlo sampling. The average running times of the dispersion curve inversion based on L2 norm (L2-D), FWH inversion based on L2 norm (L2-FWH), L2-J, and Was-J, are 22.5, 29.3, 45.5, and 37.4 min, respectively. The runtime of joint inversions is significantly higher than that of the individual inversions. According to Fig. 2, the standard deviation of the objective function based on the Was-J is the lowest. The color interval in generations 4–8 is the iteration interval after the implementation of joint constraint (when the value of the objective function mismatch term decreases to one-third of that during the original generation). Based on the inset in Fig. 2, it is observed that the joint term constraint leads to a better value of H in the early stages of the optimization process.
The figure presents the values of four different objective functions plotted against GA iteration number, averaged over 100 independent runs of GAs. The blue and green shaded regions correspond to the mean ±1 standard deviation of objective functions for each iteration in the joint inversion approach, while the line plots illustrate that in individual inversions. For the joint inversions, the objective function value is the sum of two objective functions. The red shaded area corresponds to the 4th–8th iterative interval. In the inset, a magnified view is provided to illustrate the distribution of the sedimentary layer thickness parameter specifically during this iteration of the Was-J inversion. The black solid line is a nonparametric kernel-smoothed representation of the inset histogram.
The figure presents the values of four different objective functions plotted against GA iteration number, averaged over 100 independent runs of GAs. The blue and green shaded regions correspond to the mean ±1 standard deviation of objective functions for each iteration in the joint inversion approach, while the line plots illustrate that in individual inversions. For the joint inversions, the objective function value is the sum of two objective functions. The red shaded area corresponds to the 4th–8th iterative interval. In the inset, a magnified view is provided to illustrate the distribution of the sedimentary layer thickness parameter specifically during this iteration of the Was-J inversion. The black solid line is a nonparametric kernel-smoothed representation of the inset histogram.
Was-J results can be summarized by parameter distributions derived from 100 optimization runs, as shown in Fig. 3, where the green color indicates the best model. The sediment depth and soundspeed, as well as basement soundspeed, are superimposed on Fig. 3(a) to facilitate comparison. For the soundspeed parameters, the inversion results compared to the true value show a variation range of ± 375 m/s in the basement layer, while in the sedimentary layer, it is only ± 20 m/s. Therefore, the distribution of the basement soundspeed is not included in Fig. 3, due to the lack of representativeness in the results obtained from different inversion methods. When compared to the results of the L2-J with high uncertainty and standard deviations, the Was-J clearly outperforms. First, it is evident that the mean values of all parameter distributions are in close proximity to the true values. The sediment parameters ( , , and ) are well estimated. Moreover, the sediment soundspeed and attenuation exhibit high peakedness and concentration in their unimodal distributions, reflecting the low uncertainty in Was-J inversion. According to Table 1, the overall NRMSE of parameters in Was-J is 4.35 × 10−5, and deviation range of other parameters, except , is 0.1%–2.97%, both of which are better than FWH and L2-J. On the other hand, L2-FWH method produces the highest NRMSE in most cases and performs poorly in estimating , , and .
Qualitative appraisals of the model ensemble produced by 100 GAs runs, each with different crossover, mutation, and starting values through Monte Carlo sampling. (a) Distributions of sound speed and sediment thickness models. The results are color-coded according to the cost function: three groups showing the best one (green), 30% (brown, also called medium), and all (blue) models. Black vertical lines are the true value of sound speeds in the sediment and basement. The horizontal dashed line and the patch show the true value and distribution interval of sediment depth, respectively. Data fits for (b) FWH and (c) dispersion curves between observations and estimations. The gray uncertainty intervals represent the range of values encompassing the data's mean and standard deviation, with an interval containing 95% of the models. The black solid lines and the red solid lines indicate the average inversion values and the observed values, respectively. The grouping of the best and full models and the colors is consistent with (a). (d)–(i) Nonparametric kernel smooth distributions of the five parameters ( , , , , , and ), representing an approximation of parameter distributions. Black vertical dashed lines represent the true value of parameters.
Qualitative appraisals of the model ensemble produced by 100 GAs runs, each with different crossover, mutation, and starting values through Monte Carlo sampling. (a) Distributions of sound speed and sediment thickness models. The results are color-coded according to the cost function: three groups showing the best one (green), 30% (brown, also called medium), and all (blue) models. Black vertical lines are the true value of sound speeds in the sediment and basement. The horizontal dashed line and the patch show the true value and distribution interval of sediment depth, respectively. Data fits for (b) FWH and (c) dispersion curves between observations and estimations. The gray uncertainty intervals represent the range of values encompassing the data's mean and standard deviation, with an interval containing 95% of the models. The black solid lines and the red solid lines indicate the average inversion values and the observed values, respectively. The grouping of the best and full models and the colors is consistent with (a). (d)–(i) Nonparametric kernel smooth distributions of the five parameters ( , , , , , and ), representing an approximation of parameter distributions. Black vertical dashed lines represent the true value of parameters.
Furthermore, as shown in Fig. 3(c), the uncertainty interval of dispersion curve of mode 4 is large, particularly near the Airy phase. This is attributed to the inaccuracy in extracting the fourth mode during the warping transformation and the parameter sensitivity of this mode. Finally, the parameter distribution of the sediment thickness in Was-J is a unimodal distribution, whereas in the FWH, it is a bimodal distribution (which is not shown limited by the length of the paper). This demonstrates that the joint inversion approach presented in this research successfully combines the advantages of dispersion inversion for sedimentary sound speed with the advantages of FWH for attenuation. Although it is not feasible to directly quantify the uncertainty associated with the joint inversion method involving dependent datasets, conducting a statistical analysis of the estimates obtained from multiple inversions with various initial values, crossovers, and mutations, can provide insights into the overall uncertainty of the inversion results. This analysis reveals that the overall uncertainty, considering the various sources of uncertainty, is relatively minimal for the inversion results.
This joint inversion approach is also applied to the Workshop's 97 geoacoustic inversion benchmark AT cases, with details of the environmental settings and search intervals provided in the reference.26 The results are summarized in Table 2.
Parameter true values and their estimates of the benchmark AT cases. Differences (diff) between true (TRUE) parameter values and the inversion results are also shown.
ID . | . | . | . | . | . | . | . | . | E1a . | E2 . | |
---|---|---|---|---|---|---|---|---|---|---|---|
TRUE-a | 44.57 | 1546.91 | 1624.75 | 1.40277 | 1676.30 | 1.77546 | 0.12368 | 0.08046 | — | — | |
L2-J | 44.87 | 1546.0 | 1626.0 | 1.40 | 1675.5 | 1.69 | 0.110 | 0.070 | 0.0371 | 0.075 | |
diff | −0.30 | 0.91 | −1.25 | 0 | 0.80 | 0.08 | 0.013 | 0.01 | — | — | |
Was-J | 44.18 | 1546.0 | 1629.5 | 1.41 | 1679.0 | 1.61 | 0.125 | 0.050 | 0.074 | 0.152 | |
diff | 0.39 | 0.91 | −4.75 | −0.01 | −2.7 | 0.16 | −0.002 | 0.030 | — | — | |
TRUE-a | 13.4381 | 1558.27 | 1564.62 | 1.77156 | 1758.80 | 1.95231 | 0.31423 | 0.19822 | — | — | |
L2-J | 13.21 | 1558.5 | 1568.5 | 1.74 | 1720.0 | 1.95 | 0.325 | 0.175 | 0.049 | 0.076 | |
diff | 0.23 | −0.23 | −3.88 | 0.03 | 38.8 | 0 | −0.01 | 0.023 | — | — | |
Was-J | 13.53 | 1561.5 | 1565.0 | 1.81 | 1753.5 | 1.99 | 0.325 | 0.195 | 0.035 | 0.048 | |
diff | −0.09 | −3.23 | −0.38 | −0.03 | 5.3 | −0.04 | −0.011 | 0.003 | — | — | |
TRUE-a | 30.3733 | 1533.93 | 1557.58 | 1.42907 | 1615.15 | 1.68715 | 0.08399 | 0.05278 | — | — | |
L2-J | 36.8 | 1533.0 | 1569.0 | 1.43 | 1619.0 | 1.60 | 0.095 | 0.070 | 0.073 | 0.105 | |
diff | −6.42 | 0.09 | −11.4 | 0 | −3.85 | 0.08 | −0.011 | −0.012 | — | — | |
Was-J | 30.5 | 1533.5 | 1559.0 | 1.42 | 1615.0 | 1.63 | 0.105 | 0.060 | 0.031 | 0.054 | |
diff | −0.13 | 0.43 | −1.42 | 0 | 0.15 | 0.05 | −0.02 | 0 | — | — |
ID . | . | . | . | . | . | . | . | . | E1a . | E2 . | |
---|---|---|---|---|---|---|---|---|---|---|---|
TRUE-a | 44.57 | 1546.91 | 1624.75 | 1.40277 | 1676.30 | 1.77546 | 0.12368 | 0.08046 | — | — | |
L2-J | 44.87 | 1546.0 | 1626.0 | 1.40 | 1675.5 | 1.69 | 0.110 | 0.070 | 0.0371 | 0.075 | |
diff | −0.30 | 0.91 | −1.25 | 0 | 0.80 | 0.08 | 0.013 | 0.01 | — | — | |
Was-J | 44.18 | 1546.0 | 1629.5 | 1.41 | 1679.0 | 1.61 | 0.125 | 0.050 | 0.074 | 0.152 | |
diff | 0.39 | 0.91 | −4.75 | −0.01 | −2.7 | 0.16 | −0.002 | 0.030 | — | — | |
TRUE-a | 13.4381 | 1558.27 | 1564.62 | 1.77156 | 1758.80 | 1.95231 | 0.31423 | 0.19822 | — | — | |
L2-J | 13.21 | 1558.5 | 1568.5 | 1.74 | 1720.0 | 1.95 | 0.325 | 0.175 | 0.049 | 0.076 | |
diff | 0.23 | −0.23 | −3.88 | 0.03 | 38.8 | 0 | −0.01 | 0.023 | — | — | |
Was-J | 13.53 | 1561.5 | 1565.0 | 1.81 | 1753.5 | 1.99 | 0.325 | 0.195 | 0.035 | 0.048 | |
diff | −0.09 | −3.23 | −0.38 | −0.03 | 5.3 | −0.04 | −0.011 | 0.003 | — | — | |
TRUE-a | 30.3733 | 1533.93 | 1557.58 | 1.42907 | 1615.15 | 1.68715 | 0.08399 | 0.05278 | — | — | |
L2-J | 36.8 | 1533.0 | 1569.0 | 1.43 | 1619.0 | 1.60 | 0.095 | 0.070 | 0.073 | 0.105 | |
diff | −6.42 | 0.09 | −11.4 | 0 | −3.85 | 0.08 | −0.011 | −0.012 | — | — | |
Was-J | 30.5 | 1533.5 | 1559.0 | 1.42 | 1615.0 | 1.63 | 0.105 | 0.060 | 0.031 | 0.054 | |
diff | −0.13 | 0.43 | −1.42 | 0 | 0.15 | 0.05 | −0.02 | 0 | — | — |
These error norms are defined by and , where i is the parameter index, P is the total number of unknown parameters, xi is the ith true (normalized) parameter value, and is the ith estimated (normalized) parameter value.
4. Conclusions
This Letter develops a joint inversion approach of modal dispersion and FWH data based on the PCC. Essentially, PCC is a joint constraint on the temporal dispersion characteristics and attenuation dispersion characteristics. By simultaneously coupling these two individual inversions, this joint approach prevents the error propagation associated with two-staged inversion approaches. In addition, this Letter provides further evidence that incorporating the Wasserstein metric to formulate the objective function demonstrates enhanced robustness in joint inversion, resulting in decreased uncertainty of certain inverted parameters, as compared to employing the L2 norm.
Specifically, we successfully implement the Wasserstein metric in joint geoacoustic inversion, by using the softscaling functions to transform acoustic signals into probability densities. The simulation results demonstrate that the joint inversion approach, especially based on the Wasserstein metric, outperforms these two existing individual inversions in terms of reducing the uncertainty of inversion results. However, this enhanced performance comes at the expense of compromising the accuracy of certain parameters other than sediment attenuation and sediment sound speed, as well as increasing the optimization time. Overall, the results obtained from the benchmark problems are in general agreement with the results of the simulated data experiments, according to Table 2.
The joint inversion method has the potential to improve the reliability and accuracy of the inversion results, but may also increase the uncertainty if the joint inversion involves dependent datasets. Therefore, future work will further seek to analyze the noise sensitivity and compare the performance and quantitative uncertainties under different SNR conditions.
Acknowledgments
This work was supported by the National Natural Science Foundation of China under Grant No. 41775027. The authors gratefully acknowledge Professor Julien Bonnel for discussion on the geoacoustic inversion using warping operators and Professor Stan Dosso for his invaluable guidance.
Author Declarations
Conflict of Interest
The authors have no conflicts to disclose.
Data Availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.