Ocean acoustic tomography (OAT) methods aim at estimating variations of sound speed profiles (SSP) based on acoustic measurements between multiple source-receiver pairs (e.g., eigenray travel times). This study investigates the estimation of range-dependent SSPs in the upper ocean over short ranges (<5 km) using the classical ray-based OAT formulation as well as iterative or adaptive OAT formulations (i.e., when the sources and receivers configuration can evolve across successive iterations of this inverse problem). A regional ocean circulation model for the DeSoto Canyon in the Gulf of Mexico is used to simulate three-dimensional sound speed variations spanning a month-long period, which exhibits significant submesoscale variability of variable intensity. OAT performance is investigated in this simulated environment in terms of (1) the selected source-receivers configuration and effective ray coverage, (2) the selected OAT estimator formulations, linearized forward model accuracy, and the parameterization of the expected SSP variability in terms of empirical orthogonal functions, and (3) the duration over which the OAT inversion is performed. Practical implications for the design of future OAT experiments for monitoring submesoscale variability in the upper ocean with moving autonomous platforms are discussed.

Knowledge of variations in sound speed is essential for accurate predictions of sound propagation in the ocean, which is needed for various acoustic remote sensing applications.1–3 Measurement and characterization of sound speed variability associated with submesoscale ocean features (with typical length scales of 100 m–10 km and time scales of hours to a few days4–6) is currently an operational challenge, as such small-scale spatiotemporal variability cannot easily be accurately sampled, and often is aliased, by common oceanographic survey techniques (e.g., oceanographic floats). To address this challenge, we numerically investigated the estimation of range-dependent SSPs in the upper ocean over short ranges (< 5 km) using both the classical ray-based ocean acoustic tomography (OAT) method7 (with a fixed source-receivers configuration and fixed referenced sound speed) as well as iterative OAT and adaptive OAT formulations (i.e., when the sources and receivers configuration can evolve through time or space, which can be realized using mobile autonomous platforms) for comparison. To do so, a regional oceanographic simulation of the DeSoto Canyon circulation in the Gulf of Mexico is used to construct three-dimensional (3D) SSP variations spanning a winter month that exhibit significant submesoscale variability. For simplicity, hereafter three-dimensional effects such as horizontal refraction effects of the propagating acoustic rays are neglected and the simulated OAT inversions are conducted only in 2D vertical ocean slices.

In practice, sound speed variability occurs on a wide range of spatial and temporal scales and is driven by numerous types of ocean dynamical processes. For example, mesoscale flows in the form of eddies and fronts with typical dimensions of 10 to 200 km, time scales from months to years, and vertical velocities of 1–10 m/day, can spread and deflect ray paths.8 Furthermore, in the upper ocean, submesoscale features, again in the form of eddies and fronts, but with characteristics of length scales of 100 m–10 km, time scales of hours to a few days,4 and vertical speed reaching up to 100 m/day, are associated with intense density gradients that can further enhance sound speed variability. Unlike persistent mesoscale ocean fronts, submesoscale circulations modify the upper ocean structure on much shorter time scales and are driven by complex and momentary instabilities.5,6 Therefore, submesoscale processes, which are especially significant in the tactical upper ocean (i.e., typically depths < 500 m), can affect underwater sound propagation in multiple ways, notably over (relatively) short propagation distances9–12 due to the relatively small wavelengths of (most) man-made underwater sounds. For example, even at very low frequencies, e.g., 10 Hz, the corresponding acoustic wavelength in water is only 150  m, which is on the smaller end of the conventional submesoscale range. Furthermore, ocean sound speed is typically estimated using empirical formulas based on local values of standard oceanographic parameters (such as temperature, salinity, and pressure)1,2 and adequately measuring it as a function of the rapidly evolving submesoscale field would require a very dense spatial and temporal sampling of these oceanographic parameters throughout the volume of interest. Such a dense sampling is especially challenging using standard floats or moorings.

To alleviate this requirement, an alternative approach consists of directly leveraging the fundamental relationship between acoustic observations (e.g., arrival times measurements) and the actual volumetric spatiotemporal variability of the ocean sound speed profiles (SSPs). OAT, as originally introduced by Munk and Wunsch in 1979,7 refers to the process of using changes in acoustic measurements collected between multiple pairs of sources and receivers as inputs of an inverse problem aimed at reconstructing SSP variations with respect to a known reference environment within an ocean volume for 3D applications, or more commonly, for selected ocean slices for 2D applications. Specifically, the classical ray-based OAT method optimally combines the average variability of each ray path between each source-receiver pairs to estimate the sound speed anomalies (based on a reference sound speed model) within a selected ocean region. Much literature has been devoted to demonstrating the theoretical and experimental feasibility of OAT in various environments.7,13–21 To date, however, most studies have focused on long-range (up to basin scales) OAT scenarios to primarily estimate mesoscale features, and very few studies have attempted to assess OAT performance to estimate the variability of sound speed driven by submesoscale processes.18,20–23

One challenge associated with the “inverse problem”-based approach of OAT methods for estimating SSPs is that the relationship between acoustic observations (typically arrival-time measurements) and actual SSPs is fundamentally non-linear. However, fluctuations in fractional SSPs are typically small compared to baseline SSP values (usually < 1 % ). Hence, classical ray-based OAT methods simplify this approach by linearizing the relationship between the estimated SSPs perturbations and the measured variations in arrival times of stable rays propagating between the source and receiver pairs (the rays being identified by standard ray-tracing methods using the baseline reference environment, which is also referred to as the reference state). This provides a simpler and numerically tractable solution to the OAT inverse problem.7,13–15 In addition, when solving the OAT problem, the reference state can be iteratively updated based on the estimated SSP anomalies of the previous (or initial) SSP values. This approach, known as iterative OAT24–27 aims at further improving the estimated SSP variability with each iteration, but requires additional computations. However, it is a very tractable approach, given the numerical efficiency of the ray-based OAT method. Alternatively, the accuracy of the OAT solution can be further improved by constraining this inverse problem through additional regularization constraints of the sound speed values at specific locations. In practice, this strategy would correspond to using independent SSP measurements obtained from available CTD measurements at specific locations within the inversion domain, as described in Sec. III. Finally, it is also possible to gradually add additional source-receiver pairs at each iteration, which in turn increases the number of available ray travel-time measurements for the OAT inversion. This latter approach gradually enhances the effective ray coverage over the area of interest, which typically improves the accuracy of the estimated OAT solution and is referred to hereafter as adaptive OAT since the effective sampling of the inverse domain can be changed at each iteration. A practical strategy guiding this approach is to add acoustic or local SSP measurements in areas of the inversion domain where the OAT solution from the previous iteration appears to locally exhibit large SSP errors. In concept, these additional source-receiver pairs (and associated ray multipaths) in the upper ocean could be synthetically obtained using a suite of mobile autonomous platforms (or profiling)18,20,28,29 equipped with a surface expression that allows for accurate positioning via a GPS link, as well as acoustic transponders and a CTD probe, as described in Sec. III.

To the best of our knowledge, an experimental data set that provides concurrent 3D SSP variations with sufficient temporal and spatial resolution to resolve submesoscale variability for testing OAT methods has not yet been available. Hereafter, we use a regional oceanographic circulation model covering the DeSoto Canyon in the Gulf of Mexico with a submesoscale permitting horizontal resolution of 0.5 km30,31 and compute the corresponding 3D SSP variations calculated according to the TEOS-10 standard formula (using the Gibbs-SeaWater Oceanographic Toolbox)32 based on the standard modeled output variables (temperature, salinity, and pressure). We then compute the simulated ray arrival times for various source-receiver configurations distributed in a selected vertical slice of this ocean model using the bellhop ray tracing program.33 The DeSoto Canyon area was selected as it includes significant density gradients resulting from the juxtaposition of relatively warm and salty waters advected from the equatorial Atlantic by the Loop Current, as well as the fresh riverine input by the Mississippi River System. Together, these density gradients contribute to the rich submesoscale field of fronts and eddies observed in all seasons.34 Furthermore, the submesoscale circulation of this area has been well documented by drifter deployments following the Deepwater Horizon disaster (e.g., Refs. 35 and 36) and several modeling studies (e.g., including Refs. 34 and 37 and references therein).

In summary, the main objective of this study is to explore the feasibility of OAT methods to estimate the variability of sound speed in the upper ocean over short ranges (< 5 km) in the presence of energetic submesoscale ocean dynamics. A regional oceanographic simulation of the DeSoto Canyon circulation in the Gulf of Mexico is used to generate the 3D SSPs variations that are then used in the OAT simulations. The theoretical foundations for the various OAT formulations are presented in a consistent framework, allowing one to compare the estimated SSP values obtained through these OAT calculations for different source-receiver configurations and types of constraints. Therefore, the results of this study could be used to assist in the planning and design of future OAT experiments for comparable scenarios.

The remainder of this article is organized as follows. In Sec. II, the environmental parameters and source-receiver configurations used for numerical ray-tracing simulations and OAT inversion schemes are presented. Section III introduces the theoretical formalism and numerical implementation of the classical ray-based OAT method, as well as the iterative OAT and adaptive OAT variations. In Sec. IV, the OAT results are presented for the various selected configurations of environmental and acoustic sensors. Finally, a summary and discussion of the main results are given in Sec. V and closing remarks in Sec. VI conclude the study.

The DeSoto Canyon [see Fig. 1(a)] is located in the northern Gulf of Mexico and exhibits highly dynamical mesoscale and submesoscale variability throughout the year due to the proximity of relatively warm and salty waters advected by the Loop Current and associated eddies into the Gulf from the tropical Atlantic and the fresh and abundant riverine input by the Mississippi River System.30,31,34 Notable SSP variability is found in the tactical upper ocean (i.e., depths < 500 m) where the submesoscale variability is greatest, especially nearby mesoscale eddies. This variability of SSP can affect underwater sound propagation in multiple ways,9–12 and its estimation using OAT will be the focus of this study.

FIG. 1.

(Color online) Gulf of Mexico oceanographic environment. (a) Bathymetry of the DeSoto Canyon region. The dashed rectangle shows the area of the simulation data set, and the short horizontal red line represents the total range extent of the ten consecutive ocean slices used in the study. (b) Predicted time averaged sound speed map (over 1 month) for all ten slices [see Eq. (2)]. (c) Example of predicted sound speed anomaly map derived from the difference between the mean sound speed map and a sound speed map chosen at an arbitrary time instance [see Eq. (13)]. (d) Mean sound speed profile (averaged across range and time) for the tenth slice (used for the OAT inversion in this study) and its corresponding standard deviation error bars.

FIG. 1.

(Color online) Gulf of Mexico oceanographic environment. (a) Bathymetry of the DeSoto Canyon region. The dashed rectangle shows the area of the simulation data set, and the short horizontal red line represents the total range extent of the ten consecutive ocean slices used in the study. (b) Predicted time averaged sound speed map (over 1 month) for all ten slices [see Eq. (2)]. (c) Example of predicted sound speed anomaly map derived from the difference between the mean sound speed map and a sound speed map chosen at an arbitrary time instance [see Eq. (13)]. (d) Mean sound speed profile (averaged across range and time) for the tenth slice (used for the OAT inversion in this study) and its corresponding standard deviation error bars.

Close modal

To this end, the ocean circulation in the DeSoto Canyon was modeled in the area shown in Fig. 1(a) (dashed box) using the Regional Ocean Modeling System (ROMS) in its Coastal and Regional Ocean Community model version (CROCO).38 CROCO is a primitive-equation, free-surface, 3D terrain-following, split-explicit model and is configured, for this study, at the submesoscale permitting horizontal resolution of 0.5 km and 231 vertical levels with greater vertical resolution near the surface to capture the large density changes in the upper water column. The simulation is nested on a multi-year run at 1 km horizontal resolution covering the whole Gulf of Mexico, which was validated and described in further detail in previous studies (e.g., Ref. 39) The selected vertical resolution, while limited for a submesoscale permitting simulation, is comparable to what is commonly used by mesoscale permitting or resolving hindcast and forecast products, such as HYCOM.40 The nested simulation covers 42 days starting on February 3, 2015. The first twelve days are discarded as a spin-up period, and the remaining winter days are used in this study. The modeled temperature, salinity and density fields are saved every 30 min and used to compute the 3D SSPs distributions using the TEOS-10 standard formula,32 creating T = 1439 time frames of sound speed environments.

To test the OAT methods, we selected a specific location in the southeast corner of the computational model, indicated by the red line in Figs. 1(a)–1(c). This location has been chosen partly because of the locally flat bathymetry that reduces potential terrain effects and partly because it is crossed by a diversity of mesoscale eddies and submesoscale features during the time window of interest. For simplicity, hereafter only the 2D sound propagation is modeled, and three-dimensional effects such as horizontal refraction effects of the propagating acoustic rays are neglected.1 Therefore, the OAT inversions are performed only in 2D vertical ocean slices located at a constant latitude (27.5017°N) along the red line marked in Figs. 1(a) and 1(b). Specifically, ten 2D vertical slices of equal horizontal length (5 km), each separated by a distance of 0.5 km (equivalent to one range resolution cell of the ocean circulation model), were arbitrarily selected. They all had (nearly) the same reference SSP averaged over one month, as shown in Figs. 1(b) and 1(d), with a profile typical of a deep-water environment with an average SOFAR (sound fixing and ranging) channel axis centered at  730 m. For all the ten vertical slices, most of the variability of the SSP, referred to as SSP anomalies in the context of the inverse problem of the OAT, is driven by submesoscale ocean characteristics and occurs in the upper ocean as expected [Fig. 1(c)]. In the remainder of this study, the first nine vertical slices provide only prior knowledge of the expected variability of SSP and are used to construct a database of SSP which, in turn, is used as a basis of empirical orthogonal functions ( EOFs ) for dimensionality reduction and regularization of the OAT problem, as described in Sec. III. The last 2D vertical slice centered at a longitude of 86.3366° W is used to perform the OAT experiments. SSP discretization at depth, represented by the depth index zi for the ith layer, is finer in the upper region with a grid size of 2 m from the surface to a depth of 100 m, followed by a grid size of 5 m from 105 to 1000 m, thus with a total of Nz = 231 points in depth for the discretized domain. The horizontal discretization of the SSP, represented by the index rj is 500 m, where j = 1,…,Nr and Nr = 11. In the remainder of this study, the 2D discretized SSPs that cover this selected 2D slice are called sound speed maps (SSM). Given the selected vertical and horizontal discretization, the SSM to be estimated by OAT is composed of J = N z × N r = 2541 independent pixels, introducing the vectorized pixel index j [ 1 , J ] representing all concatenated values of the 2D SSM.

The SSM ( z i , r j ; t k ) in this dynamic range-dependent environment is thus a function of three variables: discretized depth zi, discretized range rj, and the discrete time step tk used for ocean circulation simulations. For the sake of conciseness, the discretized subscripts will be removed later and the spatiotemporal dependence of the discretized SSM will simply be denoted as SSM ( z , r ; t ) . To visualize the corresponding variability of SSM, a range-averaged anomaly δSSM ( z ; t ) is computed for each time instant t in the 30-day simulation as the average of all Nr = 11 discrete ranges of the difference between the current SSM at time t and the monthly averaged reference SSM for this vertical slice. Figure 2(a) illustrates the temporal evolution of this range-averaged anomaly δSSM ( z ; t ) : for example, for time 200 h < t < 300  h, the enhanced variability corresponds to an eddy that crosses the upper part of the water column (z < 200 m) during this time. To assess the performance of OAT in estimating SSP anomalies driven by submesoscale variability, visual inspection selected three different 7-daytime intervals during the month-long simulation to represent time windows with increasing overall magnitude of δSSM ( z ; t ) anomalies. They are hereafter referred to as scenarios A, B, and C (see Fig. 2). Each of these three time windows was further divided into two parts: a 2-day training window (marked with subscript t) followed by a 5-day estimation window (marked with subscript e). This terminology is inspired by the development of machine learning methods for OAT.21 In this study, for each scenario A–C, the SSMs corresponding to the training window are only used to construct a EOF basis (i.e., those training SSMs constitute prior knowledge of the expected variability of the SSM in the area). Furthermore, the actual SSMs from the estimation window are used to provide a ground truth for the estimated SSMs from OAT over this estimation window, as described in Sec. III. In general, these three scenarios exemplify different types of submesoscale features with various strengths and temporal evolution. It should be expected that the OAT method performs best when the variability of the SSM between the training window and the estimate window is the least. For example, scenario A is characterized by a training period that captures the spatiotemporal variability of the SSM which is comparable to the apparent variability of the SSM (primarily within [–1, 2] m/s) during the subsequent estimation period. However, the training period in scenario B appears to be slightly less representative of the SSM anomalies to estimate, which are within [–1, 3] m/s. Finally, scenario C shows the highest variability of SSM and thus the largest mismatch of SSM between the training and estimation periods, with higher anomalies in sound velocities within [–2.5, 4.5] m/s. This mismatch between training and estimation SSMs can be further quantified by computing the reconstruction error δ c ̃ root mean square error (RMSE) as shown in the next section and defined in more detail by Eq. (17).

FIG. 2.

(Color online) (a) Temporal evolution of the range-averaged depth-dependent sound speed anomaly δ SSM ( z ; t ) for the test slice shown in Fig. 1(b). Scenarios A, B, and C are divided into training periods (respectively, At, Bt, Ct) and estimation windows (respectively, Ae, Be, Ce). (b) Example of Arrival times variations for the eigenrays corresponding to direct path and surface bounce path measurements evolution for a source and receiver located along the test slice (i.e., a horizontal range separation of 5 km) at respective depths 62 and 792 m.

FIG. 2.

(Color online) (a) Temporal evolution of the range-averaged depth-dependent sound speed anomaly δ SSM ( z ; t ) for the test slice shown in Fig. 1(b). Scenarios A, B, and C are divided into training periods (respectively, At, Bt, Ct) and estimation windows (respectively, Ae, Be, Ce). (b) Example of Arrival times variations for the eigenrays corresponding to direct path and surface bounce path measurements evolution for a source and receiver located along the test slice (i.e., a horizontal range separation of 5 km) at respective depths 62 and 792 m.

Close modal

The performance of OAT for the selected environment is investigated considering two main types of source-receiver configurations. The first setup shown in Fig. 3(a) is composed of a 20 source-element vertical line array (VLA) and a 20 receiver-element VLA with identical geometry, both spanning from 10 to 1000 m deep with an inter-element spacing of 52.1 m. This set-up represents a dense acoustic sensing geometry which provides a relatively dense ray coverage of the SSM pixels [see Fig. 10(a)] and is thus expected to yield the most accurate OAT estimates. The simulated ray arrival times for this selected source-receiver configuration are then computed using the Bellhop ray tracing program.33 For each source-receiver pair, only the first two arrival times (direct and surface bounce paths) are used for the OAT inversion hereafter [Fig. 2(b)]. Other multipath arrivals (e.g., bottom bounces) are purposefully ignored, as in practice those later arrivals carry additional uncertainty (e.g., spans the water column from 245 to 695 m and its elements are spaced 50 m apart due to the typical uncertainty of most bathymetry databases) which exceeds the accuracy requirements for OAT. Hence, for this first setup, a maximum number of I = 800 arrival time measurements is possible. The second setup used in this study is a non-ideal—for OAT purposes—sparse distributed geometry [Fig. 3(b)] composed of three shallow sources positioned at 114, 322, and 531 m and a 10 receiver element VLA positioned from 62 to 1000 m with a constant 104 m element spacing distance, yielding a potential number of I = 60 arrival time measurements. This second configuration represents a more realistic and sparse acoustic setup, which could potentially be implemented in a synthetic fashion using few profiling sources and receivers. The adaptive methods presented in Sec. III use this second setup as an initial state, but then consider the successive addition of extra VLAs composed of identical transponder geometry (i.e., where each element can act as both a source or receiver in a round-robin fashion17) to sequentially augment the total number of arrival-time measurements for OAT acoustic. Each additional VLA provides 10 new sources and receivers spanning depths from 245 to 695 m with a 50 m element spacing distance. Hereafter we will consider up to three additional VLAs placed, respectively, at ranges r = 1500, 2500, and 3500 m. These extra VLAs could be implemented in practice by using a profiling transponder and relying on a “frozen ocean” assumption during the time required to collect the acoustics measurements between all the selected source-receiver pairs by profiling accordingly along the specified VLA aperture.

FIG. 3.

(Color online) Acoustic setups. (a) The Dense acoustic sensing geometry uses 20 sources × 20 receivers configuration. The 20 sources and receivers are located along two vertical line arrays and are equally spaced from 10 to 1000 m. (b) The Distributed sparse acoustic geometry uses initially 3 sources × 10 receivers configuration. Three identical VLAs of 10 elements (source and receivers) are successively added at ranges r = 1500 m, r = 2500 m, and r = 3500 m. Each additional VLA (orange dots) used for the adaptive OAT method (see Sec. III).

FIG. 3.

(Color online) Acoustic setups. (a) The Dense acoustic sensing geometry uses 20 sources × 20 receivers configuration. The 20 sources and receivers are located along two vertical line arrays and are equally spaced from 10 to 1000 m. (b) The Distributed sparse acoustic geometry uses initially 3 sources × 10 receivers configuration. Three identical VLAs of 10 elements (source and receivers) are successively added at ranges r = 1500 m, r = 2500 m, and r = 3500 m. Each additional VLA (orange dots) used for the adaptive OAT method (see Sec. III).

Close modal

The primary focus of this study is to evaluate the performance of ray-based OAT methods to estimate submesoscale SSM variability using some acoustic sensing geometries shown in Fig. 3. However, the OAT estimates shown in Sec. IV represent only lower-bound errors for the various types of submesoscale variability present in the selected environment. In this initial performance study, a wide variety of error sources are not modeled, but they would need to be taken into account to fully quantify the uncertainty of OAT estimates, as extensively discussed in the literature.7,14 Such errors would notably be caused, among others, by transducer positioning errors and unaccounted motion (e.g., array tilt due to tides or currents), imperfect time synchronization between the various source-receiver pairs, the impact of noise and available acoustic bandwidth on the accuracy of the measured ray travel times, and variability at scales smaller than those simulated by CROCO.

The OAT problem is formulated using a 2D ray-based method to estimate the highly variable submesoscale features of the simulated SSMs. To do so, the 2D discretized SSM (Fig. 2) is vectorized into a vector c of dimensions [1 ×J] pixels (J = 2541). For each time t and pixel j of a given SSM, the corresponding sound speed value c t , j can be decomposed into the sum of three terms: (1) a constant, the depth-averaged mean sound speed value c 0 avg , (2) a depth-dependent offset Δ c 0 , j from this mean value c 0 avg , and (3) a time-dependent local sound speed anomaly δ c t , j ,7 
(1)
The first two terms in Eq. (1) c 0 , j = c 0 avg + Δ c 0 , j represents the entries for pixel j of the selected vectorized reference SSM c 0 about which the linearization procedure is performed. The local sound speed anomaly δ c t , j = c t , j c 0 , j is the quantity of interest to be estimated by OAT. The time-averaged local sound speed value c ¯ j over the averaging time T given by
(2)
Similarly, the arrival time information is given by a vector τ of i = 1 : I measurements. Each arrival-time measurement (including both direct and surface bounce paths) indexed by i, linking a specific source/receiver pair at a specific time t, can be written as the sum of a reference arrival time τ 0 , i (computed for the reference SSM c 0 ), and a time-dependent fluctuation value δ τ t , i ,
(3)
A first order expansion of Eq. (3) leads to the following relationship:7 
(4)
where l 0 i , j represents the distance traveled by ray # i while crossing the SSM pixel # j . This first-order expansion can be further expressed in a matrix formulation to represent the classic ray-based OAT problem formulation
(5)
where δ τ is the [ I × 1 ] vector of simulated arrival time fluctuations, the matrix E = l 0 i , j / ( c 0 avg ) 2 is the [ I × J ] linearized forward model, and δ c is the [ J × 1 ] vectorized map of sound speed anomalies (i.e., the SSM variability of interest) to be estimated by OAT.
Finally, for the acoustic sensing configurations shown in Fig. 3, the OAT inverse problem is underdetermined (i.e., I < J). Therefore, the OAT inversion is first regularized by introducing a EOF decomposition of the SSM c t ,
(6)
where Φ k is the basis vector kth EOF (and corresponds to the kth column of the formulation of the EOF basis matrix Φ ), α k , t is the corresponding coefficient (and corresponds to the kth entry of the vector of the EOF coefficient α t ) at time t, and c Φ ¯ is the average background SSM value of the set of SSMs used to compute the basis of the EOF. Note that in this study, the background SSM c Φ ¯ represents the SSM averaged over the training set over time and differs from the reference SSM c 0 , used for the linearization procedure in Eqs. (1)–(5) which is chosen to be the last SSM of the training set (that is, the most recent available SSM before the estimation period starts). It was found that choosing the reference SSM c 0 in this manner improved the linearization assumption and, therefore, the OAT estimates. This is hypothesized to occur in this highly dynamic environment, since the reference SSM c 0 matched the expected variability of the SSM at the start of the estimation period (details not shown for the sake of conciseness).

In general, the accuracy of the estimated SSMs from OAT depends on the selected parameterization of the simulated SSMs and, in particular, the number of time samples Nt (e.g., the number of days) used to form the aforementioned training set (which forms the EOF basis), as well as the number n Φ of EOF coefficients, which are ultimately kept to perform the expansion given by Eq. (6). Various empirical metrics can be used to select the optimal parameters Nt and n Φ . For this study, these parameters were selected to minimize, in the test window, the error of the OAT estimates obtained using the methodology presented in Secs. III B and III C. Using this approach, it was found that an adequate basis EOF could be calculated using only a 2-day training window (that is, Nt = 48 samples for each of the 10 slices shown in Fig. 1) for each scenario A–C (see Fig. 2). Furthermore, the following values for the parameter n Φ for scenarios A, B, and C of, respectively, 27, 10, and 17 were found to be adequate to represent the variability of the sound speed for the selected environment, as well as to minimize, on average, the errors of the OAT estimations. However, it should be noted that the selected values for the parameters Nt and n Φ generally depend on the environmental variability of interest. Furthermore, Fig. 4 illustrates in a simple way how changing either the parameter Nt or the parameter n Φ can influence the parameterization of the sound speed variability for this study. For example, while keeping the duration of the training window at two days, Fig. 4(a) shows the variations of the magnitude of the singular values of the EOF basis matrix Φ for the three scenarios A–C. The rapid decay of the magnitude of singular values confirms that most of the variability in SSM is well captured using less than 30 EOF coefficients for the three scenarios A–C, and thus the selected value of the parameters n Φ mentioned above is consistent with the variability exhibited by scenarios A–C. Further discussion of Fig. 4 is given in the results in Sec. IV after having defined the OAT estimation methodology and the associated error metrics in Secs. III B and III C. Finally, it can be noted that alternate parameterizations of sound speed fluctuations have been proposed in the literature, notably using a set of horizontal and vertical fixed basis functions.10,20,26 A potential benefit to this later approach may be the ability to parameterize a larger set of SSMs, which may otherwise not be accurately described using Eq. (6). An in-depth analysis of the effect of the selected parameterization of the sound speed fluctuations on the performance of the OAT methods when estimating submesoscale variability in the upper ocean requires further investigation in future studies.

FIG. 4.

(Color online) EOF decomposition and reconstruction errors. (a) Variations of the magnitude of the singular values of the EOF basis matrix Φ for the three scenarios A-C. (b) Temporal evolution of the reconstruction error δ c ̃ t  RMSE [see Eq. (17)] for the most variable scenario C for increasing the duration of the training period from 1 to 15 days (while keeping the parameter of the number of EOF coefficients parameter n Φ = 17 ). (c) and (d) Spatial variations of the reconstruction error [see Eq. (16)] for both scenarios A (c) and C (d) for the last time index of the test period for each scenario.

FIG. 4.

(Color online) EOF decomposition and reconstruction errors. (a) Variations of the magnitude of the singular values of the EOF basis matrix Φ for the three scenarios A-C. (b) Temporal evolution of the reconstruction error δ c ̃ t  RMSE [see Eq. (17)] for the most variable scenario C for increasing the duration of the training period from 1 to 15 days (while keeping the parameter of the number of EOF coefficients parameter n Φ = 17 ). (c) and (d) Spatial variations of the reconstruction error [see Eq. (16)] for both scenarios A (c) and C (d) for the last time index of the test period for each scenario.

Close modal
All OAT estimators used in this study are based on the standard least-square minimization. Let c ̂ be the estimated SSM and δ c ̂ the estimated anomaly SSM. The first class of estimators is single objective, which includes a l 2 penalty on δ c ̂ (the subscript t is omitted onward for the sake of notation conciseness),
(7)
where λ represents the Lagrange multiplier associated with the regularization constraints on the estimated SSM that are enforced by a regularization matrix . To this end, two different types of regularization matrix will be used in this study. First, the standard Tikhonov regularization (abbreviated as tik. in Sec. IV) is obtained by replacing with the identity matrix I. Second, the covariance regularization21 is obtained by replacing with the inverse of the covariance matrix Σ 1 of the EOF basis as described in Eq. (6). Specifically, Σ 1 is calculated from the L SSM matrices that makeup the SSM training set such that Σ 1 = [ 1 / ( L 1 ) . ( Y T Y ) ] 1 , where Y (of dimensions [L × J]) is formed by stacking the L vectorized sound speed anomaly maps of J pixels (that is, after removing the background vectorized SSM c Φ ¯ ) of the training set. Therefore, the covariance regularization (abbreviated as cov. in Sec. IV) penalizes each EOF coefficient differently, in contrast to the standard Tikhonov regularization.
The derivations for the least squares solution to Eq. (7) leads to the following closed form for the OAT estimator (see details in the  Appendix):
(8)
where ( T ) denotes the transpose operator. The subscript Φ indicates hereafter the projection of a given matrix X in the EOF basis space, that is, X Φ = X Φ . The specific formulation of Eq. (8) for the Tikhonov regularization (abbreviated as tik. in Sec. IV) is obtained by replacing Φ with I Φ in Eq. (8),
(9)
The covariance regularization is obtained by replacing Φ with Σ Φ 1 in Eq. (8),21 
(10)
Additionally, multi-objective estimators for the OAT inverse problem can also be defined to account for additional constraints (e.g., to account for a priori information on the SSM values at certain locations) by simply adding additional loss—or penalty—terms to the original loss function given by Eq. (7).41,42 For instance, measurements of the actual (i.e., true) sound speed values corresponding to specific pixels values of the SSM c true could be readily available at any of the acoustic sources and receivers locations also during the estimation period over which OAT is conducted, assuming, for instance, that each acoustic transponder is co-located with a conductivity temperature and depth (CTD) device. In this case, the generalized multi-objective loss-function can be formulated as
(11)
where U is a [ J × J ] diagonal matrix whose non-zero entries correspond to the specific indices of the SSM pixels whose values are constrained by the corresponding Nc pixels of the true SSM c true . For the underdetermined OAT inversion performed in this study at most Nc = I pixels can be constrained since the number of acoustic rays I is less than the total number J of SSM pixels. The least squares solution to Eq. (11) yields the so-called multiobjective OAT estimator,
(12)
Details of the derivation can be found in the  Appendix, as well as a comparison with a slightly different formulation of the multi-objective loss-function which does not include the second normalization term ( c 0 c Φ ¯ ) in the second penalty constraint in the multi-objective loss-function given by Eq. (11). This general multi-objective solution is hereafter referred to as a constrained OAT estimator. Furthermore, the specific constrained Tikhonov estimator (denoted const. tik. in the numerical results Sec. IV) is obtained by replacing Φ with I Φ , in Eq. (12). Similarly, the specific constrained covariance estimator (further denoted const. cov.) is obtained by replacing Φ with the inverse of the covariance matrix Σ Φ 1 [which is the same covariance matrix defined in Eq. (12)].

Finally, we practically implement the solutions given by Eqs. (8) or (12), the optimal values for the penalty parameter λ (or λ1, λ2 for the multi-objective case) as well as the number of coefficients n Φ for the EOF basis must be selected. In this study, for each of the three tested scenarios A–C, these parameters were selected as the optimal ones that minimize the RMSE of the estimation error made by the OAT, as defined in the Eq. (15), over the whole estimation window. Those optimal parameters were found to also minimize the error in residual ray travel times computed from ray tracing using either the a priori known SSM or the estimated SSM, thus confirming that the selected inversion parameters could also be determined in this alternative manner.43,44

1. Definitions of sound speed estimation errors

The true sound-speed anomaly δ c t , j = c t , j c 0 , j is the quantity of interest to be estimated in this study [see Eq. (1)], where c 0 , j represent the reference (average) vectorized SSM and c t , j represented the true vectorized SSM.

OAT only provides an estimate of the true SSM c ̂ t , j based on the formulations of the selected OAT estimates given by Eqs. (8) or (12). Hence, the estimated sound-speed anomaly δ c ̂ t , j is simply
(13)
Consequently, the error effectively made by the OAT method is the difference between the estimated sound speed values and true sound speed values that is c ̂ t , j c t , j = δ c ̂ t , j δ c t , j .
The true sound speed anomaly δ c t , j and the estimated sound speed anomaly δ c ̂ t , j are each a function of both the spatial index j of the vectorized SSM and the discrete time t used for the ocean circulation simulations. In order to ease the visual representations of the root mean square (RMS) errors associated with either of these sound speed anomalies, a set of error metrics are defined. The overall notation convention used in this paper is that for each defined RMS errors, the subscript index (e.g., t) denotes the physical variable which is kept as a free variable while the other physical variables (e.g., the vectorized pixel j) is averaged over its respective number of samples (e.g., here J = 2541). For instance, the spatially averaged and time-dependent RMS of the true sound speed anomaly maps δ c t , j , for a time instance t, is given by
(14)
Similarly, the spatially averaged and time-dependent RMS error value of the estimation error made by the OAT method is given by
(15)
When the OAT estimator is accurate, one would expect the error given by Eq. (15) to be lower than the initial error δ c t   RMSE [see Eq. (14)] which uses only the a priori known SSM c 0 used for the linearization procedure and which coincides—for each scenario A–C—with the last SSM of the training set just before the estimation period starts (see Fig. 2).
In addition, in a similar fashion, the difference δ c ̃ t (i.e., the SSM anomaly reconstruction) between the true SSM c t at time t and its projection on the basis of the EOF Φ [Eq. (6)] is given by, in vector notation,
(16)
Equation (16) is referred to hereafter as the reconstruction error and defines the lower bound of the achievable error for the OAT inversion. Since the estimators are based on truncated series of Φ , the magnitude of the reconstruction error is always greater than zero. Hence, similarly to Eq. (14), the spatially averaged and time-dependent RMSE value of this reconstruction error δ c ̃ t is given by
(17)
Finally, another metric adopted in Sec. IV is the average error margin Δ ϵ , which is the time-averaged error margin between the estimated δ c ̂ t  RMSE [Eq. (15)] and the reconstruction δ c ̃ t  RMSE [Eq. (17)]. This indicator quantifies the accuracy of the estimated SSM from OAT with respect to the lower bound error corresponding to the SSM reconstruction error for the selected EOF basis,
(18)

2. Definitions of arrival-time errors

The spatially averaged (over all I rays) and time-dependent RMSE of the arrival time fluctuations RMSE can be computed between the reference arrival time vector τ 0 (computed using the reference SSM c 0 ) and the true arrival time vector τ t (computed using the true c t )—as defined in Eq. (3)—such that
(19)
Additionally, for a given estimated SSM anomaly δ c ̂ [see Eq. (15)], the corresponding forward estimated arrival time fluctuation δ τ ̂ is given by [similarly to Eq. (5)]
(20)
Hence, the spatially averaged (over all I rays) and time-dependent RMSE of the forward estimated arrival time fluctuation δ τ ̂ t [see Eq. (20)] with respect to the true arrival time fluctuation δ τ t [see Eq. (19)] is given by
(21)

3. Additional error metrics for the estimated sound velocities from OAT

When written in terms of actual range-depth pixels of the SSM, instead of vectorized pixels, both the true sound speed anomaly δ c t , z , r and the estimated sound speed anomaly δ c ̂ t , z , r are each functions of three variables: discretized depth z, discretized range r, and the discrete time t used for the ocean circulation simulations. Hence, additional granularity for quantifying the spatial variability of the RMSE associated with which each OAT estimates can be obtained, for instance, by performing a spatial averaging across depth or range only instead of overall pixels as shown in Eqs. (14) or (15) or averaging over time only. For the sake of completeness, all additional error variables used in Sec. IV are individually defined below. The general notation convention used in this paper is that for each defined RMS error, the subscript indices ( t , r or z ) denote which of the physical variables is kept as a free variable while all other physical variables are being averaged over their respective total number of samples (i.e., T  = 240 , Nr = 11, or Nz = 141).

For example, the time-dependent and depth-dependent sound speed (for each time index t = [ 1 , T ] and for each depth index z = [ 1 , N z ] ) RMS of the true sound speed anomaly δ c t can be obtained simply by averaging across the discretized ranges
(22)
By further averaging the previous variable δ c t , z   RMS across all time instances t, the time-averaged and depth-dependent sound speed anomalies RMS is then obtained,
(23)
Similarly, the depth-dependent RMS estimated anomaly error δ c ̂ t , p  RMSE (for each time index t = [ 1 , T ] and for each depth index z = [ 1 , N z ] ) are given by
(24)
Finally, this variable δ c ̂ t , z   RMSE can be further averaged over time t to obtain the time-averaged and depth-dependent estimated sound velocity RMSE from OAT,
(25)

Next, we discuss the implementation of iterative OAT formulations, for which the reference state for the OAT inversion is iteratively updated based on the previously estimated solution, as well as adaptive OAT formulations, in which the sources and receivers configuration can evolve through time or space in between each successive iteration, for the environmental and sensing configurations previously discussed (Figs. 1–3).

The iterative OAT method24 aims at improving the accuracy of SSM estimates, especially when the reference state (that is, the initial vectorized SSM map c 0 ) significantly differs from the true SSM to be estimated. The procedure is inspired by the Levenberg-Marquardt algorithm,45 which approximates the solution of a least-square problem by iteratively linearizing the forward model based on the previous solution found. Several studies have investigated the iterative OAT method in the context of long-range scenarios,24–27 and have shown that the iterative OAT scheme generally improves the accuracy of the SSM. Here, we apply the same procedure to assess the accuracy of the estimated SSM in the presence of submesoscale variability over short propagation ranges. Specifically, the iterative OAT method first initializes the inverse problem variables with the reference state { τ 0 , E 0 , c 0 } associated with the iteration index n = 0, and then produces an estimated SSM c ̂ n = 1 = c ̂ based on one inversion using any of the OAT methods described in Sec. III. For each subsequent iteration n 1 , the algorithm updates this reference state (see Fig. 5) by using the estimated SSM c ̂ n obtained from the previous iteration as a reference SSM and computes an updated set of ray arrival times τ n between the selected source-receivers pairs as well as the corresponding linearized forward model E n [using Eq. (5)]. Additionally, at each iteration both the optimal values for the penalty parameter λ (or λ1, λ2 for the multi-objective case) as defined by Eq. (8) or (12), and the number of coefficients n Φ selected for the EOF basis can be updated.

FIG. 5.

(Color online) Flowchart of the iterative and adaptive OAT formulations, where n is the iteration index. (a) Iterated model OAT method. The OAT variables are updated with respect to the inverse mapping solution c ̂ , providing updates for the sound speed map, the linearized forward model and arrival times. (b) Adaptive setup OAT method. The OAT variables are updated similarly to the iterated scheme with additional source-receivers arrival times and sound speed constraints within the SSM.

FIG. 5.

(Color online) Flowchart of the iterative and adaptive OAT formulations, where n is the iteration index. (a) Iterated model OAT method. The OAT variables are updated with respect to the inverse mapping solution c ̂ , providing updates for the sound speed map, the linearized forward model and arrival times. (b) Adaptive setup OAT method. The OAT variables are updated similarly to the iterated scheme with additional source-receivers arrival times and sound speed constraints within the SSM.

Close modal

The algorithm for the adaptive OAT method is a generalization of the one for the iterative OAT method, in which the acoustic sensing setup (that is, the specific location and number of selected source-receiver pairs) is allowed to change in between successive iterations. In this study, the adaptive OAT method is implemented using the sensor geometry shown in Fig. 3(b): for the initial estimation (n = 0) the acoustic sensing setup only uses a sparse combination of three shallow sources (positioned at 114, 322, and 531 m) and a 10 receiver-element VLA (positioned from 62 to 1000 m with a constant spacing distance of 104 m of elements). For each subsequent iteration n 1 , an additional 10 element VLA, where each element can act as both a source or receiver in a round-robin fashion,17 increases the acoustic sensing geometry and therefore improves the effective ray coverage for the OAT inversion. In this study we consider up to three iterations and the VLA added after each iteration has the same geometry but is placed at a different range r = 1500, 2500, 3500 m. Figure 3(b) displays the final sensing geometry for the third (n = 3) OAT iteration which now includes all three additional VLAs.

In the most general case, these added VLAs for the adaptive method can be added at an arbitrary location across each iteration. In practice, those added VLAs, or any desired acoustic transponder configuration, could be placed in regions of the domain that exhibit high SSM anomalies or variability to help better constrain the OAT inversion. However, for this study, we arbitrarily chose to keep the added VLAs at the same locations to facilitate the quantitative comparison of the OAT estimation performance between either the 3 × 10 or 5 × 20 initial sensor configuration, as shown in Secs. IV B and IV C to keep the overall sensing configuration as consistent as possible.

Potential improvements in estimated SSM for a certain time index t as a function of the iteration index n can be quantified using the following formula for the inverse mapping error:
(26)
where the estimated SSM at the nth iteration is given by c ̂ t n and the true SSM is c t . Finally, the time-averaged inverse mapping error δ c ̂ ¯ | n , used as a metric in Sec. IV to quantify the overall improvement in estimation in the nth iteration is obtained by averaging the previous expression given by Eq. (26),
(27)

Figure 6 compares the accuracy of the various linear estimators given in Sec. III B when applied to scenarios A–C [see Fig. 2(a)] using the dense acoustic sensing setup of 20 sources and receivers [see Fig. 3(a)]. As explained at the end of Sec. III A, the estimators' hyperparameters n Φ and λ (equivalent to λ1 and λ2) are chosen such that the inverse RMSE for the fully constrained adaptive method is minimized over the estimation window. Therefore, the following values of the parameter n Φ for scenarios A, B, and C were selected, respectively, as 27, 10, and 17. Similarly, the values of the selected values for the hyperparameters λ1 and λ2, for scenarios A, B, and C were selected as [ 10 9 ; 10 8 ], [ 10 9 ; 10 13 ], and [ 10 10 ; 10 9 ]. Figures in the first column [Figs. 6(a), 6(e), and 6(i)] show the temporal evolution of the range-averaged SSM anomalies δ c t , p [see Eq. (22)] for each scenario. The figures in the second column [Figs. 6(b), 6(f), and 6(j)] show the equivalent estimations δ c ̂ t , p obtained using the constrained Tikhonov estimator [see Eq. (10)]. The third column figures [Figs. 6(c), 6(g), and 6(k)] present the depth-dependent root mean square anomaly [see Eq. (23)] (green lines) and the estimation δ c ̂ p  RMSE [see Eq. (25)] for the different estimators (tik, cov, const. tik, and const. cov) for each method (other colored lines). Finally, the fourth column figures [Figs. 6(d), 6(h), and 6(l)] detail the anomaly time evolution RMS [see Eq. (14)] (green lines) and the associated estimation δ c ̂ t  RMSE for all estimators [see Eq. (15)] (other colored lines).

FIG. 6.

(Color online) Classical OAT methods results for scenarios A, B, and C. (a), (e), and (i) Temporal evolution of the range-averaged ground truth SSM anomaly δ c t , z [see Eq. (22)]. (b), (f), and (j) Temporal evolution of the estimated const. tik. SSM anomaly δ c ̂ t , z [see Eqs. (22) and (12)]. (c), (g), and (k) Time- and range-averaged RMSE for ground truth anomalies δ c z [Eq. (23), green line] and estimated anomalies δ c ̂ z [Eq. (25), in other color lines]. (d), (h), and (l) Temporal evolutions of the RMSE for the ground truth anomalies δ c t [Eq. (14), green line] and estimated anomalies δ c ̂ t [Eq. (15), in other color lines] and for reconstruction errors δ c ̃ t  RMSE [Eq. (17), black line]. Estimators legend: The following color scheme is used for each of the four types of computed OAT estimates (as defined in Sec. III B) tik. (blue continuous), cov. (continuous red), const. tik. (blue dashed), const. cov. (red dashed).

FIG. 6.

(Color online) Classical OAT methods results for scenarios A, B, and C. (a), (e), and (i) Temporal evolution of the range-averaged ground truth SSM anomaly δ c t , z [see Eq. (22)]. (b), (f), and (j) Temporal evolution of the estimated const. tik. SSM anomaly δ c ̂ t , z [see Eqs. (22) and (12)]. (c), (g), and (k) Time- and range-averaged RMSE for ground truth anomalies δ c z [Eq. (23), green line] and estimated anomalies δ c ̂ z [Eq. (25), in other color lines]. (d), (h), and (l) Temporal evolutions of the RMSE for the ground truth anomalies δ c t [Eq. (14), green line] and estimated anomalies δ c ̂ t [Eq. (15), in other color lines] and for reconstruction errors δ c ̃ t  RMSE [Eq. (17), black line]. Estimators legend: The following color scheme is used for each of the four types of computed OAT estimates (as defined in Sec. III B) tik. (blue continuous), cov. (continuous red), const. tik. (blue dashed), const. cov. (red dashed).

Close modal

The temporal evolution of the reconstruction error δ c ̃ t  RMSE [as defined in Eq. (17)] during the test periods for scenarios A, B, and C are shown by the black dotted line in Figs. 6(d), 6(h), and 6(l). Overall, the average values of δ c ̃ t  RMSE across all estimation periods ( t = 1 : T ) for scenarios A, B, and C were found to be 0.01, 0.14, and 0.41 m/s, respectively. Those RMSE values confirm that scenarios A, B, and C are expected to have an increased level of SSM mismatch between the training and estimation periods, which leads to a higher estimation error for the OAT inverse problem. As a point of comparison, Fig. 4(b) shows the temporal evolution of the reconstruction error δ c ̃ t  RMSE [see Eq. (17)] for the most variable scenario C for increasing duration of the training set from 1 to 15 days (while keeping the parameter of the number of EOF coefficients parameter n Φ = 17 ). The displayed reconstruction error values indicate that changing the duration of the training period has little effect on the parameterization of the sound speed variability, even for the highly variable scenario C, for the first two-thirds of the test period. As expected, a gradual increase in the reconstruction error still occurs, and it can be noted that for the last third of the test period, using a longer training period (or more than 10 days) could slightly lower the reconstruction error. However, for convenience, the duration of the training period is kept constant to two days for all three scenarios A–C. Furthermore, Fig. 4 shows, as an illustration, the spatial variations of the reconstruction error [see Eq. (16)] for both scenarios A (c) and C (d) at the end of the test period (which represents the worst-case scenario), as the EOF basis obtained from the prior 2-day training period is least likely to be accurate at the end of the 5-day test period.

Based on the hyperparameters listed above, the averaged value of the OAT estimation errors increases as expected from scenarios A ( 0.3  m/s), to B (< 0.5 m/s) and C (> 0.5 m/s) proportionally to the increase in amplitude of the actual SSM anomalies across these three scenarios. The results for each scenario and the selected linear OAT estimator are summarized in Table I. It can be seen that for both the unconstrained [Eq. (8)] and constrained OAT estimators [Eq. (12)], using either the Tikhonov-based regularization or covariance-based regularization yields a nearly identical performance. Hence, in the remainder of this article, only the Tikhonov-based regularization is adopted. Results from this section confirm that most of the submesoscale features occurring in the 1 km upper-ocean channel have been correctly estimated in the ideal acoustic setup of 20 sources and 20 receivers [Fig. 3(a)], which offers a sufficiently dense coverage for the 5 km separation range, when applying the EOF decomposition of the training set as spatial filter on the sound speed anomalies to be estimated. This corroborates the suitability of the OAT methods for this environment, provided that a sufficiently dense ray coverage and prior knowledge of the expected SSM fluctuations are available.

TABLE I.

Time-averaged RMSE for classical OAT methods. Values are listed as functions of the selected scenario (first column) or the type of OAT estimator (second column). The inverse estimated δ c ̂  RMSE [obtained from the time average over the estimation period of the RMSE given by Eq. (15)] are given in the third column. The forward-predicted δ τ ̂  RMSE [obtained from the time average of the RMSE of the arrival time fluctuation calculated from Eq. (21)] are given in the fourth column. The average error margin Δ ϵ is given in the last column [see Eq. (18)].

Method δ c ̂  RMSE (m/s) δ τ ̂  RMSE (s) Δ ϵ (m/s)
Scenario A  Tikhonov  0.29  6.1 × 10−5  0.14 
  Covariance  0.33  6.1 × 10−5  0.17 
  constrained Tikhonov  0.26  6.8 × 10−5  0.16 
  Constrained covariance  0.28  8.9 × 10−5  0.11 
Scenario B  Tikhonov  0.33  2.8 × 10−4  0.33 
  Covariance  0.34  2.8 × 10−4  0.37 
  Constrained Tikhonov  0.32  3.1 × 10−4  0.20 
  constrained covariance  0.32  3.0 × 10−4  0.23 
Scenario C  Tikhonov  0.65  2.6 × 10−4  0.36 
  Covariance  0.73  2.6 × 10−4  0.61 
  constrained Tikhonov  0.56  2.9 × 10−4  0.23 
  Constrained covariance  0.61  2.9 × 10−4  0.31 
Method δ c ̂  RMSE (m/s) δ τ ̂  RMSE (s) Δ ϵ (m/s)
Scenario A  Tikhonov  0.29  6.1 × 10−5  0.14 
  Covariance  0.33  6.1 × 10−5  0.17 
  constrained Tikhonov  0.26  6.8 × 10−5  0.16 
  Constrained covariance  0.28  8.9 × 10−5  0.11 
Scenario B  Tikhonov  0.33  2.8 × 10−4  0.33 
  Covariance  0.34  2.8 × 10−4  0.37 
  Constrained Tikhonov  0.32  3.1 × 10−4  0.20 
  constrained covariance  0.32  3.0 × 10−4  0.23 
Scenario C  Tikhonov  0.65  2.6 × 10−4  0.36 
  Covariance  0.73  2.6 × 10−4  0.61 
  constrained Tikhonov  0.56  2.9 × 10−4  0.23 
  Constrained covariance  0.61  2.9 × 10−4  0.31 

The results for the iterative and adaptive OAT methods (see Fig. 5) are presented for scenarios A and C for two different initial sensor configurations with different sensor densities. The first one is a 3 × 10 sensing configuration [as shown in Fig. 3(b)] which initially offers the lowest ray-path coverage of the inversion domain. The second one is a 5 × 20 configuration, where all 20 available receivers of the dense setup studied in Sec. IV A [as shown on Fig. 3(a)] are kept but only 5 out of the 20 sources are kept, still spanning the whole depth of the inversion domain down to 1000 m. These 5 sources are at specific depths: 10, 218, 479, 740, and 1000 m using. This second configuration of 5 × 20 initially offers an improved ray path coverage of the inversion domain compared to the configuration of 3 × 10.

Each iterative or adaptive OAT method is iterated four times ( n = [ 0 , 1 , 2 , 3 ] , including the initial estimation based on the reference state c 0 for the SSM). For the adaptive OAT method, the VLAs added at each iteration are identical and located as shown in Fig. 3(b) for either the 3 × 10 or 5 × 20 initial sensor configuration. For the iterative OAT method, no additional VLAs are added and only the 5 × 20 configuration or the 3 × 10 configuration is used for all iterations.

The green curves in Figs. 7(a) and 7(f) show the spatially averaged and time-dependent values of the initial RMS SSMs δ c t [see Eq. (14)]. On the same figures, the yellow curves represent the initial estimation errors δ c ̂ t | n = 0  RMSE with respect to the tik. estimator [see Eq. (9)] for the 3 × 10 sensor geometry [see Fig. 3(b)]. The dashed lines represent the final estimation errors after the last iteration δ c ̂ t | n = 3  RMSE, for the iterative method (blue line) and the adaptive method (red line). Figures 7(b) and 7(g) show the ground-truth SSM anomaly δ c t at the end of the 5-day estimation period (t = 696) for both scenarios A and C. Figures 7(c)–7(e) and Figs. 7(h)–7(j) show the inverse mapping error maps δ c ̂ t | n , for the same time t = 696 and n [ 1 , 3 ] [Eq. (26)] for the adaptive OAT method using const. tik. estimator. Note that the final inverse mapping error maps [obtained with Eq. (26) where n = 3] correspond to Figs. 7(e) and 7(j). Overall, these 2D error maps show a gradual improvement in the accuracy of the OAT estimates after each iteration for the selected configuration.

FIG. 7.

(Color online) Iterative and adaptive methods inverse mapping results, with the initial 3 × 10 sensor geometry, for scenarios A (left) and C (right). (a) and (f) Temporal evolution of the initial RMS anomaly δ c t [green line; see Eq. (14)] and the first estimated RMS (n = 0) δ c ̂ t | n = 0  RMSE [yellow line, Eq. (8)] for scenarios A and C, respectively. The last iteration estimations δ c ̂ t | n = 3  RMSE are shown for tik. iterative [dashed blue, Eq. (9)] and const. tik. adaptive [dashed red, Eq. (12)] OAT methods. (b) and (g) Ground-truth anomaly map for the end of the 5-day estimation period δ c t (t = 969). (c)–(e) and (h)–(j) Inverse mapping error maps δ c ̂ t | n [Eq. (26)] for the same time t = 696 and for each iteration ( n [ 1 , 3 ] ), for the adaptive OAT method using the const. tik. estimator.

FIG. 7.

(Color online) Iterative and adaptive methods inverse mapping results, with the initial 3 × 10 sensor geometry, for scenarios A (left) and C (right). (a) and (f) Temporal evolution of the initial RMS anomaly δ c t [green line; see Eq. (14)] and the first estimated RMS (n = 0) δ c ̂ t | n = 0  RMSE [yellow line, Eq. (8)] for scenarios A and C, respectively. The last iteration estimations δ c ̂ t | n = 3  RMSE are shown for tik. iterative [dashed blue, Eq. (9)] and const. tik. adaptive [dashed red, Eq. (12)] OAT methods. (b) and (g) Ground-truth anomaly map for the end of the 5-day estimation period δ c t (t = 969). (c)–(e) and (h)–(j) Inverse mapping error maps δ c ̂ t | n [Eq. (26)] for the same time t = 696 and for each iteration ( n [ 1 , 3 ] ), for the adaptive OAT method using the const. tik. estimator.

Close modal

Interestingly, using the unconstrained OAT estimator [Eq. (8)] for the iterative OAT method (blue dashed lines) yielded slightly better performance than using the constrained OAT estimators [Eq. (12)] (not shown here for the sake of conciseness). On the other hand, the reverse trend was found for the adaptive OAT method (blue dashed lines): that is, the constrained OAT estimators [Eq. (12)], performed best. It is hypothesized that this may be due to a sensing configuration effect, as—by choice for practical considerations—the sound velocity constraints are only imposed where acoustic transponder elements exist using the formulation given by Eq. (12). For the case of the iterative OAT method, since the sensing configuration is fixed across iterations and remains sparse and equal to the initial 3 × 10 geometry, adding constraints only at a few discrete locations on the edges of the inversion domain leads to more estimation error. In contrast, for the adaptive OAT case, the sensing configuration evolves over iterations and spreads over the entire inversion domain as the number of iterations increases [Fig. 3(b)]. Hence, it gradually yields more distributed sets of constrained sound velocities locations across the inversion domain, which improves the accuracy of the estimated SSMs for an increasing number of iterations. Figure 8 further quantifies the influence on the types of constraints provided by the additional sensors of the three added VLAs (introduced after each iteration) on the precision of adaptive OAT estimators for scenario A or C (scenario B is omitted for simplicity). To do so, three different cases were simulated. The first case (blue line) corresponds to added sensors only providing local sound speed values (and no additional acoustic travel-time information); this would correspond in practice to the case of these sensors being only CTD probes and not actual acoustic transponders. The second case (red line) is the reverse, that is, the added sensors only provide additional acoustic travel-time information but no local sound speed values; this would correspond in practice to the case where these added sensors are only acoustic transponders [i.e., by setting the parameter λ 2 = 0 in Eq. (12)]. Finally, the third case (purple line) is the same as the fully constrained adaptive OAT case shown in Fig. 7, that is, when the added sensors provide both additional information on the acoustic travel time and the local sound speed values. As expected, Fig. 8 for scenarios A and C indicates the precision of the constrained adaptive OAT estimator [i.e., using Eq. (12)] is best when both constraints of acoustic travel times and sound speed values are used to perform OAT, especially for the highly variable scenario C. Thus, in summary, for the remainder of this study, all iterative results of the OAT method are obtained with the non-constrained Tikhonov estimator, while all adaptive results of the OAT method are obtained with the constrained Tikhonov estimator (for both the initial sparse 3 × 10 sensing configuration and the intermediate 5 × 20 sensing configuration).

FIG. 8.

(Color online) (a) Same as Fig. 7 for scenario A but displaying only results over the test period and displaying only the error of the adaptive OAT estimates [obtained using Eq. (12)] using three different types of constraints provided by the additional sensors of the three added VLAs (introduced after each iteration): (blue line), only sound speed constraints, (red line) only acoustic travel time constraints, and (purple line) both acoustic travel times and sound speed values constraints. (b) Same as (a) but for scenario C.

FIG. 8.

(Color online) (a) Same as Fig. 7 for scenario A but displaying only results over the test period and displaying only the error of the adaptive OAT estimates [obtained using Eq. (12)] using three different types of constraints provided by the additional sensors of the three added VLAs (introduced after each iteration): (blue line), only sound speed constraints, (red line) only acoustic travel time constraints, and (purple line) both acoustic travel times and sound speed values constraints. (b) Same as (a) but for scenario C.

Close modal
Figure 9 shows the reduction in the spatially averaged (over range and depth) and temporally averaged (over the 5-day estimation period) RMSE value of the estimated SSM anomaly δ c ̂ t | n [Eq. (26)] for increasing number of iterations for different scenarios and initial acoustic configuration. Figures 9(a)–9(c) and 9(d)–9(f) show the respective results for the initial 5 × 20 or 3 × 10 acoustic sensing configuration. As expected, the averaged RMSE value of the estimated anomaly δ c ̂ t | n  RMSE decreases with respect to the iteration index for both acoustic sensing configurations and in all scenarios before reaching a plateau after the 4th iteration (the blue and red curves represent the values for the iterative and adaptive OAT schemes, respectively). Note that the estimation performances for the 5 × 20 sensing configuration are overall better than the ones for the 3 × 10 sensing configuration for scenario A but less so for scenarios B and C, and that nearly identical RMSE values are obtained for the adaptive OAT method at the end of the 4th iteration for both sensing configurations. Table II summarizes the iterative and adaptive results for the 5 × 20 and 3 × 10 initial sensing configurations. The last column lists the improvement gain in the accuracy of the estimated SSM after four iterations defined as follows:
FIG. 9.

(Color online) Evolution of the spatially averaged (over range and depth) and temporally averaged (over the 5-day estimation period) RMSE value of the estimated anomaly δ c ̂ t | n [Eq. (26)] with respect to the iteration index n for the iterative (blue line) and adaptive (red line) OAT methods. (a)–(c) The first column shows results for the 5 × 20 initial acoustic sensing configuration, for scenarios A, B, and C. (d)–(f) The second column shows the results for the 3 × 10 initial acoustic sensing configuration, for scenarios A, B, and C. The const. tik. OAT estimator [Eq. (12)] is used for all adaptive results. The tik. OAT estimator [Eq. (9)] is used for all iterative results.

FIG. 9.

(Color online) Evolution of the spatially averaged (over range and depth) and temporally averaged (over the 5-day estimation period) RMSE value of the estimated anomaly δ c ̂ t | n [Eq. (26)] with respect to the iteration index n for the iterative (blue line) and adaptive (red line) OAT methods. (a)–(c) The first column shows results for the 5 × 20 initial acoustic sensing configuration, for scenarios A, B, and C. (d)–(f) The second column shows the results for the 3 × 10 initial acoustic sensing configuration, for scenarios A, B, and C. The const. tik. OAT estimator [Eq. (12)] is used for all adaptive results. The tik. OAT estimator [Eq. (9)] is used for all iterative results.

Close modal
TABLE II.

Comparison of the spatially averaged (over range and depth) and temporally averaged (over the 5-day estimation period) RMSE value of the estimated anomaly δ c ̂ t | n [Eq. (26)] for the adaptive OAT methods. Results are given for the different scenarios (first column), and the initial acoustic sensing configuration is specified (second column). The type of OAT estimator is listed in the third column: const. tik. OAT estimator [Eq. (12)] is used for all adaptive results and the tik. OAT estimator [Eq. (9)] is used for all iterative results. The results are given for both the initial iteration (n = 0) and the final iteration (n = 3). The values for the error gains between these two iterations are listed in the last column.

Setup Method Initial RMSE (m/s) Final RMSE (m/s) Error gain
Scenario A  5 × 20  tik. iterat.  0.308  0.239  +22% 
    const. tik. adapt.  0.310  0.258  +17% 
  3 × 10  tik. iterat.  0.342  0.305  +11% 
    const. tik. adapt.  0.437  0.344  +21% 
Scenario B  5 × 20  tik. iterat.  0.320  0.251  +22% 
    const. tik. adapt.  0.323  0.242  +25% 
  3 × 10  tik. iterat.  0.385  0.302  +22% 
    const. tik. adapt.  0.317  0.234  +26% 
Scenario C  5 × 20  tik. iterat.  0.598  0.481  +20% 
    const. tik. adapt.  0.534  0.413  +23% 
  3 × 10  tik. iterat.  0.604  0.497  +18% 
    const. tik. adapt.  0.545  0.396  +27% 
Setup Method Initial RMSE (m/s) Final RMSE (m/s) Error gain
Scenario A  5 × 20  tik. iterat.  0.308  0.239  +22% 
    const. tik. adapt.  0.310  0.258  +17% 
  3 × 10  tik. iterat.  0.342  0.305  +11% 
    const. tik. adapt.  0.437  0.344  +21% 
Scenario B  5 × 20  tik. iterat.  0.320  0.251  +22% 
    const. tik. adapt.  0.323  0.242  +25% 
  3 × 10  tik. iterat.  0.385  0.302  +22% 
    const. tik. adapt.  0.317  0.234  +26% 
Scenario C  5 × 20  tik. iterat.  0.598  0.481  +20% 
    const. tik. adapt.  0.534  0.413  +23% 
  3 × 10  tik. iterat.  0.604  0.497  +18% 
    const. tik. adapt.  0.545  0.396  +27% 

On average, the relative improvement gain obtained by the iterative and adaptive methods is close to 20 % . The iterative OAT method improves more than the adaptive OAT method for scenario A. On the other hand, the two methods are almost equivalent in scenario B and the adaptive OAT method is the most effective with respect to scenario C. Overall, these results indicate a dependence between environmental variability, ray coverage, and the best estimator choice, as discussed in Sec. V.

A simple comparison is conducted to investigate to what extent the results for the classical OAT method using the dense 20 × 20 sensor configuration for the acoustic setup [as shown in Fig. 3(a)] can be outperformed (or not) by applying the adaptive OAT method using an initial 3 × 10 sparse configuration of the acoustic setup [shown in Fig. 3(b)]. Here, the const. tik. OAT estimator [Eq. (12)] is used for all cases. No additional iteration is performed for the dense 20 × 20 sensor configuration as the estimation errors are already very low after the first (i.e., n = 0) inversion (see Fig. 6). The results of this configuration using the classical OAT estimation are adopted as a baseline for comparing the performance of adaptive OAT estimations.

Figure 10 compares the temporal evolution of the RMSE δ c ̂ t [Eq. (15)] of the estimated SSM obtained using either the dense 20 × 20 acoustic setup (blue line) or the initial (i.e., n = 0) sparse 3 × 10 acoustic setup (red line) or the final configuration (yellow line) for the adaptive setup [i.e., when all 3 extra VLAs have been added to the setup as shown in Fig. 3(b)]. For scenario A, a single inversion using the dense 20 × 20 setup yields the lowest estimation error. However, the trend is reversed for scenario B, and even more notably for scenario C, where the adaptive OAT estimations lead to the lowest estimation error. These results indicate that the error reduction obtained from the adaptive OAT method is more noticeable when the initial SSM anomalies [see Eq. (26)] are large, as for scenario C (see Fig. 2).

FIG. 10.

(Color online) Ray coverage maps R cov and time-averaged (over the 5-day estimation period) inverse mapping error δ c ̂ ¯ | n [see Eq. (27)] for scenarios A and C. (a), (d), and (g) Dense 20 × 20 sensing setup (using the tik. OAT estimator [Eq. (9)]). (b), (c), and (h) Initial (n = 0) sparse 3 × 10 sensing set-up. (c), (f), and (i) Final (n = 3) adaptive geometry [i.e., when all 3 extra VLAs have been added to the setup as shown in Fig. 3(b)]. The const. tik. The OAT estimator [Eq. (12)] is used for all adaptive cases in the panels of the second and third columns.

FIG. 10.

(Color online) Ray coverage maps R cov and time-averaged (over the 5-day estimation period) inverse mapping error δ c ̂ ¯ | n [see Eq. (27)] for scenarios A and C. (a), (d), and (g) Dense 20 × 20 sensing setup (using the tik. OAT estimator [Eq. (9)]). (b), (c), and (h) Initial (n = 0) sparse 3 × 10 sensing set-up. (c), (f), and (i) Final (n = 3) adaptive geometry [i.e., when all 3 extra VLAs have been added to the setup as shown in Fig. 3(b)]. The const. tik. The OAT estimator [Eq. (12)] is used for all adaptive cases in the panels of the second and third columns.

Close modal

The results for the three cases shown in Fig. 10 can be further explained by considering the ray coverage maps R cov , which show the number of eigenrays having crossed each pixel of the SSM [Fig. 11(a)–11(c)], since the effective ray coverage inherently drives the performance of the OAT inversion. Essentially, the classical dense setup (20 × 20) has fairly good ray coverage (i.e., over 100 eigenrays per pixel) over most of the inversion domain [Fig. 11(a)]. Note that because of refraction, the bottom center part of the inversion domain shows less dense ray coverage. On the other hand, the ray coverage for the distributed setup (3 × 10 initial) is initially very low (<50 eigenray per pixel) for most of the inversion domain [Fig. 11(b)], but is significantly enhanced (>100 eigenray per pixel over most of the domain) in the final configuration of the adaptive setup [i.e., when all 3 extra VLAs have been added; see Fig. 3(b)]. This improvement in ray coverage in the adaptive case explains most of the reduction in the time-averaged (over the 5-day estimation period) inverse mapping error δ c ̂ ¯ | n [see Eq. (27)] between the values obtained for the initial estimation [n = 0, Figs. 10(e) and 10(h)] and the values obtained for the final iteration [n = 3, Figs. 10(f) and 10(i)], which are now comparable to the values obtained from the dense (20 × 20) acoustic set-up [n = 0, Figs. 10(d) and 10(g)] especially in scenario C (last row).

FIG. 11.

(Color online) Temporal evolution of the RMSE δ c ̂ t [Eq. (15)] of the estimated SSM obtained using either the dense 20 × 20 acoustic setup (blue line, using the tik. OAT estimator [Eq. (9)]) or the initial (i.e., n = 0) sparse 3 × 10 acoustic setup (red line) or the final configuration (yellow line) for the adaptive setup [i.e., when all 3 extra VLAs have been added to the setup as shown in Fig. 3(b)]. The const. tik. The OAT estimator [Eq. (12)] is used for all adaptive cases. (a) Scenario A. (b) Scenario B. (c) Scenario C.

FIG. 11.

(Color online) Temporal evolution of the RMSE δ c ̂ t [Eq. (15)] of the estimated SSM obtained using either the dense 20 × 20 acoustic setup (blue line, using the tik. OAT estimator [Eq. (9)]) or the initial (i.e., n = 0) sparse 3 × 10 acoustic setup (red line) or the final configuration (yellow line) for the adaptive setup [i.e., when all 3 extra VLAs have been added to the setup as shown in Fig. 3(b)]. The const. tik. The OAT estimator [Eq. (12)] is used for all adaptive cases. (a) Scenario A. (b) Scenario B. (c) Scenario C.

Close modal

This study evaluated the performance of various OAT methods in the presence of submesoscale circulations. A summary and discussion of the main results are given below.

First, the results of this study show that since ocean submesoscale variability occurs on small spatial (100 m to a few km) and temporal (hours to days) scales, accurately estimating this variability with OAT requires both fairly dense acoustic sampling of the domain of interest (which is consistent with findings from previous OAT studies14) as well as selection of the reference point used to create the linearized forward model as close as possible to the onset of the estimation time period (which is key to provide accurate OAT estimations). Nevertheless, OAT can still be a viable alternative to conventional oceanographic sampling methods (e.g., local point measurements on moored or drifting platforms) given the high density of measurements required to capture this submesoscale variability over small spatial scales. Additionally, the EOF basis used to regularize the inversion problem should be selected to best represent the most current expected variability in the operational area of interest (e.g., a 2-day interval was used to construct the EOF basis in this study).

Second, the performance of the OAT estimation depends on the spatial extent and magnitude of the sound speed anomalies (see Fig. 6). As expected, large (in dimensions) and weak SSM anomalies are easier to reconstruct (scenario A). Intermediately, large features with relatively stronger SSM anomalies can be reconstructed with as much precision as small-sized anomalies with weak magnitude (scenario B). Last, highly variable, small, and strong SMM anomalies prove to be the most challenging to estimate (scenario C). In particular, the magnitude of the reconstruction error (that is, how accurately the SSM to be estimated can be described by the selected EOF basis for regularization purposes) is highly linked to the estimation error obtained from OAT. For the configurations tested in this study, a RMSE difference between the reconstruction error and the estimation error below 0.1 m/s appears sufficient to consider the estimated SSMs as near optimal.

Third, iterative and adaptive OAT methods offer the potential to improve the accuracy of the estimated SSMs for the studied environment in comparison to classical OAT methods (i.e., methods which consist of only a single inversion). Fundamentally, a sufficiently dense ray coverage of the ocean domain of interest is key to accurately estimate the rapidly evolving submesoscale features using any OAT estimator. Results from Figs. 10 and 11 show that the classical OAT method is sufficient if the magnitudes of the sound speed anomalies are weak (< 1 m/s) to moderate (  1 m/s) (e.g., scenario A or B) and the ray coverage is fairly dense [e.g., see Fig. 3(a)]. The iterative OAT method, on the other hand, is most beneficial in the presence of low-to-medium-sound-speed anomalies if the sensing configuration only provides a lower ray coverage (i.e., < 100 ray per pixels for the studied sensing configurations). Finally, if the submesoscale variability is especially high (e.g., for sound speed anomalies > 1 m/s), such as for scenario C in this study, it is necessary to enhance the ray coverage to achieve good OAT estimates. In this case, adaptive methods can be beneficial, especially, if the additional acoustic transponders at each iteration can be strategically added to better probe the domain areas where the highest sound speed anomalies were presumed to be based on the previous iteration results. Hence, in practice, the ray coverage could be adjusted in an adaptive manner, potentially leveraging the maneuverability of small sensing platforms to best sample the expected submesoscale variability. Additionally, the constrained OAT estimator [Eq. (12)] was found to perform best for the adaptive OAT method, since it gradually yields a more distributed sets of constrained sound velocities locations across the inversion domain for increasing number of iterations. On the other hand, the unconstrained OAT estimator [Eq. (8)] yielded better estimates for the iterative OAT method. In this case, applying constraints only at a few discrete locations on the edges of the inversion domain (i.e., where the sparse acoustic transducers are located) created more estimation errors for the chosen formulation. In conclusion, the benefit of using a set of distributed sound velocity constraints (based on local measurements) to regularize the OAT inversion problem appears to be determined by the geometry of the actual sensing configuration. Figure 12 conceptually summarizes the applicability of the various OAT methods discussed in this study for different levels of environmental variability and acoustic sensor configurations.

FIG. 12.

(Color online) Suggested applicability of the various OAT methods introduced in this study based on the expected level of environmental variability and achievable ray coverage for the tested configurations described in Figs. 1, 2, and 3.

FIG. 12.

(Color online) Suggested applicability of the various OAT methods introduced in this study based on the expected level of environmental variability and achievable ray coverage for the tested configurations described in Figs. 1, 2, and 3.

Close modal

Last, it is worth stressing that under the assumptions of this work, the estimation errors for the inverted SSMs represent only an idealized lower bound of what may be achievable by a practical OAT system with a similar acoustic sensing configuration. First and foremost, the submesoscale variability simulated by the ocean circulation model underestimates that of the real ocean due to horizontal and vertical resolution constraints. We plan to investigate this point in a future study. In addition, several sources of errors would likely increase the uncertainty of the OAT estimates in a practical situation, as extensively discussed in previous ray-based OAT literature,7,14 including—among others—transducer positioning errors and unaccounted motion (e.g., array tilt due to tides or currents); imperfect time synchronization between the various source-receiver pairs and the impact of noise and available acoustic bandwidth—as well as the influence of ocean processes such as surface gravity waves, internal waves, and spice—on the accuracy of the measured ray travel-times.46 Hence, the OAT performance is significantly driven by the available engineering resources which can ultimately enable the desired acoustic sensing configuration. However, the problem at stake in this study, namely, the estimation of the submesoscale ocean variability in the tactical upper ocean (< 1 km depth) over short ranges (< 10 km), may actually allow alleviating some of those engineering constraints. For example, for such short distances, the acoustic source sensors could operate at relatively high frequencies (> 10 kHz) with larger bandwidths, potentially providing a good signal-to-noise ratio and angle separation resolution which would ultimately improve the overall accuracy and diversity of the measured arrival times. Additionally, from an operational point of view, such compact high-frequency transducers could be more easily encapsulated in small (autonomous) profiling platforms, enabling simplified deployments and profiling of the upper ocean. These profiling platforms that would rely on rapid buoyancy change or other form or rapid mechanical actuation could provide depth-dependent sampling of the water column on small enough time scales to respect the frozen-ocean assumption between two OAT inversions. Furthermore, maintaining a surface expression (and hence GPS connection) for those autonomous profiling platforms may improve the power delivery to, and localization of, such profiling acoustic transducers, at least for the upper portion of the ocean domain of interest which typically exhibits the most dynamic submesoscale variability. In any event, sufficiently precise navigation systems should be implemented to ensure accurate positioning and tracking of the selected acoustic transponders, which is necessary for OAT applications.

This article compared the performance of different OAT methods over 5-day long estimation windows for three different scenarios exhibiting increasing submesoscale variability based on the simulated sound speed spatiotemporal variations computed using an ocean circulation model of the DeSoto Canyon in the Gulf of Mexico. Ray-based OAT were implemented for 2D upper ocean slices (depth < 1 km) and short ranges (< 5 km) for various acoustic sensing configurations that enabled various ray coverage density of the domain of interest. The results indicated that OAT methods have the potential to estimate submesoscale variability and highlighted the influence of the spatial and temporal scales of the sound speed anomalies of interest in the performance of the OAT estimation.

Additionally, the performance of iterative and adaptive OAT methods (encapsulating the classical OAT estimators) has been quantified to explore how to improve the estimation of sound speed anomalies in the upper ocean over short ranges when using a sparse or distributed sensor network, which could in practice be implemented with small sensing platforms. The results indicate that each of these OAT methods performs differently depending on the effective acoustic coverage and the sound speed variability to be estimated. Specifically, the classical OAT scheme was found to be sufficient for scenarios with high ray coverage and low sound speed variability, the iterated OAT scheme yielded better performances for lower acoustic coverage and low-to-moderate sound speed variability scenarios, and the adaptive OAT scheme was found to be most beneficial when the initial acoustic sensing configuration is sparse (thus yielding a low acoustic coverage initially), but the sound speed variability is high. These results could help guide the design of future OAT experiments that aim at validating the predictions from this numerical study and at quantifying the submesoscale variability in the upper ocean over short ranges.

The authors acknowledge the comments and suggestions made by the reviewers and the associate editor which significantly helped improve this manuscript. This work was supported by the Office of Naval Research (ONR) code 322 Ocean Acoustics Section, as well as the ONR Task Force Ocean (TFO) initiative under Grant No. N000142112558.

The authors have no conflict of interest to report.

Data supporting the findings of this study are available from the corresponding author upon reasonable request.

This appendix derives the expression for the multi-objective OAT estimator given by Eq. (12), as well as compare its performance with respect to an alternate expression obtained using a slightly different formulation of the multiobjective loss function describing the available sound speed constraints at Nc selected pixel locations. In practice, it is hypothesized that such constraints would correspond to penalizing the values of these Nc specific pixels of the estimated SSMs when they differ from a set of prescribed sound speed values c const (e.g., which could be independently measured at the selected pixels). In this case, using the notations from Sec. III, a general formulation of the multi-objective loss function is
(A1)
where U is the same diagonal matrix used to select the Nc constrained pixels of the estimated SSM [Eq. (12)].
For this study, the estimated SSM are projected on a selected EOF basis [see Eq. (6)] to better regularize the inverse problem. Hence, Eq. (A1) is actually solved in terms of the corresponding EOF coefficient vector α for the selected SSM c. In this case, the general formulation of the multi-objective loss function becomes
(A2)
Solving this is achieved by setting the following partial derivative to zero:
(A3)
This yields the following solution for the estimated set of EOF coefficients:
(A4)
or equivalently an estimated SSM given by
(A5)

In practice, to implement the solution given by Eq. (A5), one possibility is to simply set c const = c true , where c true is the true SSM to be estimated and also corresponds to the sound speed values, which can be independently measured in situ at the selected locations Nc. However, it should be noted that in this study, the reference SSM c 0 [set to be the last SSM of the training set, just before the estimation period starts (see Fig. 2) used to compute the linearized forward model E [see Eqs. (1)–(4)] actually differs from the averaged SSM c Φ ¯ of the training set used to compute EOF basis. Hence, an alternate choice for the constrained values is c const = c true ( c Φ ¯ c 0 ) as implemented in this study in Eq. (12). The second term ( c Φ ¯ c 0 ) allows us in a simple fashion to account for the evolution of the background SSM that has occurred between c Φ ¯ and c 0 due to the local ocean dynamics. In this case, Eq. (A5) becomes identical to Eq. (12) which was used as the multi-objective OAT estimator in Sec. IV of this study such that the last penalty term effectively reduces to λ 2 U Φ T U ( c true c 0 ) ] (which is referred to as the c 0 case) instead of λ 2 U Φ T U ( c true c Φ ¯ ) ] (which is referred to as the c Φ ¯ case).

Figure 13 compares the performance of these two types of constrained OAT for scenarios A and C based on the same adaptive setup algorithm used throughout this study and presented in Fig. 5(b). Overall, these results indicate the formulation for the multi-objective OAT estimators given by Eq. (12) [i.e., by setting c const = c true ( c Φ ¯ c 0 ) in the general formulation of Eq. (A5)]—corresponding to the dashed-dotted lines—performs slightly better for the selected test parameters and sensing configurations.

FIG. 13.

(Color online) Comparison of the performance of the two multi-objective OAT estimators discussed in the  Appendix for the adaptive setup algorithm. (a) Scenario A: δ c ̂ t | n  RMSE for iteration index n = 3. (b) Scenario C: δ c ̂ t | n  RMSE for iteration index n = 3. All parameters are identical to the ones shown in Fig. 9.

FIG. 13.

(Color online) Comparison of the performance of the two multi-objective OAT estimators discussed in the  Appendix for the adaptive setup algorithm. (a) Scenario A: δ c ̂ t | n  RMSE for iteration index n = 3. (b) Scenario C: δ c ̂ t | n  RMSE for iteration index n = 3. All parameters are identical to the ones shown in Fig. 9.

Close modal
1.
F. B.
Jensen
,
W. A.
Kuperman
,
M. B.
Porter
,
H.
Schmidt
, and
A.
Tolstoy
,
Computational Ocean Acoustics
(
Springer
,
Berlin
,
2011
), Vol.
2011
.
2.
H.
Medwin
,
C. S.
Clay
, and
S. M.
Flatte
, “
Fundamentals of acoustical oceanography
,”
Phys. Today
52
(
7
),
54
56
(
1999
).
3.
D. R.
Dowling
and
K. G.
Sabra
, “
Acoustic remote sensing
,”
Annu. Rev. Fluid Mech.
47
(
1
),
221
243
(
2015
).
4.
D.
Sun
,
A.
Bracco
,
R.
Barkan
,
M.
Berta
,
D.
Dauhajre
,
M. J.
Molemaker
,
J.
Choi
,
G.
Liu
,
A.
Griffa
, and
J. C.
McWilliams
, “
Diurnal cycling of submesoscale dynamics: Lagrangian implications in drifter observations and model simulations of the northern Gulf of Mexico
,”
J. Phys. Oceanogr.
50
(
6
),
1605
1623
(
2020
).
5.
J. C.
McWilliams
, “
Submesoscale currents in the ocean
,”
Proc. Math. Phys. Eng. Sci.
472
(
2189
),
20160117
(
2016
).
6.
J. R.
Taylor
and
A. F.
Thompson
, “
Submesoscale dynamics in the upper ocean
,”
Annu. Rev. Fluid Mech.
55
(
1
),
103
127
(
2023
).
7.
W.
Munk
and
C.
Wunsch
, “
Ocean acoustic tomography: A scheme for large scale monitoring
,”
Deep Sea Res., Part A
26
(
2
),
123
161
(
1979
).
8.
W. H.
Munk
, “
Horizontal deflection of acoustic paths by mesoscale eddies
,”
J. Phys. Oceanogr.
10
(
4
),
596
604
(
1980
).
9.
T. F.
Duda
,
Y.-T.
Lin
,
A. E.
Newhall
,
K. R.
Helfrich
,
J. F.
Lynch
,
W. G.
Zhang
,
P. F. J.
Lermusiaux
, and
J.
Wilkin
, “
Multiscale multiphysics data-informed modeling for three-dimensional ocean acoustic simulation and prediction
,”
J. Acoust. Soc. Am.
146
(
3
),
1996
2015
(
2019
).
10.
B. D.
Dushaw
and
H.
Sagen
, “
The role of simulated small-scale ocean variability in inverse computations for ocean acoustic tomography
,”
J. Acoust. Soc. Am.
142
(
6
),
3541
3552
(
2017
).
11.
J. A.
Colosi
and
D. L.
Rudnick
, “
Observations of upper ocean sound-speed structures in the north pacific and their effects on long-range acoustic propagation at low and mid-frequencies
,”
J. Acoust. Soc. Am.
148
(
4
),
2040
2060
(
2020
).
12.
J. A.
Colosi
and
W.
Zinicola-Lapin
, “
Sensitivity of mixed layer duct propagation to deterministic ocean features
,”
J. Acoust. Soc. Am.
149
(
3
),
1969
1978
(
2021
).
13.
D.
Behringer
,
T.
Birdsall
,
M.
Brown
,
B.
Cornuelle
,
R.
Heinmiller
,
R.
Knox
,
K.
Metzger
,
W.
Munk
,
J.
Spiesberger
,
R.
Spindel
,
D.
Webb
,
P.
Worcester
, and
C.
Wunsch
, “
A demonstration of ocean acoustic tomography
,”
Nature
299
(
5879
),
121
125
(
1982
).
14.
W.
Munk
,
P.
Worcester
, and
C.
Wunsch
,
Ocean Acoustic Tomography, Cambridge Monographs on Mechanics
(
Cambridge University Press
,
Cambridge
,
1995
).
15.
E. K.
Skarsoulis
,
G. A.
Athanassoulis
, and
U.
Send
, “
Ocean acoustic tomography based on peak arrivals
,”
J. Acoust. Soc. Am.
100
(
2
),
797
813
(
1996
).
16.
W.
Kuperman
and
P.
Roux
, “
Underwater acoustics
,” in
Springer Handbook of Acoustics
(
Springer
,
Berlin
,
2007
), pp.
149
204
.
17.
P.
Roux
,
B. D.
Cornuelle
,
W.
Kuperman
, and
W.
Hodgkiss
, “
The structure of raylike arrivals in a shallow-water waveguide
,”
J. Acoust. Soc. Am.
124
(
6
),
3430
3439
(
2008
).
18.
C.-F.
Huang
,
Y.-W.
Li
, and
N.
Taniguchi
, “
Mapping of ocean currents in shallow water using moving ship acoustic tomography
,”
J. Acoust. Soc. Am.
145
(
2
),
858
868
(
2019
).
19.
G.
Gopalakrishnan
,
B. D.
Cornuelle
,
M. R.
Mazloff
,
P. F.
Worcester
, and
M. A.
Dzieciuch
, “
State estimates and forecasts of the northern Philippine sea circulation including ocean acoustic travel times
,”
J. Atmos. Oceanic Technol.
38
(
11
),
1913
1933
(
2021
).
20.
K. L.
Gemba
,
H. J.
Vazquez
,
J.
Sarkar
,
J. D.
Tippman
,
B.
Cornuelle
,
W. S.
Hodgkiss
, and
W.
Kuperman
, “
Moving source ocean acoustic tomography with uncertainty quantification using controlled source-tow observations
,”
J. Acoust. Soc. Am.
151
(
2
),
861
880
(
2022
).
21.
J.
Jin
,
P.
Saha
,
N.
Durofchalk
,
S.
Mukhopadhyay
,
J.
Romberg
, and
K. G.
Sabra
, “
Machine learning approaches for ray-based ocean acoustic tomography
,”
J. Acoust. Soc. Am.
152
(
6
),
3768
3788
(
2022
).
22.
J.
Sarkar
,
B. D.
Cornuelle
, and
W.
Kuperman
, “
Information and linearity of time-domain complex demodulated amplitude and phase data in shallow water
,”
J. Acoust. Soc. Am.
130
(
3
),
1242
1252
(
2011
).
23.
J. S.
Kubicko
,
C. M.
Verlinden
,
J.
Sarkar
,
K. G.
Sabra
,
B. V.
Nichols
,
J. S.
Martin
, and
A. I.
Fagan
, “
Information content of ship noise on a drifting volumetric array for passive environmental sensing
,”
IEEE J. Oceanic Eng.
45
(
2
),
607
630
(
2020
).
24.
J. A.
Mercer
and
J. R.
Booker
, “
Long-range propagation of sound through oceanic mesoscale structures
,”
J. Geophys. Res.
88
(
C1
),
689
699
, https://doi.org/10.1029/JC088iC01p00689 (
1983
).
25.
J. L.
Spiesberger
, “
Gyre-scale acoustic tomography: Biases, iterated inversions, and numerical methods
,”
J. Geophys. Res.
90
(
C6
),
11869
11876
, https://doi.org/10.1029/JC090iC06p11869 (
1985
).
26.
B. D.
Cornuelle
,
P. F.
Worcester
,
J. A.
Hildebrand
,
W. S.
Hodgkiss
, Jr.
,
T. F.
Duda
,
J.
Boyd
,
B. M.
Howe
,
J. A.
Mercer
, and
R. C.
Spindel
, “
Ocean acoustic tomography at 1000-km range using wavefronts measured with a large-aperture vertical array
,”
J. Geophys. Res.
98
(
C9
),
16365
16377
, https://doi.org/10.1029/93JC01246 (
1993
).
27.
C. W.
Spofford
and
A. P.
Stokes
, “
An iterative perturbation approach for ocean acoustic tomography
,”
J. Acoust. Soc. Am.
75
(
5
),
1443
1450
(
1998
).
28.
T.
Rossby
,
D.
Dorson
, and
J.
Fontaine
, “
The RAFOS system
,”
J. Atmos. Oceanic Technol.
3
(
4
),
672
679
(
1986
).
29.
S. R.
Jayne
,
D.
Roemmich
,
N.
Zilberman
,
S. C.
Riser
,
K. S.
Johnson
,
G. C.
Johnson
, and
S. R.
Piotrowicz
, “
The ARGO Program: Present and future
,”
Oceanography
30
(
2
),
18
28
(
2017
).
30.
A.
Bracco
,
J.
Choi
,
K.
Joshi
,
H.
Luo
, and
J. C.
McWilliams
, “
Submesoscale currents in the northern Gulf of Mexico: Deep phenomena and dispersion over the continental slope
,”
Ocean Modell.
101
,
43
58
(
2016
).
31.
G.
Liu
,
A.
Bracco
, and
U.
Passow
, “
The influence of mesoscale and submesoscale circulation on sinking particles in the northern Gulf of Mexico
,”
Elementa: Sci. Anthropocene
6
,
36
(
2018
).
32.
T. J.
McDougall
and
P. M.
Barker
, “
Getting started with TEOS-10 and the Gibbs seawater (GSW) oceanographic toolbox
,”
Scor/Iapso WG
127
(
532
),
1
28
(
2011
).
33.
M.
Porter
, “
The Bellhop manual and user's guide: Preliminary draft
” (
2011
).
34.
G.
Liu
,
A.
Bracco
, and
A.
Sitar
, “
Submesoscale mixing across the mixed layer in the Gulf of Mexico
,”
Front. Mar. Sci.
8
,
615006
(
2021
).
35.
A. C.
Poje
,
T. M.
Özgökmen
,
B. L.
Lipphardt
,
B. K.
Haus
,
E. H.
Ryan
,
A. C.
Haza
,
G. A.
Jacobs
,
A. J. H. M.
Reniers
,
M. J.
Olascoaga
,
G.
Novelli
,
A.
Griffa
,
F. J.
Beron-Vera
,
S. S.
Chen
,
E.
Coelho
,
P. J.
Hogan
,
A. D.
Kirwan
,
H. S.
Huntley
, and
A. J.
Mariano
, “
Submesoscale dispersion in the vicinity of the deepwater horizon spill
,”
Proc. Natl. Acad. Sci. U.S.A.
111
(
35
),
12693
12698
(
2014
).
36.
E. A.
D'Asaro
,
A. Y.
Shcherbina
,
J. M.
Klymak
,
J.
Molemaker
,
G.
Novelli
,
C. M.
Guigand
,
A. C.
Haza
,
B. K.
Haus
,
E. H.
Ryan
,
G. A.
Jacobs
,
H. S.
Huntley
,
N. J. M.
Laxague
,
S.
Chen
,
F.
Just
,
J. C.
McWilliams
,
R.
Barkan
,
A. D.
Kirwan
, Jr.
,
A. C.
Poje
, and
T. M.
Özgökmen
, “
Ocean convergence and the dispersion of flotsam
,”
Proc. Natl. Acad. Sci. U.S.A.
115
(
6
),
1162
1167
(
2018
).
37.
R.
Barkan
,
J. C.
McWilliams
,
A. F.
Shchepetkin
,
M. J.
Molemaker
,
L.
Renault
,
A.
Bracco
, and
J.
Choi
, “
Submesoscale dynamics in the northern Gulf of Mexico. Part I: Regional and seasonal characterization and the role of river outflow
,”
J. Phys. Oceanogr.
47
(
9
),
2325
2346
(
2017
).
38.
F.
Auclair
,
L.
Bordois
,
Y.
Dossmann
,
T.
Duhaut
,
A.
Paci
,
C.
Ulses
, and
C.
Nguyen
, “
A non-hydrostatic non-Boussinesq algorithm for free-surface ocean modelling
,”
Ocean Modell.
132
,
12
29
(
2018
).
39.
G.
Liu
,
A.
Bracco
, and
D.
Sun
, “
Offshore freshwater pathways in the northern Gulf of Mexico: Impacts of modeling choices
,”
Front. Mar. Sci.
9
,
841900
(
2022
).
40.
J. A.
Cummings
and
O. M.
Smedstad
, “
Variational data assimilation for the global ocean
,” in
Data Assimilation for Atmospheric, Oceanic and Hydrologic Applications (Vol. II)
, edited by
S. K.
Park
and
L.
Xu
(
Springer
,
Berlin
,
2013
), pp.
303
343
.
41.
W. N.
van Wieringen
, “
Lecture notes on ridge regression
,” (
2015
).
42.
S.
Lipovetsky
and
W. M.
Conklin
, “
Multiobjective regression modifications for collinearity
,”
Comput. Oper. Res.
28
(
13
),
1333
1345
(
2001
).
43.
H.
Akaike
, “
A new look at the statistical model identification
,”
IEEE Trans. Automat. Control.
19
(
6
),
716
723
(
1974
).
44.
G.
Schwarz
, “
Estimating the dimension of a model
,”
Ann. Stat.
6
(
2
),
461
464
(
1978
).
45.
K.
Levenberg
, “
A method for the solution of certain non-linear problems in least squares
,”
Quart. Appl. Math.
2
(
2
),
164
168
(
1944
).
46.
J. A.
Colosi
,
Sound Propagation through the Stochastic Ocean
(
Cambridge University Press
,
Cambridge
,
2016
).