This work presents a data-driven, energy-conserving closure method for the coarse-scale evolution of the mean and covariance of turbulent systems. Spatiotemporally non-local neural networks are employed for calculating the impact of non-Gaussian effects to the low-order statistics of dynamical systems with an energy-preserving quadratic nonlinearity. This property, which characterizes the advection term of turbulent flows, is encoded via an appropriate physical constraint in the training process of the data-informed closure. This condition is essential for the stability and accuracy of the simulations as it appropriately captures the energy transfers between unstable and stable modes of the system. The numerical scheme is implemented for a variety of turbulent systems, with prominent forward and inverse energy cascades. These problems include prototypical models such as an unstable triad system and the Lorentz-96 system, as well as more complex models: The two-layer quasi-geostrophic flows and incompressible, anisotropic jets where passive inertial tracers are being advected on. Training data are obtained through high-fidelity direct numerical simulations. In all cases, the hybrid scheme displays its ability to accurately capture the energy spectrum and high-order statistics of the systems under discussion. The generalizability properties of the trained closure models in all the test cases are explored, using out-of-sample realizations of the systems. The presented method is compared with existing first-order closure schemes, where only the mean equation is evolved. This comparison showcases that correctly evolving the covariance of the system outperforms first-order schemes in accuracy, at the expense of increased computational cost.

The defining property of turbulent dynamical systems is the existence of multiple and persistent sources of instability. While from a physical perspective such systems are usually viewed as deterministic in nature, uncertainty manifests into their study via an abundance of mechanisms. Fundamental model assumptions such as model structure, constitutive laws, geometry, and initial and boundary conditions may be only approximations of the truth and, as a result, carry uncertainty into the implementation. Furthermore, spatiotemporal discretization errors push the model predictions away from the underlying mathematical model, allowing only for bounds between the two solutions. Additionally, other input parameters of the model may exhibit randomness. This effect can be either an intrinsic property of that quantity (i.e., aleatoric uncertainty) or stem from an inability to accurately measure its otherwise deterministic value, due to computational and experimental limitations. Hence, all these effects need to be accounted for in the study of unstable complex systems.

Despite the many sources of randomness, uncertainty quantification (UQ) attempts can be separated into the following categories: (i) propagation of stochasticity from input parameters, via the model, to the output values and (ii) parametrization of input-parameter uncertainty via backward propagation of the model output uncertainty. Both kinds of UQ studies have been applied to all fields where nonlinear multiscale systems are observed. UQ is used for performance prediction of integrated circuits with thousands of sub-micrometer parts.1 It is accounted for in assessing the structural integrity of large structures with variability in their material properties2,3 and in imperfect neuroscopic models in materials science.4,5 It is used in combustion to describe complex kinetic mechanisms.6 Furthermore, in the last 50 years, UQ tools have been widely adopted by researchers interested in turbulent flows. They have been used to model permeability of porous media in multiphase flows.7,8 UQ is used to account for randomness in microfluidic applications9 and thermal problems.10,11 It is used in engineering applications with shear turbulence.12 In addition, it has become a major research interest in climate studies13–15 and climate change in particular.16 

The most straightforward UQ approach is the well-known Monte Carlo (MC) method.17–19 Techniques inspired by the MC method perform numerous deterministic simulations of the system for sampled conditions and proceed to perform a posteriori statistical analysis on the numerical results.20 Yet, repeated simulations of turbulent models are still prohibitively expensive to both generate and store.21 Hence, MC techniques find limited applications usually only on low-dimensional systems. As a result, the need for efficient uncertainty propagation schemes arises for more complex and computationally expensive systems.

To reduce the computational cost of UQ, many approaches project the initial system on a low-dimensional subspace of pre-selected modes.22 The first such approaches derived reduced-order models based on an energy-based proper orthogonal decomposition (POD).23–26 A similar concept is used in deriving reduced-order models via balanced POD based on linear theory.27,28 Orthogonal decompositions are also used to derive dynamically orthogonal field equations.29,30 Finite-series representations of randomness are also used in truncated polynomial chaos (PC) expansions.9,31–33 While all these models have found success in weakly chaotic regimes, they suffer from the fact that in turbulent systems, non-energetic modes can intermittently have significant impact on energetic modes.34–37 As a result, despite the low computational cost of truncated-series methods, such approaches are antithetical to the nature of turbulence. Non-modal UQ approaches include the utilization of the fluctuation–dissipation theorem (FDT)38–44 and the modified quasilinear Gaussian closure (MQG) scheme,45 which utilizes a second-order moment framework and models the impact of non-Gaussian statistics via incorporating statistical steady-state information appropriately.

It is therefore clear that for reduced-order models to accurately predict that statistical properties of the reference turbulent systems, a physically consistent modeling of the intermittent energy transfers between energetic modes needs to be employed. Hence, tools that allow for the approximation of complicated operators are of interest. Apart from recent conventional basis expansions46–48 and parametrizations,22,49 deep neural networks have recently seen success in reduced-order modeling of turbulent systems both in a deterministic and in a statistical framework.50–58 Despite their lack of closed-form expressions for the closure terms, neural networks have reliably approximated many intricate operators in nonlinear dynamics. In addition, neural networks can seamlessly incorporate spatiotemporal non-locality in their predictions, a property that suits many reduced-order models. Utilizing temporal delays in a closure scheme is theoretically justified via Takens' embedding theorem,59 which states that under the constraint of observing a limited number of the state variables of a system, in principle, we can still obtain the attractor of the full system by using delay embedding of the observed state variables (i.e., a non-Markovian approach). Such methods have seen success in reduced-order modeling of highly nonlinear dynamical systems.57,58,60–63

This work aims to expand on the MQG approach for uncertainty quantification, presented in Refs. 45 and 64–66. The approach studies turbulent systems with a quadratic and energy-preserving nonlinearity. Similar to the approach of traditional MQG, a second-order statistical framework is employed for this study. This framework allows for a computationally cheap reduced-order model. Uncertainty is modeled under a parametric Bayesian point of view. In previous MQG approaches, the nonlinear energy transfers that result in non-Gaussian effects are modeled via a quasi-linear approach, where the closure is tuned to specific steady-state statistics. The current method replaces this assumption with spatiotemporally non-local neural networks. The scheme is constructed so that it a priori respects the energy transfers dictated by the nonlinear term. This is achieved by using steady-state statistics of flow realizations that are incorporated during the training process. These physical constraints allow for correct prediction of mode-to-mode energy transfers during simulations. The utilization of deep neural networks allows for a richer representation of nonlinear energy transfers and higher accuracy, not only for the mean but also for the second-order statistics of the flow. Past values of these features are also employed as inputs for the turbulent closures in a causal manner. We finally employ imitation learning67 to improve the stability properties of the computed closures. A related approach, aiming to machine learn second-order closures, has been presented recently;68 this effort does not encode the underlying physical constraints in the training process, and it has not been assessed in turbulent fluid flows.

The outline of the paper is as follows. Section II formulates the general nonlinear problem of interest. The appropriate physical constraint arising from the form of the system is derived. Furthermore, the data-driven parametrization as well as the objective function used during training are discussed. Next, the numerical investigation of the formulated scheme is included in Sec. III. The closure scheme is assessed in a multitude of turbulent models. In all cases, the results are presented for out-of-sample realizations of the models (i.e., not including in the training process) to showcase the generalizability of the closure scheme. The role of the physical constraint on the stability properties and accuracy of the coarse-scale equations are examined. The accuracy of the closure scheme is measured via its ability to capture the correct energy transfers between modes and thus reach the appropriate statistical equilibrium. This goal is quantified differently in each test case, but numerical comparisons include energy of the mean, total variance of the system, energy spectra, and heat flux spectra whenever they are of interest. Finally, Sec. IV includes the major conclusions drawn from the numerical investigation of Sec. III.

Expanding on the numerical investigation, the first application is the triad system, a toy problem used to study energy transfers between a large-scale variable and two fluctuating modes. This system is of particular interest due to its intrinsically unstable mean, rendering it challenging to approximate despite having only three degrees of freedom. Next, the method is applied to the Lorentz-96 system, a prototypical model for studying turbulence due to baroclinic instability in midlatitude atmospheric flows. Results are presented for both constant and time-varying excitation. In both cases, the data-informed scheme compares favorably with the results derived by Monte Carlo simulations. The scheme is then applied to a two-layer, quasi-geostrophic (QG) model. First, results are presented for the statistics of high-latitude oceanic turbulence. The scheme once again is able to capture the correct energy and heat flux spectra of the full physical model. Afterward, results on the statistics of midlatitude oceanic and high-latitude atmospheric flows are also presented. The final test case involves an inhomogeneous system, where the closure scheme is applied on unimodal and bimodal turbulent jets over which inertial passive tracers are being transported.

The proposed methodology combines a second-order statistical formulation with neural networks that are trained under appropriate physical constraints. This framework produces accurate uncertainty quantification predictions for nonlinear dynamical problems. Let u be a field describing the state of the system. The evolution equation of u has the following general form:

dudt=Au+B(u,u)+F(t)+Ẇk(t;ω)σk(t),
(1)

where A is a linear operator, F denotes a deterministic external forcing, and Ẇkσk corresponds to a stochastic forcing with white noise characteristics. The operator B is assumed to be quadratic and energy-preserving, that is,

B(u,u)·u=0.
(2)

This restrictive definition of B is valid for many important problems in fluid mechanics, retaining the physical relevance of the formulation. For example, B can be viewed as the advection term of turbulent flows, a class of problems that has historically attracted the attention of reduced-order modeling literature.

Using well-known linear algebra results,69 the linear operator A can be decomposed as

A=12(AAT)+12(A+AT)=L+D,
(3)

where L is a skew-symmetric linear operator and D is a symmetric linear operator. Throughout this work, D will also be assumed to be negative definite, which implies that it corresponds to a linear dissipative process. The quantity of interest u is analyzed using the finite-dimensional expansion

u=u¯+u=u¯+i=1NZi(t;ω)vi,
(4)

where vi forms a prescribed orthonormal basis, while Zi are zero-mean, time-dependent random functions. The symbol ω denotes the random argument, and the mean field u¯ can be interpreted as an ensemble average, E[u]=u¯, with respect to ω. Using the above representation, the original dynamical equation can be re-written as

du¯dt+dudt=[L+D]u¯+[L+D]u+B(u¯,u¯)+B(u,u¯)+B(u¯,u)+B(u,u)+F+Ẇk(t;ω)σk(t).
(5)

By taking the expectation of the above equation, we derive the dynamical equation for the average state u¯, that is,

du¯dt=[L+D]u¯+B(u¯,u¯)+i=1Nj=1NZiZj¯B(vi,vj)+F.
(6)

For a second-order statistical framework, an evolution equation for the covariance matrix Rij=ZiZj¯ is also required. To this end, first the equation for the perturbations u=uu¯ is derived:

dudt=[L+D]u+B(u¯,u)+B(u,u¯)+i=1Nj=1N[ZiZjRij]B(vi,vj)+Ẇkσk.
(7)

The projection of the evolution equation for u onto a basis function vn yields

dZndt=+i=1NZi[Avi+B(u¯,vi)+B(vi,u¯)]·vn+i=1Nj=1N[ZiZjRij]B(vi,vj)·vn+Ẇkσk·vn.
(8)

By multiplying the above equation with Zm and taking the ensemble average, we have

dZndtZm¯=+i=1NRim[Avi+B(u¯,vi)+B(vi,u¯)]·vn+i=1Nj=1NZiZjZm¯B(vi,vj)·vn+ZmẆkσk·vn,
(9)

since

RijZm¯=RijZm¯=0.
(10)

Hence, the evolution of the elements of the covariance matrix is dictated by the equation

ddtRmn=i=1NRim[Avi+B(u¯,vi)+B(vi,u¯)]·vn+i=1NRin[Avi+B(u¯,vi)+B(vi,u¯)]·vm+i=1Nj=1NZiZjZm¯B(vi,vj)·vn+i=1Nj=1NZiZjZn¯B(vi,vj)·vm+(σk·vm)(σk·vn).
(11)

The evolution equation for the covariance of the system can then be written as

dRdt=lR+Rl+Qσ+Q,
(12)

where l is a linear operator expressing dissipation and energy transfers between the mean field and the stochastic modes

lij=[(L+D)vj+B(u¯,vj)+B(vj,u¯)]·vi,
(13)

with l being its transpose. The operator (Qσ)ij=(σk·vj)(σk·vi) models effects due to the stochastic external forcing, and Q corresponds to third-order effects that express energy fluxes between different stochastic modes

Qmn=i=1Nj=1NZiZjZm¯B(vi,vj)·vn+i=1Nj=1NZiZjZn¯B(vi,vj)·vm.
(14)

In more detail, l includes the effects of the linear operators on each mode, as well as energy transfers between the mean and each mode via the energy-preserving nonlinear operator. All these effects can be studied and understood under the framework of second-order statistics. It is the mode-to-mode nonlinear energy transfers that require knowledge of higher moments for their estimation. In order to close the system for the covariance and the mean, a model for the third-order terms Q appearing in Eq. (11) is required. To this end, neural networks are utilized to parameterize them. The architecture and constraints used are presented in Secs. II A–II C.

Before describing the details of the data-driven model for Q, we note that an important feature of the presented closure scheme is the requirement to satisfy certain physical principles, which characterize turbulent systems. More specifically, modes that carry small energy or variance can still have important, dynamical effects on the large variance modes through nonlinear energy transfers. A schematic of this property can be seen in Fig. 1. The external force excites the mean, which then through the linear part of the covariance equation l transfers part of this energy to the unstable modes. Since these modes are unstable, the linear operator l can only increase them in amplitude. Hence, the only way for the system to reach a statistical steady state is due to (i) dissipation and (ii) the nonlinear terms Q transfer energy from the unstable modes to the stable ones.

FIG. 1.

Energy flow in the systems under consideration.

FIG. 1.

Energy flow in the systems under consideration.

Close modal

Therefore, it is critical to accurately represent the third-order closure terms. Using the fact that the quadratic term is energy-preserving by construction, Eq. (2) holds for u, and thus, one gets B(u,u)·u=0. Taking the expectation of this relations and expanding it using Eq. (4), one gets

i=1Nm=1Nn=1NZmZnZi¯B(vm,vn)·vi=0,
(15)

or in terms of the closure term45 

Tr[Q]=0.
(16)

This constraint is enforced during training to improve the numerical stability of the predictions. It is emphasized that correctly capturing this constraint is necessary for the purpose of correctly modeling the energy exchanges between stable and unstable modes and thus capturing the correct statistical steady of the system under discussion.

While the dynamical systems under study are Markovian and spatially local, that is, the evolution of u in a specific location and time instant depends only on the current time instant and the current neighborhood, this is not the case for the reduced-order averaged version of these equations. In particular, for the evolution of the mean and covariance equations, one typically does not have access to the full-state information required to fully describe the evolution of the system (in this case third-order moments).

As has been showcased by recently proposed data-driven schemes for dynamical systems,60,61 Takens embedding theorem can be used to enhance the accuracy of predictions.59 The theorem states that even if only a limited number of the state variables of a system are observed, in principle, one can still obtain the attractor of the full system by using delay embeddings of the observed state variables. To this end, the closure terms are parametrized with non-local in time (but still causal) representations, based on temporal convolutional networks70 (TCN) and long short-term memory networks71 (LSTM). These ML architectures have been employed successfully in previous work57 on computing turbulent closures just for the mean equation (6).

In terms of spatial information, the entire mean field and the covariance are utilized as input for the data-informed closure scheme. As a result, the closure terms are modeled in the following form:

Dn,m[Θ;ξ¯[χ(t)]]=i=1Nj=1NZiZjZm¯B(vi,vj)·vn+i=1Nj=1NZiZjZn¯B(vi,vj)·vm,
(17)

where ξ¯={u¯,R} are second-order statistics of the system, and χ(t) denotes the history of the statistics state backward from time t, that is, χ(t)={t,tτ1,,tτ2,,tτN}. The entire mean field u¯ and covariance R are used as input for the neural network. The vector Θ denotes the hyperparameters of the neural network, while the temporal neighborhood, χ(t), is selected such that if further increased, the training error does not significantly reduce any more. The number of points in time that have to be considered depends on the temporal discretization of the domain.

In terms of the training process itself, the input and output data are normalized as typically suggested.72 The loss function for this problem is chosen to be the single-step prediction mean square error superimposed with the physical constraint. This can be formulated as

L(Θ)=1TTnm||Dn,m[Θ;ξ¯]Qn,m||2dt+λTTr[Q]dt,
(18)

where λ is a weight that measures the relative importance between the data and the physical constraint, typically chosen to be 1.

The first application involves a three-dimensional dynamical system that consists of three Langevin equations coupled via quadratic and energy-preserving nonlinearities.73 This triad system acts as a simple surrogate model for barotropic instability. It can be viewed as the result of projection of the fluid equations to three modes, one corresponding to the mean flow and the other two corresponding to wave perturbations. Under these assumptions, the system has the following form:

[du1du2du3]=[γ1000γ2000γ3]D[u1u2u3]dt+[0λ12λ13λ120λ23λ13λ230]L[u1u2u3]dt+[β1u2u3β2u1u3β3u1u2]B(u,u)dt+[F1F2F3]dt+[σ1dW1σ2dW2σ3dW3],
(19)

where D and L are, respectively, the symmetric and skew-symmetric component of the linear operator, F1,F2,F3 correspond to deterministic external excitation, and dW1,dW2,dW3 are independent white-noise processes. It is also assumed that

β1+β2+β3=0
(20)

to ensure that the quadratic term is energy-preserving. Interaction of this reduced-order model with other modes of the full system is modeled via the white-noise terms and linear dissipation, a standard practice in many stochastic climate models.74–76 

The goal of this section is to examine the triad system as a prototypical model for turbulent anisotropic flows. A property of such flows is that the mean exhibits persistent instabilities along certain directions in phase space. Under the assumption that u1 corresponds to the mean-flow variable, this instability can be added to the triad system via a negative γ1 value. Linear dissipation is set as γ1=0.4 and γ2=γ3=2, following a previously presented setup.37 The skew-symmetric part is set as λ12=0.03,λ13=0.06, and λ23=0.09. The coefficients of the nonlinear term are set as β1=2 and β2=β3=1. For the first test case, constant deterministic external forcing is tested. To ensure that the perturbation variables u2, u3 are energetic (and thus remove energy from the mean), the deterministic forcing is set to F1=0,F2=1, and F3=1. Finally, the white noise amplitudes are tuned so as to ensure the system achieves a statistical steady state. For this to happen, the amplitudes are chosen as σ1=0.25 and σ2=σ3=0.79. Initial conditions are sampled from the distribution

[u1(0)u2(0)u3(0)][N(1,0.5)N(0.5,0.2)N(0.5,0.1)].
(21)

For reference data, 105 samples of this system are computed (each time using a new sample of both the white noise forcing and the initial conditions). Reference statistics of these dynamical systems are obtained from this MC simulation. All simulations are carried out with time step dt=0.01 from t =0 until t =25. The system is integrated into time using the forward Euler method. With respect to the reduced-order model, a single-layer LSTM neural network is used during training.

The statistical properties of the system as well as a comparison with the low-order statistical predictions of the proposed data-informed method are depicted in Fig. 2. Subplots (a) and (d) establish that u1 is indeed the most energetic mode of the system and that the system achieves a statistical steady state. Furthermore, as seen from the marginal on the u2u3 plane on subplot (f), the system exhibits strong anisotropic behavior in this plane. Yet, the marginals on the u1u2 and u1u3 planes are Gaussian. The reduced-order model is able to accurately predict the statistical equilibrium of the system. In addition, it adequately captures the intermittent dynamical evolution of the mean and variance of the system. The good agreement with the MC data is observed despite the fact that the pdf of the system is strongly non-Gaussian in the u3u2 space and the mean variable u1 is by construction linearly unstable.

FIG. 2.

Comparison between MC (solid line) and reduced-order (dashed-line) results for triad system with constant forcing. (a) Evolution of mean of the variables of the system. (b) Evolution of off-diagonal components of covariance matrix. (c) Evolution of real part of eigenvalues of the linear operator Lv. (d) Evolution of total system variance and variance of each mode. (e) Contour of the steady-state system pdf for u|f(u)=105. (f) Marginal pdfs of the system at steady state.

FIG. 2.

Comparison between MC (solid line) and reduced-order (dashed-line) results for triad system with constant forcing. (a) Evolution of mean of the variables of the system. (b) Evolution of off-diagonal components of covariance matrix. (c) Evolution of real part of eigenvalues of the linear operator Lv. (d) Evolution of total system variance and variance of each mode. (e) Contour of the steady-state system pdf for u|f(u)=105. (f) Marginal pdfs of the system at steady state.

Close modal

The next test case involves periodic forcing, while the other parameters of the system remain the same as before. In more detail, the deterministic part of the forcing is now set to

F1=0,F2=1+0.5sinπt2,F3=1+0.2cosπt2.
(22)

In addition, the parameters of the stochastic forcing are set to

σ1=0.251.3210sin2πt2,σ2=σ3=0.79250.79×1.32sin2πt2.
(23)

MC results for the system as well as a comparison with the results of the data-informed closure scheme (trained with the previous set of forcing parameters) are shown in Figs. 3(a) and 3(d). Again, despite the strongly non-Gaussian nature of the problem [Figs. 3(b), 3(e), and 3(f)], the closure scheme is able to accurately predict the first- and second-order statistics throughout the duration of the simulation.

FIG. 3.

Comparison between MC (solid line) and reduced-order (dashed-line) results for triad system with periodic forcing. (a) Evolution of mean of the variables of the system. (b) Evolution of off-diagonal components of covariance matrix. (c) Evolution of real part of eigenvalues of the linear operator Lv. (d) Evolution of total system variance and variance of each mode. (e) Contour of pdf at t =25 for u|f(u)=105. (f) Marginal pdfs of the system at t =25.

FIG. 3.

Comparison between MC (solid line) and reduced-order (dashed-line) results for triad system with periodic forcing. (a) Evolution of mean of the variables of the system. (b) Evolution of off-diagonal components of covariance matrix. (c) Evolution of real part of eigenvalues of the linear operator Lv. (d) Evolution of total system variance and variance of each mode. (e) Contour of pdf at t =25 for u|f(u)=105. (f) Marginal pdfs of the system at t =25.

Close modal

As a next step, a higher-dimensional problem, with a large number of positive Lyapunov exponents, is considered. Specifically, the model under discussion is governed by the equation

duidt=ui1(ui+1ui2)ui+F
(24)

for N =40 and F deterministic external forcing. This system acts as a prototypical model to study baroclinic instability in midlatitude flows. The dynamics comprise of an energy-conserving quadratic nonlinearity, a linear dissipation term, and external forcing. The forcing term is assumed to be spatially homogeneous, which as a result implies that the system is translationally invariant in space. This property implies that Fourier modes can be used for the basis expansion and that the covariance matrix can be assumed to be diagonal. One can observe that for different magnitudes of forcing, the number of unstable modes changes (from 0 up to 11). Hence, this is model is an excellent case to assess the presented method.

To illustrate numerically the UQ properties of the proposed closure scheme, the model is trained and tested on an aperiodic forcing generated by the Ornstein–Uhlenbeck process

dF=1τF(FF¯)dt+σFdW,
(25)

where F¯ is the mean value around which the process oscillates and dW models white noise. Here, we chose as modes vn the Fourier modes. Also, we do not perform any covariance reduction; that is, we model the full covariance matrix.

Training occurs with a single stochastic forcing realization. The reference statistics are derived by averaging over a distribution of initial conditions. To derive the required training data, 104 direct numerical simulations (DNS) are averaged. These statistics allow for the calculation of the time-evolution of E[u] and E[Zn2]=E{[(uE[u])·vn]2}. The neural network is then trained using the derived data and under the constraint of Eq. (16), which is equivalent to the constrain that the quadratic operator should be energy conserving. A single-layer LSTM neural network with 150 timesteps as time history is used. Testing of the closure scheme is carried out for four out-of-sample random realizations of the forcing. Only 10 time steps are used as initial input for the neural network. Results are presented in Fig. 4. For all realizations, we have excellent agreement between the spectrum predicted from the data-driven closure and the MC simulation. This is the case despite the strongly transient nature of the excitation that pushes the system away from its statistical steady state.

FIG. 4.

Comparison of ML uncertainty quantification scheme with exact statistics produced by the Monte Carlo method. Results are shown for different dynamical regimes of the aperiodic forcing parameter F generated as an Ornstein–Uhlenbeck process. The color plots present the evolution of the exact and approximated spectrum. We also present the energy of the mean and the trace of the covariance over time. At the last row, we show the steady-state spectrum (exact and approximate).

FIG. 4.

Comparison of ML uncertainty quantification scheme with exact statistics produced by the Monte Carlo method. Results are shown for different dynamical regimes of the aperiodic forcing parameter F generated as an Ornstein–Uhlenbeck process. The color plots present the evolution of the exact and approximated spectrum. We also present the energy of the mean and the trace of the covariance over time. At the last row, we show the steady-state spectrum (exact and approximate).

Close modal

The next test case is an anisotropic multiphase flow setup. It involves the advection of bubbles, which are assumed to be passive inertial tracers, over an incompressible fluid flow. The fluid flow is governed by the Navier–Stokes equations in dimensionless form:

DuDt=p+1ReΔu+νΔ2u+F,
(26)
·u=0,
(27)

where u is the velocity field of the fluid, p its pressure, Re is the Reynolds number of the flow, D/Dt is the material derivative operator, and F denotes an external forcing term. Parameter ν is a hypoviscosity coefficient aiming to remove energy from large scales and maintain the flow in a turbulent regime.

One can follow a similar process for the advection equation governing the motion of small inertial tracers. In particular for small inertia particles, their Lagrangian velocity, v, is a small perturbation of the underlying fluid velocity field77 

v=u+ε(3R21)DuDt+O(ε2),
(28)

where

ε=StR1,R=2ρfρf+2ρp.
(29)

The parameter ε represents the importance of inertial effects, while St is the bubble Stokes number, measuring the ratio between the characteristic timescales of the bubbles over that of the flow. Parameter R is a density ratio with ϱp and ϱf being the density of bubble and the fluid, respectively. By introducing ρ as the concentration of tracers at a particular point, the following transport equation can be derived for the bubble flow:

tρ+·(vρ)=ν2Δ4ρ.
(30)

The right-hand side of the transport equation represents a hyperviscosity term. The hyperviscosity parameter is tuned so as to remove energy from scales close to the resolution limit of the numerical simulations.

Utilizing the series expansion presented here for the variables of the problem:

u=u¯+kZk,u,vk,u,ρ=ρ¯+kZk,ρvk,ρ,
(31)

one can develop evolution equations for the mean and the variance for both the fluid flow and the bubble flow. For the particular problem, the data-informed closure scheme is trained on data from unimodal jet flows (details of the jet configuration have been presented previously57). The Reynolds numbers used in training are Re{650,750,850}. The reference simulations are carried out on a 256 × 256 grid with doubly periodic boundary conditions on a [0,2π]2 rectangular domain. All flows are evolved until they reach a statistical equilibrium. Since MC simulations are prohibitively expensive for this problem, time-averaging is used for the derivation of the energy spectra in the statistical equilibrium.

The derived closure is first tested on unimodal jets in the range Re[500,1000]. For the mean-field model [Eq. (6)], a coarse resolution of 32 × 32 is employed. For the covariance of both the flow field and the density, Fourier modes are utilized. Specifically, modes with wavenumber |k|48 are considered in the covariance evolution [Eq. (11)] that is complemented with the ML closures.

Figure 5 presents the space–time-averaged mean square error between the xy locally averaged DNS flow field, u*¯, and the coarse scale model, u¯,

||u*¯u¯||22=1(2π)2T02π02πt0t0+T(u*¯u¯)2dxdt.
(32)
FIG. 5.

Normalized mean square error of each two-dimensional closure model using TCN, LSTM, and their constrained versions with the energy conservation (6) for unimodal jets. Training includes data for unimodal jets with Re{650,750,850}.

FIG. 5.

Normalized mean square error of each two-dimensional closure model using TCN, LSTM, and their constrained versions with the energy conservation (6) for unimodal jets. Training includes data for unimodal jets with Re{650,750,850}.

Close modal

The numerical results of the presented scheme are compared with first-order closures where the ML correction is applied only on the mean equation (6) and the covariance is not modeled.57 For the first-order closure, both TCN and LSTM architectures were assessed. The constrained versions, cTCN and cLSTM, correspond to closures where, in addition to the L2 error of the closure terms, the physical constraint related to the energy-preserving property of the advection term is included [Eq. (18)]. As expected, the second-order closure scheme outperforms the corresponding first-order schemes, even using a coarser resolution. The inclusion of the constraint significantly improves the accuracy of the predicted statistics for all cases. Furthermore, to showcase the importance of using the constraint derived in Eq. (16), a version of the second-order closure scheme with λ = 0 is also presented. As it can be seen, numerical results are improved when including the constraint.

As an additional test case, the closure scheme is tested on bimodal jets with Re[500,1000] (details of the bimodal jet configuration have been presented previously57). The closure is trained on the same data set as before, that is, on unimodal jets with Reynolds number Re{650,750,850}. Figure 6 presents the space–time-averaged mean square error between the xy locally averaged DNS flow field. In this case, including the constraint during training not only improves numerical accuracy of the results but allows us to avoid numerical instabilities for Reynolds numbers outside the training data set.

FIG. 6.

Normalized mean square error for different closure models and their constrained versions for bimodal jets. Training utilizes data for unimodal jets with Re{650,750,850}.

FIG. 6.

Normalized mean square error for different closure models and their constrained versions for bimodal jets. Training utilizes data for unimodal jets with Re{650,750,850}.

Close modal

In Fig. 7, we compare the energy spectra for the fluid and bubble flows as obtained by DNS and the second-order closure scheme. We also include a comparison with the first-order closure obtained in previous work.57 For both the unimodal and bimodal jets, we have accurate computation of the energy spectra of the fluid flow and the advected bubble dispersion in the statistical equilibrium. Furthermore, the second-order model clearly outperforms the corresponding first-order scheme, indicating that incorporating additional physics through the second-order equation is leading to better performance and improved stability, albeit the additional computational cost.

FIG. 7.

Energy spectra for the fluid flow and bubble flow for (a) a unimodal jet with Re=800 and (b) a bimodal jet with Re=800.

FIG. 7.

Energy spectra for the fluid flow and bubble flow for (a) a unimodal jet with Re=800 and (b) a bimodal jet with Re=800.

Close modal

The last test case involves a two-layer quasi-geostrophic (QG) model. The model considered here78 consists of an advection–diffusion equation for the potential vorticity qi, in each of two immiscible layers with fractional layer thickness

δ=H1/H0,1δ=H2/H0,
(33)

respectively (where H0=H1+H2), and mean zonal velocities U1>U2. The domain under discussion is a square doubly periodic domain with rigid-lid surface boundary conditions. The governing equations for the potential vorticity (PV) of each layer become

tq1=J(ψ1,q1)βxψ1(U1U2)LP2(1δ)xψ1(U1U2)(1δ)xq1+F1
(34)

and

tq2=J(ψ2,q2)βxψ2+(U1U2)LP2δxψ2+(U1U2)δxq2+F2r2ψ2,
(35)

where the field q1 corresponds to the upper-layer PV and q2 to the bottom-layer PV, with ψi being the respective streamfunctions, and J(a,b)=xaybyaxb is the Jacobian operator. The β terms arise from the variation of the vertical projection of Coriolis frequency with latitude, and the kd2 terms result from the imposed shear. The inverse relations are

q1=2ψ1f02gH1(ψ1ψ2),q2=2ψ2+f02gH2(ψ1ψ2)+f0hbH2.
(36)

The dynamics can also be described in terms of the barotropic and baroclinic modes and their corresponding streamfunctions

qt=δq1+(1δ)q2=2ψt,ψt=δψ1+(1δ)ψ2,qc=δ(1δ)(q1q2)=(2kd2)ψc,ψc=δ(1δ)(ψ1ψ2).
(37)

The model dynamics can then be rewritten in terms of the barotropic and baroclinic modes. For periodic boundary conditions with a flat bottom, this yields

tqt=J(ψt,qt)J(ψc,qc)(1δ)r2(ψta1ψc)Ux2ψcβxψtν4qt,tqc=J(ψt,qc)J(ψc,qt)ξJ(ψc,qc)+δ(1δ)r2(ψta1ψc)βxψcUx(2ψt+λ2ψt+ξ2ψc)ν4qc,
(38)

where

λ2=kd2,ξ=12δδ(1δ),
(39)

with ξ expressing the triple interaction coefficient, and U=δ(1δ)(U1U2) is the shear intensity. Three different regimes can be distinguished for this model, designated by values of the model parameters: (i) “low latitude” or weakly supercritical (βkd2/2,r=1), (ii) “midlatitude” or moderately supercritical (βkd2/4,r=4), and (iii) “high latitude” or strongly supercritical (β0,r=16). Hence, β decreases as latitude increases, while the bottom friction coefficient r is increased with in order to keep the inverse energy cascade from reaching the resolution limit.

The results presented here are derived for parameters δ=0.2,r=9,β=10 and λ = 10; a set of parameters that corresponds to baroclinic ocean turbulence at high latitudes. A typical snapshot of the qt, qc is shown in Fig. 8. Typical energy properties of such flows are shown in Fig. 9.

FIG. 8.

Typical snapshots (vorticity fields) of the (a) barotropic and (b) baroclinic mode for baroclinic ocean turbulence at high latitudes.

FIG. 8.

Typical snapshots (vorticity fields) of the (a) barotropic and (b) baroclinic mode for baroclinic ocean turbulence at high latitudes.

Close modal
FIG. 9.

(a): Barotropic, baroclinic, and total energy with respect to the wavenumber |k|. (b) Wavenumber-averaged heat flux normalized over its maximum value; stability indicator: max|k|=kReλi(k) normalized over its maximum magnitude, where λi(k) are the vertical eigenvalues for each wavenumber; wavenumber-averaged BT/BC nonlinear energy fluxes.

FIG. 9.

(a): Barotropic, baroclinic, and total energy with respect to the wavenumber |k|. (b) Wavenumber-averaged heat flux normalized over its maximum value; stability indicator: max|k|=kReλi(k) normalized over its maximum magnitude, where λi(k) are the vertical eigenvalues for each wavenumber; wavenumber-averaged BT/BC nonlinear energy fluxes.

Close modal

For the implementation of the presented method, the potential vorticity is expanded to

qj=i=1NZi(t)vi.
(40)

Due to the periodicity of the domain, Fourier modes are used as a basis.

To test the performance of the closure scheme, we first train the model for U =1.00, and to assess its performance, we test it for different mean velocity, chosen within the interval U[0.75,1.25]. For training, validation, and testing, the flow is solved assuming doubly periodic lateral boundary conditions and a 256 × 256 discretization. For the coarse-scale simulations, a discretization 48 × 48 is utilized for the mean flow, while for the covariance, we include all Fourier modes with wavenumbers |k|16. To assess the performance of the closure scheme, the first metric used will be the energy of the system and its spectrum

Etotal=Et+Ec=k[|k|2|ψ̂k,l|2+(|k|2+λ2)|τ̂k,l|2],
(41)

where Et and Ec are the energy carried by the barotropic and baroclinic modes, respectively. Parameters k and l signify the zonal and meridional component of wavenumber, respectively. Furthermore, the heat flux and its spectrum will also be used as metrics

Hf=λU2|k|ikq̂ψ,klq̂τ,kl|k|2(|k|2+λ2).
(42)

In Fig. 10, the total mean energy and heat flux are shown for different values of U[0.75,1.25]; we present the results from the coarse resolution solver with the ML closures and compare these with the DNS. We also show the radially averaged energy and heat flux spectra and note the favorable comparison between the DNS results and the data-informed closure scheme. For a more detailed comparison, the energy and heat flux spectra for the case U =0.95 are also computed. Results are presented in Fig. 11, where the total normalized energy spectrum, the heat flux, and nonlinear energy fluxes are compared with DNS calculations. In Fig. 12, the energy components carried by the barotropic and baroclinic modes, respectively, are also compared with DNS results. In all cases, the coarse-scale simulation is able to accurately capture the equilibrium statistics of the flow.

FIG. 10.

Comparison between DNS results (dashed line) and ML results (solid line) for (a) and (b) percentage comparison of the average energy and heat flux for different shear stresses and (c)–(j) energy and heat flux radially averaged spectrum for 4 different testing cases.

FIG. 10.

Comparison between DNS results (dashed line) and ML results (solid line) for (a) and (b) percentage comparison of the average energy and heat flux for different shear stresses and (c)–(j) energy and heat flux radially averaged spectrum for 4 different testing cases.

Close modal
FIG. 11.

Comparison between DNS simulations and ML model for U =0.95. Results show total energy spectrum and heat flux. The black dashed line is the 10%maxk|Hfkl| contour of the heat flux field.

FIG. 11.

Comparison between DNS simulations and ML model for U =0.95. Results show total energy spectrum and heat flux. The black dashed line is the 10%maxk|Hfkl| contour of the heat flux field.

Close modal
FIG. 12.

Comparison of barotropic and baroclinic energy between spectral code and ML-closure scheme for U =0.95.

FIG. 12.

Comparison of barotropic and baroclinic energy between spectral code and ML-closure scheme for U =0.95.

Close modal

Furthermore, in Fig. 13 the marginal pdfs for the leading modes ψ̂(1,0),ψ̂(0,1) of the barotropic streamfunction ψ=(ψ1+ψ2)/2 are presented. Training takes place for flows with parameters (β,r)=(0.8,0.2),(1.5,0.1),(2.5,0.1). We compare the closure schemes for two cases: the midlatitude case that corresponds to kd=4,β=2,r=0.1 and the high-latitude case that corresponds to kd=4,β=1,r=0.2.

FIG. 13.

Comparison of pdf of deviations between DNS (red line), first-order closure (green line), second-order closure (blue line). The best-fitted Gaussian pdf for the DNS results (black line) is also shown with dashed line. Results are shown for (a) high latitude and (b) midlatitude atmospheric flows.

FIG. 13.

Comparison of pdf of deviations between DNS (red line), first-order closure (green line), second-order closure (blue line). The best-fitted Gaussian pdf for the DNS results (black line) is also shown with dashed line. Results are shown for (a) high latitude and (b) midlatitude atmospheric flows.

Close modal

We also include a comparison with the first-order closure scheme.57 The first-order closure results use the same resolution as the presented closure, with a TCN-based architecture and a constraint to the loss function that enforces the energy-preserving property of the quadratic nonlinearity of the system. We observe that the second-order closure captures very accurately the non-Gaussian structure of the pdf. This is not the case for the first-order closure, which typically underestimates the variance but also misses the bimodal character of the mode ψ̂(0,1). Similarly, in Fig. 14, the pdf of the top and bottom layer streamfunctions are shown. Again, the proposed second-order closure outperforms the first-order closure scheme. Discrepancies in the tails of the pdf may be due to unresolved high wavenumbers, especially since the large-scale modes seem to be well approximated by the second-order scheme. Finally, it is noted that training the second-order scheme with the same hyperparameters but setting λ = 0 yields a closure that becomes unstable as the flow evolves, highlighting the importance of the constraint in numerical stability.

FIG. 14.

Comparison of pdf of streamfunctions between DNS (red line), first-order closure (green line), and second-order closure (blue line). Results are shown for (a) high latitude and (b) midlatitude atmospheric flows.

FIG. 14.

Comparison of pdf of streamfunctions between DNS (red line), first-order closure (green line), and second-order closure (blue line). Results are shown for (a) high latitude and (b) midlatitude atmospheric flows.

Close modal

We have formulated and assessed a data-informed turbulence-closure scheme that respects the underlying conservation properties of nonlinear advection. The method employs a second-order framework for the uncertainty quantification of nonlinear and turbulent dynamical systems. We first applied our approach to prototypical problems of nonlinear dynamics, like the unstable triad system and the Lorentz-96 model. In both cases, the data-informed approach produced results in good agreement with reference MC simulations. Furthermore, the method was applied to more realistic turbulent flows, involving anisotropic multiphase flows and a two-layer quasi-geostrophic model. The obtained results demonstrated the improvement of applying the closure at the second-order level, as opposed to mean-flow closures, but also how the results are improved by encoding the energy conservation property of the nonlinear terms in the training process. In addition, we illustrated that the ML closure of the covariance equation allows for accurate modeling of highly non-trivial non-Gaussian statistics that govern the response of low wavenumbers in the QG model. Future work involves the application of these ideas to 3D turbulence and the detailed statistical modeling of spatiotemporally intermittent phenomena.

This paper has been supported through ONR-Multidisciplinary University Research Initiative (MURI) Grant No. N00014–17-1–2676 and AFOSR-MURI Grant No. FA9550–21-1–0058 and the Defense Advanced Research Projects Agency Grant No. HR00112290029.

The authors have no conflicts to disclose.

A. Charalampopoulos: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (equal); Methodology (lead); Software (lead); Writing - original draft (equal); Writing - review and editing (equal). T. Sapsis: Conceptualization (equal); Project administration (equal); Supervision (equal); Writing - original draft (equal); Writing - review and editing (equal).

The data that support the findings of this study are available from the corresponding author upon reasonable request.

1.
A.
Kaintura
,
T.
Dhaene
, and
D.
Spina
, “
Review of polynomial chaos-based methods for uncertainty quantification in modern integrated circuits
,”
Electronics
7
,
30
(
2018
).
2.
R. G.
Ghanem
and
P. D.
Spanos
, “
Spectral stochastic finite-element formulation for reliability analysis
,”
J. Eng. Mech.
117
,
2351
2372
(
1991
).
3.
R. G.
Ghanem
and
P. D.
Spanos
,
Stochastic Finite Elements: A Spectral Approach
(
Courier Corporation
,
2003
).
4.
M. A.
Katsoulakis
,
A. J.
Majda
, and
D. G.
Vlachos
, “
Coarse-grained stochastic processes for microscopic lattice systems
,”
Proc. Natl. Acad. Sci.
100
,
782
787
(
2003
).
5.
A.
Chatterjee
and
D. G.
Vlachos
, “
An overview of spatial microscopic and accelerated kinetic Monte Carlo methods
,”
J. Comput.-Aided Mater. Des.
14
,
253
308
(
2007
).
6.
B. D.
Phenix
,
J. L.
Dinaro
,
M. A.
Tatang
,
J. W.
Tester
,
J. B.
Howard
, and
G. J.
McRae
, “
Incorporation of parametric uncertainty into complex kinetic mechanisms: Application to hydrogen oxidation in supercritical water
,”
Combust. Flame
112
,
132
146
(
1998
).
7.
R.
Ghanem
and
S.
Dham
, “
Stochastic finite element analysis for multiphase flow in heterogeneous porous media
,”
Transp. Porous Media
32
,
239
262
(
1998
).
8.
R.
Ghanem
, “
Probabilistic characterization of transport in heterogeneous media
,”
Comput. Methods Appl. Mech. Eng.
158
,
199
220
(
1998
).
9.
O. P. L.
Maitre
,
O. M.
Knio
,
H.
Najm
, and
R.
Ghanem
, “
A stochastic projection method for fluid flow: Basic formulation
,”
J. Comput. Phys.
173
,
481
511
(
2001
).
10.
T. D.
Hien
and
M.
Kleiber
, “
Stochastic finite element modelling in linear transient heat transfer
,”
Comput. Methods Appl. Mech. Eng.
144
,
111
124
(
1997
).
11.
D.
Xiu
and
G. E.
Karniadakis
, “
A new stochastic approach to transient heat conduction modeling with uncertainty
,”
Int. J. Heat Mass Transfer
46
,
4681
4693
(
2003
).
12.
A.
Townsend
,
The Structure of Turbulent Shear Flow
(
Cambridge University Press
,
1980
).
13.
G. K.
Vallis
,
Atmospheric and Oceanic Fluid Dynamics
(
Cambridge University Press
,
2017
).
14.
R.
Salmon
,
Lectures on Geophysical Fluid Dynamics
(
Oxford University Press
,
1998
).
15.
A.
Majda
and
X.
Wang
,
Nonlinear Dynamics and Statistical Theories for Basic Geophysical Flows
(
Cambridge University Press
,
2006
).
16.
A. J.
Majda
and
B.
Gershgorin
, “
Quantifying uncertainty in climate change science through empirical information theory
,”
Proc. Natl. Acad. Sci.
107
,
14958
14963
(
2010
).
17.
N.
Metropolis
and
S.
Ulam
, “
The Monte Carlo method
,”
J. Am. Stat. Assoc.
44
,
335
341
(
1949
).
18.
J. M.
Hammersley
, “
Monte Carlo methods for solving multivariable problems
,”
Ann. New York Acad. Sci.
86
,
844
874
(
1960
).
19.
J.
Hammersley
,
Monte Carlo Methods
(
Springer Science & Business Media
,
2013
).
20.
W. R.
Gilks
,
S.
Richardson
, and
D.
Spiegelhalter
,
Markov Chain Monte Carlo in Practice
(
CRC Press
,
1995
).
21.
T.
Bengtsson
,
P.
Bickel
, and
B.
Li
, “
Curse-of-dimensionality revisited: Collapse of the particle filter in very large scale systems
,” in
Probability and Statistics: Essays in Honor of David A. Freedman
(
Institute of Mathematical Statistics
,
2008
), pp.
316
334
.
22.
A. J.
Majda
and
D.
Qi
, “
Strategies for reduced-order models for predicting the statistical responses and uncertainty quantification in complex turbulent dynamical systems
,”
SIAM Rev.
60
,
491
549
(
2018
).
23.
L.
Sirovich
, “
Turbulence and the dynamics of coherent structures. I. Coherent structures
,”
Q. Appl. Math.
45
,
561
571
(
1987
).
24.
L.
Sirovich
, “
Turbulence and the dynamics of coherent structures. II. Symmetries and transformations
,”
Q. Appl. Math.
45
,
573
582
(
1987
).
25.
L.
Sirovich
, “
Turbulence and the dynamics of coherent structures. III. Dynamics and scaling
,”
Q. Appl. Math.
45
,
583
590
(
1987
).
26.
P.
Holmes
,
J. L.
Lumley
,
G.
Berkooz
, and
C. W.
Rowley
,
Turbulence, Coherent Structures, Dynamical Systems and Symmetry
(
Cambridge university Press
,
2012
).
27.
S.
Lall
,
J. E.
Marsden
, and
S.
Glavaski
, “
A subspace approach to balanced truncation for model reduction of nonlinear control systems
,”
Int. J. Robust Nonlinear Control
12
,
519
(
2002
).
28.
Z.
Ma
,
C. W.
Rowley
, and
G.
Tadmor
, “
Snapshot-based balanced truncation for linear time-periodic systems
,”
IEEE Trans. Automat. Control
55
,
469
473
(
2010
).
29.
T. P.
Sapsis
and
P. F.
Lermusiaux
, “
Dynamically orthogonal field equations for continuous stochastic dynamical systems
,”
Phys. D: Nonlinear Phenom.
238
,
2347
2360
(
2009
).
30.
T. P.
Sapsis
, “
Dynamically orthogonal field equations for stochastic fluid flows and particle dynamics
,” Ph.D. thesis (
Massachusetts Institute of Technology
,
2011
).
31.
H. N.
Najm
, “
Uncertainty quantification and polynomial chaos techniques in computational fluid dynamics
,”
Annu. Rev. Fluid Mech.
41
,
35
52
(
2009
).
32.
T.
Hou
,
W.
Luo
,
B.
Rozovskii
, and
H.-M.
Zhou
, “
Wiener Chaos expansions and numerical solutions of randomly forced equations of fluid mechanics
,”
J. Comput. Phys.
216
,
687
(
2006
).
33.
M. A.
Hariri-Ardebili
and
B.
Sudret
, “
Polynomial chaos expansion for uncertainty quantification of dam engineering problems
,”
Eng. Struct.
203
,
109631
(
2020
).
34.
D. T.
Crommelin
and
A. J.
Majda
, “
Strategies for model reduction: Comparing different optimal bases
,”
J. Atmos. Sci.
61
,
2206
2217
(
2004
).
35.
M.
Branicki
and
A. J.
Majda
, “
Quantifying uncertainty for predictions with model error in non-Gaussian systems with intermittency
,”
Nonlinearity
25
,
2543
(
2012
).
36.
T.
Sapsis
,
A.
Vakakis
, and
L.
Bergman
, “
Targeted energy transfer between dynamical components due to essential nonlinearities: A stochastic perspective
,” in
Safety, Reliability, Risk and Life-Cycle Performance of Structures and Infrastructures—Proceedings of the 11th International Conference on Structural Safety and Reliability, ICOSSAR 2013
,
2013
.
37.
T. P.
Sapsis
and
A. J.
Majda
, “
Blending modified Gaussian closure and non-Gaussian reduced subspace methods for turbulent dynamical systems
,”
J. Nonlinear Sci.
23
,
1039
1071
(
2013
).
38.
A.
Gritsun
and
G.
Branstator
, “
Climate response using a three-dimensional operator based on the fluctuation–dissipation theorem
,”
J. Atmos. Sci.
64
,
2558
2575
(
2007
).
39.
R. V.
Abramov
and
A. J.
Majda
, “
Blended response algorithms for linear fluctuation-dissipation for complex nonlinear dynamical systems
,”
Nonlinearity
20
,
2793
(
2007
).
40.
A.
Gritsun
,
G.
Branstator
, and
A.
Majda
, “
Climate response of linear and quadratic functionals using the fluctuation–dissipation theorem
,”
J. Atmos. Sci.
65
,
2824
2841
(
2008
).
41.
R. V.
Abramov
and
A. J.
Majda
, “
New algorithms for low frequency climate response
,”
J. Atmos. Sci.
66
,
286
309
(
2009
).
42.
M.
Hairer
and
A. J.
Majda
, “
A simple framework to justify linear response theory
,”
Nonlinearity
23
,
909
(
2010
).
43.
A. J.
Majda
,
R.
Abramov
, and
B.
Gershgorin
, “
High skill in low-frequency climate response through fluctuation dissipation theorems despite structural instability
,”
Proc. Natl. Acad. Sci.
107
,
581
586
(
2010
).
44.
M. A.
Khodkar
and
P.
Hassanzadeh
, “
Data-driven reduced modelling of turbulent Rayleigh–Bénard convection using DMD-enhanced fluctuation–dissipation theorem
,”
J. Fluid Mech.
852
,
R3
(
2018
).
45.
T. P.
Sapsis
and
A. J.
Majda
, “
A statistically accurate modified quasilinear Gaussian closure for uncertainty quantification in turbulent dynamical systems
,”
Phys. D
252
,
34
45
(
2013
).
46.
H.
Zhang
,
J.
Harlim
, and
X.
Li
, “
Computing linear response statistics using orthogonal polynomial based estimators: An RKHS formulation
,” e-print arXiv:1912.11110 (
2019
).
47.
S. W.
Jiang
and
J.
Harlim
, “
Modeling of missing dynamical systems: Deriving parametric models using a nonparametric framework
,”
Res. Math. Sci.
7
,
16
(
2020
).
48.
C.
Schwab
and
J.
Zech
, “
Deep learning in high dimension: Neural network expression rates for generalized polynomial chaos expansions in UQ
,”
Anal. Appl.
17
,
19
55
(
2019
).
49.
A. J.
Majda
and
D.
Qi
, “
Linear and nonlinear statistical response theories with prototype applications to sensitivity analysis and statistical control of complex turbulent dynamical systems
,”
Chaos
29
,
103131
(
2019
).
50.
M.
Gamahara
and
Y.
Hattori
, “
Searching for turbulence models by artificial neural network
,”
Phys. Rev. Fluids
2
,
054604
(
2017
).
51.
A. P.
Singh
,
S.
Medida
, and
K.
Duraisamy
, “
Machine-learning-augmented predictive modeling of turbulent separated flows over airfoils
,”
AIAA J.
55
,
2215
2227
(
2017
).
52.
R.
Maulik
,
O.
San
,
A.
Rasheed
, and
P.
Vedula
, “
Subgrid modelling for two-dimensional turbulence using neural networks
,”
J. Fluid Mech.
858
,
122
144
(
2019
).
53.
R.
Maulik
,
A.
Mohan
,
B.
Lusch
,
S.
Madireddy
,
P.
Balaprakash
, and
D.
Livescu
, “
Time-series learning of latent-space dynamics for reduced-order model closure
,”
Phys. D: Nonlinear Phenom.
405
,
132368
(
2020
).
54.
X. I.
Yang
,
S.
Zafar
,
J. X.
Wang
, and
H.
Xiao
, “
Predictive large-eddy-simulation wall modeling via physics-informed neural networks
,”
Phys. Rev. Fluids
4
,
034602
(
2019
).
55.
M.
Ma
,
J.
Lu
, and
G.
Tryggvason
, “
Using statistical learning to close two-fluid multiphase flow equations for a simple bubbly system
,”
Phys. Fluids
27
,
092101
(
2015
).
56.
M.
Ma
,
J.
Lu
, and
G.
Tryggvason
, “
Using statistical learning to close two-fluid multiphase flow equations for bubbly flows in vertical channels
,”
Int. J. Multiphase Flow
85
,
336
347
(
2016
).
57.
A.-T. G.
Charalampopoulos
and
T. P.
Sapsis
, “
Machine-learning energy-preserving nonlocal closures for turbulent fluid flows and inertial tracers
,” e-print arXiv:2102.07639 (
2021
).
58.
A.-T.
Charalampopoulos
,
S. H.
Bryngelson
,
T.
Colonius
, and
T. P.
Sapsis
, “
Hybrid quadrature moment method for accurate and stable representation of non-Gaussian processes and their dynamics
,” e-print arXiv:2110.01374 (
2021
).
59.
F.
Takens
, “
Detecting strange attractors in turbulence
,” in
Dynamical Systems and Turbulence, Warwick 1980
(
Springer
,
1981
), pp.
366
381
.
60.
P. R.
Vlachas
,
W.
Byeon
,
Z. Y.
Wan
,
T. P.
Sapsis
, and
P.
Koumoutsakos
, “
Data-driven forecasting of high-dimensional chaotic systems with long-short term memory networks
,”
Proc. R. Soc. A
474
,
20170844
(
2018
).
61.
Z. Y.
Wan
,
P. R.
Vlachas
,
P.
Koumoutsakos
, and
T. P.
Sapsis
, “
Data-assisted reduced-order modeling of extreme events in complex dynamical systems
,”
PLoS One
13
,
e0197704
(
2018
).
62.
P. R.
Vlachas
,
J.
Pathak
,
B. R.
Hunt
,
T. P.
Sapsis
,
M.
Girvan
,
E.
Ott
, and
P.
Koumoutsakos
, “
Backpropagation algorithms and reservoir computing in recurrent neural networks for the forecasting of complex spatiotemporal dynamics
,”
Neural Networks
126
,
191
217
(
2020
).
63.
S. H.
Bryngelson
,
A.
Charalampopoulos
,
T. P.
Sapsis
, and
T.
Colonius
, “
A Gaussian moment method and its augmentation via LSTM recurrent neural networks for the statistics of cavitating bubble populations
,”
Int. J. Multiphase Flow
127
,
103262
(
2020
).
64.
T. P.
Sapsis
, “
Attractor local dimensionality, nonlinear energy transfers, and finite-time instabilities in unstable dynamical systems with applications to 2D fluid flows
,”
Proc. R. Soc. A
469
,
20120550
(
2013
).
65.
T. P.
Sapsis
and
A. J.
Majda
, “
Statistically accurate low-order models for uncertainty quantification in turbulent dynamical systems
,”
Proc. Natl. Acad. Sci. U. S. A.
110
,
13705
(
2013
).
66.
A. J.
Majda
, “
Statistical energy conservation principle for inhomogeneous turbulent dynamical systems
,”
Proc. Natl. Acad. Sci.
112
,
8937
8941
(
2015
).
67.
S.
Bai
,
J. Z.
Kolter
, and
V.
Koltun
, “
An empirical evaluation of generic convolutional and recurrent networks for sequence modeling
,” e-print arXiv:1803.01271 (
2018
).
68.
D.
Qi
and
J.
Harlim
, “
Machine learning-based statistical closure models for turbulent dynamical systems
,” e-print arXiv:2108.13220 (
2021
).
69.
G.
Strang
,
Introduction to Linear Algebra
(
Wellesley-Cambridge Press
,
Wellesley, MA
,
1993
), Vol.
3
.
70.
A.
van den Oord
,
S.
Dieleman
,
H.
Zen
,
K.
Simonyan
,
O.
Vinyals
,
A.
Graves
,
N.
Kalchbrenner
,
A.
Senior
, and
K.
Kavukcuoglu
, “
Wavenet: A generative model for raw audio
,” e-print arXiv:1609.03499 (
2016
).
71.
S.
Hochreiter
and
J.
Schmidhuber
, “
Long short-term memory
,”
Neural Comput.
9
,
1735
1780
(
1997
).
72.
S.
Shalev-Shwartz
and
S.
Ben-David
,
Understanding Machine Learning: From Theory to Algorithms
(
Cambridge University Press
,
2014
).
73.
A. J.
Majda
,
I.
Timofeyev
, and
E. V.
Eijnden
, “
Models for stochastic climate prediction
,”
Proc. Natl. Acad. Sci.
96
,
14687
14691
(
1999
).
74.
C. E.
Leith
, “
Climate response and fluctuation dissipation
,”
J. Atmos. Sci.
32
,
2022
2026
(
1975
).
75.
T.
Hasselman
, “
Modal coupling in lightly damped structures
,”
AIAA J.
14
,
1627
1628
(
1976
).
76.
T.
DelSole
and
B. F.
Farrell
, “
The quasi-linear equilibration of a thermally maintained, stochastically excited jet in a quasigeostrophic model
,”
J. Atmos. Sci.
53
,
1781
1797
(
1996
).
77.
G.
Haller
and
T.
Sapsis
, “
Where do inertial particles go in fluid flows?
,”
Phys. D: Nonlinear Phenom.
237
,
573
583
(
2008
).
78.
N. A.
Phillips
, “
Energy transformations and meridional circulations associated with simple baroclinic waves in a two-level, quasi-geostrophic model
,”
Tellus
6
,
274
286
(
1954
).