The wave kinetic equation (WKE) describing drift-wave (DW) turbulence is widely used in the studies of zonal flows (ZFs) emerging from DW turbulence. However, this formulation neglects the exchange of enstrophy between DWs and ZFs and also ignores effects beyond the geometrical-optics limit. We derive a modified theory that takes both of these effects into account, while still treating DW quanta (“driftons”) as particles in phase space. The drifton dynamics is described by an equation of the Wigner–Moyal type, which is commonly known in the phase-space formulation of quantum mechanics. In the geometrical-optics limit, this formulation features additional terms missing in the traditional WKE that ensure exact conservation of the total enstrophy of the system, in addition to the total energy, which is the only conserved invariant in previous theories based on the WKE. Numerical simulations are presented to illustrate the importance of these additional terms. The proposed formulation can be considered as a phase-space representation of the second-order cumulant expansion, or CE2.

The formation of zonal flows (ZFs) is a problem of fundamental interest in many contexts, including the physics of planetary atmospheres, astrophysics, and fusion science.1–7 In particular, the interaction of ZFs and drift-wave (DW) turbulence in laboratory plasmas significantly affects the transport of energy, momentum, and particles, so understanding it is critical to improving the plasma confinement. But modeling the underlying physics remains a difficult problem. The workhorse approach to describing the DW-ZF coupling is the wave kinetic equation (WKE),5,8 but it is limited to the ray approximation9 and, in fact, is oversimplified even as a geometrical-optics10 (GO) model. That leads to missing essential physics, as was recently pointed out in Ref. 11 and will be elaborated below. These issues can be fixed by using the more accurate quasilinear approach known as the second-order cumulant expansion, or CE2,12–16 whose applications to DW-ZF physics were pursued in Refs. 17–20. However, the CE2 is less intuitive than the WKE, and its robustness with respect to further approximations remains obscure. Having an approach as accurate as the CE2 and as intuitive as the WKE would be more advantageous.

Here, we propose such an approach for a DW turbulence model based on the generalized Hasegawa–Mima equation (gHME).21,22 The idea is as follows. We start by splitting the gHME into two coupled equations that describe ZFs and fluctuations, respectively, and then linearize the equation for fluctuations, like in the CE2 approach. We notice then that this linearized equation is similar to that describing a quantum particle that is governed by a generalized (non-Hermitian) Hamiltonian. By drawing on this analogy, we then formulate an exact (modulo quasilinear approximation) kinetic equation for such particle, which is akin to the so-called Wigner–Moyal equation in quantum mechanics.23–25,42,43

Compared to the CE2, the Wigner–Moyal formulation is arguably more intuitive, namely, for two reasons: (i) like the traditional WKE (hereafter denoted by tWKE), it permits viewing DW quanta (“driftons”) as particles, except that now driftons are quantumlike particles, i.e., have nonzero wavelengths; and (ii) the separation between Hamiltonian effects and dissipation remains transparent and unambiguous even beyond the GO approximation. Compared to the tWKE, the new approach is also more accurate because (i) it captures effects beyond the GO limit, and (ii) even in the GO limit, it predicts corrections to the tWKE that emerge from the newly found corrections to the drifton dispersion. (In this aspect, our paper can be understood as an expansion of the GO approximation introduced in Ref. 11.) These corrections are essential as they allow DW-ZF enstrophy exchange, which is not included in the tWKE. By deriving the GO limit from first principles, we eliminate this discrepancy and obtain a formulation that exactly conserves the total enstrophy (as opposed to the DW enstrophy conservation predicted by the tWKE) and the total energy, in precise agreement with the underlying gHME. We also illustrate the substantial difference between the GO limit of our formulation and the tWKE using numerical simulations.

The paper is organized as follows. In Sec. II, we introduce the gHME and its quasilinear approximation. In Sec. III, we derive the Wigner–Moyal formulation. In Sec. IV, we rederive the dispersion relation for the linear growth rate of ZFs. In Sec. V, we derive a corrected WKE that, in contrast to the tWKE, conserves both the total enstrophy and energy. Numerical simulations are presented to compare the new WKE with the tWKE. In Sec. VI, we summarize our main results. Auxiliary calculations are presented in Appendixes. This includes a brief introduction to the Weyl calculus that we extensively use in our paper ( Appendix A), a spectral representation of our formulation ( Appendix B), and proofs of the conservation properties of our models ( Appendix C).

Our formulation is based on the gHME21,22

tw+v·w+βxψ=Q,
(1)

which is widely used to describe the electrostatic two-dimensional (2-D) turbulent flows both in a magnetized plasma with a density gradient and in an atmospheric fluid on a rotating planet, where the role of DWs is played by Rossby waves.1,20 Both contexts will be described on the same footing, so our results are applicable to DWs and Rossby waves equally. We assume the usual geophysical coordinate system, where x = (x, y) is a 2-D coordinate, the x-axis is the ZF direction, and the y-axis is the direction of the local gradient of the plasma density or of the Coriolis parameter. (In the context of fusion plasmas, a different choice of coordinates is usually preferred in literature, where x and y are swapped.) The constant β is a measure of this gradient. The function ψ(x, t) is the electric potential or the stream function, v=ez×ψ is the fluid velocity on the x plane, and ez is a unit vector normal to this plane. The function w(x, t) is the generalized vorticity given by w(2LD2α̂)ψ, where α̂ is an operator such that α̂=1 in parts of the spectrum corresponding to DWs and α̂=0 in those corresponding to ZFs. (The symbol denotes definitions.) Also, LD is the plasma sound radius or the deformation radius. (For plasmas, one can take LD = 1 in normalized units.21 Also, the barotropic model used in geophysics is recovered in the limit LD.12–15) The term Q(x, t) describes the external forces and dissipation. Systems with Q = 0 will be called isolated.

Let us decompose the fields into their zonal-averaged and fluctuating components, denoted with bars and tildes, respectively. (For any g, its zonal average is g¯dxg/Lx, where Lx, henceforth assumed equal to one, is the system length along x axis.) In particular, w=w¯+w̃, where the two components of the generalized vorticity are18 

w¯=2ψ¯,w̃=D2ψ̃,
(2)

where D22LD2. Equations for w̃ and w¯ are obtained by taking the zonal-average and fluctuating parts of Eq. (1). This gives

(3)
tw̃+ṽ·w¯+v¯·w̃+βxψ̃+fNL=Q̃,
(3a)
tw¯+ṽ·w̃¯=Q¯,
(3b)
where fNLṽ·w̃ṽ·w̃¯ is a term nonlinear with respect to fluctuations. As discussed in Ref. 15, this term represents “eddy-eddy” interactions and is responsible for the Batchelor–Kraichnan inverse-energy cascade; however, it is inessential for the formation of ZFs. Since the main scope of this paper is to specifically study the interaction between eddies and ZFs, we ignore eddy-eddy interactions so fNL will be neglected. Hence,
(4)
tw̃+ṽ·w¯+v¯·w̃+βxψ̃=Q̃,
(4a)
tw¯+ṽ·w̃¯=Q¯.
(4b)
Equations (4) compose the well-known quasilinear model.12 In isolated systems, both sets of equations conserve the enstrophy Z and the energy E (strictly speaking, free energy), which are defined as

Z12d2xw2,E12d2xwψ.
(5)

It is convenient to rewrite Eqs. (4) in terms of the ZF velocity v¯=exU, whose only component U(y, t) is U=yψ¯. Specifically, one has ṽ·w¯=(xψ̃)(y2U),v¯·w̃=Uxw̃, and ṽ·w̃¯=y2ṽxṽy¯. We will also assume Q̃=ξ̃μdww̃ and Q¯=μzfw¯. Here, ξ̃ is some external force with zero zonal average (eventually, we will assume it to be a white noise), and the constant coefficients μdw and μzf are intended to emulate the dissipation of DWs and ZFs caused by the external environment. Then, Eqs. (4) become

(6)
tw̃+Uxw̃+[β(y2U)]xψ̃=ξ̃μdww̃,
(6a)
tU+μzfU+yṽxṽy¯=0.
(6b)

Equations (6) are the same model as the one that underlies the CE2. Although not exact, this model is useful because it captures the key aspects of ZF dynamics, such as formation and merging of zonal jets.15,20,26 Below, we use it to derive a formulation of DW-ZF interactions alternative to the CE2.

Consider a family of all reversible linear transformations of w̃(x,t) of the form d2xK(x,x,t)w̃(x,t). These transformations map w̃(x,t) into some family of image functions. Since these functions are mutually equivalent up to an isomorphism, the resulting family can be viewed as a single object, a time-dependent “state vector” |w̃. (Analogous definitions will be assumed also for |ψ̃ and |ξ̃.) The original function w̃(x,t) is then understood as a projection of |w̃, namely, as its “coordinate representation” given by w̃(x,t)=x|w̃. Here, |x are the eigenstates of the position operator x̂ normalized such that x|x̂|x=xx|x=xδ(xx). This definition of a field is similar to that used in quantum mechanics for describing probability amplitudes.27 Hence, it is convenient to describe the dynamics of |w̃ using a quantumlike formalism. This is done as follows.

In addition to the coordinate operator x̂, we introduce a momentum (wave-vector) operator p̂ such that, in the coordinate representation, p̂i. Accordingly, |w̃=p̂D2|ψ̃, where

p̂D2p̂2+LD2,p̂2p̂·p̂.
(7)

Hence, Eq. (6a) can be represented in the following form:

it|w̃=Ĥ|w̃+i|ξ̃.
(8)

The operator Ĥ is given by

Ĥβp̂xp̂D2+Ûp̂x+Ûp̂xp̂D2iμdw.
(9)

Also, ÛU(ŷ,t), and the prime above U henceforth denotes y; in particular, Ûy2U(ŷ,t).

Let us express Eq. (9) as Ĥ=ĤH+iĤA, where ĤH(Ĥ+Ĥ)/2 and ĤA(ĤĤ)/(2i) are the Hermitian and anti-Hermitian parts of Ĥ, correspondingly. Explicitly, these operators can be written as

(10)
ĤH=βp̂xp̂D2+Ûp̂x+[Û,p̂xp̂D2]+/2,
(10a)
ĤA=[Û,p̂xp̂D2]/(2i)μdw,
(10b)
where [·,·] denotes the commutator given by [Â,B̂]=ÂB̂B̂Â and [·,·]+ denotes the anti-commutator given by [Â,B̂]+=ÂB̂+B̂Â. Let us also introduce a Hermitian operator Ŵ|w̃w̃|, which, by analogy with quantum mechanics, is interpreted as the “fluctuating-vorticity density” operator. It is seen from Eq. (8) that Ŵ satisfies

itŴ=[ĤH,Ŵ]+i[ĤA,Ŵ]++iF̂,
(11)

where F̂|ξ̃w̃|+|w̃ξ̃|. In particular, taking the trace of this equation also gives an equation for the “total number of DW quanta,” NTrŴ=d2xx|Ŵ|x=d2xw̃2=w̃|w̃; namely,

Ṅ=2Tr(ĤAŴ)+TrF̂.
(12)

This indicates that ĤA determines the loss of quanta, or dissipation of DWs. [In particular, the term μdw in Eq. (10b) is responsible for DW dissipation to the external environment, whereas the term [Û,p̂xp̂D2]/(2i) destroys DW quanta while conserving the total enstrophy, as will be discussed in Sec. III E.] Also, ĤH determines the conservative dynamics of DWs and thus can be understood as the drifton Hamiltonian. (The non-Hermitian operator Ĥ will be attributed as the generalized Hamiltonian.) Notice that the distinction between dissipation and Hamiltonian effects remains unambiguous even beyond the GO approximation.

Equation (11) can be understood as a generalized von Neumann equation akin to the one that commonly emerges in quantum mechanics. A standard approach to such equation is to project it on the phase space using the Weyl transform. Hence, we proceed as follows. (Readers who are not familiar with the Weyl calculus are encouraged to read  Appendix A before continuing further.)

Let us introduce W as the Weyl symbol of Ŵ, i.e.,

W(x,p,t)d2seip·sx+s/2|Ŵ|xs/2,
(13)

which is real because Ŵ is Hermitian. In quantum mechanics, a similar construct is known as the Wigner function,28 so one can readily identify the physical meaning of W. Specifically, in the regime, when the ray approximation applies and dissipation is negligible, W/(2π)2 represents the phase-space probability density of driftons [the numerical coefficient (2π)2 comes from Eq. (A4)], while beyond the GO limit, it can be considered as a generalization of this probability density.29 Using the fact that w̃(x,t) is real, one can also cast W as

W(x,p,t)d2seip·sw̃(x+s2,t)w̃(xs2,t),
(14)

which also implies

W(x,p,t)=W(x,p,t).
(15)

One can interpret the right-hand side of Eq. (14) as the local spatial spectrum of the correlation function of w. Hence, W will be called the DW spectral function.

By applying the Weyl transform to Eq. (11), one obtains the following pseudo-differential equation:30,44

tW={{HH,W}}+[[HA,W]]+F.
(16)

Here {{·,·}} and [[·,·]] are Moyal's “sine bracket” [Eq. (A10)] and “cosine bracket” [Eq. (A12)]. The functions HH(y, p, t), HA(y, p, t), and F(x, p, t) are the Weyl symbols of ĤH,ĤA, and F̂, respectively. In particular, using Eq. (A5) and the fact that U is independent of x, one obtains

HH(y,p,t)=βpx/pD2+pxU+[[U,px/pD2]]/2,
(17)
HA(y,p,t)={{U,px/pD2}}/2μdw,
(18)

where pD2p2+LD2. By analogy with quantum mechanics, we call Eq. (16) a Wigner–Moyal equation.

Next, let us consider the zonal average of this equation

tW¯={{HH,W¯}}+[[HA,W¯]]+F¯,
(19)

where W¯=W¯(y,p,t). We adopt the ergodic assumption, namely, that the zonal average is equivalent to the ensemble average [denoted ] over realizations of the random force ξ̃ (e.g., as done in Ref. 18). To calculate F¯=F, consider integrating Eq. (8) on a time interval (t0, t). The result can be written as |w̃t=|w̃t0+|δw̃t+t0tdt|ξ̃t, where the indexes denote the times at which functions are evaluated, and |δw̃tit0tdtĤ|w̃t. We assume

ξ̃(x,t)ξ̃(x,t)=δ(tt)Ξ((y+y)/2,xx),
(20)

where Ξ is a correlation function that is homogeneous in x but not necessarily in y.15,20 Since |δw̃t can be affected by |ξ̃t only if t>t, one has |ξ̃tδw̃t|=0. Hence,

F¯(y,p)=d2seip·sx+s2|(|ξ̃tw̃t|+h.c.)|xs2=d2seip·st0tdtδ(tt)[Ξ(y,s)+Ξ(y,s)]=12d2seip·s[Ξ(y,s)+Ξ(y,s)]=d2sΞ(y,s)cos(p·s),
(21)

where ‘h.c.’ denotes the “Hermitian conjugate.” In other words, once the correlation function Ξ of ξ̃ is specified, F¯ can be readily calculated as the Fourier image of Ξ.

This concludes the calculation of the functions that determine the evolution of W¯ through Eq. (19). However, these functions generally depend on U, so an additional equation for U is needed to make the theory self-consistent. This equation is derived as follows.

Returning to Eq. (6b), we rewrite the nonlinear term as

ṽxṽy=(yψ̃)(xψ̃)=x|p̂y|ψ̃ψ̃|p̂x|x=x|p̂yp̂D2Ŵp̂D2p̂x|x=d2p(2π)2pypD2WpxpD2,
(22)

where we used Eq. (A3) in the last step. After introducing the averaged vorticity density W¯, Eq. (6b) becomes

tU+μzfU=yd2p(2π)2pypD2W¯pxpD2.
(23)

Since W¯ is independent of x and satisfies the condition (15), Eq. (23) can also be written as

tU+μzfU=yd2p(2π)21pD2pxpyW¯1pD2.
(24)

The combination of Eqs. (19) and (24) forms a closed set of equations that can be used to calculate the dynamics of W¯ and U self-consistently.

Let us slightly change the notation and summarize the above equations in the following form:

(25)
tW¯={{H,W¯}}+[[Γ,W¯]]+F¯2μdwW¯,
(25a)
tU+μzfU=yd2p(2π)21pD2pxpyW¯1pD2.
(25b)
As a reminder, W¯(y,p,t) is the zonal-averaged spectral (or Wigner) function that describes the DW turbulence, and U(y, t) is the ZF velocity. Also, F¯=F¯(y,p) is determined by the correlation function of the external noise ξ̃ (Sec. III C). We have also introduced HHH and ΓHA+μdw, or, explicitly
(26)
H(y,p,t)=βpx/pD2+pxU+[[U,px/pD2]]/2,
(26a)
Γ(y,p,t)={{U,px/pD2}}/2.
(26b)
In  Appendix B, we also present a spectral representation of these equations that can be used for a numerical implementation of the Wigner–Moyal formulation.

The function H can be understood as the Weyl symbol of the drifton Hamiltonian, whereas Γ determines the dissipation of DW quanta that is caused specifically by DW interaction with ZFs. This is explained as follows. Since Eqs. (25) are exact within the quasilinear approximation (modulo the ergodic assumption), they inherit the same conservation laws as the original quasilinear model given by Eqs. (6). Specifically, for isolated systems (F¯=0 and μdw, zf = 0), Eqs. (25) and (26) exactly conserve the total enstrophy and energy [Eq. (5)]

Z=Zdw+Zzf,E=Edw+Ezf,
(27)

rather than their DW and ZF components. (A direct proof is given in Appendix C 1.) For completeness, we present expressions for these components:

(28)
Zdw12d2xw̃2=12d2p(2π)2dyW¯,
(28a)
Zzf12dyw¯2=12dy(U)2,
(28b)
Edw12d2xw̃ψ̃=12d2p(2π)2dyW¯pD2,
(28c)
Ezf12dyw¯ψ¯=12dyU2,
(28d)
where we used Eqs. (A4) and (A14) to derive the second set of equalities. According to Eqs. (28a) and (A4), note that the DW enstrophy Zdw and the total number of DW quanta NTrŴ are the same up to a constant factor.

The conservative equations (25) and (26), which we attribute as the Wigner–Moyal formulation of DW-ZF interactions, constitute the main result of our work. This formulation can be understood as an alternative phase-space representation of the CE2 since it is derived from the same quasilinear model. However, the Wigner–Moyal formulation is arguably more intuitive than the CE2, namely, for two reasons: (i) Like in the tWKE, driftons are treated as particles, except now they are quantumlike particles, i.e., have nonzero wavelengths; hence, one is not constrained to the GO limit. (ii) Also, the separation between Hamiltonian effects and dissipation remains transparent and unambiguous even beyond the GO approximation. The Wigner–Moyal formulation elucidates the link between the WKE formalism and the CE2 and also helps make approximations rigorous by making them systematic. Below, these and other applications are discussed in further detail.

To demonstrate the convenience of the Wigner-Moyal formulation, let us apply it to rederive the rate of the linear zonostrophic instability, i.e., the growth rate of weak ZFs. Suppose a homogeneous equilibrium with zero ZF velocity and some DW spectral function W(p). [As pointed out in Sec. III C, the corresponding W(p)/(2π)2 represents the phase-space probability distribution of driftons.] Consider small perturbations to this equilibrium; namely,

U=δU(y,p,t),δU=Re(Uqeiqy+γt),W¯=W(p)+δW¯(y,p,t),δW¯=Re[W¯q(p)eiqy+γt].

Here, the constant q serves as the modulation wave number, and the constant γ is the instability rate to be found. The linearization of Eq. (25a) leads to

(t+2μdw)δW¯+{{βpx/pD2,δW¯}}={{pxδU,W}}+{{[[δU,px/pD2]]/2,W}}+[[{{δU,px/pD2}}/2,W]],
(29)

where we substituted Eqs. (26). The brackets can be calculated using Eqs. (A17). Hence, we obtain

[i(γ+2μdw)+βpx(1pD,+q21pD,q2)]W¯q=(W+qWq)[pxq22(1pD,+q2+1pD,q2)px]Uq+pxq22(W+q+Wq)(1pD,+q21pD,q2)Uq,
(30)

where we assume the notation A±qA(p±eyq/2) for any A. Solving for W¯q in terms of Uq leads to

W¯q=ipxpD,+q2pD,q2(γ+2μdw)pD,+q2pD,q2+2iβqpxpy×[W+q(1q2pD,+q2)Wq(1q2pD,q2)]Uq.

Then, Eq. (25b) yields

(γ+μzf)eiqyUq=yd2p(2π)21pD2pxpyeiqyW¯q1pD2.

Due to Eq. (A16), this can be simplified as follows:

(γ+μzf)Uq=iqd2p(2π)2pxpypD,+q2pD,q2W¯q.
(31)

After substituting the expression for W¯q, one gets

γ+μzf=d2p(2π)2qpx2py(γ+2μdw)pD,+q2pD,q2+2iβqpxpy×[Wq(1q2pD,q2)W+q(1q2pD,+q2)].
(32)

As expected, this dispersion relation coincides with that obtained using the CE2 formalism.15 Notably, the dependence of the integrand on W±q makes the expression similar to dispersion relations that emerge in quantum mechanics; for instance, cf. Ref. 31, Sec. 40.

As a side note, it is commonly thought that ZFs only grow in situ; i.e., Re γ > 0 with Im γ = 0. There have been questions over whether it is possible to have unstable zonal modes at nonzero Im γ, which implies nonzero group velocity.32 Here we show, by presenting an example, that the answer is yes. Let us consider the following steady state:

W=(2π)2N[δ(pxkx)δ(pyky)+δ(px+kx)δ(py+ky)+δ(px+kx)δ(pyky)+δ(pxkx)δ(py+ky)]/4.
(33)

After integrating, Eq. (32) can be cast as follows:

0=γ+μzfXNkx2vgykD4(1q2kD2)×n=1,1n(ky+nq/2)kD,+2nq2/kD2X2kD,+2nq4/kD4+(ky+nq/2)2/ky2,
(34)

where vgy2βkxky/kD4 is the DW group velocity in the absence of ZFs, kD,±q2kx2+(ky±q/2)2+LD2, and X(γ+2μdw)/(qvgy). Numerical solutions of Eq. (34) are presented in Fig. 1. As shown, solutions for γ are complex over some interval of ky. This example shows the existence of “traveling” unstable ZF modes.

FIG. 1.

Numerical solutions of the dispersion relation (34) for different ky with fixed kx = 1, q = 0.1, β = 1, LD = 1, μdw,zf = 0, and N=1. The solutions shown in (a)–(d) correspond to ky = 0.3, ky = 0.5, ky = 0.9, and ky = 2.0, respectively. In the interval 0.33|ky|0.82, the solutions γ can be complex-valued. With the same forcing and same fixed parameters, similar regimes can also be observed in the barotropic limit (LD) or in the case of nonzero dissipation.

FIG. 1.

Numerical solutions of the dispersion relation (34) for different ky with fixed kx = 1, q = 0.1, β = 1, LD = 1, μdw,zf = 0, and N=1. The solutions shown in (a)–(d) correspond to ky = 0.3, ky = 0.5, ky = 0.9, and ky = 2.0, respectively. In the interval 0.33|ky|0.82, the solutions γ can be complex-valued. With the same forcing and same fixed parameters, similar regimes can also be observed in the barotropic limit (LD) or in the case of nonzero dissipation.

Close modal

Additional insights on this phenomenon can be obtained by considering the limit, where μdw,zf|γ| and qky. In this case, Eq. (34) simplifies to

0=X[18σX21(X2+1)2]+O(q2),
(35)

where

σNkx28vgy2kD4(14ky2kD2).
(36)

One may consider this as the GO limit of Eq. (34). Equation (35) predicts four nontrivial solutions for X, which are given by X2=1+4σ±4[σ(σ1)]1/2. Different regimes for the solutions can be deduced. When σ ≥ 1, the solutions γ are purely real. For the parameters in Fig. 1, this regime corresponds to |ky|0.33. In the interval 0 < σ < 1 corresponding to 0.33|ky|0.82, γ is complex-valued. In the interval −1/8 ≤ σ ≤ 0, which corresponds to 0.82|ky|1.07, the solutions are purely imaginary. Finally, in the interval σ < −1/8 corresponding to |ky|1.07, two solutions γ are purely imaginary, and two other solutions are purely real. The different regimes identified by solving Eq. (35) are consistent with the observed numerical solutions of the exact dispersion relation (34). In Sec. V, we will explore the GO limit of the DW-ZF interactions in more detail.

Let us assume that the characteristic wavelengths for ZFs and DWs are λzf and λdw, respectively, and

ϵmax(λdwλzf,LDλzf)1.
(37)

Hence, the following estimates will be adopted:

yW¯λzf1W¯,pW¯λdwW¯,yHλzf1H,pHLDH,
(38)

where H denotes both H and Γ. (The latter estimate is given for the maximum of pH, which is realized at pLD1.37) This gives

nHynnW¯pyn(λdwλzf)nHW¯ϵnHW¯,
(39)
nHpynnW¯yn(LDλzf)nHW¯ϵnHW¯.
(40)

Then, using the lowest-order approximations of the Moyal products ( Appendix A), Eqs. (25) reduce to

(41)
tW¯={H,W¯}+2ΓW¯+F¯2μdwW¯,
(41a)
tU+μzfU=yd2p(2π)2pxpyW¯pD4,
(41b)
where {·,·} is the canonical Poisson bracket (A8) and
(42)
Hβpx/pD2+pxU+pxU/pD2,
(42a)
Γ{U,px/pD2}/2=pxpyU/pD4.
(42b)
One may recognize Eq. (41a) as a variation of the WKE, so we attribute Eqs. (41) and (42) as the WKE limit of the Wigner-Moyal formulation. Clearly, H acts as the drifton ray Hamiltonian while Γ acts as the corresponding dissipation rate. [The factors of two in Eq. (41a) are due to the fact that W¯ is quadratic in the DW amplitude.] In other words, ω(y,p,t)H+iΓiμdw can be viewed as the local complex frequency of DWs with a given wave vector p.

Notice that our WKE differs from the tWKE, which assumes a simpler dispersion of DWs; namely,

(43)
H=βpx/pD2+pxU,
(43a)
Γ=0.
(43b)
Although the difference is only in the high-order derivatives of U, these terms remain important for a number of reasons. For example, in the Hamiltonian H,U can be comparable to β (as is sometimes the case in geophysics2). Also, consider the following. In isolated systems, the tWKE is tW¯={H,W¯}, so it conserves the DW quanta, or, in other words, the DW enstrophy Zdw [Eq. (28a)]. At the same time, the ZF enstrophy Zzf [Eq. (28b)] generally evolves, so the total enstrophy Z=Zdw+Zzf does too. This is in contradiction with the gHME, which conserves Z, and can lead to overestimating the ZF velocity and shear generated by DW turbulence.38 In contrast to the tWKE, our formulation is free from such issues because Eqs. (41) and (42) exactly conserve both Z and E (Appendix C 2). Note that, in order to retain this conservation property, it is necessary to keep both U and U in Eqs. (42). In this sense, Eqs. (41) and (42) represent the simplest GO model that is physically meaningful in the nonlinear regime. This is in agreement with Ref. 11, where a similar conclusion was made based on comparing the linear zonostrophic instability rate predicted by the CE2. (As a note on terminology, Ref. 11 refers to the tWKE [Eqs. (41) and (43)] as the “Asymptotic WKE,” i.e., the limit obtained when one assumes the ZFs are asymptotically large scale. Also, Ref. 11 refers to Eqs. (41) and (42) as “CE2-GO.”)

The numerical results presented in Figs. 2–4 illustrate the importance of the difference between our WKE and the tWKE [subfigures (a) and (b), respectively]. As seen in Fig. 2, while our WKE model predicts ZFs with a particular λzf, the scale of ZFs predicted by tWKE is determined by nothing but the grid size that is used in simulations. This is because the tWKE predicts that the rate of the zonostrophic instability γ (Sec. IV) scales linearly with the ZF wave number q, so ZFs are produced at the largest q that is allowed (cf. Ref. 11).

FIG. 2.

The ZF velocity U(y, t) obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. Both simulations used the same parameters and initial conditions. Small initial values for W¯ and U were randomly assigned such that Eq. (15) was satisfied. The parameters used are: β = 1, LD = 1, μdw,zf = 0.1, and F¯=4πδ(|p|1). Equation (41a) was discretized in a [−39, 39] × [−2, 2] × [−4, 4] phase space using a discontinuous-Galerkin (DG) method33 on a uniformly spaced Cartesian grid with 80 × 24 × 48 cells while Eq. (41b) was discretized on a subset of this grid. Time advancement was done using an explicit third-order strong-stability-preserving the Runge-Kutta algorithm.34 The solution was expanded locally in each cell as a sum of piecewise polynomials of degree one. At cell interfaces, an upwind numerical flux was used in Eq. (41a) and a centered numerical flux was used in Eq. (41b). Higher-order spatial derivatives such as U and U were computed using the Recovery-based DG method.35 For numerical stability, small hyperviscosity was added into the simulations, e.g., as done in Ref. 19. Specifically, the terms 2ν(px2+py2)W¯+(ν/2)y2W¯ and νy4U with ν = 0.001 were added to the right-hand side of Eqs. (41a) and (41b), respectively.36 

FIG. 2.

The ZF velocity U(y, t) obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. Both simulations used the same parameters and initial conditions. Small initial values for W¯ and U were randomly assigned such that Eq. (15) was satisfied. The parameters used are: β = 1, LD = 1, μdw,zf = 0.1, and F¯=4πδ(|p|1). Equation (41a) was discretized in a [−39, 39] × [−2, 2] × [−4, 4] phase space using a discontinuous-Galerkin (DG) method33 on a uniformly spaced Cartesian grid with 80 × 24 × 48 cells while Eq. (41b) was discretized on a subset of this grid. Time advancement was done using an explicit third-order strong-stability-preserving the Runge-Kutta algorithm.34 The solution was expanded locally in each cell as a sum of piecewise polynomials of degree one. At cell interfaces, an upwind numerical flux was used in Eq. (41a) and a centered numerical flux was used in Eq. (41b). Higher-order spatial derivatives such as U and U were computed using the Recovery-based DG method.35 For numerical stability, small hyperviscosity was added into the simulations, e.g., as done in Ref. 19. Specifically, the terms 2ν(px2+py2)W¯+(ν/2)y2W¯ and νy4U with ν = 0.001 were added to the right-hand side of Eqs. (41a) and (41b), respectively.36 

Close modal
FIG. 3.

The total, DW, and ZF enstrophies obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. The yellow lines show the total enstrophy that one would get due to the external force F¯ at μdw,zf = 0. The initial conditions and simulation parameters are the same as in Fig. 2.

FIG. 3.

The total, DW, and ZF enstrophies obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. The yellow lines show the total enstrophy that one would get due to the external force F¯ at μdw,zf = 0. The initial conditions and simulation parameters are the same as in Fig. 2.

Close modal
FIG. 4.

The total, DW, and ZF energies obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. The yellow lines show the total energy that one would get due to the external force F¯ at μdw,zf = 0. The initial conditions and simulation parameters are the same as in Fig. 2.

FIG. 4.

The total, DW, and ZF energies obtained by numerically integrating the WKE (41) for H and Γ are of two types: (a) our model [Eqs. (42)]; (b) the tWKE model [Eqs. (43)]. The yellow lines show the total energy that one would get due to the external force F¯ at μdw,zf = 0. The initial conditions and simulation parameters are the same as in Fig. 2.

Close modal

Consider also the enstrophy plots in Fig. 3. To aid our discussion, we added the plots of the enstrophy Zext that the external forcing F¯ injects into the DW-ZF system; namely, Zext=(t/2)(2π)2dyd2pF¯. Within our model, the total enstrophy Z remains always smaller than Zext, which is natural, since the simulation is done for μdw,zf > 0. In contrast, the tWKE model predicts that Z can surpass Zext, which is unphysical. In addition, the values of the ZF and total enstrophies predicted by the tWKE are several times larger than those predicted by our model.

For the sake of completeness, Fig. 4 also presents the corresponding energies and the energy Eext introduced by the external force, Eext=(t/2)(2π)2dyd2pF¯/pD2. In both cases, E(t)Eext(t), which is in agreement with the fact that both models conserve the total energy of an isolated system. Still, the tWKE predicts very different results quantitatively even though the tWKE model [Eqs. (43)] is seemingly close to ours [Eqs. (42)].

The goal of this paper is to propose a new formulation of DW-ZF interactions that is more accurate than the tWKE and, simultaneously, more intuitive than the CE2. We adopt the same model [Eq. (6)] that was previously applied to derive the CE2. Then, we manipulate it using the Weyl calculus to produce a phase-space formulation of DW-ZF interactions. The resulting formulation [Eqs. (25) and (26)] is akin to a quantum kinetic theory and involves a pseudodifferential Wigner–Moyal equation. To facilitate its numerical implementation in the future, we also present an integral representation of our main equations ( Appendix B).

On one hand, this Wigner–Moyal formulation can be understood as an alternative representation to the CE2 since both models use the same assumptions. For example, we show that it leads to the same linear growth rate of weak ZFs as that obtained from the CE2 (Sec. IV). On the other hand, the Wigner–Moyal formulation is arguably more intuitive than the CE2 for two reasons: (i) it permits treating driftons as particles (i.e., as objects traveling in phase space), except now they are quantumlike particles with nonzero wavelengths; and (ii) the separation between Hamiltonian effects and dissipation remains unambiguous even beyond the GO limit.

Compared to the tWKE, the new approach is also more precise because (i) it captures effects beyond the GO limit and (ii) even in the GO limit, it predicts corrections to the tWKE that emerge from the newly found corrections to the drifton dispersion (Sec. V). These corrections are essential as they allow DW-ZF enstrophy exchange, which is not included in the tWKE. By deriving the GO limit from first principles, we eliminate this discrepancy and arrive at a model that exactly conserves the total enstrophy (as opposed to the DW enstrophy conservation predicted by the tWKE) and the total energy, in agreement with the underlying gHME. We also illustrate the substantial difference between the GO limit of our WKE model and the tWKE using numerical simulations.

This work can be expanded at least in two directions. First, the difference between the Wigner–Moyal formulation and the newly proposed WKE can be assessed quantitatively using numerical simulations. Second, the analytic methods we proposed here can be extended to other turbulence models, such as those in Refs. 39 and 40. The anticipated benefit is that more accurate equations would be derived that would respect the fundamental conservation laws that existing theories may be missing otherwise.

The authors thank J. A. Krommes for valuable discussions. This work was supported by the U.S. DOE through Contract Nos. DE-AC02-09CH11466 and DE-AC52-07NA27344, by the NNSA SSAA Program through DOE Research Grant No. DE-NA0002948, and by the U.S. DOD NDSEG Fellowship through Contract No. 32-CFR-168 a.

This Appendix summarizes our conventions for the Weyl transform. (For more information, see the excellent reviews in Refs. 10 and 41.) The Weyl symbol A(x, p) of any given operator  is defined as

A(x,p)dnseip·sx+s/2|Â|xs/2.
(A1)

We shall refer to this description of the operators as a phase-space representation since Weyl symbols are functions of the 2 n-dimensional ray phase space (x, p). Conversely, the inverse Weyl transformation is

Â=1(2π)ndnxdnpdnseip·sA(x,p)|xs/2x+s/2|.
(A2)

In particular, notice that, for any operator Â, its matrix elements in the coordinate representation, A(x,x)x|Â|x, can be expressed as

A(x,x)=1(2π)ndnpeip·(xx)A(x+x2,p),

so A(x, p) can be understood as a spectrum of A(x,x). In particular,

A(x,x)=dnp(2π)nA(x,p).
(A3)

Other properties of the Weyl transform that we use in this paper are as follows:

  • As seen from Eq. (A3), for any operator Â, its trace TrÂdxx|Â|x can be expressed as

TrÂ=1(2π)ndnxdnpA(x,p).
(A4)
  • If A(x, p) is the Weyl symbol of Â, then A*(x, p) is the Weyl symbol of Â. As a corollary, the Weyl symbol of a Hermitian operator is real.

  • For any Ĉ=ÂB̂, the corresponding Weyl symbols satisfy23,24
    C(x,p)=A(x,p)B(x,p).
    (A5)
    Here, ⋆ is the Moyal product, which is given by
    A(x,p)B(x,p)A(x,p)eiL̂/2B(x,p),
    (A6)
    and L̂ is the Janus operator, which is given by
    L̂x·pp·x={·,·}.
    (A7)
    The arrows indicate the directions in which the derivatives act, and AL̂B={A,B} is the canonical Poisson bracket; namely,
    {A,B}(xA)·(pB)(pA)·(xB).
    (A8)
  • The Moyal product is associative; i.e.,

ABC(AB)C=A(BC).
(A9)
  • The anti-symmetrized Moyal product defines the so-called Moyal bracket
    {{A,B}}i(ABBA)=2Asin(L̂/2)B.
    (A10)
    Because of the latter equality, this bracket is also called the sine bracket. To lowest order,
    {{A,B}}AL̂B={A,B}.
    (A11)
  • The symmetrized Moyal product is defined as
    [[A,B]]AB+BA=2Acos(L̂/2)B.
    (A12)
    Because of the latter equality, this bracket is also called the cosine bracket. To lowest order,
    [[A,B]]2AB.
    (A13)

    In a wave equation of the Wigner–Moyal type, such as Eq. (25a), neglecting higher-order phase-space derivatives in the sine and cosine brackets leads to a WKE. Higher-order wave effects, such as diffraction and tunneling, are lost in this limit. For this reason, it is called the GO approximation, or the ray approximation.

  • Assuming that fields vanish at infinity rapidly enough, the phase-space integral of the Moyal product of two symbols equals the integral of the regular product of these symbols:
    dnxdnpAB=dnxdnpAB.
    (A14)

    As a corollary

    (A15)
    dnxdnp{{A,B}}=0,
    (A15a)
    dnxdnp[[A,B]]=2dnxdnpAB.
    (A15b)

  • For any constant q, one has

A(p)eiq·x=A(p)ep·(q/2)eiq·x=A(p+q/2)eiq·x.
(A16)
  • As a corollary, one has

(A17)
{{A(p),eiq·x}}=1i[A(p+q2)A(pq2)]eiq·x,
(A17a)
[[A(p),eiq·x]]=[A(p+q2)+A(pq2)]eiq·x.
(A17b)

  • For any constants k and q, one can also show that

A(p)eik·xB(p)eiq·x=A(p+q/2)B(pk/2)ei(k+q)·x.
(A18)
  • Now we tabulate some Weyl transforms of various operators. (We use a two-sided arrow to show the correspondence with the Weyl transform.) First of all, the Weyl transforms of the identity, position, and momentum operators are given by
    1̂1,x̂ixi,p̂ipi.
    (A19)
    If f and g are any two functions, then
    f(x̂)f(x),g(p̂)g(p).
    (A20)
    Similarly, using Eq. (A6), we have
    f(x̂)p̂jpjf(x)+(i/2)jf(x),
    (A21)
    p̂jf(x̂)pjf(x)(i/2)jf(x).
    (A22)

    One may also notice the connection between these relations and the commutation relation between operators; i.e., [x̂j,p̂k]=x̂jp̂kp̂kx̂j=iδkj.

To facilitate numerical implementations of the Wigner–Moyal formulation in the future, we propose an integral form of Eqs. (25) using a spectral representation. (Numerical simulations of the CE2 theory were reported in Refs. 12–14.) The assumed notation for the Fourier representation of any A(y, p, t) will be

A(y,p,t)=dq2πAq(p,t)eiqy.
(B1)

We start by rewriting Eqs. (A17) as

H=βpxpD2+dq2π(pxeiqyq22[[eiqy,pxpD2]])Uq=βpxpD2+dq2π[pxq22(pxpD,q2+pxpD,+q2)]Uqeiqy,

where pD,±q2pD2(p±eyq/2). This leads to

Hq=2πδ(q)βpxpD2+px[1q22(1pD,q2+1pD,+q2)]Uq.
(B2)

Similarly,

Γ=dq2πq22{{eiqy,pxpD2}}Uq=dq2πq22i(pxpD,q2pxpD,+q2)Uq,
(B3)

so one obtains

Γq=i2(1pD,q21pD,+q2)pxq2Uq.
(B4)

Also, using Eq. (A18), we obtain

{{H,W¯}}=drds(2π)2{{Hr(p,t)eiry,W¯s(p,t)eisy}}=drds(2π)21i(Hr,+sW¯s,rHr,sW¯s,+r)ei(r+s)y,

where Ar,±sAr(p±eys/2,t) for any Ar(p, t). Also

[[Γ,W¯]]=drds(2π)2[[Γr(p,t)eiry,W¯s(p,t)eisy]]=drds(2π)2(Γr,+sW¯s,r+Γr,sW¯s,+r)ei(r+s)y.

By inserting these expressions into Eq. (25a), we obtain

tW¯q=F¯q2μdwW¯q+dr2π[(Γr,+qriHr,+qr)W¯qr,r+(Γr,+rq+iHr,+rq)W¯qr,r].
(B5)

Also, using that

yd2p(2π)21pD2pxpyW¯1pD2=yd2pdq(2π)31pD2pxpyW¯qeiqy1pD2=yd2pdq(2π)3pxpypD,+q2pD,q2W¯qeiqy=i2d2pdq(2π)3(1pD,q21pD,+q2)pxW¯qeiqy,
(B6)

one gets the following representation of Eq. (25b):

tUq+μzfUq=i2d2p(2π)2(1pD,q21pD,+q2)pxW¯q.
(B7)

Equations (B2), (B4), (B5), and (B7) constitute the spectral representation of our Wigner–Moyal formulation.

In this Appendix, we prove that in the case of isolated systems (Q = 0), the Wigner–Moyal and the WKE models conserve the total enstrophy Z and the total energy E, whose expressions are given by Eqs. (27) and (28).

1. Wigner–Moyal model

First, consider the Wigner–Moyal model [Eqs. (25) and (26)]. To show conservation of enstrophy, we obtain

dZdt=dy(yU)(ytU)+12d2p(2π)2dytW¯=12d2p(2π)2dy[2U(1pD2pxpyW¯1pD2)+{{H,W¯}}+[[Γ,W¯]]]=12d2p(2π)2dy[2U(1pD2pxpyW¯1pD2)+2ΓW¯],
(C1)

where we used Eqs. (A15). To evaluate the remaining terms, we use the Fourier representations of W¯(y,p,t) and U(y, t) as defined via Eq. (B1). Specifically, after substituting Eq. (26b) for Γ, we obtain

dZdt=12d2p(2π)2dq2πdk2πdyUqW¯k[2iq3(1pD2pxpyeiky1pD2)eiqyq2{{eiqy,pxpD2}}eiky].
(C2)

The Moyal products and the brackets can be evaluated using Eqs. (A16) and (A17). Using the notation A±qA(p±eyq/2) for any A(p), one then obtains

dZdt=12id2p(2π)2dq2πdk2πUqW¯k[2pxpyq3pD,+k2pD,k2pxq2(1pD,q21pD,+q2)]dyei(k+q)y=id2p(2π)2dq2πdkUqW¯kpxpyq3(1pD,+k2pD,k21pD,+q2pD,q2)δ(k+q)=0.
(C3)

To show conservation of energy, we obtain

dEdt=dyU(tU)+12d2p(2π)2dytW¯pD2=12d2p(2π)2dy[2Uy(1pD2pxpyW¯1pD2)+1pD2{{H,W¯}}+1pD2[[Γ,W¯]]]=12d2p(2π)2dy[2U(1pD2pxpyW¯1pD2)1pD2{{pxU+[[U,pxpD2]]/2,W¯}}1pD2[[{{U,pxpD2}}/2,W¯]]].
(C4)

Here, we used the fact that the Taylor expansion of Eq. (A10) for the Moyal bracket {{W¯,βpx/pD2}}/pD2 consists of total derivatives on y, so its integral over y is zero. The other terms can be expressed as follows. First of all,

d2p(2π)2dy[2U(1pD2pxpyW¯1pD2)1pD2{{pxU,W}}]=d2p(2π)2dq2πdk2πdyUq[2iqW¯k(1pD2pxpyeiky1pD2)eiqy1pD2{{pxeiqy,W¯k}}eiky]=d2p(2π)2dq2πdk2πUq[iW¯k2pxpyqpD,+k2pD,k2pxipD2(W¯k,qW¯k,+q)]dyei(k+q)y=d2p(2π)2dq2πdkiUqW¯k[2pxpyqpD,+k2pD,k2+px(1pD,+q21pD,q2)]δ(k+q)=0.
(C5)

Also, using Eq. (A18), we obtain

d2p(2π)2dy12pD2({{[[U,pxpD2]],W¯}}+[[{{U,pxpD2}},W¯]])=d2p(2π)2dq2πdk2πdyUqq22pD2({{[[eiqy,pxpD2]],W¯keiky}}+[[{{eiqy,pxpD2}},W¯keiky]])=d2p(2π)2dq2πdk2πdyUqpxq22pD2({{(pD,+q2+pD,q2)eiqy,W¯keiky}}1i[[(pD,+q2pD,q2)eiqy,W¯keiky]])=d2p(2π)2dq2πdk2πdyUqpxq22ipD2(W¯k,qpD,+qk2W¯k,qpD,+q+k2+W¯k,qpD,qk2W¯k,qpD,q+k2+W¯k,qpD,+qk2+W¯k,qpD,+q+k2W¯k,qpD,qk2W¯k,qpD,q+k2)ei(k+q)y=id2p(2π)2dq2πdkUqW¯kpxq2(1pD,q2pD,k21pD,+q2pD,+k2)δ(k+q)=0.
(C6)

By substituting Eqs. (C5) and (C6) into Eq. (C4), one obtains Ė=0.

2. WKE model

Now let us consider the WKE model [Eqs. (41) and (42)]. To show the conservation of enstrophy, we obtain

dZdt=dy(yU)(ytU)+12d2p(2π)2dytW¯=12d2p(2π)2dy(2pxpypD4UW¯+{H,W¯}+2ΓW¯)=12d2p(2π)2dy(2pxpypD4U+2Γ)W¯=0,
(C7)

where we used Eq. (42b) and the fact that the integral of the Poisson bracket over all phase space is zero. In contrast, the tWKE does not conserve the total enstrophy because Γ = 0 [see Eqs. (43)]. This is also understood as follows: the tWKE manifestly conserves Zdw, whereas Zzf is obviously not conserved; thus, Zdw+Zzf cannot be conserved either.

To show conservation of energy, we obtain

dEdt=dyU(tU)+12d2p(2π)2dytW¯pD2=12d2p(2π)2dy1pD2(2pxpypD2UW¯{H,W¯}2ΓW¯)=12d2p(2π)2dy1pD2(2pxpypD2UW¯{pxU+pxpD2U,W¯}+2pxpypD4UW¯),
(C8)

where the integral of {W¯,βpx/pD2}/pD2 over y is zero because it can be written as a total derivative on y. Then, we have

dEdt=12d2p(2π)2dy(2pxpypD4UW¯pxpD2UpyW¯pxpD4UpyW¯2pxpypD6UyW¯+2pxpypD6UW¯)=12d2p(2π)2dy(2pxpypD4UW¯2pxpypD4UW¯4pxpypD6UW¯+2pxpypD6UW¯+2pxpypD6UW¯)=0.
(C9)

Note that an analysis using the tWKE model leads to an expression similar to Eq. (C9) with only the first two terms in the integrand. These terms cancel out, thus showing that the tWKE model also conserves the total energy.

1.
Ö. D.
Gürcan
and
P. H.
Diamond
,
J. Phys. A: Math. Theor.
48
,
293001
(
2015
).
2.
A. R.
Vasavada
and
A. P.
Showman
,
Rep. Prog. Phys.
68
,
1935
(
2005
).
3.
A.
Johansen
,
A.
Youdin
, and
H.
Klahr
,
Astrophys. J.
697
,
1269
(
2009
).
4.
M. W.
Kunz
and
G.
Lesur
,
Mon. Not. R. Astron. Soc.
434
,
2295
(
2013
).
5.
P. H.
Diamond
,
S.-I.
Itoh
,
K.
Itoh
, and
T. S.
Hahm
,
Plasma Phys. Controlled Fusion
47
,
R35
(
2005
).
6.
A.
Fujisawa
,
Nucl. Fusion
49
,
013001
(
2009
).
7.
J. C.
Hillesheim
,
E.
Delabie
,
H.
Meyer
,
C. F.
Maggi
,
L.
Meneses
,
E.
Poli
, and
JET Contributors
,
Phys. Rev. Lett.
116
,
065002
(
2016
).
8.
R.
Trines
,
R.
Bingham
,
L. O.
Silva
,
J. T.
Mendonça
,
P. K.
Shukla
, and
W. B.
Mori
,
Phys. Rev. Lett.
94
,
165002
(
2005
).
9.

Specifically, this means that the traditional WKE has a unique set of characteristics at each phase-space location. Using the terminology that is adopted in the present paper, the ray approximation can be defined formally as a model based on the approximations (A11) and (A13). This model is applicable under the assumption that ZFs have scales larger compared to the characteristic wavelength of DWs, which is also known as the condition of geometrical optics. See below for details.

10.
E. R.
Tracy
,
A. J.
Brizard
,
A. S.
Richardson
, and
A. N.
Kaufman
,
Ray Tracing and Beyond: Phase Space Methods in Plasma Wave Theory
(
Cambridge University Press
,
New York
,
2014
).
11.
J. B.
Parker
, “
Dynamics of zonal flows: Failure of wave-kinetic theory, and new geometrical optics approximations
,”
J. Plasma Phys.
82
(
2016
).
12.
B. F.
Farrell
and
P. J.
Ioannou
,
J. Atmos. Sci.
60
,
2101
(
2003
).
13.
B. F.
Farrell
and
P. J.
Ioannou
,
J. Atmos. Sci.
64
,
3652
(
2007
).
14.
J. B.
Marston
,
E.
Conover
, and
T.
Schneider
,
J. Atmos. Sci.
65
,
1955
(
2008
).
15.
K.
Srinivasan
and
W. R.
Young
,
J. Atmos. Sci.
69
,
1633
(
2012
).
16.
F.
Ait-Chaalal
,
T.
Schneider
,
B.
Meyer
, and
J. B.
Marston
,
New J. Phys.
18
,
025019
(
2016
).
17.
B. F.
Farrell
and
P. J.
Ioannou
,
Phys. Plasmas
16
,
112903
(
2009
).
18.
J. B.
Parker
and
J. A.
Krommes
,
Phys. Plasmas
20
,
100703
(
2013
).
19.
J. B.
Parker
and
J. A.
Krommes
,
New J. Phys.
16
,
035006
(
2014
).
20.
J. B.
Parker
, Ph.D. thesis,
Princeton University
,
2014
.
21.
J. A.
Krommes
and
C.-B.
Kim
,
Phys. Rev. E
62
,
8508
(
2000
).
22.
A. I.
Smolyakov
and
P. H.
Diamond
,
Phys. Plasmas
6
,
4410
(
1999
).
23.
J. E.
Moyal
,
Math. Proc. Cambridge Philos. Soc.
45
,
99
(
1949
).
24.
H. J.
Groenewold
,
Physica
12
,
405
(
1946
).
25.

Related calculations were also proposed in Refs. 42 and 43. However, the complete Wigner-Moyal equation for DW was not introduced explicitly in those papers, and the validity of its GO limit was not explored in detail. Here, we argue that such details are, in fact, crucial.

26.
N. C.
Constantinou
,
B. F.
Farrell
, and
P. J.
Ioannou
,
J. Atmos. Sci.
71
,
1818
(
2014
).
27.

The only peculiarity of our problem in this respect is that w̃(x,t) is real rather than complex. However, that is just a matter of initial conditions. The equation governing |w̃ still has a quantumlike form.

28.
E.
Wigner
,
Phys. Rev.
40
,
749
(
1932
).
29.

Beyond the ray approximation, the probability density of driftons cannot be defined, because wave quanta cannot be assigned specific locations in phase space due to the uncertainty principle. Unlike the true probability density, the Weyl symbol of the density matrix is not positive-definite in this case; however, it does remain real.

30.

Equation (16) is a pseudo-differential equation as it contains, in general, phase-space derivatives of infinite order. For an extended discussion, see Ref. 44.

31.
E. M.
Lifshitz
and
L. P.
Pitaevskii
,
Physical Kinetics
(
Pergamon Press
,
New York
,
1981
).
32.
N. A.
Bakas
,
N. C.
Constantinou
, and
P. J.
Ioannou
,
J. Atmos. Sci.
72
,
1689
(
2015
).
33.
J.-G.
Liu
and
C.-W.
Shu
,
J. Comput. Phys.
160
,
577
(
2000
).
34.
S.
Gottlieb
,
C.-W.
Shu
, and
E.
Tadmor
,
SIAM Rev.
43
,
89
(
2001
).
35.
B.
van Leer
,
M.
Lo
, and
M.
van Raalte
, in
18th AIAA Computational Fluid Dynamics Conference
(
Miami
,
2007
).
36.

Decreasing the hyperviscosity parameter by an order of magnitude to ν = 0.0001 leads to qualitatively similar results. Quantitatively, the steady-state ZF enstrophy Zzf increases by about 25% when using the tWKE. This can be understood by how U in the tWKE model is dominated by short wavelength components, which are more strongly affected by hyperviscosity.

37.

In the barotropic limit (LD → ∞), the ordering (37) can no longer be satisfied. This means that a GO approximation of Eqs. (25) and (26) is not possible in this limit. This problem originates from the fact that the Hamiltonian is divergent at p = 0. In the unnatural case where W¯=0 for p < p0, then we can estimate pHH/p0. Then, a GO approximation is possible if (λzfp0)11.

38.

The fundamental reason for this discrepancy is that the tWKE assumes W¯ to be the true distribution probability of driftons, but W¯ is not. The true probability distribution would satisfy a strictly conservative equation. In the GO limit, it can always be found, at least numerically, as will be discussed elsewhere. The special case when tU is negligible is discussed in Ref. 11.

39.
J.
Anderson
,
H.
Nordman
,
R.
Singh
, and
J.
Weiland
,
Phys. Plasmas
9
,
4500
(
2002
).
40.
J.
Anderson
,
H.
Nordman
,
R.
Singh
, and
J.
Weiland
,
Plasma Phys. Controlled Fusion
48
,
651
(
2006
).
41.
K.
Imre
,
E.
Özizmir
,
M.
Rosenbaum
, and
P. F.
Zweifel
,
J. Math. Phys.
8
,
1097
(
1967
).
42.
J. T.
Mendonça
and
K.
Hizanidis
,
Phys. Plasmas
18
,
112306
(
2011
).
43.
J. T.
Mendonça
and
S.
Benkadda
,
Phys. Plasmas
19
,
082316
(
2012
).
44.
S. W.
McDonald
,
Phys. Rep.
158
,
337
(
1988
).