Medium-range interactions occur in a wide range of systems, including charged-particle systems with varying screening lengths. We generalize the Ewald method to charged systems described by interactions involving an arbitrary dielectric response function ϵ(𝐤). We provide an error estimate and optimize the generalization to find the break-even parameters that separate a neighbor list-only algorithm from the particle-particle particle-mesh algorithm. We examine the implications of different choices of the screening length for the computational cost of computing the dynamic structure factor. We then use our new method in molecular dynamics simulations to compute the dynamic structure factor for a model plasma system and examine the wave-dispersion properties of this system.

The interactions employed in molecular dynamics (MD) simulations are typically considered to be either short- or long-range. In d dimensions, interactions of the form 1/rn are considered short range if n>d. Short-range interactions are typically treated with a fast neighbor search within a cutoff radius rc; the interaction is truncated thereafter, and a tail correction can be added.1,2 Conversely, if nd, then the interactions are considered long range; such interactions are considerably more difficult to treat. In periodic systems, most methods for handling long-range interactions are based on the Ewald sum,3,4 which decomposes the Coulomb potential into two sums that are rapidly convergent. This decomposition can be implemented through a wide variety of algorithms, of which a naive implementation yields O(N2) scaling; a simple optimization reduces this scaling to O(N3/2).5 Faster, grid-based methods have been developed that allow the fast Fourier transform (FFT) to be used,6 resulting in O(NlogN) scaling; among these methods are the particle-mesh Ewald (PME),7 smoothed particle-mesh Ewald (SPME),8 and particle-particle particle-mesh (PPPM) methods.6,9–12 Alternatively, non-Ewald methods13 have also been developed, such as the Wolf summation method14 and fast multipole methods,15 and hybrid methods16,17 have been developed, as well.

However, this categorization of interactions as either short range or long range reflects the mathematical issue of force-sum convergence. In fact, many interactions can be considered to be medium range when their convergence is guaranteed yet either that there are a prohibitive number of particles in the neighbor radius or there are unacceptable truncation errors. For example, power-law potentials of the form 1/r2 (charge-dipole), 1/r3 (dipole-dipole), and 1/r6 (dispersion) fall into this category,18,19 as well as Sutton-Chen20 potentials relevant for metallic glasses.21 Isele-Holder et al.22 have extended the PPPM algorithm to interactions other than 1/r in their work on 1/r6 dispersion interactions. Similarly, prior work on screened Coulomb interactions includes extending the Ewald summation to Yukawa (exponentially screened) interactions.23,24

Here, we examine a particular class of medium-range interactions that form with linearly screened Coulomb potentials, where the screening is specified through an arbitrary dielectric response function (DRF) ϵ(𝐤). The Ewald sum is formulated for this screened interaction and implemented with both a direct force approach (linked-cell list (LCL)26 alone) and the PPPM method. An error analysis is carried out to examine the crossover between the direct force calculation and the PPPM implementation as a function of the range of the interaction and the particle number.

This paper is organized as follows. In Sec. II, we describe a class of interactions for screened charged systems obtained through the Fourier representation of the dielectric properties of background species. Several choices for the screening function are given in terms of the dielectric response function ϵ(𝐤), and their ranges are discussed. In Sec. III, the Ewald decomposition is developed for charged particle systems interacting through an effective interaction involving ϵ(𝐤).

Sec. IV includes error estimates and timing studies for the Yukawa dielectric response function.

Finally, we apply our results to compute the dynamic structure factor for a screened Coulomb system and discuss the implications of the choice of ϵ(𝐤) and the computational costs that result from the temperature and density dependence of the effective interaction.

Consider N point particles with charges {Qi} (e.g., ions), which can be partial, in the presence of a background charge density −enb (e.g., electrons). The electrostatic forces in such a system can be calculated from the Poisson equation

14π2Φ(𝐫)=i=1NQiδ(𝐫𝐫i(t))enb(𝐫),
(1)

where Φ(𝐫) is the total electrostatic potential, δ(𝐫) is the delta function, and {ri(t)} are the particle coordinates at time t. Once this equation is solved, the corresponding force on the ith particle is calculated to be 𝐅i=QiΦ(𝐫=𝐫i). Expression (1) can be equivalently represented in Fourier space as

k24πΦ(𝐤)=i=1NQiei𝐤𝐫ienb(𝐤).
(2)

By assuming that the background density evolves instantaneously with the point particles, which is a valid assumption for a very light species, such as electrons, and approximating the structure of the background density as a linear response to the potential fluctuations induced by the presence of the point particles, we can express the background density in terms of a response function χ(𝐤) as

nb(𝐤)=υ(k)χ(𝐤)i=1NQiei𝐤𝐫i,υ(k)=4πk2.
(3)

We can write the electrostatic potential as

Φ(𝐤)=υ(k)ϵ(𝐤)i=1NQiei𝐤𝐫i,
(4)

where the reciprocal of the DRF can be expressed in terms of the response function as

ϵ1(𝐤)=1+v(k)χ(𝐤).
(5)

Finally, the force on the ith particle can be represented in terms of the pair interactions as

𝐅i=ji𝐫iuij(𝐫i𝐫j),
(6)

where the effective interaction uij(r) is given in Fourier space as

uij(𝐤)=QiQjv(k)ϵ(𝐤).
(7)

When the background does not respond, χ(𝐤)=0 in (3). Such a completely uniform background corresponds to a dielectric response of unity in (5), and inverting (7) recovers the usual Coulomb interaction uij(r) = QiQj/r. Clearly, in the limit ϵ1, Ewald-class methods are needed. To continue further, we must make a specific choice of the DRF; we begin with the most widely used form

ϵ(k)=1+(λsk)2,
(8)

where λs is the screening length associated with the background fluctuations. The associated pair interaction is given in real space by

uij(r)=QiQjrer/λs.
(9)

This interaction decays exponentially and is therefore “short” range. However, as λs0, the number of particles in a neighbor-sphere diverges. It is therefore convenient to characterize the strength of the screening in terms of the dimensionless quantity

κ=aiλs,
(10)

where ai=(3/4πni)1/3 is the ion-sphere radius, which is a measure of the interparticle distance. The κ=0 case is the pure Coulomb (one-component plasma) limit; thus, despite the exponential decay for finite κ, (9) has a long-range interaction as one of its limits. This suggests that there may be a finite range of κ for which the interaction (9) is medium range in the sense that there is no clear, optimal choice of algorithm.

To provide a concrete example, we can examine this choice in the context of the well-known Thomas-Fermi (TF) model,27,28 for which

λTF=(T2+4EF2/9)1/24πnbe2,
(11)

where T is the background temperature in energy units, EF is the Fermi energy of the electron background in energy units, e is the elementary charge, and nb is the background number density. Let us suppose that, for example, an acceptable error is given by a choice of cutoff radius rc=5λs, which yields the factor e50.674102 in (9). Allowing physical conditions (T and nb) to vary, we can examine the number of particles in the neighbor cell. This is shown in Fig. 1, where contours show the logarithm of the number of particles in such a sphere as a function of temperature and density. Condensed systems (e.g., ionic liquids, liquid metals, and electrolytic solutions) tend to lie near the lower right region of the graph. Laser-produced plasmas originate as cold solids, in that same region, but can be rapidly heated to the very high temperatures of the upper right region. Roughly speaking, the lower right half of the diagram corresponds to 103 particles in the neighbor cell for this choice of error. In Fig. 2, we show the Γκ phase diagram. The coupling parameter Γ is given by

Γ=Z2e2aiTi,
(12)

where Z is the degree of ionization and Ti is the ionic temperature. This diagram reflects the regimes of different charged particle systems of interest such as liquid metals,29 ultracold neutral plasmas,30 dusty plasmas,31 warm dense matter,32 and conditions at the National Ignition Facility (NIF).33 We also show the regions corresponding to different liquid-like properties based on the properties of their velocity autocorrelation function.27 Note that these disparate systems cover a wide range of κ values.

FIG. 1.

The logarithm (base 10) of the number of particles in a cutoff sphere with radius determined by rc=5λs is shown. Because charged particle systems occur over very large ranges of temperature and density, the use of a simple neighbor-list algorithm may not be optimal.

FIG. 1.

The logarithm (base 10) of the number of particles in a cutoff sphere with radius determined by rc=5λs is shown. Because charged particle systems occur over very large ranges of temperature and density, the use of a simple neighbor-list algorithm may not be optimal.

Close modal
FIG. 2.

The liquid portion of the Γκ phase diagram for a Yukawa system is shown. Three regions—caged, modulated decay, and exponential decay—are shown according to the properties of the velocity autocorrelation function.27 Five different types of charged particle systems are shown: liquid metals,29 ultracold neutral plasmas,30 dusty plasmas,31 warm dense matter,32 and conditions associated with heated capsules at the National Ignition Facility (NIF).33 This diagram shows the various types of behavior these disparate systems exhibit and their span of the Γκ plane.

FIG. 2.

The liquid portion of the Γκ phase diagram for a Yukawa system is shown. Three regions—caged, modulated decay, and exponential decay—are shown according to the properties of the velocity autocorrelation function.27 Five different types of charged particle systems are shown: liquid metals,29 ultracold neutral plasmas,30 dusty plasmas,31 warm dense matter,32 and conditions associated with heated capsules at the National Ignition Facility (NIF).33 This diagram shows the various types of behavior these disparate systems exhibit and their span of the Γκ plane.

Close modal

Of course, the long-wavelength form of (8) is not unique, and interactions other than (9) are possible.34 Moreover, the method is applicable to a system of partial charges as well. For this reason, we will retain the generality of an arbitrary ϵ(𝐤) throughout the Ewald development in Sec. III. In fact, medium range interactions of different nature occur in biological and chemical systems where the linear screening approach is not applicable.35–37 

To proceed with the development of the Ewald composition of (4), we note that the central quantity is the potential-like quantity

ϕ(𝐤)=v(k)ϵ(𝐤).
(13)

As is common, we implement the Ewald decomposition by adding and subtracting a screening-charge distribution of the form ζρ(𝐫), where ζ is a constant to be determined, and ρ(𝐫) is chosen to have the Gaussian form

ρ(𝐫)=α3π3/2eα2r2,
(14)

where α is the Ewald splitting parameter. Fourier-space representation of the decomposition takes the form

ϕ(𝐤)=υ(k)ϵ(𝐤)[1ζek2/4α2]+υ(k)ϵ(𝐤)ζek2/4α2
(15)
=ϕR(𝐤)+ϕF(𝐤),
(16)

where the real-space contribution

ϕR(𝐤)=υ(k)ϵ(𝐤)[1ζek2/4α2]
(17)

can be highly short range (because there are effectively two sources of cutoff). Analogously,

ϕF(𝐤)=υ(k)ϵ(𝐤)ζek2/4α2
(18)

is short range in Fourier-space and hence can also be computed efficiently; this is the subject of Sec. IV. In the absence of screening, ϵ(k)1, and we recover the Ewald decomposition for the standard Coulomb interaction.4 

The real-space representation of ϕR(𝐫) is given by the inverse Fourier transform

ϕR(𝐫)=1(2π)3d3kυ(k)ϵ(𝐤)[1ζek2/4α2]ei𝐤𝐫
(19)
=2πr0dkksin(kr)ϵ(k)[1ζek2/4α2],
(20)

where an isotropic dielectric function ϵ(k) is assumed in the second line. We evaluate this first with the long-wavelength (k0) form of ϵ(k) given by

ϵY1(k)=k2ks2+k2
(21)

where ks is the magnitude of the inverse screening vector given by ks=1/λs, where λs is the long-wavelength screening length associated with properties of the background (i.e., classical, quantum, and multispecies). This form yields what is generically referred to as the Yukawa interaction. In what follows, we will be using ks1 in place of λs. Substituting ϵY1(k) in (20) yields

ϕRY(r)=12r[eksrerfc(αrks2α)+eksrerfc(αr+ks2α)],
(22)

with ζeks2/4α2=1 (see Appendix  A). Alternative derivations of ϕRY(r) have been given previously.23–25 

Gradient corrections to the Yukawa result can be captured by the form

ϵEGS1(k)=k2(ks2+14νk2)ks4+k2(ks2+14νk2),
(23)

where leading-order corrections to the Yukawa form appear as k4 terms. An example of this type of DRF is the exact gradient-corrected screening (EGS) potential,34 in which k4 terms arise from either electronic correlations or Heisenberg uncertainty (see Appendix  A). Interestingly, the form of the potential is qualitatively different for ν<1 and ν>1, with the latter corresponding to an oscillatory decay.34 Hence, the Ewald decomposition for the two cases will be different; in this paper, we will restrict our discussion to the ν<1 case. Substituting ϵEGS1(k) in (20) yields

ϕREGS(r)=Aϕ(r)A+ϕ+(r)
(24)
ϕ±(r)=1rek±r+ζ2rek±2/4α2[ek±rerfc(k±2α+αr)ek±rerfc(k±2ααr)],
(25)

where

A±=ks214νk±21νks2,k±2=2ν(1±1ν)ks2.
(26)

Because k<k+ (for ν<1), the optimal choice is ζ=ek2/4α2. We have given the detailed derivation in Appendix  A.

The real-space part of the Ewald decomposition of the screened Coulomb interaction between two point charges for an arbitrary DRF is, therefore, given by

uij(R)(𝐫i𝐫j)=QiQjϕR(𝐫i𝐫j).
(27)

This completes the formulation of the real-space contribution, which will be common to all Ewald-based methods. In Sec. IV, we first turn to the Fourier-space contribution, which we will treat with the PPPM algorithm for the particle-mesh contribution. Next, we provide an error analysis and a time-per-time-step comparison of the force calculations using PPPM and LCL algorithms for the Yukawa interaction. For the remainder of this paper, when we discuss LCL for a Yukawa interaction, this should be understood to refer to a force calculation using the LCL algorithm for a plain Yukawa interaction that is not subject to Ewald decomposition.

In this section, we begin with a brief explanation of the PPPM algorithm, followed by an error analysis for the computed force for the Yukawa DRF, with errors due to the truncation and discretization that are inherent to the algorithm. Then, we compare the execution times of the PPPM and LCL algorithms for the Yukawa potential to distinguish between the two algorithms in terms of computational efficiency for a certain level of error in the force calculation. The results of this section and Sec. V were obtained using the codes we implemented.38 

The particle-particle (PP) part of the PPPM algorithm refers to the real-space part of the Ewald decomposition, which can be highly short range; therefore, interactions beyond a certain cutoff radius can be neglected. If the number of particles within a cutoff sphere Nc is small compared to the total number of particles N in a simulation box, then the total number of interactions O(NNc) tends to O(N) as Nc becomes much smaller than N (NcN). Such an O(N) scaling is achieved by the LCL algorithm,6 in which the simulation box is divided into a number of cells with dimensions proportional to the cutoff radius. The particle-mesh (PM) portion of the PPPM algorithm refers to the Fourier-space part of the interaction that is computed on a grid using one of the two algorithms in Ref. 9; these algorithms differ in the manner in which forces are computed on a grid. Both versions of the PPPM algorithm employ the highly efficient FFT, which scales as O(MlogM), where M is the number of mesh points in the Fourier-space grid.

Errors are associated with computing the energy and forces in real-space and Fourier-space. Because we are interested in MD simulations in which the trajectories of a system of particles evolve primarily because of the forces acting on the particles, we restrict our error analysis to the errors in the forces. We chose to quantify the error in a force using the root-mean-square (RMS) error.9,10 The RMS errors in the real-space and Fourier-space components of the forces, which are acting on particles in a simulation box, have a common form given by

ΔFPξ=i=1N[𝐅i,Papprox.𝐅i,Pexact]2N,
(28)

where P=R,F refers to real-space or Fourier-space, 𝐅i,Papprox. is the approximate force acting on particle i, 𝐅i,Pexact is the actual force (or numerically exact force) acting on particle i, and ξ is used to distinguish between computed RMS errors and analytical estimates of RMS errors. Moreover, because the computational costs associated with real-space and Fourier-space force calculations are determined by the levels of error that are allowed, it is essential to understand how the errors vary with respect to the parameters. For this purpose, we have derived analytical estimates for the errors in the real-space and Fourier-space components of the forces for a system of randomly distributed particles. The error estimates were verified by comparing with the RMS errors computed for a system of particles placed at random positions within a simulation box. For the remainder of this paper, we use ai as the unit of distance and the elementary charge e as the unit of charge; thus, the RMS force error is in units of e2/ai2.

The error in the real-space component of the force is due to the truncation of particle interactions beyond a certain cutoff radius. For a system of N charged point particles interacting pair-wise with an arbitrary short-range interaction expressed as us(𝐫i𝐫j)=QiQjϕs(𝐫i𝐫j), the estimate of the RMS force error due to a cutoff radius rc can be derived by adapting the error analysis in Ref. 39, which considers the real-space portion of the Ewald decomposition for the Coulomb potential. With the assumption that beyond rc particles are randomly distributed, the estimate of the RMS force error due to us(r) is given by

ΔFs=QNV[Vc(rrc<)d3r|𝐟s(𝐫)|2]1/2,
(29)

where 𝐟s(𝐫)=𝐫ϕs(𝐫), Q=i=1NQi2, and V is the volume enclosing the particles.

Substituting 𝐟RY(𝐫)=𝐫ϕRY(𝐫) in (29) and making several approximations, we get the following estimate of the RMS force error for the real-space portion of the Ewald decomposition for the Yukawa potential:

ΔFRY2QNVeκ2/4α2eα2rc2rc
(30)

(see Appendix  B for details). Note that as κ0 (Coulomb limit), we recover the corresponding error estimate for the Coulomb case.39 Figs. 3(a)–3(c) show very good agreement between the error estimates and the computed RMS error in the real-space force component for a system of 5×103 unit charges with κ = 0.1, 1, and 2 and varying values of α. The choice of these κ values is consistent with the κ range of interest as shown in Fig. 2. The particles were randomly distributed in a cube of edge length L given by L=(4πN/3)1/3 in units of ai.

FIG. 3.

The RMS force error is shown for the real-space (top row) and the Fourier-space (bottom row) for three values of κ=0.1,1,2 (the columns). For different values of α, the computed RMS errors (dots) agree well with the estimates (curves) for the real-space part of the Ewald decomposition for (a) κ=0.1, (b) κ=1, and (c) κ=2, and for the Fourier-space part of the Ewald decomposition with B-splines of order p = 2–7 for (d) κ=0.1, (e) κ=1, and (f) κ=2. The computed RMS errors correspond to a system of 5×103 unit charges placed at random positions in a cube.

FIG. 3.

The RMS force error is shown for the real-space (top row) and the Fourier-space (bottom row) for three values of κ=0.1,1,2 (the columns). For different values of α, the computed RMS errors (dots) agree well with the estimates (curves) for the real-space part of the Ewald decomposition for (a) κ=0.1, (b) κ=1, and (c) κ=2, and for the Fourier-space part of the Ewald decomposition with B-splines of order p = 2–7 for (d) κ=0.1, (e) κ=1, and (f) κ=2. The computed RMS errors correspond to a system of 5×103 unit charges placed at random positions in a cube.

Close modal

In the Fourier-space part of the PPPM algorithm, the force calculation for a system of charged point particles with arbitrary spatial distribution is mapped to a real-space grid by assigning particle positions to the real-space grid points using B-splines of some order p. Calculations are then performed in Fourier-space starting with the FFT of the charge density. This is followed by computing the potential energy and forces on the Fourier-space grid following the steps described in Ref. 9. Forces on the Fourier-space grid are then transformed to real-space by an inverse FFT. Next, the forces are interpolated from the real-space grid to the actual particle positions using B-splines of the same order, in a manner similar to the earlier step of assigning particle positions to the real-space grid. FFT involves an inherent truncation in the maximum allowable length of Fourier-space vectors, which is one cause for error in the Fourier-space component of the forces. In addition, errors occur because of aliasing, which arises because calculations are being made on a grid, and because of the interpolation of particle positions to the real-space grid and of forces computed on the real-space grid to actual particle positions. Critically, the PPPM algorithm employs an optimal Green function6 that is derived by minimizing the error in the forces for a random distribution of particles. The optimal Green function can be interpreted as a weighted average of Fourier modes of the actual Green function.

For the version of the PPPM algorithm in which forces are computed on a grid,9 the optimal Green function corresponding to an arbitrary Green function is given by9 

G^𝐤𝐧=𝐦U^𝐤n+m2G^𝐤n+m𝐤𝐧𝐤n+m[𝐦U^𝐤n+m2]2|𝐤𝐧|2,
(31)

where n refers to the grid vector denoted by the triplet of grid indices (nx, ny, nz), G^𝐤𝐧 is the actual Green function evaluated at the Fourier-space vector kn corresponding to grid vector n, m refers to the triplet of grid indices (mx, my, mz) that contribute to aliasing, and U^𝐤𝐧 is the Fourier transform of the B-spline of order p. U^𝐤𝐧 is given by

U^𝐤𝐧=[sin(πnx/Mx)πnx/Mx]p[sin(πny/My)πny/My]p[sin(πnz/Mz)πnz/Mz]p,
(32)

where Mx, My, and Mz refer to the number of grid points along the respective directions in Fourier-space. The triplet of grid sizes (hx, hy, hz) are then given by (Lx/Mx, Ly/My, Lz/Mz). The estimate of the optimal RMS error in the Fourier-space force is given by9 

ΔFFQNV2/3χF,
(33)

where

χF2V2/3=𝐤0G^𝐤|𝐤|2𝐧([𝐦U^𝐤n+m2G^𝐤n+m𝐤𝐧𝐤n+m]2[𝐦U^𝐤n+m2]2|𝐤𝐧|2).
(34)

Figs. 3(d)–3(f) show very good agreement between the RMS error estimates for the Fourier-space force with order of B-splines ranging from 2 to 7 and the corresponding RMS error computed using the same system of 5×103 unit charges that was employed for the real-space error analysis. Because the simulation volume is a cube, the numbers of grid points M along each direction in Fourier-space is equal.

Eq. (33) is not of a form that explicitly reveals the influence of the parameters h, α and p on the error incurred in the Fourier-space force calculation. Therefore, (33) requires further simplification, which is possible if the arbitrary Green function decays sufficiently rapidly in Fourier-space that the alias contributions can be neglected. Following the approach in Ref. 10, (33) is approximated to

ΔFF(approx)QNVAF1/2,
(35)

where AF is given by

AF3(2π)2m=0p1Cm(p)(h2)2(p+m)2[1+2(p+m)]β(p,m),
(36)

where β(p,m) denotes an integral given by

β(p,m)=0dkG^k2k2(p+m+2),
(37)

and Cm(p) are coefficients listed in Table I of Ref. 10 (see Appendix  B for further detail about the simplification of (33)). For the Yukawa interaction, substituting G^k=4π(k2+κ2)e(k2+κ2)/4α2 results in a modified form for β(p,m) given by

β(p,m,κ/α)=(4π)2α2(p+m)+1ψ(p,m,κ/α),
(38)

where ψ(p,m,κ/α) denotes an integral of the form

ψ(p,m,κ/α)=κ/αdueu2/2u3(u2κ2/α2)p+m+3/2.
(39)

Finally, a simplified form of the RMS error estimate for the Fourier-space component of the force for the Yukawa interaction is given by

ΔFFY(approx)=2QNV(3α)1/2(BFY)1/2,
(40)

where

BFY=m=0p1Cm(p)(hα2)2(p+m)2[1+2(p+m)]ψ(p,m,κ/α).
(41)

Fig. 4 shows a good agreement between ΔFFY(approx.) and the full error estimate ΔFFY for κ = 0.1 and 1 for a system of 103 unit charges. For κ=1 as α decreases below unity, ΔFFY(approx.) deviates from ΔFFY. Further, we found that for κ>1, ΔFFY(approx.) is not a good approximation of ΔFFY for the α values of interest. Nevertheless, the simplified form of ΔFFY(approx.) is useful because it reveals that the error approximately varies as the product of the grid size and the Ewald splitting parameter (hα) raised to the power given by the order of B-spline (p). Further, finding an optimal set of parameters requires considerable effort using (33), whereas using a simplified estimator can expedite the process of finding the optimal parameters. An alternative approach for estimating the errors is described in Ref. 54.

FIG. 4.

Comparison of approximate RMS error estimates (curves) and full RMS error estimates (dots) of the Fourier-space forces for (a) κ=0.1 and (b) κ=1.0 for a system of 103 unit charges. There is a good agreement between the simplified (approximated) error estimates and the full error estimates for κ=0.1. For κ=1; the simplified estimates deviate from the full estimates as α decreases below unity.

FIG. 4.

Comparison of approximate RMS error estimates (curves) and full RMS error estimates (dots) of the Fourier-space forces for (a) κ=0.1 and (b) κ=1.0 for a system of 103 unit charges. There is a good agreement between the simplified (approximated) error estimates and the full error estimates for κ=0.1. For κ=1; the simplified estimates deviate from the full estimates as α decreases below unity.

Close modal

The total RMS error in the force computed using the PPPM algorithm is defined as

ΔFtot=ΔFR2+ΔFF2.
(42)

To compare the computational costs of the PPPM and LCL algorithms, the execution times for the two algorithms must be compared for the same RMS force errors. To this end, an estimate of the RMS force error for Yukawa interaction is needed. Substituting 𝐟Y(𝐫)=𝐫ϕY(𝐫) in (29), where ϕY(𝐫)=eκr/r, results in the following estimate of the RMS force error for Yukawa interaction:

ΔFYQNV2πκeκrc.
(43)

This equation, in turn, implies that

rcD1κlog(ΔFYQNV2πκ)
(44)

(see Appendix  B for further detail). As shown in Fig. 5, the RMS error estimate in (43) has been verified by comparison with the RMS error computed using 105 unit charges placed at random positions in a box.

FIG. 5.

RMS force errors versus the cutoff radius rc is shown. RMS force errors (dots) computed using 105 unit charges placed at random positions, with κ=1, show very good agreement with the corresponding error estimate (dashed curve) given by (43).

FIG. 5.

RMS force errors versus the cutoff radius rc is shown. RMS force errors (dots) computed using 105 unit charges placed at random positions, with κ=1, show very good agreement with the corresponding error estimate (dashed curve) given by (43).

Close modal

Based on the error analysis for the Yukawa interaction, Fig. 6 shows that for κ=0.1 and an RMS force error of 10−8 (e2/ai2), the minimum image convention rcL/2 is not satisfied below a system size of 107 unit charges. Therefore, for a κ value and particle numbers for which LCL would be less efficient, PPPM serves as an excellent alternative. As κ increasesfor approximately the same error of 10−8 (e2/ai2), the threshold above which the particle number satisfies the minimum image convention decreases; for example, for κ=0.5, the minimum image convention is satisfied beyond 105 unit charges, as shown in Fig. 6. Importantly, for some conditions where the LCL algorithm is applicable, we find the PPPM algorithm to be computationally advantageous. This observation is based on the comparison of the time per time step of PPPM and LCL algorithms for a fixed RMS force error.The timing results for a range of κ and N (with unit charges) are provided in Table I. For example, with 106 unit charges and an RMS force error of approximately 10−5 (e2/ai2), we find that the PPPM algorithm was approximately 25 and 130 times faster than the LCL algorithm for κ=0.5 and 0.3, respectively (as shown in Table I). As κ approaches unity, we find the LCL and PPPM algorithms to be comparable in performance, as summarized by the entries in Table I for κ=1. As κ increases beyond unity, we find that the PPPM algorithm loses its computational advantage over the LCL algorithm, as can be seen in the entries in Table I for κ=2. We should point out, however, that execution times are architecture dependent and our conclusions are based on that fact. All calculations used a 3.5 GHz Intel Xeon(R) processor and 32 GB RAM.

FIG. 6.

Ratio of the cutoff radius to half the edge length of simulation cube (rcDL/2) for a fixed RMS force error over a range of particle numbers. For κ = 0.1, the ratios for an error of 10−8(e2/ai2) and 10−6(e2/ai2) are given by the red line and the red dashed line, respectively. For κ = 0.5, the ratios for an error of 10−8(e2/ai2) and 10−6(e2/ai2) are given by the blue line and the blue dashed line, respectively. Also shown is the light gray region corresponding to beyond the minimum image convention (rcD>L/2) which implies that for κ = 0.1 and an error of 10−8(e2/ai2), LCL would be less efficient if the system has less than about 107 unit charges, while for κ = 0.5, LCL would be less efficient for a system with less than about 105 unit charges.

FIG. 6.

Ratio of the cutoff radius to half the edge length of simulation cube (rcDL/2) for a fixed RMS force error over a range of particle numbers. For κ = 0.1, the ratios for an error of 10−8(e2/ai2) and 10−6(e2/ai2) are given by the red line and the red dashed line, respectively. For κ = 0.5, the ratios for an error of 10−8(e2/ai2) and 10−6(e2/ai2) are given by the blue line and the blue dashed line, respectively. Also shown is the light gray region corresponding to beyond the minimum image convention (rcD>L/2) which implies that for κ = 0.1 and an error of 10−8(e2/ai2), LCL would be less efficient if the system has less than about 107 unit charges, while for κ = 0.5, LCL would be less efficient for a system with less than about 105 unit charges.

Close modal
TABLE I.

Comparison of times per time step for force calculations using the PPPM and LCL algorithms for Yukawa interaction, for a range of κ and the number of unit charges (N). The entries correspond to an error of approximately 10−5(e2/ai2). All times are in seconds. α is the Ewald splitting parameter, rc is the cutoff radius for the real-space calculation, M is the number of grid points along each direction in Fourier-space, and rcD is the cutoff radius for LCL. For all Fourier-space calculations, B-spline of order p = 6 was found to be optimal. TR and TF are the times per time step of real-space and Fourier-space calculations, respectively. TPPPM refers to the total time per time step for PPPM and is given by TPPPM=TR+TF. TLCL is the time per time step for LCL.

κNα(ai1)rc (ai)MrcD (ai)TR (s)TF (s)TPPPM (s)TLCL (s)TLCL/TPPPM
0.3 1 × 105 0.625 5.2 128 35.48 2.61 1.92 4.53 324.2 71.56 
 1 × 106 0.579 5.6 256 35.62 25.65 26.81 52.46 7138.24 136 
0.5 1 × 105 0.614 5.3 128 22.24 1.97 1.36 3.33 133.6 40.1 
 1 × 106 0.569 5.7 256 22.33 24.79 23.16 47.95 1173.05 24.5 
1 × 105 0.61 5.33 128 12.08 1.89 1.35 3.24 11.7 3.6 
 1 × 106 0.57 5.8 256 12.34 26.1 21.2 47.3 119.9 2.53 
1.5 1 × 105 0.684 4.6 128 7.99 1.28 1.22 2.5 3.07 1.23 
 1 × 106 0.635 5.2 256 8.38 18.66 26.34 45 34.76 0.77 
1 × 105 0.684 4.5 128 6.4 1.2 2.17 3.37 1.83 0.54 
 1 × 106 0.683 4.7 256 6.47 13.2 22.8 36 16.3 0.45 
κNα(ai1)rc (ai)MrcD (ai)TR (s)TF (s)TPPPM (s)TLCL (s)TLCL/TPPPM
0.3 1 × 105 0.625 5.2 128 35.48 2.61 1.92 4.53 324.2 71.56 
 1 × 106 0.579 5.6 256 35.62 25.65 26.81 52.46 7138.24 136 
0.5 1 × 105 0.614 5.3 128 22.24 1.97 1.36 3.33 133.6 40.1 
 1 × 106 0.569 5.7 256 22.33 24.79 23.16 47.95 1173.05 24.5 
1 × 105 0.61 5.33 128 12.08 1.89 1.35 3.24 11.7 3.6 
 1 × 106 0.57 5.8 256 12.34 26.1 21.2 47.3 119.9 2.53 
1.5 1 × 105 0.684 4.6 128 7.99 1.28 1.22 2.5 3.07 1.23 
 1 × 106 0.635 5.2 256 8.38 18.66 26.34 45 34.76 0.77 
1 × 105 0.684 4.5 128 6.4 1.2 2.17 3.37 1.83 0.54 
 1 × 106 0.683 4.7 256 6.47 13.2 22.8 36 16.3 0.45 

In this section, we apply our results to the computation of the dynamic structure factor S(k,ω). Our interest in S(k,ω) falls into two categories. Theoretically, S(k,ω) contains all of the many-body processes associated with thermal density fluctuations, including collective modes and transport.29,40–42 All of these processes are obviously impacted by the choice of ϵ(k), and we wish to use MD for model validation. The ionic S(k,ω) is also connected directly with neutron scattering43 and, indirectly, to X-ray Thomson Scattering (XRTS).44 In the context of warm dense matter physics, for example, recent XRTS studies have developed a modified Navier-Stokes equation (MNSE) to model the ionic S(k,ω).29 

Within the framework of the Yukawa model, we are interested in density and temperature conditions for which MD is required to validate approximate theoretical models and where LCL would be computationally less efficient than PPPM for force calculations. Consider the case Γ=2 and κ=0.1, for 104 unit charges the PPPM algorithm can have a force error of approximately 10−6 (e2/ai2) for an optimal choice of α, rc, h, and p. For the same error and particle number, LCL would be less efficient because the minimum-image convention is not satisfied. This case corresponds to an effective coupling of Γeff=Γeκ0.74, which is of an order that is relevant to systems of recent interest, such as warm dense matter.44 

The dynamic structure factor for an N-particle system is defined as

S(𝐤,ω)1Ndteiωtn(𝐤,t)n(𝐤,0).
(45)

Here, the brackets refer to the stationary time average

n(𝐤,t)n(𝐤,0)=1T0Tdτn(𝐤,t+τ)n(𝐤,τ),
(46)

where T is the length of the run. We note that for a system in equilibrium the stationary time average is equivalent to the ensemble average as T. The Fourier-transformed densities n(k,t) are obtained from the definition of the density,

n(𝐫,t)=j=1Nδ(𝐫𝐫j(t)),
(47)

to yield

n(𝐤,t)=d3rn(𝐫,t)ei𝐤𝐫=j=1Nei𝐤𝐫j(t).
(48)

Substituting (46) into (45), we obtain

S(𝐤,ω)=1NT0Tdτn(𝐤,τ)dteiωtn(𝐤,t+τ),
(49)

which, through the change of variables y=t+τ, can be written as

S(k,ω)=1NT[0Tdτn(𝐤,τ)eiωτ][dyeiωyn(𝐤,y)].
(50)

We use the usual definition of the temporal Fourier transform

n(𝐤,ω)=dyeiωyn(𝐤,y).
(51)

For numerical purposes, the infinite domain must be approximated as finite, and as we are calculating stationary processes, we can take this domain to be [0, T] without loss of generality for sufficiently large T. This yields the approximate transform

n(𝐤,ω)=0Tdτn(𝐤,τ)eiωτ.
(52)

Therefore, the final form for S(𝐤,ω) is given by

S(k,ω)=1NT|n(𝐤,ω)|2,
(53)

as n(𝐤,ω) = n*(𝐤,ω). The Fourier-transformed density n(𝐤,ω) is given by

n(𝐤,ω)=j=1N0Tdtei[𝐤𝐫j(t)ωt].
(54)

Particle positions from MD are available only at certain times in [0, T]; hence, n(𝐤,ω) is computed using

n(𝐤,ω)=j=1Nnt=0Nt1ei[𝐤𝐫j(ntΔt)ωΔt],
(55)

where Nt is the number of steps after the equilibration phase of MD, and Δt is the step size for time integration. Particle positions are stored at intervals corresponding to this step size. Long runs are preferred because the noise in S(𝐤,ω) decreases as T increases. The temporal Fourier transform is computed by FFT. S(𝐤,ω) computed from MD data tends to be noisy, and the extent of the noise increases as k increases. To determine the location of the peak of S(𝐤,ω) and its width, the noisy curves are usually smoothed by averaging over data from different MD runs that differ in their initial conditions. In Sec. V B, we present a potentially more efficient smoothing approach that employs kernel smoothing.49,50

Kernel smoothing refers to weighted averaging of noisy observations to obtain a smooth function.49,50 To smooth our data using kernel smoothing, we use a kernel regression technique50 defined by

y(x)=i=1BKσ(x,xi)y(xi)i=1BKσ(x,xi),
(56)

where y(xi) is the observation of y at xi, B is the number of observations, and Kσ(x,xi) is the smoothing kernel. This yields a weighted average to give a smooth function y(x) evaluated at x. We use the Gaussian kernel

Kσ(x,xi)=e(xxi)2/2σ2,
(57)

where σ is the kernel width. For Γ=50 and κ=1, Figs. 7 and 8 show the smooth dynamic structure factor curves (red curves) obtained by kernel smoothing of noisy dynamic structure factor curves obtained from MD simulations (grey curves) with σ = 0.01 and σ = 0.005, respectively. MD simulations of 103 unit charges were performed using the PPPM algorithm, with an RMS force error of approximately 10−4 (e2/ai2). The parameters used were α=1, rc=3, M=64, and p = 2.

FIG. 7.

S(q,Ω) obtained from PPPM-MD for a Yukawa system with Γ=50 and κ=1 is shown for a range of values of q. Each panel shows S(q,Ω) for one value of q, for q = kai = 0.39, 0.78, 1.17, 1.56, 1.95, 2.31, which are the first six multiples of 2π/L, where L is the edge length of the simulation cube containing 103 unit charges. The grey curves are the noisy S(q,Ω) computed using the particle positions in the MD simulation. The red curves are S(q,Ω) obtained by kernel-smoothing the noisy S(q,Ω) with a Gaussian kernel of width σ=0.01. The blue band surrounding the smooth S(q,Ω) curves shows the standard deviation in the smooth S(q,Ω) with respect to kernel smoothing.

FIG. 7.

S(q,Ω) obtained from PPPM-MD for a Yukawa system with Γ=50 and κ=1 is shown for a range of values of q. Each panel shows S(q,Ω) for one value of q, for q = kai = 0.39, 0.78, 1.17, 1.56, 1.95, 2.31, which are the first six multiples of 2π/L, where L is the edge length of the simulation cube containing 103 unit charges. The grey curves are the noisy S(q,Ω) computed using the particle positions in the MD simulation. The red curves are S(q,Ω) obtained by kernel-smoothing the noisy S(q,Ω) with a Gaussian kernel of width σ=0.01. The blue band surrounding the smooth S(q,Ω) curves shows the standard deviation in the smooth S(q,Ω) with respect to kernel smoothing.

Close modal
FIG. 8.

S(q,Ω) obtained from PPPM-MD for a Yukawa system with Γ=50 and κ=1 is shown for a range of values of q. Each panel shows S(q,Ω) for one value of q, for q = kai = 0.39, 0.78, 1.17, 1.56, 1.95, 2.31, which are the first six multiples of 2π/L, where L is the edge length of a simulation cube containing 103 unit charges. The grey curves are the noisy S(q,Ω) computed using the particle positions in the PPPM MD simulation. The red curves are S(q,Ω) obtained by kernel-smoothing the noisy S(q,Ω) with a Gaussian kernel of width σ=0.005. The blue band surrounding the smooth S(q,Ω) curves shows the standard deviation in the smooth S(q,Ω) with respect to kernel smoothing.

FIG. 8.

S(q,Ω) obtained from PPPM-MD for a Yukawa system with Γ=50 and κ=1 is shown for a range of values of q. Each panel shows S(q,Ω) for one value of q, for q = kai = 0.39, 0.78, 1.17, 1.56, 1.95, 2.31, which are the first six multiples of 2π/L, where L is the edge length of a simulation cube containing 103 unit charges. The grey curves are the noisy S(q,Ω) computed using the particle positions in the PPPM MD simulation. The red curves are S(q,Ω) obtained by kernel-smoothing the noisy S(q,Ω) with a Gaussian kernel of width σ=0.005. The blue band surrounding the smooth S(q,Ω) curves shows the standard deviation in the smooth S(q,Ω) with respect to kernel smoothing.

Close modal

We have defined a simple way of quantifying the smoothing error as follows:

Δy(x)=y2(x)[y(x)]2,
(58)

where y2(x) is defined as

y2(x)=i=1BKσ(x,xi)y2(xi)i=1BKσ(x,xi).
(59)

More rigorous ways of quantifying the error in smoothing can be found in Refs. 49 and 50. Figs. 7 and 8 show the error in smoothing (blue band) about the smooth S(q,Ω) curves (red curves). Because the kernel width influences the extent of smoothing and the corresponding error in the smoothing, the width must be chosen carefully such that the smoothed data are consistent with the physically expected result.

In this section, we compare dispersion relations obtained using PPPM-MD with those obtained using different theoretical models. For convenience, k and ω are expressed as dimensionless quantities q = kai and Ω=ω/ωp, where ωp=(4πniQi2/M)1/2 is the ion plasma frequency for ionic mass M and ionic number density ni.

One simple dispersion relation obtained from a static structure factor S(q)29 is given by

Ω2=q23ΓS(q).
(60)

In the random-phase approximation (RPA) for a Yukawa system, S(q) takes the approximate form

SRPA(q)=q2+κ2q2+κ2+3Γ.
(61)

More accurate theoretical forms of S(q) are discussed in Refs. 29 and 46. Fig. 9(a) shows the comparison of a dispersion relation obtained using PPPM-MD and dispersion relations obtained using different theoretical approximations. The purpose of this comparison is to show that the MD result agrees well with the theoretical dispersion relations in the low-q limit, where the theory is considered to work well.29 As q increases, the MD curve begins to deviate from the theoretical curves; this occurs because of the lack of required many-body effects in those theoretical models. A very accurate S(q) can be obtained from hypernetted chain (HNC) equations with a bridge function48 (denoted by SHNC(q)). Although particle correlations are captured in Ω(q) by SHNC(q) (solid blue curve), this model does not incorporate viscoelastic effects,46 and hence it underestimates Ω(q) compared to MD. Fig. 9(b) shows the full-width-half-maximum (FWHM) of a smooth S(q,Ω) constructed by kernel smoothing of S(q,Ω) obtained from MD data using a Gaussian kernel of widths σ=0.01 (blue dots) and σ= 0.005 (green dots). A smaller width of σ=0.005 results in a curve that approaches zero (as expected from theory) more accurately than does a curve constructed using a width of σ= 0.01.

FIG. 9.

Theoretical dispersion relations are compared with dispersion relation from dynamic structure factor of MD. In the left panel (a), dispersion relation from S(q,Ω) of MD smoothed with Gaussian kernels of widths σ=0.01 (black dots) and σ=0.005 (green dots). The green band corresponds to the full-width-half-maximum (FWHM) of smooth S(q,Ω) obtained using σ=0.005 (the band for the other case with σ=0.01 is similar to the band for σ=0.005, hence, is not shown). Also shown are Ω(q) obtained using Eq. (60) with S(q) from RPA (solid red), with S(q)SHNC(0) (dashed red) and with full SHNC(q) (solid blue) where SHNC(q) refers to S(q) computed using the hypernetted chain (HNC) equations with a bridge function. In the right panel (b), full-width-half-maximum of smooth S(q,Ω) denoted by Δ/ωp with σ=0.005 (green dots) and σ=0.01 (blue dots). Both (a) and (b) correspond to the MD results described in Figs. 7 and 8.

FIG. 9.

Theoretical dispersion relations are compared with dispersion relation from dynamic structure factor of MD. In the left panel (a), dispersion relation from S(q,Ω) of MD smoothed with Gaussian kernels of widths σ=0.01 (black dots) and σ=0.005 (green dots). The green band corresponds to the full-width-half-maximum (FWHM) of smooth S(q,Ω) obtained using σ=0.005 (the band for the other case with σ=0.01 is similar to the band for σ=0.005, hence, is not shown). Also shown are Ω(q) obtained using Eq. (60) with S(q) from RPA (solid red), with S(q)SHNC(0) (dashed red) and with full SHNC(q) (solid blue) where SHNC(q) refers to S(q) computed using the hypernetted chain (HNC) equations with a bridge function. In the right panel (b), full-width-half-maximum of smooth S(q,Ω) denoted by Δ/ωp with σ=0.005 (green dots) and σ=0.01 (blue dots). Both (a) and (b) correspond to the MD results described in Figs. 7 and 8.

Close modal

More rigorous theoretical models for computing the dispersion relation are the quasilocalized charge approximation (QLCA),45 modified Navier-Stokes (MNSE), 29 andviscoelastic-dynamic density functional theory (VE-DDFT).46 Using QLCA, the dispersion relation takes the form

Ω2=q2q2+κ2+0drr[g(r)1]K(qr,κr),
(62)

where K(x,y) is given by

K(x,y)=ey[(2+2y+23y2)(sinxx+3cosxx23sinxx3)]eyy23(sinxx1).
(63)

The dispersion relation from the Navier-Stokes equation modified to incorporate low-frequency corrections results in the dispersion relation for MNSE is given by

Ω2=q23ΓS(q)(43ωE3ωpq2η)2,
(64)

where ωE is the Einstein frequency, and η is the dimensionless viscosity given by η=η/3ωEMniai2, where η is the shear viscosity. Values of η can be found in Ref. 47. VE-DDFT incorporates viscoelastic effects that are not incorporated in QLCA and MNSE formulations. The dispersion relation calculated using VE-DDFT is given by

Ω2=q23ΓS(q)iη¯q2Ω1iΩτωp,
(65)

where τ is given by

ωpτ=3η¯Γ1γiμ+415Ec,
(66)

where γi is the adiabatic index, μ is the compressibility, Ec(Γ,κ) is the correlation energy in units of temperature, and η¯ is the normalized viscosity given by η¯=(43η+ε)/(Mniωpai2), where ε is the bulk viscosity.

S(q,Ω) curves from MD were obtained by averaging over data from 20 different PPPM-MD simulations for Γ=2 and κ=0.1 using 104 unit charges and an RMS force error of 10−6 (e2/ai2). The parameters used were α = 0.46, rc = 7.8, M = 64, and p = 6. Each MD run was 800 ωp1 long with a time step Δt=0.05ωp1 and frequency resolution Δω0.008ωp. S(q,Ω) curves shown in Fig. 10(a) were obtained by smoothing the noisy S(q,Ω) curves from MD simulations using Gaussian kernels of width σ=0.01. Figs. 10(b) and 10(c) show the corresponding dispersion relations obtained using the smoothed S(q,Ω) curves and comparisons with the theoretical dispersion relations obtained for QLCA, MNSE and VE-DDFT, with S(q) given by SHNC(q). We can see that for values of q greater than approximately 0.3, the MD result increasingly deviates from the theoretical curves, while the VE-DDFT curve deviates from the MNSE and QLCA curves for q greater than approximately 0.4. Further, the QLCA and MNSE curves also deviate from each other for q greater than approximately 0.7. Hence, the differences between the dispersion relations for the theoretical models and their differences from MD results clearly demonstrate the need for model validation with accurate and computationally efficient MD simulations, which can be performed using the PPPM algorithm as discussed in this paper.

FIG. 10.

Dynamic structure factor and dispersion relation for Γ=2 and κ=0.1. In the left panel (a), S(q,Ω) for the first six integral multiples of Δq=2π/L=0.18 by averaging over data from 20 different MD simulations using 104 unit charges in a cube and force calculation using PPPM corresponding to an RMS force error of 10−6 (e2/ai2). For this error, LCL would be less efficient since the minimum image convention is not satisfied. The 20 different MD simulations differed in their initial conditions for particle positions and velocities. Each MD run was 800 ωp1 long with a time step Δt=0.05ωp1 and frequency resolution of Δω0.008ωp. S(q,Ω) curves were obtained by smoothing S(q,Ω) from MD using Gaussian kernel smoothing with kernel width σ=0.01. In the right upper panel (b), dispersion relation from peaks of kernel smoothed S(q,Ω) (red dots), QLCA (blue dots), Murillo’s MNSE formulation (green dots), and VE-DDFT formulation (orange dots). In the right lower panel (c), for a wider range of q, we can see significant deviation of the theoretical dispersion relations from the MD dispersion relation for q greater than about 0.4. Lines connecting the dots in (b) and (c) are for guiding the eye.

FIG. 10.

Dynamic structure factor and dispersion relation for Γ=2 and κ=0.1. In the left panel (a), S(q,Ω) for the first six integral multiples of Δq=2π/L=0.18 by averaging over data from 20 different MD simulations using 104 unit charges in a cube and force calculation using PPPM corresponding to an RMS force error of 10−6 (e2/ai2). For this error, LCL would be less efficient since the minimum image convention is not satisfied. The 20 different MD simulations differed in their initial conditions for particle positions and velocities. Each MD run was 800 ωp1 long with a time step Δt=0.05ωp1 and frequency resolution of Δω0.008ωp. S(q,Ω) curves were obtained by smoothing S(q,Ω) from MD using Gaussian kernel smoothing with kernel width σ=0.01. In the right upper panel (b), dispersion relation from peaks of kernel smoothed S(q,Ω) (red dots), QLCA (blue dots), Murillo’s MNSE formulation (green dots), and VE-DDFT formulation (orange dots). In the right lower panel (c), for a wider range of q, we can see significant deviation of the theoretical dispersion relations from the MD dispersion relation for q greater than about 0.4. Lines connecting the dots in (b) and (c) are for guiding the eye.

Close modal

We have generalized the Ewald decomposition to screened Coulomb interactions with arbitrary DRFs. The resulting effective interactions span from short to long range, depending on the physics contained in the DRF. This extension allows us to identify parameters for which the decomposition offers a computational advantage over efficient neighbor-list-only algorithms (e.g., LCL-only), thereby providing an operational definition of “medium range.”

We have demonstrated an elegant way of performing the Ewald decomposition in Fourier-space and applied the decomposition to two choices of the DRF, Yukawa and EGS interactions, that span a wide range of temperatures and densities of interest. However, this Ewald decomposition is quite general and can be applied to non-Coulomb interactions, such as those that employ electron-ion pseudopotentials.51 

For the Yukawa interaction, we implemented a standard PPPM6,9 algorithm. We verified our implementation with an extensive analysis of force errors that are inherent to the PPPM algorithm. The analysis involved deriving, for a random distribution of particles, analytical error estimates that agreed very well with the computed error. The error estimates provide insight into the effects of truncation and discretization on the errors in the computed forces, and this information is needed when choosing optimal parameters for calculating forces efficiently. The error analysis was also extended to the plain Yukawa interaction, which allows for a cutoff radius because of its exponentially decaying form and, hence, can be implemented using the LCL algorithm. For the same force errors computed using the PPPM and LCL algorithms, we compared the time required for one step of a force calculation for a range of κ values and particle numbers.

We found that the PPPM algorithm offers a significant computational advantage over the LCL algorithm for a medium-range Yukawa potential, which corresponds to screening lengths exceeding the ion-sphere radius (κ<1). For screening lengths less than the ion-sphere radius (κ>1), the computational advantage of PPPM over LCL was found to diminish (see Table I). As screening lengths decreased further to values less than half of the ion-sphere radius (κ2), the LCL algorithm became more advantageous than the PPPM algorithm.

We then applied our method to the computation of the dynamic structure factor for a Yukawa system. By varying the screening length and the system size, we found cases for which the PPPM algorithm is computationally advantageous over the LCL algorithm. One such case is κ=0.1 and 104 unit charges, corresponding to an RMS force error of approximately 10−6 (e2/ai2), which precludes the use of the LCL algorithm because the minimum image convention is not satisfied. The computed dynamic structure factor tends to be noisy and, therefore, must be smoothed, which is usually accomplished by averaging over the results of MD runs that vary in their initial conditions. We have presented an alternative and potentially more efficient way of smoothing S(k,ω) data using kernel smoothing. With Gaussian smoothing kernels, the dispersion relation obtained from the smoothed dynamic structure factor curves was compared with theoretical dispersion relations obtained using the QLCA, MNSE, and VE-DDFT models, as shown in Fig. 10. The differences between the dispersion relations obtained using MD and those obtained using theoretical models, as shown in Fig. 10, highlight the need for model validation with accurate and computationally efficient MD simulations, which are possible using the PPPM algorithm for a wide range of temperatures and densities of interest.

We note that a number of techniques, such as analytical differentiation,52 interlacing,53 oversampling,54 and others, could be used to design PPPM algorithms. The particular choice of the technique has consequences for the form of the optimal Green function, the errors, and the computational costs. Any of these techniques could be applied in a straightforward way to arbitrary dielectric response functions.

We would like to thank John Verboncoeur of Michigan State University for his support and useful discussions. The work of Dharuman and Murillo was supported by the Air Force Office of Scientific Research (Grant No. FA9550-12-1-0344); Stanton and Glosli were supported under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory Contract No. DE-AC52-07NA27344.

In this section, we provide the details of Ewald decomposition for Coulomb interactions screened with Yukawa and EGS DRFs.

In general, the Ewald decomposition of a Coulomb interaction linearly screened through an isotropic response takes the form

ϕ(k)=υ(k)ϵ(k)[1ζek2/4α2]+ζυ(k)ϵ(k)ek2/4α2
(A1)
=ϕR(k)+ϕF(k),
(A2)

where υ(k)=4π/k2. The long-range component ϕF(k) can be evaluated in Fourier-space; however, the remaining short-range component ϕR(k) must be inverted to real-space as

ϕR(r)=1(2π)3d𝐤υ(k)ϵ(k)[1ζek2/4α2]ei𝐤𝐫
(A3)
=2πr0dkksin(kr)ϵ(k)[1ζek2/4α2].
(A4)

1. Yukawa dielectric function

We first consider the Yukawa dielectric function of the form

ϵY1(k)=k2ks2+k2,
(A5)

where ks=λs1 is the magnitude of the inverse screening vector associated with the background charge distribution. Using the known result

0dkksin(kr)k2+c2eb2k2=π4eb2c2[f(r)f(r)],withf(r)=ecrerfc(bc+r2b)
(A6)

and erfc(z) being the usual complimentary error function, we can evaluate the above expression as

ϕRY(r)=1reksr+ζ2reks2/4α2[eksrerfc(ks2α+αr)eksrerfc(ks2ααr)].
(A7)

Lastly, the charge fraction ζ can be tuned to make this result as short range as possible. Noting that

erfc(z)ez2πz(112z2+34z4),Re{z}1,
(A8)

the large-r behavior of (A7) is given by

ϕRY(r)(1ζeks2/4α2)eksrr+O(r2eα2r2).
(A9)

Thus, ζ=eks2/4α2 is the optimal choice for the charge fraction, which results in the final form of the real-space interaction

ϕRY(r)=1reksr+12r[eksrerfc(ks2α+αr)eksrerfc(ks2ααr)]=eksr2rerfc(αr+ks2α)+eksr2rerfc(αrks2α).
(A10)

2. EGS dielectric function

We next turn to the next-order gradient correction to the Yukawa interaction using the EGS dielectric function

ϵEGS1(k)=k2(ks2+14νk2)ks4+k2(ks2+14νk2),
(A11)

where expressions for ν and further generalizations can be found in Ref. 34; however, the analysis is applicable to any dielectric function that can be expressed in this form. To simplify the problem, this dielectric function can be written in terms of partial fractions as

ϵEGS1(k)=Ak2k2+k2A+k2k+2+k2,
(A12)

where we have introduced the coefficients and new inverse lengths

A±=ks214νk±21νks2,k±2=2ν(1±1ν)ks2.
(A13)

We have now reduced the calculation to evaluating two Yukawa-type dielectric functions to yield

ϕREGS(r)=Aϕ(r)A+ϕ+(r),
(A14)
ϕ±(r)=1rek±r+ζ2rek±2/4α2[ek±rerfc(k±2α+αr)ek±rerfc(k±2ααr)].
(A15)

Note that this result is generic even for the case in which ν>1, where the coefficients are complex. The large-r behavior for this interaction is given by

ϕREGS(r)A(1ζek2/4α2)ekrrA+(1ζek+2/4α2)ek+rr,
(A16)

and there is no longer a choice in ζ that eliminates the leading-order term. However, given that k<k+ (for ν<1), the optimal choice is ζ=ek2/4α2.

3. Optimizing the charge fraction ζ

In general, there will not be an analytic expression for the optimal ζ in terms of α and the physical parameters, and a numerical investigation will be necessary. An approximate expression can be obtained by noting that the large-r limit of ϕR(r) will be dominated by the low-k behavior of the integrand. By temporarily inserting the approximate dielectric function

ϵapp1(k)=k2k02+k2,k02=limk0(k2ϵ(k)),
(A17)

the resulting optimal choice of charge fraction becomes ζapp=eks2/4α2. For example, k0 = ks for the EGS dielectric function; this becomes increasingly valid for small ν, when the system is relatively hot and/or dense.

In the particular case of the EGS dielectric function, there is an alternative approach to accelerate the convergence of the real-space interaction. By introducing two screening clouds in the Ewald decomposition, the real- and Fourier-space interactions, respectively, become

ϕR(k)=υ(k)ϵ(k)[1ζ1ek2/4α12ζ2ek2/4α22],
(A18)
ϕF(k)=υ(k)ϵ(k)[ζ1ek2/4α12+ζ2ek2/4α22].
(A19)

The above analysis can then be repeated to obtain

ϕREGS(r)=Aϕ(r)A+ϕ+(r),
(A20)
ϕ±(r)=1rek±r+ζ12rek±2/4α12[ek±rerfc(k±2α1+α1r)ek±rerfc(k±2α1α1r)]+ζ22rek±2/4α22[ek±rerfc(k±2α2+α2r)ek±rerfc(k±2α2α2r)],
(A21)

which now has the limiting behavior

ϕREGS(r)A(1ζ1ek2/4α12ζ2ek2/4α22)ekrrA+(1ζ1ek+2/4α12ζ2ek+2/4α22)ek+rr.
(A22)

This leading-order term can hence be eliminated by solving the system of equations

1=ζ1ek2/4α12+ζ2ek2/4α22,
(A23)
1=ζ1ek+2/4α12+ζ2ek+2/4α22
(A24)

to obtain the optimal charge fractions

ζ1=ek+2/4α22ek2/4α22ek2/4α12+k+2/4α22ek2/4α22+k+2/4α12,
(A25)
ζ2=ek2/4α12ek+2/4α12ek2/4α12+k+2/4α22ek2/4α22+k+2/4α12.
(A26)

Note that this approach results in two Ewald parameters to optimize, which could prove to be advantageous.

In this section, we provide details of the derivation of the RMS error estimate for forces computed using the PPPM algorithm. We begin with the analysis for the real-space portion of the algorithm. As introduced in Section IV, for a system of point charges with an arbitrary pair-wise short-range interaction, the estimate of the RMS force error is given by (29). For the real-space part of the Ewald decomposition for Yukawa interaction, (29) takes the form

ΔFRY=QNV[Vc(rrc<)d3r|𝐟RY(𝐫)|2]1/2,
(B1)

where 𝐟RY(𝐫) is given by

𝐟RY(𝐫)=𝐫ϕRY(r)=ϕRY(r)r𝐫r
(B2)

because of the radial nature of ϕRY(r). To simplify the analysis, we make use of the symmetry in ϕRY(r) with respect to κ and express ϕRY(r) as a sum of two terms,

ϕRY(r)=VRY(r,κ)+VRY(r,κ),
(B3)

where

VRY(r,κ)=12r[eκrerfc(αrκ2α)].
(B4)

This implies that

ϕRY(r)r=VRY(r,κ)rVRY(r,κ)r,
(B5)

where

VRY(r,κ)r=12r[2απeκre(αr+κ2α)2+κeκrerfc(αr+κ2α)]+eκr2r2erfc(αr+κ2α).
(B6)

For a sufficiently large distance (r1), the complementary error function erfc(αr+κ2α) can be approximated by the first term of the asymptotic expansion39 given by

erfc(αr+κ2α)2απe(αr+κ2α)22α2(r+κ2α2).
(B7)

Using this approximation in (B6) gives

VRY(r,κ)reκr2[2απe(αr+κ2α)2r+καπe(αr+κ2α)2r(r+κ2α2)]+eκr2απe(αr+κ2α)2r2(r+κ2α2).
(B8)

For r1, the second and third terms on the right-hand side can be neglected compared to the first term, resulting in

VRY(r,κ)r(απ)eκ2/4α2eα2r2r,
(B9)

which implies that VRY(r,κ)r=VRY(r,κ)r. This, in turn, yields

ϕRY(r)r(2απ)eκ2/4α2eα2r2r.
(B10)

Substituting (B10) in (B1) gives

(ΔFRY)2Q2NVV(rrc<)d3r(4α2π)eκ2/2α2e2α2r2r2=16α2Q2NVeκ2/2α2rcdre2α2r2r.
(B11)

Using the first term of the asymptotic expansion,39 the integral on the right-hand side is approximated as

rcdre2α2r2re2α2rc24α2rc,
(B12)

resulting in the simplified expression for ΔFRY given by

ΔFRY2QNVeκ2/4α2eα2rc2rc.
(B13)

We note that the approximation, which involves truncation of the asymptotic expansion that led to the above error estimate, is not valid for κ/α as α0. Hence, this error estimate will not tend to the error estimate for the plain Yukawa interaction that is derived below.

Similar error estimates can be derived for the plain Yukawa interaction because it decays exponentially. For the plain Yukawa interaction

ϕY(r)=eκrr,
(B14)

we have

𝐟Y(𝐫)=ϕY(r)r𝐫r=eκrr2(1+κr)𝐫r.
(B15)

Using (29), the error estimate for the Yukawa interaction is given by

(ΔFY)2=4πQ2NVrce2κrr2(1+κr)2,
(B16)

and the integral in this equation can be approximated using the first term of an asymptotic expansion39 to yield

(ΔFY)22πQ2NVe2κrcκ(1rc+κ)2.
(B17)

For 1rcκ, we could further approximate (B17) to obtain

ΔFYQNV2πκeκrc.
(B18)

We now proceed to simplifying the estimate of the RMS error in the Fourier-space force computed using the PPPM algorithm. As introduced in Section IV, the RMS error estimate for the Fourier-space force is given by

ΔFFQNV2/3χF,
(B19)

where

χF2V2/3=𝐤0G^𝐤|𝐤|2𝐧([𝐦U^𝐤n+m2G^𝐤n+m𝐤𝐧𝐤n+m]2[𝐦U^𝐤n+m2]2|𝐤𝐧|2).
(B20)

If the arbitrary Green function (denoted by G^𝐤𝐧) rapidly decays in Fourier-space such that alias contributions to the Green function can be neglected (m denotes the alias indices), then (B20) can be simplified further, beginning with the approximation given by

[𝐦U^𝐤n+m2G^𝐤n+m𝐤𝐧𝐤n+m]2[U^𝐤𝐧2G^𝐤𝐧𝐤𝐧𝐤𝐧]2=G^𝐤𝐧2kn4U^𝐤𝐧4.
(B21)

This implies that

χF2V2/3𝐤0G^𝐤|𝐤|2𝐧(G^𝐤𝐧2kn4U^𝐤𝐧4[𝐦U^𝐤n+m2]2kn2),
(B22)

which can be rewritten as

χF2V2/3𝐧G^𝐤𝐧2kn2[1U^𝐤𝐧4(𝐦U^𝐤n+m2)2],
(B23)

with the assumption that the maximum length of n (denoted by 𝐧max) is sufficiently large such that G^𝐤𝐧2kn20 for 𝐧=𝐧max. For an isotropic system, following the derivation in Ref. 10, we find that

[1U^𝐤n4(𝐦U^𝐤n+m2)2]3(knzh2)2pm=0p1Cm(p)(knzh2)2m,
(B24)

where the coefficients Cm(p) are listed in Table I of Ref. 10. Therefore, (B23) is approximated to a form given by

χF2V2/33𝐧G^𝐤𝐧2kn2[(knzh2)2pm=0p1Cm(p)(knzh2)2m]
(B25)
=3(h2)2pm=0p1Cm(p)(h2)2m[𝐧G^𝐤𝐧2kn2knz2(m+p)].
(B26)

When the box dimensions are large (L1, for a cube of edge length L),

𝐧(L2π)3d3k,
(B27)

which implies that

𝐧G^𝐤𝐧2kn2knz2(m+p)(L2π)3d3kG^𝐤2k2knz2(m+p)=I.
(B28)

If the Green function is symmetric, then the integral I can be reduced further to the following 1D integral:

I=(L2π)3d3kG^𝐤2k2knz2(m+p)
(B29)
=V(2π)2[0πdϕsinϕ(cosϕ)2(m+p)][0dkG^k2k2(m+p+2)],
(B30)

where

0πdϕsinϕ(cosϕ)2(m+p)=21+2(p+m).
(B31)

The integral I can then be expressed as

I=V(2π)22[1+2(p+m)]β(m,p),
(B32)

where

β(m,p)=0dkG^k2k2(m+p+2).
(B33)

Therefore, for an arbitrary Green function,

χF2V1/3=AF3(2π)2m=0p1Cm(p)(h2)2(m+p)2[1+2(p+m)]β(m,p),
(B34)

which yields the following approximated estimate of the RMS error for an arbitrary Green function:

ΔFF=QNVAF1/2.
(B35)

The Green function for the Ewald decomposition for Yukawa interaction is given by

G^k=4πk2+κ2e(k2+κ2)/4α2.
(B36)

The integral β(m,p) for this case takes the form

βY(m,p)=0dk[4πk2+κ2]2ek2/2α2k2(m+p+2),
(B37)

which results in (38).

1.
V. K.
Shen
,
R. D.
Mountain
, and
J. R.
Errington
,
J. Phys. Chem. B
111
,
6198
(
2007
).
2.
M.
Mecke
,
J.
Winkelmann
, and
J.
Fischer
,
J. Chem. Phys.
107
,
9264
(
1997
).
3.
P.
Ewald
,
Ann. Phys.
369
,
253
(
1921
).
4.
A. Y.
Toukmaji
and
J. A.
Board
,
Comput. Phys. Commun.
95
,
73
(
1996
).
5.
J. W.
Perram
,
H. G.
Petersen
, and
S. W.
De Leeuw
,
Mol. Phys.
65
,
875
(
1988
).
6.
R. W.
Hockney
and
J. W.
Eastwood
,
Computer Simulation Using Particles
(
CRC Press
,
New York
,
1988
).
7.
T.
Darden
,
D.
York
, and
L.
Pedersen
,
J. Chem. Phys.
98
,
10089
(
1993
).
8.
U.
Essmann
,
L.
Perera
,
M. L.
Berkowitz
,
T.
Darden
,
H.
Lee
, and
L. G.
Pedersen
,
J. Chem. Phys.
103
,
8577
(
1995
).
9.
H. A.
Stern
and
K. G.
Calkins
,
J. Chem. Phys.
128
,
214106
(
2008
).
10.
M.
Deserno
and
C.
Holm
,
J. Chem. Phys.
109
,
7694
(
1998
).
11.
E. L.
Pollock
and
J.
Glosli
,
Comput. Phys. Commun.
95
,
93
(
1996
).
12.
V.
Ballenegger
,
J. J.
Cerda
,
O.
Lenz
, and
C.
Holm
,
J. Chem. Phys.
128
,
034109
(
2008
).
13.
I.
Fukuda
and
H.
Nakamura
,
Biophys. Rev.
4
,
161
(
2012
).
14.
D.
Wolf
,
P.
Keblinski
,
S. R.
Phillpot
, and
J.
Eggebrecht
,
J. Chem. Phys.
110
,
8254
(
1999
).
15.
K. N.
Kudin
and
G. E.
Scuseria
,
Chem. Phys. Lett.
283
,
61
(
1998
).
16.
H. Q.
Ding
,
N.
Karasawa
, and
W. A.
Goddard
,
Chem. Phys. Lett.
196
,
6
(
1992
).
17.
J.
Shimada
,
H.
Kaneko
, and
T.
Takada
,
Chem. Phys. Lett.
14
,
867
(
1993
).
18.
S. W.
de Leeuw
,
J. W.
Perram
, and
E. R.
Smith
,
Proc. R. Soc. London, Ser. A
373
,
27
(
1980
).
19.
T. M.
Nymand
and
P.
Linse
,
J. Chem. Phys.
112
,
6152
(
2000
).
20.
A. P.
Sutton
and
J.
Chen
,
Philos. Mag. Lett.
61
,
139
(
1990
).
21.
Y.
Qi
,
T.
Cagin
,
Y.
Kimura
, and
W. A.
Goddard
,
Phys. Rev. B
59
,
3527
(
1999
).
22.
R. E.
Isele-Holder
,
W.
Mitchell
, and
A. E.
Ismail
,
J. Chem. Phys.
137
,
174107
(
2012
).
23.
Y.
Rosenfeld
,
Mol. Phys.
88
,
1357
(
1996
).
24.
G.
Salin
and
J. M.
Caillol
,
J. Chem. Phys.
113
,
10459
(
2000
).
25.
M.
Mazars
,
Phys. Rep.
500
,
43
(
2011
).
26.
U.
Welling
and
G.
Germano
,
Comput. Phys. Commun.
182
,
611
(
2011
).
27.
L. G.
Stanton
and
M. S.
Murillo
,
Phys. Rev. E
93
,
043203
(
2016
).
28.

A slightly more accurate fit for λTF is discussed in Ref. 27.

29.
M. S.
Murillo
,
High Energy Density Phys.
4
,
49
(
2008
).
30.
T. C.
Killian
 et al,
Phys. Rep.
449
,
77
(
2007
).
31.
M. S.
Murillo
,
Phys. Plasmas
11
,
2964
(
2004
).
32.
J. J.
Fortney
and
N.
Nettelmann
,
Space Sci. Rev.
152
,
423
(
2010
).
33.
Plasma Science Committee
,
Frontiers in High Energy Density Physics: The X-Games of Contemporary Science
(
National Academies Press
,
2003
).
34.
L. G.
Stanton
and
M. S.
Murillo
,
Phys. Rev. E
91
,
033104
(
2015
).
35.
C.
Gabriel
,
S.
Gabriel
, and
E.
Corthout
,
Phys. Med. Biol.
41
,
2231
(
1996
).
36.
T. P.
Doerr
and
Y. K.
Yu
,
Am. J. Phys.
82
,
460
(
2014
).
37.
F.
Prins
,
A. J.
Goodman
, and
W. A.
Tisdale
,
Nano Lett.
14
,
6087
(
2014
).
38.

The codes are written in high performance Python using the optimizing compiler Numba which compiles Python code to machine code giving similar performance to C and Fortran. More details about Numba can be found here: http://numba.pydata.org. In the future the code will be publicly available under the name Sarkas.

39.
J.
Kolafa
and
J. W.
Perram
,
Mol. Simul.
9
,
351
(
1992
).
40.
T.
Komatsu
,
N.
Yoshii
,
S.
Miura
, and
S.
Okazaki
,
Fluid Phase Equilib.
226
,
345
(
2004
).
41.
J.
Sarnthein
,
A.
Pasquarello
, and
R.
Car
,
Science
275
,
1925
(
1997
).
42.
W.
Jin
,
P.
Vashishta
,
R. K.
Kalia
, and
J. P.
Rino
,
Phys. Rev. B
48
,
9359
(
1993
).
43.
E. G.
Brandt
and
O.
Edholm
,
Biophys. J.
96
,
1828
(
2009
).
44.
M. S.
Murillo
,
Phys. Rev. E
81
,
036403
(
2010
).
45.
S. A.
Khrapak
,
B.
Klumov
,
L.
Couëdel
, and
H. M.
Thomas
,
Phys. Plasmas
23
,
023702
(
2016
).
46.
A.
Diaw
and
M. S.
Murillo
,
Phys. Rev. E
92
,
013107
(
2015
).
47.
T.
Saigo
and
S.
Hamaguchi
,
Phys. Plasmas
9
,
1210
(
2002
).
48.
W.
Daughton
,
M. S.
Murillo
, and
L.
Thode
,
Phys. Rev. E
61
,
2129
(
2000
).
49.
J. S.
Simonoff
,
Smoothing Methods in Statistics
(
Springer
,
New York
,
1996
).
50.
T.
Hastie
,
R.
Tibshirani
, and
J.
Friedman
,
The Elements of Statistical Learning
(
Springer
,
New York
,
2009
).
51.
M. W. C.
Dharma-Wardana
,
Phys. Rev. E
86
,
036407
(
2012
).
52.
V.
Ballenegger
,
J. J.
Cerda
, and
C.
Holm
,
Comput. Phys. Commun.
182
,
1919
(
2011
).
53.
A.
Neelov
and
C.
Holm
,
J. Chem. Phys.
132
,
234103
(
2010
).
54.
F.
Nestler
,
Front. Phys.
4
,
28
(
2016
).