Since the seminal studies of Thomas and Fermi, researchers in the Density-Functional Theory (DFT) community are searching for accurate electron density functionals. Arguably, the toughest functional to approximate is the noninteracting kinetic energy, Ts[ρ], the subject of this work. The typical paradigm is to first approximate the energy functional and then take its functional derivative, , yielding a potential that can be used in orbital-free DFT or subsystem DFT simulations. Here, this paradigm is challenged by constructing the potential from the second-functional derivative via functional integration. A new nonlocal functional for Ts[ρ] is prescribed [which we dub Mi-Genova-Pavanello (MGP)] having a density independent kernel. MGP is constructed to satisfy three exact conditions: (1) a nonzero “Kinetic electron” arising from a nonzero exchange hole; (2) the second functional derivative must reduce to the inverse Lindhard function in the limit of homogenous densities; (3) the potential is derived from functional integration of the second functional derivative. Pilot calculations show that MGP is capable of reproducing accurate equilibrium volumes, bulk moduli, total energy, and electron densities for metallic (body-centered cubic, face-centered cubic) and semiconducting (crystal diamond) phases of silicon as well as of III-V semiconductors. The MGP functional is found to be numerically stable typically reaching self-consistency within 12 iterations of a truncated Newton minimization algorithm. MGP’s computational cost and memory requirements are low and comparable to the Wang-Teter nonlocal functional or any generalized gradient approximation functional.
I. INTRODUCTION
A. Orbital-free and Kohn–Sham DFT
Since the seminal work of Kohn and Sham,1 the energy functional of the electron density can be written in terms of the noninteracting kinetic energy (KE) functional, Ts[ρ], and the associated exchange-correlation (xc) functional, Exc[ρ], as follows:
The nuclear-nuclear interaction, ENN, is density independent. The electron-nuclear interaction, EeN[ρ] = ∫(r)ρ(r)dr, is linear in the density, and the other terms [for closed-shell systems and introducing the Kohn–Sham (KS) orbitals, {ϕi[ρ](r)}, including the yet unknown exchange–correlation (xc) energy density, εxc[ρ](r)] take the form
There are two different and theoretically equivalent ways to find the electron density associated with a given external potential. One prescribes the direct minimization of the density-functional theory (DFT) Lagrangian,
which is known as Orbital-Free DFT (OF-DFT). The other way (according to Kohn and Sham1) prescribes solving the following Schrödinger equation:
B. OF-DFT: The kinetic energy functional conundrum
If one decides to carry out the density search according to the minimization of the DFT Lagrangian in Eq. (5), then there is no need to invoke the concept of Kohn–Sham (KS) orbitals. Instead, the electron density, ρ(r), can be utilized as the only variational parameter. The drawback is that in addition to approximations needed for the xc functional, in OF-DFT, approximations for the kinetic energy density functional (KEDF), Ts[ρ], are also needed.
Local and semilocal (i.e., dependent on the value of the electron density and its gradient at a point in space) parametrizations of KEDFs have shown potential2–4 and displayed a more than satisfactory agreement with KS-DFT for simulations of warm dense matter.5 Recent work by the Della Sala group6 advocated for employing the Laplacian in addition to the gradient in the formulation of Ts[ρ] and the possibility of reproducing accurate kinetic energy densities.7
Nonlocal versions of Ts[ρ], such as the Wang-Govind-Carter (WGC),8 the one proposed by Perrot,9 the latest Huang-Carter (HC),10 and others,11 typically improve over local and semilocal functionals,10,12–16 and all share the following form:
where is the Thomas-Fermi kinetic energy,17,18 is the gradient correction according to von Weizsäcker,19 and is the remaining contribution which is by construction of nonlocal character. The general form of TNL is
where the function is commonly called “kernel of the nonlocal KEDF” and α and β are the suitable exponents.
Unfortunately, the current state of the art for OF-DFT is that, although the algorithms have dramatically improved over the years and are typically orders of magnitude faster than KS-DFT, their application to semiconductors and molecular systems is still problematic.10,12,13,15 For example, WGC8,15,20 and Wang-Teter (WT)21 KEDFs can reproduce remarkably accurate bulk properties for light metallic systems, but are less accurate for semiconductors and nonsimple metals. The HC KEDF10 produces generally reliable bulk properties for semiconductors. However, it underestimates the electron density in the bonding regions of crystal diamond (CD) silicon and III-V semiconductors, it is less accurate than WT for metals, and it is substantially more computationally expensive than any other KEDFs.10,22 Density decomposition schemes that combine nonlocal with other functionals have found applicability to molecules, materials, and alloys12,22,23 and thus appear to be an interesting avenue of research.
In this work, we aim at formulating a KEDF that is (1) at least as accurate as WGC and WT KEDFs for nonsimple metals such as the body-centered cubic (BCC) and face-centered cubic (FCC) phases of silicon; (2) at least as accurate as the HC KEDF in predicting bulk properties of CD silicon and III-V semiconductors, especially in regard to the self-consistent electron density in bonding regions; and (3) computationally inexpensive.
II. THEORETICAL BACKGROUND
It is known that the exact Ts functional satisfies a number of mathematical relations also known as exact conditions. It is arguable24 that the more exact conditions the functional satisfies, the closer the functional necessarily is to the exact functional. It is through the imposition of exact conditions that the Trickey group developed generalized gradient approximation (GGA) KE functionals approaching the accuracy of KS-DFT in the limit of high temperature.5 By imposing exact conditions on the asymptotic behavior of the linear response function, the Carter group formulated the HC KEDF which could be applied to systems with record inhomogenous density for OF-DFT simulations.10
Let us enumerate three exact conditions that we believe are particularly important: (1) existence of the “Kinetic electron,”25 (2) hypercorrelation, i.e., potentials are related by functional integration to their respective second functional derivatives, and (3) the Lindhard response in the Free Electron Gas (FEG) limit.
A. The Kinetic electron
The first unconstrained functional derivative of the DFT Lagrangian with respect to the density yields the Euler equation of DFT.26 Namely,
By taking the negative of the Laplacian of each term and rearranging, . After solving the associated Poisson equations for each term in the long-range limit,27 we find
where ρN is the charge density of the nuclei (ions) and ρxc is related to the exchange-correlation hole. At long ranges, ρxc is identical to the xc hole.27 Thus, at long ranges, is the kinetic energy equivalent of the xc hole. However, upon simplification of the 4π terms and integration over all space, we find that
where the integration of the negative of the xc hole is 1, the integration of the electron density is Ne, and the integration of the nuclear density is NN, and the total charge of the system is defined as NN − Ne.
We define as the Kinetic electron, as for neutral systems, it integrates to +1 in contrast to the xc hole which integrates to −1.
B. Hypercorrelation via a line integral
It is possible to recover an entire functional from the corresponding functional derivative using the following line integral:28,29
where we have chosen the linear interpolation path which we describe below.
As the potential is the functional derivative , the result of the integration is path-independent. The scaled density, ρt, can then have almost any form,28 the simplest of which is given by a linear interpolation between the vacuum and the density,
Thus, the t-integral equation reduces to
The previous integral defines the energy density in terms of the hypercorrelated potential,
Hypercorrelation is a term coined by Burke30 to emphasize the fact that the energy density is directly related to the physical (correlated) potential evaluated for an array of electron densities ranging from the vacuum to the true density.
The above equation can be considered yet another exact constraint any functional (including the kinetic energy) should satisfy.
Hypercorrelation does not only concern the energy density but also the potential. That is, the potential must be related to the second functional derivative by
It is important to notice that in the definition of the scaled second functional derivative on the rhs, ρt appears only on one of the two functional derivatives.
We should remark that due to the derivative discontinuity,31 the line integration in the above equations will cross discontinuities in the integrands when integrating the kernel to yield a potential and also when integrating the potential to obtain an energy value. These discontinuities are integrable and do not pose formal problems. Equation (17) may worry the reader because, in principle, terms proportional to the Dirac delta function in real space could appear in the second functional derivative and could lead to density linear terms in the potential and density quadratic terms in the functional. This of course is not possible in theory due to the known density scaling relations of the exact KEDF functional.32 In practice, this also does not occur because the chosen second functional derivative (vide infra) is such that these terms do not appear. Similar arguments lead to the realization that density linear terms should not appear in the KEDF.
C. FEG response
An exact condition for a KE functional is its relationship to the linear response function of the FEG. There is an established relation between the KE functional and the linear response function.33–35 Taking a functional derivative of each term in the Euler equation (9),
and the response function is given by . Unfortunately, the above relationship is only valid at self-consistency, and trying to impose it directly would lead to impractical algorithms. However, for the FEG, we have a simplified relationship, χs(r, r′) = χs(|r − r′|), resulting in a one-dimensional Fourier transform, χLind(η), the Lindhard function,36,37
with , and the Fermi wavevector is given by , with .
Thus, in the limit of a constant and periodic electron density, we must impose
where stands for the Fourier transform in q, the conjugate variable of |r − r′|. This exact condition is at the core of the original Hohenberg and Kohn functional35 which inspired the entire class of nonlocal KE functionals, well summarized in Ref. 33. The function GLind(η) is introduced here as
III. KINETIC ENERGY FUNCTIONAL BY INTEGRATION
A. Imposing the response of the free electron gas through hypercorrelation
We cast our developments in terms of Eq. (8). Particularly, in the following, we will only focus on the nonlocal term of the KEDF. We begin by espousing the idea that in the limit of modeling the Free Electron Gas (FEG), a nonlocal KEDF should recover the exact FEG linear response function. This can be imposed by hypercorrelation, Eq. (16). Namely,
where will be dependent on ρt and consequently on t. Applying a local density approximation (LDA) on the polynomial terms of Eq. (20), can be rewritten as
where, following the prescriptions of Eqs. (7), (19), and (20), the Fourier transform of GNL(|r − r′|) is taken to be
In the above, we have substituted ρ0 → tρ0, which gives the following Fermi wavevector , and the reciprocal space variable is .
Equation (23) is derived from Eq. (22) in the following two steps: First, the GNL term depending on the reciprocal space variable η is represented in terms of its conjugate real-space variable |r − r′|. In a second step, the Fermi wavevector multiplying GNL has a dependence that can be split into and, after symmetric LDA approximation, leads to the ρ(r)−1/6·ρ(r′)−1/6 dependence of the second functional derivative.
Once again, we point out that the integral above should cross discontinuities in the integrand as the density scaling parameter, t, is varied. This does not directly affect the integral above because the Lindhard function is continuous across all values of t.
Finally, the nonlocal kinetic potential can be obtained by
where the kernel in reciprocal ωT(q) is obtained from integration by parts,
The new kernel, , is given by the kernel of the WT KEDF21 plus a correction term. We should point out that this correction term derived from assuming that the inverse Lindhard function is dependent on the electron density (this allowed us to include in the line integral the average density ρ0). Thus, we expect in Eq. (27) to be the most appropriate for substantially nonuniform densities. For light (simple) metallic systems, the distribution of electron density is very close to the uniform electron gas, and thus the WT kernel is already very good for these systems. We will see in our pilot calculations that for more complex systems, such as (nonsimple) metallic phases of silicon, IV, and III-V semiconductors, the correction term in conjunction with an additional correction arising from the presence of the “Kinetic electron” remarkably improves upon the performance of WT.
We also note that in going from Eqs. (22) to (23), we have assumed that the inverse Lindhard function is a function of t multiplied by ρ0. It is arguable that symmetry properties of the second functional derivative should be imposed. In the supplementary material, we carry out a detailed analysis of symmetrization of the kernel. There, we propose an arithmetic symmetrization as well as a geometric symmetrization of the kernel. We note that in either cases, the resulting kernels yield results that are equivalent to the ones obtained with the kernel in Eq. (27) (vide infra).
B. Imposing a nonzero Kinetic electron
Equation (10) implies that the KE potential in reciprocal space is given by
where the tilde indicates the quantities which have been Fourier transformed in reciprocal space (the Fourier transform is kept one dimensional for the sake of simplicity). Unfortunately, the potential derived by Eq. (26) is zero in the low q limit (or at long ranges in real space). That is,
and with that the Kinetic electron is absent.
Thus, to impose a nonzero Kinetic electron, we decided to model it with a simple Gaussian multiplied by the square of the error function centered at q = 0 in order to remove the Coulomb singularity. This results in a new kernel, ωMGP, obtained simply by modifying the expression for the kernel in Eq. (27). Namely,
Although modeling the Kinetic electron by a simple Gaussian function is a gross approximation, in the future, we commit to explore more accurate expressions inspired by the established work of others.38,39 We are aware that the exchange hole (and thus also the Kinetic electron) is generally not spherical,40 and thus in the future, we will also consider nonspherical parametrizations. In Fig. 1, four different kernels are plotted. It is clear that the presence of the Kinetic electron imposed through Eq. (30) affects dramatically the low-q limit of the kernel. The kernel resulting from our manipulations is smooth, which will result in smooth nonlocal kinetic energy potentials.
Comparison of four kernels: ω(q) on the y-axis and q on the x-axis in atomic units. MGP(0) is the kernel from Eq. (30) without a Kinetic electron. MGP(0.2) and MGP(1.0) are the kernels with the Kinetic electron parameters a = b = 0.2 and a = b = 1.0, respectively. WT is the Wang–Teter kernel (Thomas–Fermi and von Weizsäcker kernels removed from the inverse Lindhard function). All kernels are evaluated for ρ0 = 0.03 e−/bohr3 (yielding a value of kF ∼ 1) which is a typical value for valence electrons in bulk systems.
Comparison of four kernels: ω(q) on the y-axis and q on the x-axis in atomic units. MGP(0) is the kernel from Eq. (30) without a Kinetic electron. MGP(0.2) and MGP(1.0) are the kernels with the Kinetic electron parameters a = b = 0.2 and a = b = 1.0, respectively. WT is the Wang–Teter kernel (Thomas–Fermi and von Weizsäcker kernels removed from the inverse Lindhard function). All kernels are evaluated for ρ0 = 0.03 e−/bohr3 (yielding a value of kF ∼ 1) which is a typical value for valence electrons in bulk systems.
To understand the effect of the Kinetic electron, let us consider a very narrow Gaussian (b → +∞). This results in a spatially extended real-space Kinetic electron. This is the preferred shape for materials with a finite gap, in which the dielectric screening is small. The opposite case (b → 0) is preferred for metals, as the dielectric screening is large and the real-space spatial extension of the Kinetic electron is reduced. Thus, we expect that small b values are suitable to model metallic systems and large b values are suitable to model semiconductors. These observations are consistent with the finding41–44 (also exploited by Huang and Carter10) that, for q → 0, the static response function for systems with a gap, χ → −Constant × q2. An in-depth analysis of the relation between the low-q behavior of χ and the long-range properties of the potential will be the subject of a follow-up study.
C. Introducing the MGP functional: Details of the implementation
We implemented the new functional resulting from the kernel in Eq. (30) according to the framework outlined in Eq. (26). The Mi-Genova-Pavanello (MGP) potential and energy functional are defined as
where indicates the inverse Fourier transform and (r) + (r) is the local part of the kinetic energy potential composed of the sum of Thomas–Fermi and von Weizsäcker terms. To avoid spurious dependencies from the parameter a, we choose such boundary conditions as ωMGP(q = 0) = 0.
The simplicity of the above equation results from double integration of the kernel in Eq. (23). A similar expression would be recovered for the arithmetically symmetrized kernel, while a more complex expression involving a double integration of the inverse Lindhard function would be required for the geometrically symmetrized kernel.
We mention here that the so-called LQ and HQ functionals11 (their potentials are constructed ad hoc to exactly reproduce the high-q or the low-q limits) feature a nonlocal kinetic energy that is also evaluated with a line integral similar to Eq. (32). The line integral in Ref. 11, however, is carried out only for the nonhomogeneous component of the density [i.e., ρ(r) − ρ0].
IV. COMPUTATIONAL DETAILS
The OF-DFT calculations are carried out with both modified versions of Ab initio orbiTaL-free density functionAl theory Software (ATLAS)45,46 and PRinceton Orbital-Free Electronic Structure Software (PROFESS) 3.0.47 Calculations involving the HC KEDF are performed with PROFESS 3.0, and all others are carried out in ATLAS and double checked with PROFESS 3.0 for consistency. The benchmark KS-DFT calculations are carried out with the same local pseudopotentials as the OF-DFT simulations. We employed the optimal effective local pseudopotentials (OEPP)48 and the bulk-derived local pseudopotentials (BLPS) of Huang and Carter.49 KS-DFT calculations with OEPP are performed with Quantum-ESPRESSO (QE),50 while the ones employing BLPS are carried out with ABINIT.51 In all calculations, the LDA xc functional by Perdew and Zunger52 is adopted.
Crystal diamond (CD), body-centered cubic (BCC), and face-centered cubic (FCC) silicon phases, as well as nine III-V cubic zincblende (ZB) semiconductors, are selected as benchmark systems. We compute total energy curves as a function of the lattice constant and extract the cell volume, V0, the minimum energy, E0, and the bulk modulus, B0. These were computed using the prescription of Huang and Carter,10 fitting the energy curves vs volume against Murnaghan’s equation of state53 within 0.95 V0 to 1.05 V0.
In all OF-DFT calculations, the grid spacing of 0.2 Å in ATLAS and kinetic energy cutoffs 1600 eV in PROFESS 3.0 are sufficient for well-converged total energies (1 meV/cell). In KS-DFT simulations, for bulk properties, the 20 × 20 × 20 k-point meshes and 800 eV kinetic energy cutoffs are used. To obtain smooth electron density, a denser grid (larger kinetic energy cutoffs) is adopted for both OF-DFT and KS-DFT calculations to keep the real space electron density as represented on 54 × 54 × 54 for ZB GaAs and 36 × 36 × 36 for CD silicon.
V. RESULTS AND DISCUSSION
We have benchmarked MGP against KS-DFT and compared it to WT, WGC, and HC KEDFs for metallic and semiconducting phases of Si, as well as the ZB phase of nine II-V semiconductors. In the following, we will only compare the so-called “optimal” parameters for all KEDFs, including MGP. We should remark that the optimal parameters are pseudopotential dependent. Thus, we optimize all KEDF parameters accordingly. With exception of WT, all other KEDFs have two adjustable parameters: MGP has a and b, defining the Kinetic electron term; WGC has the effective electron density, ρ*, and the averaged Fermi wavevector parameter, γ; and HC has β and λ, defining the long-range portion of the functional.
A. Bulk properties for both metallic and semiconductor phases of silicon
The optimal parameters for MGP functionals along with the optimal γ and ρ* for WGC, optimal β and λ for HC functionals, and calculated quantities for all KEDFs are collected in Table I.
Equilibrium volume (V0 in bohr3), minimum energy (E0 in eV/atom), and bulk modulus (B0 in GPa) for metallic phases (BCC, FCC) and the semiconductor phase (CD) of silicon computed by OF-DFT/BLPS and OF-DFT/OEPP using the MGP, WGC, WT, and HC KEDFs with optimal choices of their parameters. The corresponding benchmark KS-DFT/BLPS and KS-DFT/OEPP results are also given. HC KEDF values are taken from Ref. 10. WGC’s optimal parameters are taken from Refs. 22 and 15.
Systems . | Functionals . | Pseudopotentials . | Parameters . | V0 (a.u./Cell) . | E0 (eV) . | B0 (GPa) . |
---|---|---|---|---|---|---|
FCC | KS | OEPP | … | 118.2 | −107.761 | 71 |
MGP | OEPP | a = 0.5,b = 0.3 | 119.5 | −107.765 | 73 | |
WGC | OEPP | γ = 2.2, ρ* = ρ0 | 112.7 | −107.733 | 113 | |
WGC | OEPP | γ = 1.0, ρ* = 1.05ρ0 | 115.9 | −107.790 | 98 | |
KS | BLPS | … | 97.0 | −109.248 | 83 | |
MGP | BLPS | a = 0.6,b = 0.4 | 97.6 | −109.243 | 75 | |
WT | BLPS | … | 97.5 | −109.260 | 58 | |
WGC | BLPS | γ = 2.2, ρ* = ρ0 | 94.4 | −109.258 | 96 | |
BCC | KS | OEPP | … | 234.7 | −215.490 | 74 |
MGP | OEPP | a = 0.66,b = 0.38 | 235.7 | −215.492 | 89 | |
WGC | OEPP | γ = 2.2, ρ* = 1.05ρ0 | 228.9 | −215.504 | 106 | |
WGC | OEPP | γ = 1.0, ρ* = 1.05ρ0 | 230.5 | −215.521 | 101 | |
KS | BLPS | … | 197.1 | −218.556 | 98 | |
MGP | BLPS | a = 0.94,b = 0.6 | 205.4 | −218.553 | 97 | |
WT | BLPS | … | 204.3 | −218.666 | 66 | |
WGC | BLPS | γ = 2.2, ρ* = 1.05ρ0 | 196.9 | −218.553 | 106 | |
CD | KS | OEPP | … | 309.3 | −215.538 | 61 |
MGP | OEPP | a = 0.341,b = 0.45 | 310.0 | −215.534 | 54 | |
WGC | OEPP | γ = 4.2, ρ* = 1.05ρ0 | 352.2 | −215.512 | 34 | |
WGC | OEPP | γ = 5.0, ρ* = 1.05ρ0 | 332.7 | −214.974 | 44 | |
KS | BLPS | … | 266.9 | −219.258 | 98 | |
MGP | BLPS | a = 0.364,b = 0.57 | 265.6 | −219.258 | 95 | |
HC | BLPS | λ = 0.01, β = 0.65 | 269.4 | −219.248 | 97 | |
WT | BLPS | … | … | … | … | |
WGC | BLPS | γ = 4.2, ρ* = 1.05ρ0 | 287.4 | −218.838 | 64 | |
MAE | V0 (%) | E0 (meV) | B0 (GPa) | |||
MGP | 1.2 | 3 | 6 | |||
WGC | 4.8 | 125 | 25 |
Systems . | Functionals . | Pseudopotentials . | Parameters . | V0 (a.u./Cell) . | E0 (eV) . | B0 (GPa) . |
---|---|---|---|---|---|---|
FCC | KS | OEPP | … | 118.2 | −107.761 | 71 |
MGP | OEPP | a = 0.5,b = 0.3 | 119.5 | −107.765 | 73 | |
WGC | OEPP | γ = 2.2, ρ* = ρ0 | 112.7 | −107.733 | 113 | |
WGC | OEPP | γ = 1.0, ρ* = 1.05ρ0 | 115.9 | −107.790 | 98 | |
KS | BLPS | … | 97.0 | −109.248 | 83 | |
MGP | BLPS | a = 0.6,b = 0.4 | 97.6 | −109.243 | 75 | |
WT | BLPS | … | 97.5 | −109.260 | 58 | |
WGC | BLPS | γ = 2.2, ρ* = ρ0 | 94.4 | −109.258 | 96 | |
BCC | KS | OEPP | … | 234.7 | −215.490 | 74 |
MGP | OEPP | a = 0.66,b = 0.38 | 235.7 | −215.492 | 89 | |
WGC | OEPP | γ = 2.2, ρ* = 1.05ρ0 | 228.9 | −215.504 | 106 | |
WGC | OEPP | γ = 1.0, ρ* = 1.05ρ0 | 230.5 | −215.521 | 101 | |
KS | BLPS | … | 197.1 | −218.556 | 98 | |
MGP | BLPS | a = 0.94,b = 0.6 | 205.4 | −218.553 | 97 | |
WT | BLPS | … | 204.3 | −218.666 | 66 | |
WGC | BLPS | γ = 2.2, ρ* = 1.05ρ0 | 196.9 | −218.553 | 106 | |
CD | KS | OEPP | … | 309.3 | −215.538 | 61 |
MGP | OEPP | a = 0.341,b = 0.45 | 310.0 | −215.534 | 54 | |
WGC | OEPP | γ = 4.2, ρ* = 1.05ρ0 | 352.2 | −215.512 | 34 | |
WGC | OEPP | γ = 5.0, ρ* = 1.05ρ0 | 332.7 | −214.974 | 44 | |
KS | BLPS | … | 266.9 | −219.258 | 98 | |
MGP | BLPS | a = 0.364,b = 0.57 | 265.6 | −219.258 | 95 | |
HC | BLPS | λ = 0.01, β = 0.65 | 269.4 | −219.248 | 97 | |
WT | BLPS | … | … | … | … | |
WGC | BLPS | γ = 4.2, ρ* = 1.05ρ0 | 287.4 | −218.838 | 64 | |
MAE | V0 (%) | E0 (meV) | B0 (GPa) | |||
MGP | 1.2 | 3 | 6 | |||
WGC | 4.8 | 125 | 25 |
Since BCC and FCC structures of Si are metallic, we expect that WGC KEDF produces more or comparably accurate bulk properties compared to HC KEDFs. Thus, for these metallic phases, we just compare MGP results to WGC, WT, and the benchmark KS-DFT results. Comparing to KS-DFT, MGP reproduces bulk properties overall improving on WT and WGC KEDFs with both OEPP and BLPS. This is especially the case for the total MGP energies which deviate from KS-DFT by less than 5 meV/cell in all cases.
Modeling CD Si with OF-DFT has historically been a challenge which has been addressed by several nonlocal functionals with a density-dependent kernel.8,10,20,22 For example,15 WT is unable to reproduce a bound curve for CD Si. As MGP functional has a density-independent kernel, we had no expectations that CD Si would be modeled correctly. Table I, however, shows that MGP is capable of producing bound energy curves and overall bulk properties that are close to the KS-DFT benchmark compared to other KEDFs. To the best of our knowledge, MGP is the only KEDF with density-independent kernel capable of simultaneously reproducing KS-DFT equilibrium total energies, bulk moduli, and equilibrium volumes for both metallic and semiconductor phases of silicon.
We also tested a modified version of MGP only including WT and the Kinetic electron term. We found that the modified functional could not reproduce the benchmark despite efforts in optimizing the a and b parameters. To give an example, we have tried to optimize parameters a and b for the CD phase of silicon and found the best values to yield a reasonable V0, E0 = −217.427 eV, and B0 = 126 GPa. These values are severely deviated from the KS-DFT benchmark. Thus further analysis of the WT+KE model is unnecessary. This indicates that, to achieve accurate results, it is not enough to simply add the correction terms separately. Instead, the correction term derived from functional integration and the Kinetic electron must be included together for MGP to approach both metallic and semiconducting phases.
B. Benchmarks for III-V semiconductors
In this section, we present the benchmark MGP against KS-DFT for binary III-V semiconductors in the ZB phase. These are challenging systems for KEDFs. For example, it was shown10,12,15 that WGC is not appropriate and that the proper long-range behavior of the kernel needs to be included. We have developed MGP with semiconductors and finite systems in mind, and specifically, the Kinetic electron term aims at correcting the KEDF kernel in its long-range (low q) behavior.
Table II lists the performance of MGP compared to HC as well as KS-DFT. The results indicate that both MGP and HC functionals can reproduce accurate equilibrium volume and bulk moduli. The predicted equilibrium volumes with MGP and HC lie within 2% of the KS-DFT results for all considered semiconductors. For the bulk moduli, MGP and HC are within 10 GPa compared to KS-DFT. MGP total equilibrium energies are within 5 meV/cell to the KS-DFT benchmark. This improves upon HC which finds itself within 42 meV/cell. Overall, however, the quality of the calculated MGP energies is similar to HC for the considered systems.
Equilibrium volume (V0 in Bohr3), minimum energy (E0 in eV/atom), and bulk modulus (B0 in GPa) for various ZB III-V semiconductors computed within OFDFT/BLPS by the MGP functional and HC functional with optimal choices of the parameters. The benchmark KSDFT/BLPS results are also given. KS-DFT/BLPS and OF-DFT/BLPS with HC functional values are taken from Ref. 10.
. | . | . | V0 . | E0 . | B0 . |
---|---|---|---|---|---|
Systems . | Functional . | Parameters . | (a.u./Cell) . | (eV) . | (GPa) . |
GaAs | KS | 274.2 | −235.799 | 75 | |
HC | λ = 0.0130, β = 0.783 | 275.3 | −235.782 | 81 | |
MGP | a = 0.434, b = 0.524 | 275.2 | −235.801 | 75 | |
GaSb | KS | 354.2 | −209.697 | 56 | |
HC | λ = 0.0100, β = 0.720 | 355.5 | −209.739 | 58 | |
MGP | a = 0.341, b = 0.752 | 360.5 | −209.696 | 49 | |
GaP | KS | 254.0 | −243.079 | 80 | |
HC | λ = 0.0100, β = 0.791 | 255.0 | −243.057 | 87 | |
MGP | a = 0.405, b = 0.394 | 252.5 | −243.077 | 82 | |
AlAs | KS | 294.3 | −232.908 | 80 | |
HC | λ = 0.0125, β = 0.825 | 301.6 | −232.912 | 76 | |
MGP | a = 0.396, b = 0.424 | 296.2 | −232.903 | 76 | |
AlP | KS | 274.2 | −240.182 | 90 | |
HC | λ = 0.0120, β = 0.845 | 272.8 | −240.199 | 91 | |
MGP | a = 0.378, b = 0.319 | 273.0 | −240.180 | 81 | |
AlSb | KS | 382.0 | −206.606 | 60 | |
HC | λ = 0.0120, β = 0.750 | 377.9 | −206.588 | 61 | |
MGP | a = 0.359, b = 0.832 | 383.9 | −206.606 | 55 | |
InP | KS | 310.7 | −235.722 | 73 | |
HC | λ = 0.0120, β = 0.885 | 309.4 | −235.696 | 66 | |
MGP | a = 0.394, b = 0.351 | 308.2 | −235.724 | 62 | |
InAs | KS | 331.5 | −228.537 | 65 | |
HC | λ = 0.0142, β = 0.875 | 334.0 | −228.523 | 63 | |
MGP | a = 0.408, b = 0.444 | 330.1 | −228.532 | 61 | |
InSb | KS | 424.5 | −202.387 | 50 | |
HC | λ = 0.0120, β = 0.810 | 424.1 | −202.381 | 49 | |
MGP | a = 0.361, b = 0.850 | 426.9 | −202.386 | 46 | |
V0 | E0 | B0 | |||
MEA | (%) | (meV) | (GPa) | ||
HC | 0.72 | 18.9 | 3.4 | ||
MGP | 0.68 | 2.4 | 5.1 |
. | . | . | V0 . | E0 . | B0 . |
---|---|---|---|---|---|
Systems . | Functional . | Parameters . | (a.u./Cell) . | (eV) . | (GPa) . |
GaAs | KS | 274.2 | −235.799 | 75 | |
HC | λ = 0.0130, β = 0.783 | 275.3 | −235.782 | 81 | |
MGP | a = 0.434, b = 0.524 | 275.2 | −235.801 | 75 | |
GaSb | KS | 354.2 | −209.697 | 56 | |
HC | λ = 0.0100, β = 0.720 | 355.5 | −209.739 | 58 | |
MGP | a = 0.341, b = 0.752 | 360.5 | −209.696 | 49 | |
GaP | KS | 254.0 | −243.079 | 80 | |
HC | λ = 0.0100, β = 0.791 | 255.0 | −243.057 | 87 | |
MGP | a = 0.405, b = 0.394 | 252.5 | −243.077 | 82 | |
AlAs | KS | 294.3 | −232.908 | 80 | |
HC | λ = 0.0125, β = 0.825 | 301.6 | −232.912 | 76 | |
MGP | a = 0.396, b = 0.424 | 296.2 | −232.903 | 76 | |
AlP | KS | 274.2 | −240.182 | 90 | |
HC | λ = 0.0120, β = 0.845 | 272.8 | −240.199 | 91 | |
MGP | a = 0.378, b = 0.319 | 273.0 | −240.180 | 81 | |
AlSb | KS | 382.0 | −206.606 | 60 | |
HC | λ = 0.0120, β = 0.750 | 377.9 | −206.588 | 61 | |
MGP | a = 0.359, b = 0.832 | 383.9 | −206.606 | 55 | |
InP | KS | 310.7 | −235.722 | 73 | |
HC | λ = 0.0120, β = 0.885 | 309.4 | −235.696 | 66 | |
MGP | a = 0.394, b = 0.351 | 308.2 | −235.724 | 62 | |
InAs | KS | 331.5 | −228.537 | 65 | |
HC | λ = 0.0142, β = 0.875 | 334.0 | −228.523 | 63 | |
MGP | a = 0.408, b = 0.444 | 330.1 | −228.532 | 61 | |
InSb | KS | 424.5 | −202.387 | 50 | |
HC | λ = 0.0120, β = 0.810 | 424.1 | −202.381 | 49 | |
MGP | a = 0.361, b = 0.850 | 426.9 | −202.386 | 46 | |
V0 | E0 | B0 | |||
MEA | (%) | (meV) | (GPa) | ||
HC | 0.72 | 18.9 | 3.4 | ||
MGP | 0.68 | 2.4 | 5.1 |
C. Analysis of the electron density
We have also evaluated MGP by comparing the ground state electron densities produced by OF-DFT with MGP and HC KEDFs, as well as KS-DFT for CD Si and ZB GaAs. Once again, we employ optimal parameters for the energy functionals. Figures 2 and 3 display the electron density distribution along the [111] direction for CD silicon and ZB GaAs, respectively. Inspection of the figures reveals that the MGP electron density reproduces KS-DFT quite well. While in both cases HC tends to underestimate the density in the bonding region, MGP is much closer to KS-DFT. Conversely, in the space between non-covalently bonded atoms, HC slightly underestimates the KS-DFT density, while MGP slightly overestimates it.
The electron density of CD silicon along the [111] direction. Purple line: KS-DFT. Green line: OF-DFT with the HC functional. Blue line: OF-DFT with the MGP functional. Vertical axis represents the electron density in 1/bohr3; horizontal axis represents the grid points, and the position of the silicon atoms is indicated with a “Si” label in the graph.
The electron density of CD silicon along the [111] direction. Purple line: KS-DFT. Green line: OF-DFT with the HC functional. Blue line: OF-DFT with the MGP functional. Vertical axis represents the electron density in 1/bohr3; horizontal axis represents the grid points, and the position of the silicon atoms is indicated with a “Si” label in the graph.
The electron density of ZB GaAs along the [111] direction. Purple line: KS-DFT. Green line: OF-DFT with the HC functional. Blue line: OF-DFT with the MGP functional. Electron densities are in atomic units; horizontal axis represents the grid points, and the position of the silicon atoms is indicated with a “Si” label in the graph.
The electron density of ZB GaAs along the [111] direction. Purple line: KS-DFT. Green line: OF-DFT with the HC functional. Blue line: OF-DFT with the MGP functional. Electron densities are in atomic units; horizontal axis represents the grid points, and the position of the silicon atoms is indicated with a “Si” label in the graph.
We have tested several functionals for their performance in reproducing KS-DFT electron densities for these systems and found that WT does a remarkably good job. In this respect, because MGP is based on WT, it inherits some of its good traits.
D. Computational efficiency
To evaluate the computational efficiency of MGP, we compare the computational cost for optimizing the electron density in CD Si (2 atoms), 2 × 2 × 2 (16 atoms), 4 × 4 × 4 (128 atoms), and 8 × 8 × 8 (1024 atoms) supercells with WGC, WT, and MGP KEDFs, respectively. All calculations are performed on a single thread with PROFESS employing the BLPS using a large kinetic energy cutoff (4000 eV). For all KEDFs, we employ optimal parameters (WGC: γ = 4.2, ρ* = ρ0; WT: α = β = 5/6; MGP: a = 0.364 and b = 0.570). Since the computational cost of HC KEDF is much higher than that of other functionals (about 1000 times higher than MGP for CD Si),10,22 we exclude it in the comparison. In principle, the only difference in the computational cost between MGP and WT KEDFs is the initial kernel building step in reciprocal space.
As shown in Table III, the wall time of the computing MGP kernel is about 200 times larger than a single call for the evaluation of the potential. Fortunately, the kernel is evaluated only once at the beginning of the simulation. We recall that the computational cost in the initial MGP kernel is strictly linear scaling with system size instead of quasilinear [i.e., N ln(N) scaling with N being the number of real-space grid points] in the evaluation of the potential. As a result, for small systems, the total wall time for optimizing electron density with MGP KEDFs is always between WT and WGC’s wall times. Increasing the number of grid points (or the size of the system), the wall time should approach WT (although this should be system dependent).
Total computational wall time, number of truncated Newton (TN) optimization steps, total number of calls for evaluation of potentials, wall time per call in evaluation of the total potential, and wall time per call in evaluation of kinetic potential with the WGC, MGP, and WT KEDFs. The wall time for evaluation of the MGP kernel is also listed.
. | KEDFs . | 2 atoms . | 16 atoms . | 128 atoms . | 1024 atoms . |
---|---|---|---|---|---|
Total wall time (s) | WGC | 8.7 | 87.1 | 830.4 | 6264.2 |
MGP | 8.9 | 76.4 | 648.4 | 5486.4 | |
WT | 5.0 | 51.5 | 456.8 | 4342.3 | |
Number of optimized steps | WGC | 10 | 10 | 11 | 12 |
MGP | 8 | 8 | 10 | 11 | |
WT | 7 | 8 | 9 | 11 | |
Number of calls for evaluation potentials | WGC | 248 | 253 | 280 | 282 |
MGP | 191 | 212 | 224 | 228 | |
WT | 173 | 207 | 219 | 236 | |
Wall time of the total potential (s/call) | WGC | 0.03 | 0.32 | 2.81 | 21.27 |
MGP | 0.01 | 0.16 | 1.42 | 12.64 | |
WT | 0.02 | 0.23 | 1.97 | 17.3 | |
Wall time of the kinetic potential (s/call) | WGC | 0.03 | 0.27 | 2.37 | 17.88 |
MGP | 0.01 | 0.12 | 1.02 | 9.00 | |
WT | 0.02 | 0.19 | 1.55 | 13.71 | |
Wall time of the evaluation kernel(s) | MGP | 4.9 | 37.8 | 299.9 | 2383.3 |
. | KEDFs . | 2 atoms . | 16 atoms . | 128 atoms . | 1024 atoms . |
---|---|---|---|---|---|
Total wall time (s) | WGC | 8.7 | 87.1 | 830.4 | 6264.2 |
MGP | 8.9 | 76.4 | 648.4 | 5486.4 | |
WT | 5.0 | 51.5 | 456.8 | 4342.3 | |
Number of optimized steps | WGC | 10 | 10 | 11 | 12 |
MGP | 8 | 8 | 10 | 11 | |
WT | 7 | 8 | 9 | 11 | |
Number of calls for evaluation potentials | WGC | 248 | 253 | 280 | 282 |
MGP | 191 | 212 | 224 | 228 | |
WT | 173 | 207 | 219 | 236 | |
Wall time of the total potential (s/call) | WGC | 0.03 | 0.32 | 2.81 | 21.27 |
MGP | 0.01 | 0.16 | 1.42 | 12.64 | |
WT | 0.02 | 0.23 | 1.97 | 17.3 | |
Wall time of the kinetic potential (s/call) | WGC | 0.03 | 0.27 | 2.37 | 17.88 |
MGP | 0.01 | 0.12 | 1.02 | 9.00 | |
WT | 0.02 | 0.19 | 1.55 | 13.71 | |
Wall time of the evaluation kernel(s) | MGP | 4.9 | 37.8 | 299.9 | 2383.3 |
Among WGC, MGP, and WT KEDFs, WGC is the most expensive KEDF in terms of total wall time. Additionally, we find that MGP KEDFs converge in all cases considered with 12 iterations of a truncated Newton (TN) minimization. By contrast, WGC KEDF is numerically unstable in some cases (this is a well-known limitation of this functional10,22).
In summary, the computational efficiency of MGP closely resembles WT’s, and both MGP and WT are more computationally efficient than WGC.
VI. CONCLUSIONS
We have formulated and implemented MGP, a new nonlocal kinetic energy functional with a density independent kernel. In MGP, the inverse response function of the FEG is functional-integrated to yield a new kernel. In a second step, the kernel is augmented by a “Kinetic electron” which is opposite to the exchange hole.
Our pilot calculations show that MGP improves dramatically over currently available nonlocal functionals with density independent kernels and performs even better than the best available functionals (featuring density dependent kernels) for both metallic and semiconducting phases of Si as well as the ZB phases of nine common III-V semiconductors. Although the results presented here are quite encouraging, we should take them with a grain of salt. In this work, we have only considered bulk systems, and the lingering question is, can MGP deliver similar quality results for finite systems or systems with vacancies? Initial tests of MGP applied to finite systems (isolated clusters) are quite encouraging and will be the subject of a follow-up study.
In light of recent debates criticizing the use of the total electronic energy as the only descriptor relevant for optimizing a density functional,54 we have also inspected the ability of MGP to reproduce electron density distributions. We find that MGP performs quite remarkably in this respect. For example, MGP’s densities are much closer to KS-DFT in bonding regions for CD silicon and ZB GaAs compared to the current state-of-the-art HC functional. Interestingly, we find that although for semiconductors and nonsimple metals, WT is unable to reproduce observables related to the energy; it delivers quite accurate electron densities. As MGP is derived from WT, this is probably the reason why MGP delivers accurate densities.
Finally, the benchmarks for computational cost indicate that our new KEDF is almost as computationally efficient as WT which is of similar scaling as GGA functionals.
Our analysis shows that there is room for improvement, particularly for two aspects of the MGP functional. First, the Kinetic electron can be better parametrized following prescriptions that have been very successful in formulating exchange functionals in real space.38,55 Second, there exist formulations of “jellium with gap” models for the Lindhard function56 which have been recently (and successfully) applied to the formulation of GGA KEDF.7 These can be extended to the line-integral formulation of MGP. Third, a density-dependent kernel version of MGP can be constructed to satisfy additional exact conditions, such as density and coordinate scaling relations.32
SUPPLEMENTARY MATERIAL
See supplementary material for an analysis of different ways of symmetrizing the kernel with respect to the spatial coordinates and the respective electron density dependence.
ACKNOWLEDGMENTS
This material is based upon work supported by the National Science Foundation under Grant No. CHE-1553993. We thank Emily Carter and her group members for insightful discussions.
REFERENCES
The name “kinetic electron” is chosen here to stress that it represents a charge density opposite to the “exchange hole.” It is a particle/hole relationship, in which the electron plays the role of the opposite of the hole.