Ab initio and semiempirical electronic structure methods are usually implemented in separate software packages or use entirely different code paths. As a result, it can be time-consuming to transfer an established ab initio electronic structure scheme to a semiempirical Hamiltonian. We present an approach to unify ab initio and semiempirical electronic structure code paths based on a separation of the wavefunction ansatz and the needed matrix representations of operators. With this separation, the Hamiltonian can refer to either an ab initio or semiempirical treatment of the resulting integrals. We built a semiempirical integral library and interfaced it to the GPU-accelerated electronic structure code TeraChem. Equivalency between ab initio and semiempirical tight-binding Hamiltonian terms is assigned according to their dependence on the one-electron density matrix. The new library provides semiempirical equivalents of the Hamiltonian matrix and gradient intermediates, corresponding to those provided by the ab initio integral library. This enables the straightforward combination of semiempirical Hamiltonians with the full pre-existing ground and excited state functionality of the ab initio electronic structure code. We demonstrate the capability of this approach by combining the extended tight-binding method GFN1-xTB with both spin-restricted ensemble-referenced Kohn–Sham and complete active space methods. We also present a highly efficient GPU implementation of the semiempirical Mulliken-approximated Fock exchange. The additional computational cost for this term becomes negligible even on consumer-grade GPUs, enabling Mulliken-approximated exchange in tight-binding methods for essentially no additional cost.

Numerous methodological developments as well as the increased availability of high-performance computing resources have contributed to computational chemistry becoming an integral discipline among the modern chemical sciences.1–3 A large number of different ab initio as well as semiempirical electronic structure codes are now in widespread use. A combination of these two electronic structure realms typically takes place in multi-level simulation workflows, e.g., to explore the chemical4,5 and conformer space6,7 or in free energy calculations.8,9 While some quantum chemistry programs are “general purpose”, many software packages have a certain application-specific focus with respect to the implemented methods, and a semiempirical equivalent to an ab initio method is generally not available in the same software package. Although many quantum chemistry software packages include functionality for both ab initio and semiempirical Hamiltonians,10–18 these packages typically provide support for semiempirical methods using separate code paths with restricted functionality. Hence, if an electronic structure method is intended to be used in combination with a different Hamiltonian, such as a semiempirical Hamiltonian, a reimplementation of the respective electronic structure routine is typically required. A unified implementation would have numerous advantages: (a) it reduces the implementation effort to develop semiempirical variants of ab initio methods; (b) multi-level electronic structure workflows become naturally accessible since the identical code path guarantees the same functionality for ab initio and semiempirical Hamiltonians; and (c) due to their lower cost, semiempirical Hamiltonians can be used to efficiently test new electronic structure implementations (e.g., when analytic and numerical gradients are compared).

In this work, we present the implementation of a novel semiempirical integral library into the GPU-accelerated electronic structure code TeraChem. We will explain in detail how the modular setup of TeraChem and the new SQMBox library enable the straightforward extension to semiempirical Hamiltonians. The benefits of this approach are demonstrated by combining TeraChem’s spin-restricted ensemble-referenced Kohn–Sham (REKS) method with the semiempirical tight-binding Hamiltonian GFN1-xTB. For complete compatibility with the ab initio code, we add Mulliken-approximated19 Fock exchange to this semiempirical library. The inclusion of this term was previously reported to increase the computational cost of semiempirical methods by almost an order of magnitude.20 We show how to efficiently compute semiempirical exchange by utilizing an earlier presented refactorization,21 reuse of stored intermediates, and GPU acceleration. We also generalize the exchange algorithm to encompass non-identical and non-symmetric density matrices and present formulas for the efficient evaluation of these terms in the supplementary material.

We first outline the key implementation aspects of electronic structure methods in TeraChem, followed by a recap of the extended tight-binding methods GFN1-xTB and GFN2-xTB. We then describe in more detail how we develop the Mulliken approximation-based semiempirical integrals by decomposing the individual xTB Hamiltonian elements according to their order with respect to the density matrix. Finally, we demonstrate the performance of our GPU-accelerated Mulliken-approximated Fock exchange implementation and show exploratory results for the GFN1-xTB Hamiltonian in combination with REKS and complete active space (CAS) electronic structure methods, demonstrating that gradients and nonadiabatic coupling elements are readily available through the newly established setup.

In a seminal paper by Almlöf and co-workers, the superiority of an integral-direct scheme for the electron repulsion integrals (ERIs) in self-consistent field (SCF) procedures was demonstrated.22 Here, the fourth-order tensors (μν|κλ) are recomputed whenever required and contracted immediately with the density matrix. This is in contrast to “conventional” integral handling, where the integrals are precomputed and stored. Due to its lower memory requirements, the integral-direct scheme is applicable to large molecular systems. Thus, it has become the accepted standard for the computation of ERIs and is implemented in the majority of ab initio electronic structure codes. However, most programs, specifically in post-SCF calculations, further benefit from efficient integral approximations that are performed in a somewhat semi-direct scheme. Popular variants include the resolution-of-the-identity (RI)23,24 and Cholesky decomposition (CD)25 methods. Both schemes factorize the four-index tensor as follows:

(1)

Here, μ, ν, κ, and λ denote the atomic orbitals, while P is an auxiliary index corresponding to the auxiliary or Cholesky basis. The necessary contractions with the atomic orbital (AO) density matrix (or orbital coefficients24,26) are then carried out on these three-index tensors. This typically reduces the formal scaling of the calculation. For optimum performance, a lower dimensional intermediate (i.e., a second- or third-order tensor) can be precomputed and stored. Such integral handling schemes have found particular use in post-SCF calculations, where they reduce the formal scaling of ERI calculations on the molecular orbital (MO) basis. Consequently, the integral handling in many quantum chemistry codes depends, at least in part, on the electronic structure calculation to be performed.

TeraChem, a GPU-accelerated ab initio electronic structure code, approaches this problem differently.27,28 In TeraChem, ERIs are always evaluated in a fully AO integral-direct manner in both SCF and post-SCF methods. This approach is rooted in the availability of a very efficient AO integral library (called “IntBox,” see Fig. 1). Its efficiency stems from a highly optimized GPU code that exploits primitive pair screening as well as the architecture specifics of common consumer-grade GPUs. As a consequence, every electronic structure method in TeraChem is formulated in terms of the basic quantities (i.e., Fock matrix intermediates such as Coulomb and exchange operators) that are provided by the IntBox library. Implemented electronic structure methods directly “inherit” excellent performance by leveraging this integral library. Effectively, this approach creates a clean separation of the electronic Schrödinger equation. While the wavefunction ansatz is encoded in the electronic structure routines inside the main TeraChem program, the Hamiltonian represented in the Gaussian-type orbital (GTO) basis is evaluated in the IntBox library.

FIG. 1.

Schematic workflow for an arbitrary electronic structure method in TeraChem. The main code passes density matrices D (or transition and difference densities δD or Z) to the IntWrapper, which passes them to the ab initio integral library (IntBox) or the semiempirical library (SQMBox), depending on the run mode. These compute the essential electronic structure quantities, which are passed back to the electronic structure routine. The latter are the components that are familiar from building the Fock matrix, i.e., for ab initio, these are the kinetic energy matrix T, nuclear-electron Coulomb matrix Vne, electron–electron Coulomb matrix J, and electron–electron exchange matrix K. In the semiempirical run mode, these are replaced by their semiempirical equivalents (HEHT, Ṽne, J̃, and K̃) provided by SQMBox.

FIG. 1.

Schematic workflow for an arbitrary electronic structure method in TeraChem. The main code passes density matrices D (or transition and difference densities δD or Z) to the IntWrapper, which passes them to the ab initio integral library (IntBox) or the semiempirical library (SQMBox), depending on the run mode. These compute the essential electronic structure quantities, which are passed back to the electronic structure routine. The latter are the components that are familiar from building the Fock matrix, i.e., for ab initio, these are the kinetic energy matrix T, nuclear-electron Coulomb matrix Vne, electron–electron Coulomb matrix J, and electron–electron exchange matrix K. In the semiempirical run mode, these are replaced by their semiempirical equivalents (HEHT, Ṽne, J̃, and K̃) provided by SQMBox.

Close modal

In general, the energy of a single-determinant electronic wavefunction expanded in an orbital basis can be formulated in terms of energy contributions that depend on the one-particle density matrix D,

(2)

Here, σ, τ = α, β denote the electron spin and μ, ν, κ, and λ denote the contracted GTOs. The prefactors cK and cXC are method-specific scaling factors of the Fock exchange and the density-functional exchange-correlation energy. For the ab initio methods implemented in TeraChem, the IntBox library provides the Fock matrix terms given in brackets in the last line, as well as the nuclear gradients for the corresponding energy terms given in the first line (the XC term is handled separately in the TeraChem code and is not part of the IntBox library due to the grid dependence and nonlinear potential in the density). The same intermediates found in the last line of Eq. (2) can also be used to compute multi-determinant wavefunctions. The most naive way would be to sum up the respective terms for individual determinant contributions, but in practice, it is often convenient to generate the two-electron integral tensor in the molecular orbital (MO) basis from transformed J and K intermediates.29 

Originally, the electronic structure code made direct calls to the ab initio integral library. To achieve a more modular workflow in which the electronic structure procedure is truly agnostic to the explicit form of the underlying Hamiltonian, we have now set-up a wrapper routine (“IntWrapper”), which communicates with the electronic structure code and the respective integral library at the backend. Schematically, this workflow is shown in Fig. 1. The electronic structure code passes a transition or particle density matrix to IntWrapper and obtains a basic quantity (e.g., a J matrix) in return. Depending on the run mode, the received basic quantity has been computed at the backend by the ab initio (IntBox) or the semiempirical (SQMBox) integral library. In this way, unmodified electronic structure routines can carry out ab initio and semiempirical electronic structure methods using the same code path. Once the respective quantities [cf. the last line in Eq. (2) and gradients] for a semiempirical Hamiltonian are available and coded up, the entire functionality of the existing electronic structure code becomes available. This is the approach we follow in this work, and the details are given in Sec. II B.

We developed a semiempirical integral library (SQMBox), written in C/C++ and CUDA, and interfaced with the quantum chemistry software TeraChem. It is interfaced to the main code via a C interface, i.e., only standard C variable types and pointers are passed between the main program and the library. These design decisions ensure that SQMBox can be easily interfaced to other quantum chemistry codes. We take the semiempirical tight-binding Hamiltonians GFN1-xTB (originally GFN-xTB) and GFN2-xTB as blueprints to develop the library. In contrast to other popular semiempirical methods (see Ref. 30 for an overview), these tight-binding methods are formulated in terms of GTOs, allowing us to leverage existing routines (the overlap matrix and its gradient) from TeraChem. The parameters for these Hamiltonians are currently hard-coded in the SQMBox library, but user-specified parameters could be read from an external parameter file. The starting point for SQMBox development is the extended tight-binding energy expressions previously given,31,32 which we briefly review here. The energy expression for either of these Hamiltonians can be formulated in a concise way as follows:

(3)

Here, n = 1, 2 refer to the GFN1-xTB and GFN2-xTB Hamiltonians, respectively. The first term in Eq. (3) refers to a purely classical short-range repulsion function. The dispersion (Edisp) is evaluated by means of the D3 model33,34 in GFN1-xTB and by means of a self-consistent D4 model35,36 in GFN2-xTB. The covalent bond formation is mainly driven by the third term, which is the extended Hückel-type (EHT) energy. In GFN1- and GFN2-xTB, EEHT carries most of the element-specific parameters. The last term describes the electrostatic and exchange-correlation effects in both xTB methods. Most of the terms in Eq. (3) are similar in form for GFN1-xTB and GFN2-xTB. However, major differences are found in EES+XC. In GFN2-xTB, this takes the following form:

(4)

Here, the shell l of an atom denotes the summed quantity (here the electron population) of the atomic orbitals κ of a specific principal quantum number and angular momentum. The Δql are the shell charges and are given as the differences between the actual and reference electron population of the shell,

(5)

The first term in Eq. (4) refers to the isotropic electrostatic interaction energy, often denoted as Eγ, which results from the second-order tight-binding expansion in terms of density fluctuations. The sum goes over all shells in the system, which are distinguished by their atom index as well as their principal and angular momentum quantum numbers. The second term in Eq. (4) originates from the third-order term in the TB expansion. In GFN2-XTB, this is restricted to same-shell contributions. In GFN1-xTB, this term is evaluated in terms of atomic partitioning. The last five terms are absent in GFN1-xTB and refer to the multipole electrostatic terms and XC terms, which are considered up to second order in GFN2-xTB. All these terms are based on cumulative atomic multipole moments,37 which are obtained analogously to Eq. (5) using atom-centered multipole moment integrals instead of overlap integrals. We refer the interested reader to the original publication,31 and for the purpose of this paper, we simply mention that the dependence on the density matrix is the same in the isotropic and anisotropic terms. Hence, we restrict our discussion to the isotropic Mulliken-approximated integral terms.

To obtain a semiempirical integral library, the individual energy terms in the xTB methods are grouped according to their dependence on the electron density matrix D. Next, these terms are juxtaposed to the respective ab initio terms (see Fig. 2). The energy terms with no dependence on D already in the tight-binding formulation are the repulsion and D3 dispersion. These are entirely classical and do not depend on the electronic density matrix. The energy terms with a linear dependence on D contribute to the electronic energy, but do not require a self-consistent solution. In the tight-binding energy expression, this directly applies to the extended Hückel-type energy, which can be handled equivalently to the kinetic energy in ab initio theories (see the top entry in Fig. 2).

FIG. 2.

Juxtaposition of the energy/Kohn–Sham–Fock matrix elements occurring in ab initio and semiempirical extended tight-binding theory (based on GFN2-xTB Hamiltonian terms without multipole contributions). The XC energy and nuclear repulsion energy are also shown along with their tight-binding equivalents.

FIG. 2.

Juxtaposition of the energy/Kohn–Sham–Fock matrix elements occurring in ab initio and semiempirical extended tight-binding theory (based on GFN2-xTB Hamiltonian terms without multipole contributions). The XC energy and nuclear repulsion energy are also shown along with their tight-binding equivalents.

Close modal

The second- and third-order tight-binding terms, however, can be decomposed into terms that are of the same or lower order in the density matrix. We will first look at the second-order term: by inserting Eq. (5) into the first term on the right-hand side (rhs) of Eq. (4) and collecting all terms with no dependence on D, we obtain the reference density Coulomb interaction, which is purely classical and solely depending on ql,0. This plain ql,0 electrostatic term is treated like the nuclear repulsion energy in ab initio theories and denoted as Ẽnn in the following (see ρ0ρ0 interaction in Fig. 2). Furthermore, we obtain a term where the density matrix-dependent electronic charges ql are interacting with the reference charges ql,0. This term can be treated equivalently to the nuclear-electron interaction integrals in ab initio methods (denoted as Ẽnein the following). In case of GFN2-xTB, a multipole component (arising from EESqμ and EESqΘ due to ql,0-dipole/quadrupole interactions) contributes to this term as well.

The electrostatic interactions between ql correspond to a Mulliken-approximated interelectronic Coulomb interaction (J),

(6)

In our implementation, the Mataga–Nishimoto–Ohno–Klopman-type38–40 Coulomb interaction terms are generalized to allow for range-separation as used in the context of exchange (see below),

(7)

RAB is the interatomic distance and ηll’ is the average of the effective chemical hardness of the two shells involved. Note that GFN1- and GFN2-xTB both use a shell-wise isotropic electrostatic treatment but use different averaging schemes for the hardness (harmonic and arithmetic, respectively). In Eq. (7), the cFR and cLR are the full- and long-range scaling coefficients and ω is a range-separation parameter analogous to range-separated hybrid density functional approximations. To the best of our knowledge, this is the first suggestion to introduce range separation in the context of empirical xTB treatment of the Coulomb interaction. The second term in Eq. (7) is motivated by the fact that the ω parameter is of inverse distance (bohr−1) units. Hence, the second term in Eq. (7) can be regarded as a modified harmonic weighting and may be a better approximation than the error function scaling proposed in Ref. 21, as it still gives the maximum magnitude for the Coulomb interaction at zero distance.

The ql,0-independent multipole electrostatic and the XC contributions, which occur in GFN2-xTB, are quadratic in D and also contribute to the interelectronic electrostatic energy ẼJ in our implementation. Decomposing the third-order tight binding terms leads to energy terms that range from zeroth to third order in the electron density. The decomposed terms read as

(8)
(9)
(10)
(11)

Equation (8) is of zeroth order in the electron density and just an atomic (shell-wise), geometry-invariant constant. We add this contribution to the classical energy Ẽnn to be in agreement with the energies obtained with the original xTB implementation.41 The first order term in Eq. (9) is added to the semiempirical equivalent of nuclear–electron interaction (Ẽne), and likewise, the second order term in Eq. (10) is added to ẼJ. The last term, which is the third order in the electron density has no equivalent in ab initio wavefunction theory. We handle this term like the XC energy (with corresponding XC potential and kernel) in first principles Kohn–Sham DFT codes.

We mention at this point that Nishimoto has previously applied a similar decomposition procedure when deriving the TD-DFTB3 equations and also drew similar connections between the tight-binding and ab initio Hamiltonians.42 

It should be emphasized that the semiempirical integral library presented here has its origin in semiempirical tight-binding methods. Hence, integrals are calculated in the nonorthogonal atomic orbital basis set, just like in the ab initio library. That means that the zero-differential overlap (ZDO) approximation43 is not employed, and all Fock matrix quantities are used in the generalized eigenvalue problem,

(12)

All quantities and results presented in this paper refer to this non-ZDO treatment. However, the semiempirical library could be used in combination with the ZDO approximation by setting the overlap matrix to a unit matrix.

Meanwhile, the previously presented xTB Hamiltonians did not include any Fock exchange, the semiempirical integral library presented here does include an implementation of Mulliken-approximated Fock exchange. The matrix to be computed is as follows:

(13)

Our library is set up to use the same hardness parameters in the Mataga–Nishimoto–Ohno–Klopman-type term for Coulomb (J) and exchange (K) [see Eq. (7) for the definition of γ]. In principle, this provides a self-interaction error free treatment within the monopole approximation if Coulomb and exchange enter the Hamiltonian with identical scaling and range-separation parameters. As written in Eq. (13), the evaluation of the Mulliken-approximated Fock exchange would scale as O(N4). Initial implementations of long-range Fock exchange corrected DFTB were found to be considerably slower (by about a factor of 10).20 This computational complexity is one of the reasons why this term is absent in most DFTB methods. However, it is often added when excited states are of interest.21,44–46 Employing the well-known neglect of diatomic differential overlap (NDDO) approximation would bring down the scaling of this term to N2, but that needs to be accompanied with the ZDO of the overlap matrix in Eq. (12), which is incompatible with most DFTB and GFNn-xTB Hamiltonians. Fortunately, the computational complexity of the Mulliken-approximated exchange can be significantly lowered, leading to a form with O(N3) scaling,

(14)

Here, the entire build for exchange contribution to the Fock matrix can be formulated in terms of cubically-scaling matrix–matrix multiplications and quadratically-scaling element-wise products (denoted by °). This allows efficient evaluation of computing architectures like graphics processing units (GPUs). This rearrangement was pointed out earlier by Humeniuk and Mitrić,21 but no GPU implementation has been reported up to this point. We implemented the expression in Eq. (14) efficiently on the CPU as well as on the GPU (single and double precision), making use of existing BLAS and cuBLAS libraries. By storing intermediates, a total of six matrix–matrix multiplications need to be performed. If D is symmetric, i.e., in ground state SCF calculations, Eq. (14) can be further reduced to only four matrix–matrix multiplications, greatly reducing the computational effort. The computational efficiency of evaluating this term on the GPU will be demonstrated in Sec. IV.

The gradient expressions corresponding to the energy/potential terms listed in Fig. 2 and generalized to non-identical and non-symmetric density matrices X, Y, and Z are given as

(15)
(16)
(17)
(18)

Here, αC is the Cartesian direction of nucleus C. The latter equation for the derivative of the density functional can be expressed as a derivative of an exchange-correlation kernel, which is evaluated for the reference density (Y in this case) and contracted with two other (transition) density matrices X and Z. The same routine can then be used for both ground and excited state cases (for the ground state, X = Y = Z = D). Essentially all electronic terms in the semiempirical Hamiltonian make use of the overlap matrix. For optimum efficiency in the gradient calculation, we want to avoid multiple evaluations of overlap gradients. Therefore, we split each of the gradient terms into two components: an overlap or basis function component and a Hamiltonian or Hellmann–Feynman-type component,

(19)

Here, we use the chain rule to differentiate the overlap matrix elements first and then factor out the nuclear derivative of the overlap matrix element. The second term on the right-hand side is the Hellmann–Feynman-type term and includes all partial derivatives with constant overlap. For the first term, this leads to a derivative contribution of the form

(20)

For each energy contribution to the electronic energy, we form the corresponding intermediate and store it in a matrix (denoted WS here). In our implementation, these are added together and then added to the energy-weighted density matrix, which appears in the derivative expression of the orbital orthonormality constraint in both ab initio and semiempirical tight-binding calculations. This ensures that the derivative of the overlap matrix is computed only once per gradient evaluation.

From the above-mentioned description, it should be clear that the semiempirical integral library never computes any overlap integrals or their derivatives but rather obtains overlap integrals on input and returns energy/potential terms and corresponding WS matrices or gradients as output. Given that the evaluation of overlap integrals and their gradients is standard in electronic structure codes, these are then evaluated by the integral library of the electronic structure program, in our case IntBox/TeraChem. This has a practical implication: up to this point, our semiempirical integral library setup is agnostic to the type of basis function employed, as long as they are centered on the individual atoms. In TeraChem, we employ Gaussian-type orbitals (GTOs), but also Slater-type orbitals (STOs) or less standard functions like polynomial “ramp” functions47 could, in principle, be employed.

1. Multipole gradient terms in GFN2-xTB-type Hamiltonians

The GFN2-xTB Hamiltonian contains energy terms that depend on the electric dipole moment and quadrupole moment integrals. For variational ground state solutions, the computation of nuclear derivatives has been described in the Supplementary Material of Ref. 31. In our general semiempirical integral library, different and non-symmetric density matrices (X and Y) may enter the corresponding energy expression. Just like in the overlap case, we collect the corresponding intermediates for the dipole and quadrupole moment integrals,

(21)
(22)

Here, Mβ and Mβγ denote the Cartesian dipole and quadrupole moment integrals, with components along the β and γ axes, respectively. The extra index A indicates that these are evaluated with the origin at atom A (with μA; see Ref. 31 for details). For this reason, the individual W matrices for multipole integrals are non-symmetric with respect to exchanging the indices μν. The evaluation of the multipole gradients in this fashion is certainly non-standard in electronic structure packages, and a GTO-based routine has been added to the TeraChem integral library.

To demonstrate the advantages of our approach, we combine the GFN1-xTB Hamiltonian with the two-state interaction two-state-averaged spin-restricted ensemble-referenced Kohn–Sham method with two active electrons and orbitals, denoted SI2-SA2-REKS(2,2) in the following.48–50 Here, the ground and first excited states are obtained as solutions from the state interaction Hamiltonian,

(23)

Here, the EREKS(2,2) state is the energy of the perfectly spin-paired configuration, and EOSS is the energy of the open-shell singlet configuration. The coupling parameter ∆rs is related to the Lagrange multiplier for the active orbital pair r and s as well as to their fractional occupation number difference. The orbitals used in SI2-SA2-REKS(2,2) are obtained by minimizing the SA2-REKS(2,2) energy expression,

(24)

Here, the sum goes over all possible Slater determinants that can be generated in the (2,2) active space, and cL are their weights in the ensemble. We note that GFN1- and GFN2-xTB have been specifically parameterized for the purpose of obtaining molecular geometries and describing noncovalent interactions, hence we do not expect these parameters to be ideal for excited state properties. Nevertheless, we refrain from modifying the semiempirical parameters since our goal here is to demonstrate how easily the semiempirical Hamiltonians can be used with ab initio electronic structure methods. In the future, it will be important to optimize the semiempirical parameters.

Adding Fock exchange is crucial to improve the description of excited states in both density functional and density functional tight-binding (DFTB) Hamiltonians. In the latter, it is typically added in a range-separated form approaching 100% Fock exchange at long distances.20,21 We augment the GFN1-xTB′ Hamiltonian (i.e., GFN1-xTB without third-order term) with long-range Mulliken-approximated Fock exchange such that the energy becomes

(25)

Here, the damped and range-separated Coulomb terms are from Eq. (7). We will use ω = 0.4 a.u. (as well as cFR = 0 and cLR = 1) in the following (see Sec. III). Following Ref. 21 (but at variance with Ref. 20), we add the exchange directly to the Hamiltonian (as opposed to using the difference density). This is motivated by the fact that, though the GFN1-xTB Hamiltonian has been fitted empirically, it is formally based on the density fluctuation expansion of a functional without Fock exchange. It is noted for completeness that implementing a difference density formulation is possible by decomposing the exchange contributions into reference and fluctuation contributions, similar to what we did for the second- and third-order tight binding terms (see above). The corresponding reference contributions would then also be added to Ẽnn and Ẽne, respectively.

We noticed that the long-range correction alone is not sufficient for a reasonable description of the electronic states near intersections (see Sec. IV). This is in agreement with findings for SI2-SA2-REKS(2,2) in combination with long-range corrected DFTB (LC-DFTB).51 In that study, the authors resorted to spin-polarized DFTB Hamiltonians with corresponding parameters. Therein, an atomic on-site correction proportional to the spin density matrix is added, which leads to an improved description of the branching plane near minimum energy conical intersections (MECIs). In this work, we suggest a parameter-economic scheme that is inspired by spin-polarized DFTB but in line with our setup. We use the ab initio integral library in TeraChem to compute the atomic on-site ab initioK contributions and add them to the Hamiltonian. The corresponding energy then becomes

(26)

For the purpose of this paper, the scaling factor cK = 0.05 is chosen based on benchmarks for vertical excitation energies (see Sec. III). But in future efforts, which should involve full reparameterization, this scaling factor can be treated it as an element-specific parameter as well. A schematic of the Fock exchange contributions added to the GFN1-xTB Hamiltonian is given in Fig. 3.

FIG. 3.

Exchange contributions to the semiempirical Fock matrix in the LC-GFN1-xTB′(Kat) Hamiltonian. While, the long-range Mulliken-approximated Fock exchange is added globally, the on-site ab initio contributions only affect the Fock matrix elements belonging to a specific atom. This treatment resembles the spin density handling in spin-polarized DFTB.67 

FIG. 3.

Exchange contributions to the semiempirical Fock matrix in the LC-GFN1-xTB′(Kat) Hamiltonian. While, the long-range Mulliken-approximated Fock exchange is added globally, the on-site ab initio contributions only affect the Fock matrix elements belonging to a specific atom. This treatment resembles the spin density handling in spin-polarized DFTB.67 

Close modal

We combine this Hamiltonian with the SI2-SA2-REKS(2,2) approach, which is described in detail elsewhere.49,50 Using this method, we compute the ground and first excited electronic states, their nuclear gradients, and nonadiabatic coupling vectors using the existing implementation in TeraChem50 but with SQMBox semiempirical integrals (cf. Fig. 1). Finally, we also show that we can use the GFN1-xTB-type Hamiltonians in combination with other excited state wavefunction approaches, such as CASCI, CASSCF, RPA, TDA, and hh-TDA, demonstrating the strength of our approach while revealing limitations of the current GFN1-xTB parametrization for excited states.

We added the semiempirical integral library presented in this work to the current development version of the GPU-accelerated electronic structure program TeraChem.27,28 All calculations presented in this work were carried out with this development version. We implemented the GFN1-xTB32 and GFN2-xTB31 Hamiltonian expressions into our semiempirical integral library, following the decomposition strategy described earlier. For now, we have implemented GFN2-xTB without the self-consistent D4 dispersion model35,36 but rather in combination with the D3(BJ)33,34 London dispersion correction and corresponding Axilrod–Teller–Muto correction, which had already been used in the original parametrization process (damping parameters: a1 = 0.52, a2 = 5.00, s8 = 2.70, s9 = 5.00, all other parameters remain unchanged).31 We will add support for self-consistent D4 dispersion at a later stage and note here that, due to the exponential dependence on the density matrix, this term will lead to contributions to the XC energy/potential equivalent in the semiempirical integral library presented here.

We tested our implementation first by comparing energies and gradients computed for ground state SCF solutions with the GFN1-xTB and GFN2-xTB Hamiltonians against the standalone xtb code.41 The implementation of Mulliken-approximated Fock exchange to SQMBox and the compatibility of excited state workflows were tested by comparison of analytic and numerical gradients. Among the considered electronic structure routines were configuration interaction singles (CIS), random-phase approximation (RPA), complete active space configuration interaction (CASCI) and self-consistent field (CASSCF), and spin-restricted ensemble-referenced Kohn–Sham (REKS) theory in combination with GFN1-xTB and GFN2-xTB Hamiltonians, with the restriction that third-order terms [see Eqs. (8)(11)] of the tight binding expansion have been neglected up to this point and will be added for post-SCF/response-type schemes in the future. Detailed derivations and expressions for the individual Fock matrix quantities and gradients are given in the supplementary material.

In the computational performance assessment of the Mulliken-approximated Fock exchange, we performed GFN1-xTB SCF calculations augmented with 100% of the semiempirical Mulliken-approximated full-range Fock exchange (cFR = 1 and cLR = 0). Fullerene systems were taken from Ref. 52, and larger systems were constructed as assemblies of those. The considered geometries are given in the supplementary material and range from C100 to a dimer of C240@C720. The reported timings in Fig. 4 are the timings for a single K build, averaged over the different SCF cycles. An incremental Fock build in terms of different density matrices was used in the SCF.53 This allowed us to design an efficient mixed precision GPU-based scheme that is detailed in Sec. IV A, while the numerical precision assessment is shown in the supplementary material (Table S1 and Fig. S2). We note that the third-order term of the GFN1-xTB Hamiltonian has a nonlinear potential in the density matrix, hence, its mean-field potential is built using the full density matrix, similar to the XC potential in Kohn–Sham DFT calculations.54 

FIG. 4.

Average time for the computation of the Mulliken-approximated exchange matrix K̃ in the framework of GFN1-xTB for differently sized fullerene systems. The calculations were performed with the TeraChem program on a 3.33 GHz Intel Xeon CPU X5680 with a single GeForce GTX 970 card. The Intel® math kernel library (MKL) acceleration with four cores (green solid) in the dgemm calls was used in the CPU code. The time for the density matrix construction is shown for comparison (black thin line) and is dominated by the diagonalization of the Fock matrix (using the Intel® MKL dsyevd routine on four CPU cores). The double precision and single precision GPU variants both use the respective cuBLAS routines in the matrix–matrix multiplications [see Eq. (14) and the supplementary material for implementation details]. In the hybrid CPU/mixed precision GPU scheme (blue dotted line), the evaluation was performed on the GPU, if nAO > 300 + 100 × nCPU, with nCPU = 4 in our case and only the very first K̃ build of the incremental SCF procedure was performed in double precision, while the rest was performed in the mixed precision scheme (see text and Eq. (64) in the supplementary material).

FIG. 4.

Average time for the computation of the Mulliken-approximated exchange matrix K̃ in the framework of GFN1-xTB for differently sized fullerene systems. The calculations were performed with the TeraChem program on a 3.33 GHz Intel Xeon CPU X5680 with a single GeForce GTX 970 card. The Intel® math kernel library (MKL) acceleration with four cores (green solid) in the dgemm calls was used in the CPU code. The time for the density matrix construction is shown for comparison (black thin line) and is dominated by the diagonalization of the Fock matrix (using the Intel® MKL dsyevd routine on four CPU cores). The double precision and single precision GPU variants both use the respective cuBLAS routines in the matrix–matrix multiplications [see Eq. (14) and the supplementary material for implementation details]. In the hybrid CPU/mixed precision GPU scheme (blue dotted line), the evaluation was performed on the GPU, if nAO > 300 + 100 × nCPU, with nCPU = 4 in our case and only the very first K̃ build of the incremental SCF procedure was performed in double precision, while the rest was performed in the mixed precision scheme (see text and Eq. (64) in the supplementary material).

Close modal

To demonstrate the strength of the TeraChem/SQMBox setup in enabling new method combinations, we combine the SI2-SA2-REKS(2,2) approach with the LC-GFN1-xTB′(Kat) Hamiltonian described in Sec. II E. The individual Fock exchange components described in Sec. II are added to the GFN1-xTB Hamiltonian, and we scanned for suitable range-separation and atomic exchange scaling factors ω and cK, respectively. We used the lowest vertical excitation benchmark sets collected in Ref. 55 (systems originally taken from Refs. 56–58) and tested possible parameter combinations ω = 0.1, 0.2, 0.3, 0.4, and 0.5 and cK = 0.005, 0.01, and 0.05, respectively. We selected ω = 0.4 and cK = 0.05 in the following. We then optimized minimum energy conical intersections (MECIs) with this method using a well-established benchmark set.59 It is emphasized that the standard TeraChem architecture (run conical) has been used, which employs DL-FIND (using hybrid delocalized internal coordinates).60 In other words, no further modifications of the code were made after setting up and interfacing with the semiempirical integral library. All structures were converged to an S1/S0 gap of at most 1.5 meV, except for the ethylene (ethylidene) and methyliminium (methylamine) structures, for which the gap could not be reduced below about 0.2 eV (see Fig. 5 for structures and supplementary material for explicit energy values).

FIG. 5.

Minimum energy conical intersection geometries (structures in green) computed with SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat). All structures were optimized such that the S1/S0 gap was smaller than 0.1 mHartree [0.05 Hartree for methyliminium (methylamine) and ethylene (ethylidene)]. The multi-reference singles-doubles configuration interaction (MRCISD) structures (in gray/blue/silver color) and energy differences are taken from Ref. 59 using the 6-31+G** basis set (6-31G** for stilbene and HBI). The relative energies of the MECIs are given with respect to the S0 energy at the Franck–Condon point.

FIG. 5.

Minimum energy conical intersection geometries (structures in green) computed with SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat). All structures were optimized such that the S1/S0 gap was smaller than 0.1 mHartree [0.05 Hartree for methyliminium (methylamine) and ethylene (ethylidene)]. The multi-reference singles-doubles configuration interaction (MRCISD) structures (in gray/blue/silver color) and energy differences are taken from Ref. 59 using the 6-31+G** basis set (6-31G** for stilbene and HBI). The relative energies of the MECIs are given with respect to the S0 energy at the Franck–Condon point.

Close modal

Figure 6 shows the S1/S0 potential energy surfaces around the twisted-pyramidal conical intersection geometry of ethylene with LC-GFN1-xTB′(Kat)/SI2-SA2-REKS(2,2), demonstrating that the topography is properly conical around the MECI. Here, potential energy scans are performed close to the MECI by sampling ±0.002 Å displacements along the branching plane vectors g and h. These are determined as the normalized difference gradient of the two states (g) and as the normalized nonadiabatic coupling vector between the two states (h) after Gram–Schmidt-type orthogonalization against g. To complement this, the GFN1-xTB′ Hamiltonian with 100% Mulliken-approximated Fock exchange [termed GFN1-xTB′(100) in the following] is combined with the floating-occupation molecular orbital complete-active space configuration interaction and the two state-averaged complete active space self-consistent field method, with both methods using a (2,2) active space [termed FOMO-CASCI(2,2) and SA2-CASSCF(2,2), respectively, in the following]. For FOMO-CASCI(2,2), a constant fractional occupation of 0.5 for the active spin-MOs is used.

FIG. 6.

Potential energy surface plot around the twisted-pyramidal MECI of ethylene computed with SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat). The surface is computed on a grid obtained by displacing the MECI geometry along the branching plane vectors g and h. These are computed from the gradient difference (g) and the nonadiabatic coupling (h) vectors between the two states. Both vectors are visualized in the insets.

FIG. 6.

Potential energy surface plot around the twisted-pyramidal MECI of ethylene computed with SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat). The surface is computed on a grid obtained by displacing the MECI geometry along the branching plane vectors g and h. These are computed from the gradient difference (g) and the nonadiabatic coupling (h) vectors between the two states. Both vectors are visualized in the insets.

Close modal

A potential energy surface plot of ethylene using a rigid scan along the torsion and pyramidalization angles is shown to demonstrate the importance of adding Fock exchange (Fig. 7). Here, we used the SI2-SA2-REKS(2,2) approach with the LC-GFN1-xTB′(Kat) and GFN1-xTB′ Hamiltonians.

FIG. 7.

Potential energy surfaces of the S0 ground (red) and S1 excited (blue) states of ethylene computed at the (a) SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat) and the (c) SI2-SA2-REKS(2,2)/GFN1-xTB′ level of theory. The contour plots of the S1 PES from (a) and (c) are given in (b) and (d), respectively. All energies correspond to a rigid scan over the pyramidalization and torsion angles (visualized in the inset). All energies are given relative to the S0 energy at the Franck–Condon point. The point of smallest S0/S1 separation is marked by a red point. For comparison, the estimated location of the S1 minimum (at 5.46 eV relative to the S0 minimum energy) obtained from SA3-XMS-CAS(2,2)PT2 (from Ref. 55) is marked with a light-yellow X.

FIG. 7.

Potential energy surfaces of the S0 ground (red) and S1 excited (blue) states of ethylene computed at the (a) SI2-SA2-REKS(2,2)/LC-GFN1-xTB′(Kat) and the (c) SI2-SA2-REKS(2,2)/GFN1-xTB′ level of theory. The contour plots of the S1 PES from (a) and (c) are given in (b) and (d), respectively. All energies correspond to a rigid scan over the pyramidalization and torsion angles (visualized in the inset). All energies are given relative to the S0 energy at the Franck–Condon point. The point of smallest S0/S1 separation is marked by a red point. For comparison, the estimated location of the S1 minimum (at 5.46 eV relative to the S0 minimum energy) obtained from SA3-XMS-CAS(2,2)PT2 (from Ref. 55) is marked with a light-yellow X.

Close modal

To identify some characteristics of the GFN1-xTB′ Hamiltonians in the context of excited state calculations, we additionally computed the lowest four vertical excitations of ethylene with the time-dependent density functional theory (TD-DFT) or random-phase approximation (RPA),61 its Tamm–Dancoff approximated (TDA) variant,62 as well as the recently presented variant of the holehole Tamm–Dancoff approximation (hh-TDA).55 These were compared to the excited state obtained from SI2-SA2-REKS(2,2). We employ both the Fock exchange-inclusive LC-GFN1-xTB′(Kat) and the exchange-free GFN1-xTB′ Hamiltonians in this context (see Table I).

TABLE I.

Lowest vertical excitation energies of ethylene computed with different electronic structure methods in combination with the LC-GFN1-xTB′(Kat) and the GFN1-xTB′ Hamiltonians (see Secs. II E for details). We consider here the SI2-SA2-REKS(2,2), the time-dependent linear response or random-phase approximation (RPA), its Tamm–Dancoff approximation (TDA), as well as the recently presented hole–hole TDA method (hh-TDA). We use the ethylene minimum geometry from Ref. 58 without re-optimization. Excitation energies are given in eV, and the oscillator strengths are given in parentheses. The state with non-vanishing oscillator strength corresponds to the ππ* state.

LC-GFN1-xTB′(Kat)GFN1-xTB′
ExcitationSI2-SA2-REKS(2,2)RPATDAhh-TDASI2-SA2-REKS(2,2)RPATDAhh-TDA
S0S1 6.6 (0.5) 6.0 (0.0) 6.0 (0.0) 5.1 (0.0) 5.4 (0.4) 6.4 (0.0) 6.4 (0.0) 5.4 (0.4) 
S0S2 ⋯ 6.2 (0.3) 6.3 (0.0) 5.6 (0.0) ⋯ 7.3 (0.0) 7.3 (0.0) 8.0 (0.0) 
S0S3 ⋯ 6.3 (0.0) 6.8 (0.4) 6.3 (0.4) ⋯ 7.6 (0.3) 8.4 (0.5) 8.3 (0.0) 
S0S4 ⋯ 8.0 (0.0) 8.0 (0.0) 6.8 (0.0) ⋯ 8.5 (0.0) 8.5 (0.0) 9.9 (0.0) 
LC-GFN1-xTB′(Kat)GFN1-xTB′
ExcitationSI2-SA2-REKS(2,2)RPATDAhh-TDASI2-SA2-REKS(2,2)RPATDAhh-TDA
S0S1 6.6 (0.5) 6.0 (0.0) 6.0 (0.0) 5.1 (0.0) 5.4 (0.4) 6.4 (0.0) 6.4 (0.0) 5.4 (0.4) 
S0S2 ⋯ 6.2 (0.3) 6.3 (0.0) 5.6 (0.0) ⋯ 7.3 (0.0) 7.3 (0.0) 8.0 (0.0) 
S0S3 ⋯ 6.3 (0.0) 6.8 (0.4) 6.3 (0.4) ⋯ 7.6 (0.3) 8.4 (0.5) 8.3 (0.0) 
S0S4 ⋯ 8.0 (0.0) 8.0 (0.0) 6.8 (0.0) ⋯ 8.5 (0.0) 8.5 (0.0) 9.9 (0.0) 

We first demonstrate the performance for the evaluation of the semiempirical Mulliken-approximated Fock exchange in our implementation. In this context, we show how single-precision arithmetic on GPUs can efficiently be exploited while retaining sufficient numerical precision in the total energies. This is followed by presenting results for calculations that were enabled by the newly presented modular setup: we combine the GFN1-xTB Hamiltonian with the multiconfigurational electronic structure method REKS and showcase this for the optimization of minimum energy crossing point geometries on an established benchmark set. The straightforward combination with CAS-type methods is then shown for the description of the conical intersection geometry of ethylene.

For full generalizability to ab initio codes, a semiempirical Mulliken-approximated Fock exchange has been added to the semiempirical integral library. Here, we assess the computational performance of our implementation in the computation of semiempirical K matrices. In initial implementations of long-range Fock exchange corrected DFTB, this became one of the computational bottlenecks, making LC-DFTB calculations slower by about a factor of 10 compared to DFTB.20 In Fig. 4, we plot the average computational time for the formation of the semiempirical Mulliken-approximated Fock exchange matrix as a function of basis set size. For this purpose, we use the otherwise unmodified GFN1-xTB Hamiltonian with its corresponding basis set but include the Mulliken-approximated Fock exchange (100%), as described in Eq. (14). Fullerenes and assemblies of these ranging from 100 to 1920 atoms are used (see supplementary material for details). We report the time required to form the semiempirical K matrix, averaged over all SCF iterations. Even though we use a highly efficient implementation, which only requires four matrix–matrix multiplications [i.e., O(N3) steps], the computational expense becomes considerable for several thousand basis functions on the CPU (almost 80 s for close to 8000 basis functions using 4 CPU cores). Evaluation on the GPU in double precision significantly reduces the computational cost by at least a factor of two. We mention that we used a GeForce GTX 970 card, i.e., a consumer grade GPU from 2014. Such cards are known to perform dramatically better if they work in single-precision. This is confirmed here, and we find that the single-precision implementation allows us to form the semiempirical K matrix in not even 4 s for the largest system considered with almost 8000 basis functions. We use incremental Fock builds here, so a difference density matrix is used in Eq. (14) for all but the first SCF cycle.53 In this way, only the very first formation of the semiempirical K matrix needs to be performed in double precision, while lower precision is sufficient for the remaining SCF iterations. We use lower precision with a hybrid CPU/mixed precision GPU scheme based on the following decomposition of the overlap matrix:

(27)

Here, we decompose the overlap matrix into a diagonal part (unit matrix I) and an off-diagonal part S̄. In our mixed precision scheme, terms that depend on S̄ are evaluated in single precision. Terms that are proportional to the diagonal part only are equivalent to a quadratically scaling ZDO-type evaluation of K. These are always evaluated on the CPU in double precision [see Eq. (64) in the supplementary material for details]. The latter does not add any significant computational cost to the overall computation time. Based on our numerical findings, we furthermore suggest to evaluate K entirely in double precision on the CPU if nAO < 300 + 100 × nCPU, where the latter is the number of available CPU cores. For larger systems, we switch between double and mixed precision on the GPU, depending on the type of density matrix passed in. If the density matrix is a full particle density matrix, the code runs in double precision, while the mixed precision is used if a difference or transition density matrix is used. This is tested by forming the trace, i.e., the semiempirical K matrix is formed in double precision, if

(28)

where we keep the threshold parameter ϵthr close to zero. This identifies if a full or a difference/transition density is passed in. The extra computational cost of forming the trace, which scales with nAO × nAO, is negligible compared to the overall cubic formation of the Mulliken-approximated K matrix. This dynamic switching in combination with the mixed precision evaluation of K, allows us to exploit the extremely fast evaluation of Mulliken-approximated Fock exchange with consumer-grade GPUs. In combination with an incremental Fock matrix SCF procedure,53 we note only minor numerical differences in the final single point energies due to the formation of the Mulliken-approximated exchange matrix in single precision, which becomes reduced by an order of magnitude by each using double precision in the first iteration and by resorting to the mixed precision scheme (see Table S1 in the supplementary material). With this implementation, the rate-determining computational step in exchange-augmented xTB-type calculations remains the diagonalization of the Hamiltonian matrix (see Fig. 4). With the new algorithm, there is no longer any reason to avoid Mulliken-approximated Fock exchange.

The availability of semiempirical Hamiltonian elements via the semiempirical integral library enables new method combinations, as demonstrated in the following sections. The LC-GFN1-xTB′(Kat) Hamiltonian is combined with the REKS method and applied in the optimization of minimum energy conical intersections. We also show that the GFN1-xTB′ Hamiltonian with a 100% Mulliken-approximated Fock exchange can be straightforwardly combined with the CASCI and CASSCF methods. These are then used, along with the aforementioned REKS scheme, in the description of the minimum energy conical intersection region. Not only energies and gradients, but also the analytic derivative coupling implementations in TeraChem are now accessible with semiempirical Hamiltonians through the established framework. In the context of ethylene, we point out peculiarities related to the purpose-specific parametrization of the GFN1-xTB Hamiltonian, using additional electronic structure methods for further reference in the computation of the ππ* excited state energy. Finally, we will point out the importance of Fock exchange (on-site and long-range) in the description of the excited state potential energy surface with REKS.

1. Minimum energy conical intersection geometry optimizations

We combine the two-state interaction two-state-averaged spin-restricted ensemble-averaged Kohn–Sham DFT method [SI2-SA2-REKS(2,2)]48 with the semiempirical LC-GFN1-xTB′(Kat) Hamiltonian (see Sec. II E). In Fig. 5, S1/S0 MECI geometries that are optimized at this level of theory are shown (green) and overlayed with the reference geometries computed at the multi-reference configuration interaction singles and doubles (MRCISD) level of theory (taken from Ref. 59). We find that the optimized structures agree well with the reference structures, suggesting that this method combination could have potential in the explorations and preoptimizations of MECIs in photochemical studies. We note, however, that the relative energies for the MECIs are underestimated for all systems but ketene. This is likely due to the parametrization of the GFN1-xTB Hamiltonian, which has been parameterized (without Fock exchange) for a different purpose, i.e., minimum geometries, nuclear Hessians, and noncovalent interactions.32 At this point, it is emphasized that the MECI optimization procedure not only makes use of adiabatic nuclear gradients but also of nonadiabatic coupling vectors between both states. Due to the established setup (see Fig. 1), gradients and nonadiabatic coupling vectors for this new method combination become available simply by providing the aforementioned semiempirical integral components (Fock matrix equivalents and gradients) in place of the ab initio components. This way, elaborate re-derivation and implementation becomes unnecessary, and the existing implementation in TeraChem can be used.49,50

2. Full functionality with xTB-based REKS, CASCI, and CASSCF through SQMBox: The potential energy surfaces of ethylene near the twisted-pyramidal minimum energy conical intersection as example

To highlight this further, the S0 and S1 potential energy surfaces around the twisted-pyramidal MECI of ethylene, which were computed with LC-GFN1-xTB′(Kat)/SI2-SA2-REKS(2,2), are shown in Fig. 8. Here, we used the optimized MECI geometry and computed the S1 and S0 surfaces on a grid along the branching plane vectors g and h, which are obtained from the gradient difference and nonadiabatic coupling vectors between both states, respectively. We observe a peaked conical intersection along these vectors. The branching plane vectors are visualized in the insets and indicate contributions from the pyramidalization, torsion, and also from bond C–C bond stretching modes, which are in good agreement with the branching plane vectors from previously reported MRCISD and SI2-SA2-REKS(2,2)/ωPBEh calculations.51,59 Consequently, the presented framework provided us with easy access to gradients and nonadiabatic coupling, enabling photochemical studies in the near future.

In Fig. 8, we also plot the corresponding potential energy surfaces computed with FOMO-CASCI(2,2) and SA2-CASSCF(2,2) based on the GFN1-xTB′(100) Hamiltonian (see Sec. III). We find again that we retrieve the full functionality of the available implementations in TeraChem, which initially had been formulated only in terms of ab initio theories.63–65 Thanks to the cleanly separated Hamiltonian/wavefunction setup and the SQMBox library, these novel method combinations become accessible with semiempirical Hamiltonians, like the GFN1-xTB Hamiltonian used here.

We have already noted before that we are using a purpose-specifically fitted Hamiltonian (original purpose: ground state geometries, noncovalent interactions, and nuclear Hessians) in an off-purpose setting simply to demonstrate our implementation. Unsurprisingly, we find that the GFN1-xTB parametrization is not optimum for excited state purposes. Using the linear-response time-dependent density functional theory (TD-DFT) implementation,61 its Tamm–Dancoff approximated variant62 and the recently added hole–hole Tamm–Dancoff approximation (hh-TDA)55 implementation in TeraChem along with the semiempirical library, we can compute the corresponding excited states based on these methods in combination with the semiempirical tight-binding Hamiltonian. We compute the lowest four excited states of ethylene in combination with the LC-GFN1-xTB′(Kat) Hamiltonian and compare it to the first excited state from SI2-SA2-REKS(2,2). The results are listed in Table I. We also did the same comparison for the exchange-free GFN1-xTB′ Hamiltonian. We find that the visible ππ* state of ethylene, which is essentially a HOMO–LUMO transition, is the lowest excited state in SI2-SA2-REKS(2,2) but in no other method. The only exception is hh-TDA/GFN1-xTB′, where the excitations exactly correspond to the corresponding orbital energy differences since the Hamiltonian does not include exchange (see Ref. 55 for details). This indicates that the GFN1-xTB′ Hamiltonian erroneously places other electronic states in the minimal valence basis set (which cannot describe Rydberg states) to be lower than the ππ* state. The lowest excited state in all other schemes is in fact a σ(C–H)π* state. In the TDA scheme, the dark σ(C–C)π* state also lies lower (i.e., it is S2) than the ππ* state, which is different in RPA, where the ππ* state lies lower. It is known that in the electrostatic monopole approximation, which is a consequence of the Mulliken-approximated two-electron integrals, the two-electron (ia|jb) terms vanish for i = σ and a = π* due to symmetry and, hence, such states are identical in RPA and TDA.66 This is different for the ππ* states, which are strongly blue-shifted, predominantly due to the diagonal (ππ*|ππ*) term in the TDA/RPA A matrix. RPA somewhat reduces this shift, leading to lower excitation energies. This finding along with the strong blue-shift of the ππ* state in hh-TDA, if the coupling is activated due to the addition of exchange [LC-GFN1-xTB′(Kat)], leads to the conclusion that the Coulomb interactions in the GFN1-xTB′ Hamiltonian are somewhat overestimated or imbalanced with respect to the orbital energy differences, which are expected to have strong contributions from the extended Hückel term. Meanwhile, the parameters in the xTB-type Hamiltonian should offer enough flexibility to, in principle, provide a well-working Hamiltonian also for excited states, its original parametrization is specifically fitted (“overfitted”) to the original purpose of providing good ground state geometries and noncovalent interactions. The SI2-SA2-REKS(2,2) scheme still works well regardless, since the (2,2) active space prevents any other orbitals from contributing to the excited state. We conclude that a full reparameterization of the Hamiltonian is necessary to obtain a robust and accurate method for excited states. In this context, the presented framework allows the straightforward application of a broader range of excited state methods, just as we did now in this assessment for ethylene, and can now be used to check a re-parametrized Hamiltonian for consistency.

As a final point, we want to highlight the importance of the Fock exchange in Hamiltonians to be used for photochemical studies. It is known that vertical excitations are generally better described for approximate density functional and tight-binding models when nonlocal Fock exchange is included. This is also why we chose to augment the Hamiltonian with the Fock exchange since the GFN1-xTB Hamiltonian does not contain the Fock exchange or any spin-density dependent terms. To demonstrate the implications for photochemical applications beyond vertical excitations, which are long-term targets of the TeraChem/SQMBox framework, we plot the potential energy surfaces of the two lowest states of ethylene along the torsional and pyramidalization coordinates in Fig. 7. Here, the surfaces are computed within the SI2-SA2-REKS(2,2) framework in combination with the exchange-free GFN1-xTB′ and the exchange-inclusive LC-GFN1-xTB′(Kat) Hamiltonian (see Sec. II E). In Figs. 7(c) and 7(d), we see the direct consequence of employing a Hamiltonian, which is free from the exchange or spin-density components. Here, the S0/S1 intersection coincides with the S1 minimum geometry and is found at a purely twisted and non-pyramidalized geometry. High-level reference calculations of complete active space second-order perturbation theory type indicate that both the S1 minimum and the MECI geometry should be significantly pyramidalized (pyramidalization angles >70°).55,67 This failure is immediately cured by the inclusion of Fock exchange as in the LC-GFN1-xTB′(Kat) Hamiltonian [Figs. 7(a) and 7(b)]. The MECI geometry is then found at 60 degrees of pyramidalization, i.e., much closer to the aforementioned reference, and the overall shape of the S1 potential energy surface is significantly improved (cf. Fig. 1 in Ref. 68). We furthermore find that the potential energy surface at small pyramidalization angles and 90° torsion requires the addition of a short-range correction, which we introduce in the form of an atomic ab initio K contribution (see supplementary material for the plots with the LC-GFN1-xTB′ Hamiltonian). Hence, we conclude that a tight-binding approach with long-range exchange correction alone is not likely to be sufficient for photochemical studies.

FIG. 8.

Potential energy surface plots around the twisted-pyramidal MECI of ethylene computed with (a) GFN1-xTB′(100)/FOMO-CASCI(2,2) and (b) GFN1-xTB′(100)/SA2-CASSCF(2,2). GFN1-xTB′(100) is the second-order GFN1-xTB Hamiltonian with 100% of Mulliken-approximated Fock exchange. The surfaces are computed on a grid obtained by displacing the respective MECI geometry along the corresponding branching plane vectors g and h. For each method, these are computed from the gradient difference (g) and the nonadiabatic coupling (h) vectors between the two states. These vectors are visualized in the insets.

FIG. 8.

Potential energy surface plots around the twisted-pyramidal MECI of ethylene computed with (a) GFN1-xTB′(100)/FOMO-CASCI(2,2) and (b) GFN1-xTB′(100)/SA2-CASSCF(2,2). GFN1-xTB′(100) is the second-order GFN1-xTB Hamiltonian with 100% of Mulliken-approximated Fock exchange. The surfaces are computed on a grid obtained by displacing the respective MECI geometry along the corresponding branching plane vectors g and h. For each method, these are computed from the gradient difference (g) and the nonadiabatic coupling (h) vectors between the two states. These vectors are visualized in the insets.

Close modal

We have presented a modular framework that facilitates the joint development of ab initio and semiempirical electronic structure workflows. In our approach, we group the semiempirical Hamiltonian terms by their orders in the one-particle density matrix and link them to the ab initio Hamiltonian quantities with an equivalent dependency on the density matrix (and nuclear coordinates). The semiempirical integral library then provides the Mulliken-approximated semiempirical Hamiltonian matrix equivalents that resemble the ab initio/Kohn–Sham Hamiltonian matrix quantities. In an existing ab initio electronic structure program, the semiempirical integral library is then called instead of the ab initio integral library, which enables the straightforward extension of electronic structure methods to semiempirical Hamiltonians. This removes any extra effort to re-derive and re-implement electronic structure methods for semiempirical Hamiltonians once an ab initio framework already exists. We showcase this by combining the recently presented extended tight-binding Hamiltonian GFN1-xTB (augmented with Fock exchange) with the state-interaction state-averaged spin-restricted ensemble-averaged Kohn–Sham (SI-SA-REKS) method. We apply this combined scheme to the optimization of conical intersections for a benchmark set from the literature and to the computation of the potential energy surfaces around the twisted-pyramidal S1/S0 minimum energy conical intersection of ethylene. The latter was also carried out with Fock exchange-augmented GFN1-xTB in combination with wave-function-based complete active space methods (state-averaged CASSCF and floating-occupation molecular orbital CASCI). This demonstrates that our framework gives full access to the existing functionality of the ab initio code, including gradients and nonadiabatic coupling vectors for the aforementioned method combinations. The derivation and implementation of these quantities, which can represent a considerable effort,49,50,63–65 is entirely avoided by leveraging the existing implementation in TeraChem with the modular integral library setup. Currently, the library is a proof-of-concept implemented within the TeraChem package and supports both GFN-xTB-type Hamiltonians. Current efforts aim at generalizing the workflow to set up and re-parametrize such semiempirical Hamiltonians, whereupon we hope to make the library more widely available. All Hamiltonian matrix and derivative expressions are thoroughly documented in the supplementary material.

We envisage the presented framework to enable faster method and workflow developments in which the ab initio and semiempirical electronic structure developments are no longer decoupled. Multi-level procedures, in which semiempirical electronic structure methods are used for exploratory calculations while the corresponding ab initio variant is then used for refinement, are naturally facilitated by this framework. Furthermore, testing in the course of implementing new electronic structure methods becomes simplified by resorting to the semiempirical integral library for this purpose: a standard consistency check in electronic structure developments is the comparison of analytic to numerical gradients, with the latter typically requiring a considerable amount of computation time. Given that the semiempirical integral library offers one-to-one correspondence to the terms from the ab initio library but is significantly faster (3–4 orders of magnitude), future testing and AO-direct electronic structure code developments can be accelerated as well by our framework.

See the supplementary materialPDF file for detailed equations, derivations, and algorithms. Further benchmarking data for mixed precision Fock matrix construction schemes. ZIP file for molecular geometries used for benchmarking and testing.

This work was supported by the Office of Naval Research (Grant Nos. N00014-21-1-2151 and N00014-18-1-2659). C.B. thanks the German National Academy of Sciences Leopoldina for a Leopoldina Fellowship (Project No. LPDS 2018–09). We thank Jimmy K. Yu for providing a template script for plotting the ethylene potential energy surfaces.

T.J.M. is a co-founder of PetaChem, LLC.

Christoph Bannwarth: Conceptualization (equal); Funding acquisition (equal); Investigation (equal); Methodology (equal); Writing – original draft (equal); Writing – review & editing (equal). Todd J. Martínez: Conceptualization (equal); Funding acquisition (equal); Investigation (equal); Methodology (equal); Resources (equal); Supervision (equal); Validation (equal); Writing – review & editing (equal).

The data that support the findings of this study are available from the corresponding author upon reasonable request.

1.
K. N.
Houk
and
F.
Liu
, “
Holy grails for computational organic chemistry and biochemistry
,”
Acc. Chem. Res.
50
(
3
),
539
543
(
2017
).
2.
S.
Grimme
and
P. R.
Schreiner
, “
Computational chemistry: The fate of current methods and future challenges
,”
Angew. Chem., Int. Ed.
57
(
16
),
4170
4176
(
2018
).
3.
F.
Neese
,
M.
Atanasov
,
G.
Bistoni
,
D.
Maganas
, and
S.
Ye
, “
Chemistry and quantum mechanics in 2019: Give us insight and numbers
,”
J. Am. Chem. Soc.
141
(
7
),
2814
2824
(
2019
).
4.
S.
Grimme
, “
Exploration of chemical compound, conformer, and reaction space with meta-dynamics simulations based on tight-binding quantum chemical calculations
,”
J. Chem. Theory Comput.
15
(
5
),
2847
2862
(
2019
).
5.
H. L.
Woodcock
,
M.
Hodošček
, and
B. R.
Brooks
, “
Exploring SCC-DFTB paths for mapping QM/MM reaction mechanisms
,”
J. Phys. Chem. A
111
(
26
),
5720
5728
(
2007
).
6.
P.
Pracht
,
F.
Bohle
, and
S.
Grimme
, “
Automated exploration of the low-energy chemical space with fast quantum chemical methods
,”
Phys. Chem. Chem. Phys.
22
,
7169
7192
(
2020
).
7.
S.
Grimme
,
C.
Bannwarth
,
S.
Dohm
,
A.
Hansen
,
J.
Pisarek
,
P.
Pracht
,
J.
Seibert
, and
F.
Neese
, “
Fully automated quantum-chemistry-based computation of spin–spin-coupled nuclear magnetic resonance spectra
,”
Angew. Chem., Int. Ed.
56
(
46
),
14763
14769
(
2017
).
8.
S.
Grimme
, “
Supramolecular binding thermodynamics by dispersion-corrected density functional theory
,”
Chem. Eur. J.
18
(
32
),
9955
9964
(
2012
).
9.
P. S.
Hudson
,
H. L.
Woodcock
, and
S.
Boresch
, “
Use of nonequilibrium work methods to compute free energy differences between molecular mechanical and quantum mechanical representations of molecular systems
,”
J. Phys. Chem. Lett.
6
(
23
),
4850
4856
(
2015
).
10.
M. J.
Frisch
,
G. W.
Trucks
,
H. B.
Schlegel
,
G. E.
Scuseria
,
M. A.
Robb
,
J. R.
Cheeseman
,
G.
Scalmani
,
V.
Barone
,
B.
Mennucci
,
G. A.
Petersson
,
H.
Nakatsuji
,
M.
Caricato
,
X.
Li
,
H. P.
Hratchian
,
A. F.
Izmaylov
,
J.
Bloino
,
G.
Zheng
,
J. L.
Sonnenberg
,
M.
Hada
,
M.
Ehara
,
K.
Toyota
,
R.
Fukuda
,
J.
Hasegawa
,
M.
Ishida
,
T.
Nakajima
,
Y.
Honda
,
O.
Kitao
,
H.
Nakai
,
T.
Vreven
,
J. A.
Montgomery
, Jr.
,
J. E.
Peralta
,
F.
Ogliaro
,
M.
Bearpark
,
J. J.
Heyd
,
E.
Brothers
,
K. N.
Kudin
,
V. N.
Staroverov
,
R.
Kobayashi
,
J.
Normand
,
K.
Raghavachari
,
A.
Rendell
,
J. C.
Burant
,
S. S.
Iyengar
,
J.
Tomasi
,
M.
Cossi
,
N.
Rega
,
J. M.
Millam
,
M.
Klene
,
J. E.
Knox
,
J. B.
Cross
,
V.
Bakken
,
C.
Adamo
,
J.
Jaramillo
,
R.
Gomperts
,
R. E.
Stratmann
,
O.
Yazyev
,
A. J.
Austin
,
R.
Cammi
,
C.
Pomelli
,
J. W.
Ochterski
,
R. L.
Martin
,
K.
Morokuma
,
V. G.
Zakrzewski
,
G. A.
Voth
,
P.
Salvador
,
J. J.
Dannenberg
,
S.
Dapprich
,
A. D.
Daniels
,
Ö.
Farkas
,
J. B.
Foresman
,
J. V.
Ortiz
,
J.
Cioslowski
, and
D. J.
Fox
, Gaussian 09,
Gaussian, Inc.
,
Wallingford, CT
,
2009
.
11.
F.
Neese
, “
The ORCA program system
,”
Wiley Interdiscip. Rev.: Comput. Mol. Sci.
2
(
1
),
73
78
(
2012
).
12.
ADF 2019.3, SCM, Theoretical Chemistry, Vrije Universiteit, Amsterdam, The Netherlands, http://www.scm.com, Optionally, you may add the following list of authors and contributors,
E. J.
Baerends
,
T.
Ziegler
,
A. J.
Atkins
,
J.
Autschbach
,
O.
Baseggio
,
D.
Bashford
,
A.
Bérces
,
F. M.
Bickelhaupt
,
C.
Bo
,
P. M.
Boerrigter
,
L.
Cavallo
,
C.
Daul
,
D. P.
Chong
,
D. V.
Chulhai
,
L.
Deng
,
R. M.
Dickson
,
J. M.
Dieterich
,
D. E.
Ellis
,
M.
van Faassen
,
L.
Fan
,
T. H.
Fischer
,
A.
Förster
,
C.
Fonseca Guerra
,
M.
Franchini
,
A.
Ghysels
,
A.
Giammona
,
S. J. A.
van Gisbergen
,
A.
Goez
,
A. W.
Götz
,
J. A.
Groeneveld
,
O. V.
Gritsenko
,
M.
Grüning
,
S.
Gusarov
,
F. E.
Harris
,
P.
van den Hoek
,
Z.
Hu
,
C. R.
Jacob
,
H.
Jacobsen
,
L.
Jensen
,
L.
Joubert
,
J. W.
Kaminski
,
G.
van Kessel
,
C.
König
,
F.
Kootstra
,
A.
Kovalenko
,
M. V.
Krykunov
,
E.
van Lenthe
,
D. A.
McCormack
,
A.
Michalak
,
M.
Mitoraj
,
S. M.
Morton
,
J.
Neugebauer
,
V. P.
Nicu
,
L.
Noodleman
,
V. P.
Osinga
,
S.
Patchkovskii
,
M.
Pavanello
,
C. A.
Peeples
,
P. H. T.
Philipsen
,
D.
Post
,
C. C.
Pye
,
H.
Ramanantoanina
,
P.
Ramos
,
W.
Ravenek
,
J. I.
Rodríguez
,
P.
Ros
,
R.
Rüger
,
P. R. T.
Schipper
,
D.
Schlüns
,
H.
van Schoot
,
G.
Schreckenbach
,
J. S.
Seldenthuis
,
M.
Seth
,
J. G.
Snijders
,
M.
Solà
,
M.
Stener
,
M.
Swart
,
D.
Swerhone
,
V.
Tognetti
,
G.
te Velde
,
P.
Vernooijs
,
L.
Versluis
,
L.
Visscher
,
O.
Visser
,
F.
Wang
,
T. A.
Wesolowski
,
E. M.
van Wezenbeek
,
G.
Wiesenekker
,
S. K.
Wolff
,
T. K.
Woo
, and
A. L.
Yakovlev
.
13.
G.
te Velde
,
F. M.
Bickelhaupt
,
E. J.
Baerends
,
C.
Fonseca Guerra
,
S. J. A.
van Gisbergen
,
J. G.
Snijders
, and
T.
Ziegler
, “
Chemistry with ADF
,”
J. Comput. Chem.
22
(
9
),
931
967
(
2001
).
14.
Y.
Shao
,
Z.
Gan
,
E.
Epifanovsky
,
A. T. B.
Gilbert
,
M.
Wormit
,
J.
Kussmann
,
A. W.
Lange
,
A.
Behn
,
J.
Deng
,
X.
Feng
,
D.
Ghosh
,
M.
Goldey
,
P. R.
Horn
,
L. D.
Jacobson
,
I.
Kaliman
,
R. Z.
Khaliullin
,
T.
Kuś
,
A.
Landau
,
J.
Liu
,
E. I.
Proynov
,
Y. M.
Rhee
,
R. M.
Richard
,
M. A.
Rohrdanz
,
R. P.
Steele
,
E. J.
Sundstrom
,
H. L.
Woodcock
,
P. M.
Zimmerman
,
D.
Zuev
,
B.
Albrecht
,
E.
Alguire
,
B.
Austin
,
G. J. O.
Beran
,
Y. A.
Bernard
,
E.
Berquist
,
K.
Brandhorst
,
K. B.
Bravaya
,
S. T.
Brown
,
D.
Casanova
,
C.-M.
Chang
,
Y.
Chen
,
S. H.
Chien
,
K. D.
Closser
,
D. L.
Crittenden
,
M.
Diedenhofen
,
R. A.
DiStasio
,
H.
Do
,
A. D.
Dutoi
,
R. G.
Edgar
,
S.
Fatehi
,
L.
Fusti-Molnar
,
A.
Ghysels
,
A.
Golubeva-Zadorozhnaya
,
J.
Gomes
,
M. W. D.
Hanson-Heine
,
P. H. P.
Harbach
,
A. W.
Hauser
,
E. G.
Hohenstein
,
Z. C.
Holden
,
T.-C.
Jagau
,
H.
Ji
,
B.
Kaduk
,
K.
Khistyaev
,
J.
Kim
,
J.
Kim
,
R. A.
King
,
P.
Klunzinger
,
D.
Kosenkov
,
T.
Kowalczyk
,
C. M.
Krauter
,
K. U.
Lao
,
A. D.
Laurent
,
K. V.
Lawler
,
S. V.
Levchenko
,
C. Y.
Lin
,
F.
Liu
,
E.
Livshits
,
R. C.
Lochan
,
A.
Luenser
,
P.
Manohar
,
S. F.
Manzer
,
S.-P.
Mao
,
N.
Mardirossian
,
A. V.
Marenich
,
S. A.
Maurer
,
N. J.
Mayhall
,
E.
Neuscamman
,
C. M.
Oana
,
R.
Olivares-Amaya
,
D. P.
O’Neill
,
J. A.
Parkhill
,
T. M.
Perrine
,
R.
Peverati
,
A.
Prociuk
,
D. R.
Rehn
,
E.
Rosta
,
N. J.
Russ
,
S. M.
Sharada
,
S.
Sharma
,
D. W.
Small
,
A.
Sodt
,
T.
Stein
,
D.
Stück
,
Y.-C.
Su
,
A. J. W.
Thom
,
T.
Tsuchimochi
,
V.
Vanovschi
,
L.
Vogt
,
O.
Vydrov
,
T.
Wang
,
M. A.
Watson
,
J.
Wenzel
,
A.
White
,
C. F.
Williams
,
J.
Yang
,
S.
Yeganeh
,
S. R.
Yost
,
Z.-Q.
You
,
I. Y.
Zhang
,
X.
Zhang
,
Y.
Zhao
,
B. R.
Brooks
,
G. K. L.
Chan
,
D. M.
Chipman
,
C. J.
Cramer
,
W. A.
Goddard
,
M. S.
Gordon
,
W. J.
Hehre
,
A.
Klamt
,
H. F.
Schaefer
,
M. W.
Schmidt
,
C. D.
Sherrill
,
D. G.
Truhlar
,
A.
Warshel
,
X.
Xu
,
A.
Aspuru-Guzik
,
R.
Baer
,
A. T.
Bell
,
N. A.
Besley
,
J.-D.
Chai
,
A.
Dreuw
,
B. D.
Dunietz
,
T. R.
Furlani
,
S. R.
Gwaltney
,
C.-P.
Hsu
,
Y.
Jung
,
J.
Kong
,
D. S.
Lambrecht
,
W.
Liang
,
C.
Ochsenfeld
,
V. A.
Rassolov
,
L. V.
Slipchenko
,
J. E.
Subotnik
,
T.
Van Voorhis
,
J. M.
Herbert
,
A. I.
Krylov
,
P. M. W.
Gill
, and
M.
Head-Gordon
, “
Advances in molecular quantum chemistry contained in the Q-Chem 4 program package
,”
Mol. Phys.
113
(
2
),
184
215
(
2015
).
15.
J.
Hutter
,
M.
Iannuzzi
,
F.
Schiffmann
, and
J.
VandeVondele
, “
cp2k: atomistic simulations of condensed matter systems
,”
Wiley Interdiscip. Rev.: Comput. Mol. Sci.
4
(
1
),
15
25
(
2014
).
16.
S. G.
Balasubramani
,
G. P.
Chen
,
S.
Coriani
,
M.
Diedenhofen
,
M. S.
Frank
,
Y. J.
Franzke
,
F.
Furche
,
R.
Grotjahn
,
M. E.
Harding
,
C.
Hättig
,
A.
Hellweg
,
B.
Helmich-Paris
,
C.
Holzer
,
U.
Huniar
,
M.
Kaupp
,
A.
Marefat Khah
,
S.
Karbalaei Khani
,
T.
Müller
,
F.
Mack
,
B. D.
Nguyen
,
S. M.
Parker
,
E.
Perlt
,
D.
Rappoport
,
K.
Reiter
,
S.
Roy
,
M.
Rückert
,
G.
Schmitz
,
M.
Sierka
,
E.
Tapavicza
,
D. P.
Tew
,
C.
van Wüllen
,
V. K.
Voora
,
F.
Weigend
,
A.
Wodyński
, and
J. M.
Yu
, “
TURBOMOLE: Modular program suite for ab initio quantum-chemical and condensed-matter simulations
,”
J. Chem. Phys.
152
(
18
),
184107
(
2020
).
17.
Q.
Sun
,
X.
Zhang
,
S.
Banerjee
,
P.
Bao
,
M.
Barbry
,
N. S.
Blunt
,
N. A.
Bogdanov
,
G. H.
Booth
,
J.
Chen
,
Z.-H.
Cui
,
J. J.
Eriksen
,
Y.
Gao
,
S.
Guo
,
J.
Hermann
,
M. R.
Hermes
,
K.
Koh
,
P.
Koval
,
S.
Lehtola
,
Z.
Li
,
J.
Liu
,
N.
Mardirossian
,
J. D.
McClain
,
M.
Motta
,
B.
Mussard
,
H. Q.
Pham
,
A.
Pulkin
,
W.
Purwanto
,
P. J.
Robinson
,
E.
Ronca
,
E. R.
Sayfutyarova
,
M.
Scheurer
,
H. F.
Schurkus
,
J. E. T.
Smith
,
C.
Sun
,
S.-N.
Sun
,
S.
Upadhyay
,
L. K.
Wagner
,
X.
Wang
,
A.
White
,
J. D.
Whitfield
,
M. J.
Williamson
,
S.
Wouters
,
J.
Yang
,
J. M.
Yu
,
T.
Zhu
,
T. C.
Berkelbach
,
S.
Sharma
,
A. Y.
Sokolov
, and
G. K.-L.
Chan
, “
Recent developments in the PySCF program package
,”
J. Chem. Phys.
153
(
2
),
024109
(
2020
).
18.
E.
Aprà
,
E. J.
Bylaska
,
W. A.
de Jong
,
N.
Govind
,
K.
Kowalski
,
T. P.
Straatsma
,
M.
Valiev
,
H. J. J.
van Dam
,
Y.
Alexeev
,
J.
Anchell
,
V.
Anisimov
,
F. W.
Aquino
,
R.
Atta-Fynn
,
J.
Autschbach
,
N. P.
Bauman
,
J. C.
Becca
,
D. E.
Bernholdt
,
K.
Bhaskaran-Nair
,
S.
Bogatko
,
P.
Borowski
,
J.
Boschen
,
J.
Brabec
,
A.
Bruner
,
E.
Cauët
,
Y.
Chen
,
G. N.
Chuev
,
C. J.
Cramer
,
J.
Daily
,
M. J. O.
Deegan
,
T. H.
Dunning
,
M.
Dupuis
,
K. G.
Dyall
,
G. I.
Fann
,
S. A.
Fischer
,
A.
Fonari
,
H.
Früchtl
,
L.
Gagliardi
,
J.
Garza
,
N.
Gawande
,
S.
Ghosh
,
K.
Glaesemann
,
A. W.
Götz
,
J.
Hammond
,
V.
Helms
,
E. D.
Hermes
,
K.
Hirao
,
S.
Hirata
,
M.
Jacquelin
,
L.
Jensen
,
B. G.
Johnson
,
H.
Jónsson
,
R. A.
Kendall
,
M.
Klemm
,
R.
Kobayashi
,
V.
Konkov
,
S.
Krishnamoorthy
,
M.
Krishnan
,
Z.
Lin
,
R. D.
Lins
,
R. J.
Littlefield
,
A. J.
Logsdail
,
K.
Lopata
,
W.
Ma
,
A. V.
Marenich
,
J.
Martin del Campo
,
D.
Mejia-Rodriguez
,
J. E.
Moore
,
J. M.
Mullin
,
T.
Nakajima
,
D. R.
Nascimento
,
J. A.
Nichols
,
P. J.
Nichols
,
J.
Nieplocha
,
A.
Otero-de-la-Roza
,
B.
Palmer
,
A.
Panyala
,
T.
Pirojsirikul
,
B.
Peng
,
R.
Peverati
,
J.
Pittner
,
L.
Pollack
,
R. M.
Richard
,
P.
Sadayappan
,
G. C.
Schatz
,
W. A.
Shelton
,
D. W.
Silverstein
,
D. M. A.
Smith
,
T. A.
Soares
,
D.
Song
,
M.
Swart
,
H. L.
Taylor
,
G. S.
Thomas
,
V.
Tipparaju
,
D. G.
Truhlar
,
K.
Tsemekhman
,
T.
Van Voorhis
,
Á.
Vázquez-Mayagoitia
,
P.
Verma
,
O.
Villa
,
A.
Vishnu
,
K. D.
Vogiatzis
,
D.
Wang
,
J. H.
Weare
,
M. J.
Williamson
,
T. L.
Windus
,
K.
Woliński
,
A. T.
Wong
,
Q.
Wu
,
C.
Yang
,
Q.
Yu
,
M.
Zacharias
,
Z.
Zhang
,
Y.
Zhao
, and
R. J.
Harrison
, “
NWChem: Past, present, and future
,”
J. Chem. Phys.
152
(
18
),
184102
(
2020
).
19.
W.
Kutzelnigg
, “
Der quantenchemische ausdruck für die bindungsenergie eines beliebigen moleküls in der MO‐LCAO‐näherung und seine physikalische interpretation
,” in
Einführung in die Theoretische Chemie, Part II: Die Chemische Bindung
(
Wiley-VCH
,
2001
), p.
67
.
20.
V.
Lutsker
,
B.
Aradi
, and
T. A.
Niehaus
, “
Implementation and benchmark of a long-range corrected functional in the density functional based tight-binding method
,”
J. Chem. Phys.
143
(
18
),
184107
(
2015
).
21.
A.
Humeniuk
and
R.
Mitrić
, “
Long-range correction for tight-binding TD-DFT
,”
J. Chem. Phys.
143
(
13
),
134120
(
2015
).
22.
J.
Almlöf
,
K.
Faegri
, Jr.
, and
K.
Korsell
, “
Principles for a direct SCF approach to LICAO–MO ab-initio calculations
,”
J. Comput. Chem.
3
(
3
),
385
399
(
1982
).
23.
O.
Vahtras
,
J.
Almlöf
, and
M. W.
Feyereisen
, “
Integral approximations for LCAO-SCF calculations
,”
Chem. Phys. Lett.
213
(
5–6
),
514
518
(
1993
).
24.
F.
Weigend
, “
A fully direct RI-HF algorithm: Implementation, optimised auxiliary basis sets, demonstration of accuracy and efficiency
,”
Phys. Chem. Chem. Phys.
4
(
18
),
4285
4291
(
2002
).
25.
N. H. F.
Beebe
and
J.
Linderberg
, “
Simplifications in the generation and transformation of two-electron integrals in molecular calculations
,”
Int. J. Quantum Chem.
12
(
4
),
683
705
(
1977
).
26.
F.
Weigend
and
M.
Häser
, “
RI-MP2: First derivatives and global consistency
,”
Theor. Chem. Acc.
97
(
1
),
331
340
(
1997
).
27.
I. S.
Ufimtsev
and
T. J.
Martinez
, “
Quantum chemistry on graphical processing units. 2. Direct self-consistent-field implementation
,”
J. Chem. Theory Comput.
5
(
4
),
1004
1015
(
2009
).
28.
I. S.
Ufimtsev
and
T. J.
Martínez
, “
Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation
,”
J. Chem. Theory Comput.
4
(
2
),
222
231
(
2008
).
29.
S.
Seritan
,
C.
Bannwarth
,
B. S.
Fales
,
E. G.
Hohenstein
,
S. I. L.
Kokkila-Schumacher
,
N.
Luehr
,
J. W.
Snyder
,
C.
Song
,
A. V.
Titov
,
I. S.
Ufimtsev
, and
T. J.
Martínez
, “
TeraChem: Accelerating electronic structure and ab initio molecular dynamics with graphical processing units
,”
J. Chem. Phys.
152
(
22
),
224110
(
2020
).
30.
A. S.
Christensen
,
T.
Kubař
,
Q.
Cui
, and
M.
Elstner
, “
Semiempirical quantum mechanical methods for noncovalent interactions for chemical and biochemical applications
,”
Chem. Rev.
116
(
9
),
5301
5337
(
2016
).
31.
C.
Bannwarth
,
S.
Ehlert
, and
S.
Grimme
, “
GFN2-xTB—An accurate and broadly parametrized self-consistent tight-binding quantum chemical method with multipole electrostatics and density-dependent dispersion contributions
,”
J. Chem. Theory Comput.
15
(
3
),
1652
1671
(
2019
).
32.
S.
Grimme
,
C.
Bannwarth
, and
P.
Shushkov
, “
A robust and accurate tight-binding quantum chemical method for structures, vibrational frequencies, and noncovalent interactions of large molecular systems parametrized for all spd-block elements (Z = 1–86)
,”
J. Chem. Theory Comput.
13
(
5
),
1989
2009
(
2017
).
33.
S.
Grimme
,
J.
Antony
,
S.
Ehrlich
, and
H.
Krieg
, “
A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu
,”
J. Chem. Phys.
132
(
15
),
154104
(
2010
).
34.
S.
Grimme
,
S.
Ehrlich
, and
L.
Goerigk
, “
Effect of the damping function in dispersion corrected density functional theory
,”
J. Comput. Chem.
32
(
7
),
1456
1465
(
2011
).
35.
E.
Caldeweyher
,
C.
Bannwarth
, and
S.
Grimme
, “
Extension of the D3 dispersion coefficient model
,”
J. Chem. Phys.
147
(
3
),
034112
(
2017
).
36.
E.
Caldeweyher
,
S.
Ehlert
,
A.
Hansen
,
H.
Neugebauer
,
S.
Spicher
,
C.
Bannwarth
, and
S.
Grimme
, “
A generally applicable atomic-charge dependent London dispersion correction
,”
J. Chem. Phys.
150
(
15
),
154122
(
2019
).
37.
A. M.
Köster
,
M.
Leboeuf
, and
D. R.
Salahub
, “
Molecular electrostatic potentials from density functional theory
,” in
Theoretical and Computational Chemistry
, edited by
J. S.
Murray
and
K.
Sen
(
Elsevier
,
1996
), Vol. 3, pp
105
142
.
38.
N.
Mataga
and
K.
Nishimoto
, “
Electronic structure and spectra of nitrogen heterocycles
,”
Z. Phys. Chem.
13
(
3–4
),
140
157
(
1957
).
39.
K.
Ohno
, “
Some remarks on the Pariser-Parr-Pople method
,”
Theor. Chim. Acta
2
(
3
),
219
227
(
1964
).
40.
G.
Klopman
, “
A semiempirical treatment of molecular structures. II. Molecular terms and application to diatomic molecules
,”
J. Am. Chem. Soc.
86
(
21
),
4550
4557
(
1964
).
41.
See https://github.com/grimme-lab/xtb for xtb code version 6.2.
42.
Y.
Nishimoto
, “
Time-dependent density-functional tight-binding method with the third-order expansion of electron density
,”
J. Chem. Phys.
143
(
9
),
094108
(
2015
).
43.
G.
Klopman
and
R. C.
Evans
, “
The neglect-of-differential-overlap methods of molecular orbital theory
,” in
Semiempirical Methods of Electronic Structure Calculation: Part A: Techniques
, edited by
G. A.
Segal
(
Springer US
,
Boston, MA
,
1977
), pp
29
67
.
44.
B. M.
Bold
,
M.
Sokolov
,
S.
Maity
,
M.
Wanko
,
P. M.
Dohmen
,
J. J.
Kranz
,
U.
Kleinekathöfer
,
S.
Höfener
, and
M.
Elstner
, “
Benchmark and performance of long-range corrected time-dependent density functional tight binding (LC-TD-DFTB) on rhodopsins and light-harvesting complexes
,”
Phys. Chem. Chem. Phys.
22
,
10500
10518
(
2020
).
45.
L.
Stojanović
,
S. G.
Aziz
,
R. H.
Hilal
,
F.
Plasser
,
T. A.
Niehaus
, and
M.
Barbatti
, “
Nonadiabatic dynamics of cycloparaphenylenes with TD-DFTB surface hopping
,”
J. Chem. Theory Comput.
13
(
12
),
5846
5860
(
2017
).
46.
Y.
Nishimoto
, “
Time-dependent long-range-corrected density-functional tight-binding method combined with the polarizable continuum model
,”
J. Phys. Chem. A
123
(
26
),
5649
5659
(
2019
).
47.
L. K.
McKemmish
,
A. T. B.
Gilbert
, and
P. M. W.
Gill
, “
Mixed Ramp–Gaussian basis sets
,”
J. Chem. Theory Comput.
10
(
10
),
4369
4376
(
2014
).
48.
M.
Filatov
, “
Spin-restricted ensemble-referenced Kohn–Sham method: Basic principles and application to strongly correlated ground and excited states of molecules
,”
Wiley Interdiscip. Rev.: Comput. Mol. Sci.
5
(
1
),
146
167
(
2015
).
49.
M.
Filatov
,
F.
Liu
, and
T. J.
Martínez
, “
Analytical derivatives of the individual state energies in ensemble density functional theory method. I. General formalism
,”
J. Chem. Phys.
147
(
3
),
034113
(
2017
).
50.
F.
Liu
,
M.
Filatov
, and
T. J.
Martínez
, “
Analytical derivatives of the individual state energies in ensemble density functional theory. II. Implementation on graphical processing units (GPUs)
,”
J. Chem. Phys.
154
,
104108
(
2021
).
51.
I. S.
Lee
,
M.
Filatov
, and
S. K.
Min
, “
Formulation and implementation of the spin-restricted ensemble-referenced Kohn–Sham method in the context of the density functional tight binding approach
,”
J. Chem. Theory Comput.
15
(
5
),
3021
3032
(
2019
).
53.
M.
Häser
and
R.
Ahlrichs
, “
Improvements on the direct SCF method
,”
J. Comput. Chem.
10
(
1
),
104
111
(
1989
).
54.
R. M.
Parrish
,
F.
Liu
, and
T. J.
Martínez
, “
Communication: A difference density picture for the self-consistent field ansatz
,”
J. Chem. Phys.
144
(
13
),
131101
(
2016
).
55.
C.
Bannwarth
,
J. K.
Yu
,
E. G.
Hohenstein
, and
T. J.
Martínez
, “
Hole–hole Tamm–Dancoff-approximated density functional theory: A highly efficient electronic structure method incorporating dynamic and static correlation
,”
J. Chem. Phys.
153
(
2
),
024110
(
2020
).
56.
S.
Grimme
and
C.
Bannwarth
, “
Ultra-fast computation of electronic spectra for large systems by tight-binding based simplified Tamm-Dancoff approximation (sTDA-xTB)
,”
J. Chem. Phys.
145
,
054103
(
2016
).
57.
T.
Risthaus
,
A.
Hansen
, and
S.
Grimme
, “
Excited states using the simplified Tamm-Dancoff-approach for range-separated hybrid density functionals: Development and application
,”
Phys. Chem. Chem. Phys.
16
,
14408
14419
(
2014
).
58.
M.
Schreiber
,
M. R.
Silva-Junior
,
S. P. A.
Sauer
, and
W.
Thiel
, “
Benchmarks for electronically excited states: CASPT2, CC2, CCSD, and CC3
,”
J. Chem. Phys.
128
,
134110
(
2008
).
59.
A.
Nikiforov
,
J. A.
Gamez
,
W.
Thiel
,
M.
Huix-Rotllant
, and
M.
Filatov
, “
Assessment of approximate computational models for conical intersections and branching plane vectors in organic molecules
,”
J. Chem. Phys.
141
,
124122
(
2014
).
60.
J.
Kästner
,
J. M.
Carr
,
T. W.
Keal
,
W.
Thiel
,
A.
Wander
, and
P.
Sherwood
, “
DL-FIND: An open-source geometry optimizer for atomistic simulations
,”
J. Phys. Chem. A
113
,
11856
11865
(
2009
).
61.
M. E.
Casida
, in
Time-Dependent Density Functional Response Theory for Molecules in: Recent Advances in Density Functional Methods
, edited by
D. P.
Chong
(
World Scientific
,
Singapore
,
1995
), Vol. 1.
62.
S.
Hirata
and
M.
Head-Gordon
, “
Time-dependent density functional theory within the Tamm–Dancoff approximation
,”
Chem. Phys. Lett.
314
(
3–4
),
291
299
(
1999
).
63.
E. G.
Hohenstein
, “
Analytic formulation of derivative coupling vectors for complete active space configuration interaction wavefunctions with floating occupation molecular orbitals
,”
J. Chem. Phys.
145
(
17
),
174110
(
2016
).
64.
E. G.
Hohenstein
,
M. E. F.
Bouduban
,
C.
Song
,
N.
Luehr
,
I. S.
Ufimtsev
, and
T. J.
Martínez
, “
Analytic first derivatives of floating occupation molecular orbital-complete active space configuration interaction on graphical processing units
,”
J. Chem. Phys.
143
(
1
),
014111
(
2015
).
65.
J. W.
Snyder
,
E. G.
Hohenstein
,
N.
Luehr
, and
T. J.
Martínez
, “
An atomic orbital-based formulation of analytical gradients and nonadiabatic coupling vector elements for the state-averaged complete active space self-consistent field method on graphical processing units
,”
J. Chem. Phys.
143
(
15
),
154107
(
2015
).
66.
C.
Bannwarth
and
S.
Grimme
, “
A simplified time-dependent density functional theory approach for electronic ultraviolet and circular dichroism spectra of very large molecules
,”
Comput. Theor. Chem.
1040–1041
,
45
53
(
2014
).
67.
C.
Köhler
,
G.
Seifert
, and
T.
Frauenheim
, “
Density functional based calculations for Fen (n ⩽ 32)
,”
Chem. Phys.
309
(
1
),
23
31
(
2005
).
68.
B. G.
Levine
,
C.
Ko
,
J.
Quenneville
, and
T. J.
MartÍnez
, “
Conical intersections and double excitations in time-dependent density functional theory
,”
Mol. Phys.
104
,
1039
1051
(
2006
).

Supplementary Material