The frozen natural orbital (FNO) coupled-cluster method increases the speed of coupled-cluster (CC) calculations by an order of magnitude with no consequential error along a potential energy surface. This method allows the virtual space of a correlated calculation to be reduced by about half, significantly reducing the time spent performing the coupled-cluster (CC) calculation. This paper reports the derivation and implementation of analytical gradients for FNO-CC, including all orbital relaxation for both noncanonical and semicanonical perturbed orbitals. These derivatives introduce several new orbital relaxation contributions to the CC density matrices. and are applied to a test set of equilibrium structures, verifying that these methods are capable of reproducing geometries and vibrational frequencies accurately, as well as energies. Several decomposition pathways of nitroethane are investigated using and with 60% of the FNO virtual orbitals in a cc-pVTZ basis, and find differences on the order of with reordering of the transition state energies when compared to B3LYP .
I. INTRODUCTION
Prediction of structures of equilibria and transition states is among the most important tasks for computational chemistry. Coupled-cluster theory has shown itself to be well-suited for determining equilibrium structures, especially in the coupled-cluster singles, doubles, and perturbative triples form.1–3 A significant drawback of the coupled-cluster approach is the high computational scaling with respect to the size of the system. For , for example, the most expensive step is where is the number of occupied orbitals and is the number of unoccupied orbitals and is the total number of basis functions. It is well-known, however, that standard basis sets are not optimal; one can reduce the size of the virtual (unoccupied) space without adversely affecting the numerical results. In particular, in large basis sets there are combinations of virtual orbitals that do not contribute significantly to the coupled-cluster (CC) energy. One can, therefore, reduce the computational cost by identifying and removing these irrelevant functions from the basis set. There is a long history of trying to generate such spaces for configuration interaction4–11 and many-body perturbation theory12–14 (MBPT) and, more recently for coupled-cluster theory.15–22 Perhaps the most powerful method of doing so is to use frozen natural orbitals (FNOs).18,20,23–28 These orbitals use information from an approximate one-particle reduced density matrix to choose the best subset of one-particle orbitals within which to perform a correlated calculation. When using FNOs based on the MBPT(2) density matrix,29,30 this truncated orbital set has been shown to be surprisingly effective at truncating larger basis sets, allowing of a modified unoccupied orbital set to be removed without significant changes to ground state CC energies and density matrices.18–20 The FNO procedure could be combined with further reductions in the underlying contracted Gaussian basis31–33 for further savings.
The frozen natural orbitals are able to achieve this speed-up by tailoring the basis set to a molecule at a particular geometry. Therefore, when one wants to calculate forces there is a nontrivial component due to the changes in the underlying structure of the frozen natural orbitals. To account for these changes, one must introduce a set of coupled-perturbed frozen natural orbital equations, similar to the coupled-perturbed Hartree-Fock (CPHF) equations, and rearrange the terms in a computationally efficient manner, to avoid calculating many perturbation-dependent quantities.
High-energy gas phase chemistry is an area that both tests and takes advantage of computational chemistry techniques. These reactions are especially susceptible to small errors in the treatment of correlation both in energetics and structures.34 To be predictive, one must use high-levels of correlation and large basis sets, which tax computational resources. Nitroethane is a prototype for the decomposition of the nitroalkane class of high-energy materials.35,36 When compared to the better studied nitromethane,37–39 the decomposition of nitroethane introduces one important additional pathway: The elimination of HONO.40,41 This pathway is apparently the kinetically favored one for thermal decomposition of nitroethane, and is important in the decomposition of more complicated materials, such as 1,3,5-trinitrohexahydro-1,3,5-triazine (RDX). The energetics of decomposition for nitroethane have been studied using the density functional theory (DFT) (using the B3LYP functional) (Ref. 42) but to be able to understand the relative importance of the various pathways with confidence high-level correlated calculations should be performed at the appropriately optimized stationary points. Nitroethane has five heavy atoms and, to use a large enough basis set to be definitive, one needs several hundred basis functions. To perform multiple optimizations and transition state searches with in that size of basis is a computational challenge; instead, we apply the FNO procedure, with analytical gradients to calculate the potential energy surface for nitroethane decomposition. We use both and the more recent ,39,43–45 which, by using information from the left-hand CCSD eigenvector, improves the description of bond breaking.
II. THEORY
A. Frozen natural orbital CC
The FNO-CC method has been summarized before in Ref. 20. Below, , , and indicate occupied orbitals, , , and indicate virtual orbitals, and , , and indicate arbitrary orbitals. A set of improved virtual orbitals is generated through a series of relatively simple operations. First, the conventional HF equations are solved and then the MBPT(2) density matrix is computed in the resulting virtual orbital space. This density matrix is defined as
where the denominator
is composed of diagonal Fock matrix elements . The density matrix is diagonalized yielding a set of natural orbitals whose occupied space has been frozen to the original Hartree-Fock orbitals (hence the name “frozen natural orbitals”). Associated with each natural orbital is an approximate occupation number. Then, based on keeping those orbitals with highest occupation, the virtual space is partitioned into two subspaces: A set of kept orbitals and a set of dropped orbitals. The Fock matrix is formed in these new orbitals, and is separately diagonalized in each of the two subspaces. Therefore, at the end of the process, one has three sets of orbitals: Canonical Hartree-Fock occupied orbitals, a kept set of Hartree-Fock virtual orbitals that are canonical among themselves, and a dropped set of Hartree-Fock virtual orbitals that are also canonical among themselves. Orbitals in the original Hartree-Fock basis will be uncapitalized, while orbitals after the FNO transformation will be denoted by capitals. Kept virtuals are indicated by , , and , dropped virtuals are , , and , and arbitrary virtuals (the union of the kept and dropped orbitals) are , , and .
Hartree-Fock orbitals in a given basis set are completely defined by the following Brillouin condition:
Similarly, at the end of the FNO procedure, the additional “Brillouin” condition is
This equation is similar to the Hartree-Fock condition for noncanonical orbitals as the off-diagonal elements and can be nonzero. The combination of the Brillouin condition and the equivalent FNO condition will be used to calculate gradients.
The overall set of FNO molecular orbital coefficients can be expressed in the following series of equalities. The elements are the overall transformations, while is the original Hartree-Fock transformation and is the additional FNO transformation in the virtual space, where are atomic orbitals,
B. Gradients
A general expression for CC gradients is46–48
where , , , and run over all correlated orbitals. and are the one- and two-particle coupled-cluster response density matrices in the active space. The derivatives and are total derivatives of the molecular orbital Fock operator and two-electron integrals. These total derivatives can be separated into a piece due to the atomic orbitals and a piece due to the molecular orbital coefficients; to calculate the gradient efficiently, it is necessary to distinguish between these two.
The derivative of an active FNO orbital with respect to an external perturbation is
Focusing on the first (molecular orbital) term one can parametrize the response as
The coefficient is the equivalent of a coupled-perturbed Hartree-Fock coefficient for the FNOs, which we will refer to as a CPFNO coefficient. It is important to note that the CPFNO coefficients have contributions from all orbitals, including those dropped during the FNO procedure.
The atomic orbital piece from Eq. (8) contributes to several perturbed integrals that are transformed into the FNO basis (here and below we are assuming real orbitals),
where the atomic orbital part of the Fock matrix derivative is
Then the full partial derivatives of the Fock matrix and the two-electron integrals can be written as
Substituting these definitions into Eq. (7),
where the intermediate matrix is
with if is not an occupied orbital and if is occupied. Unlike the energy, the derivative depends on orbitals that are dropped by the FNO procedure due to the presence of terms such as and .
By requiring orthonormality of the perturbed orbitals, the CPFNO coefficients satisfy
The perturbed integral is known, so one can solve for ,
Therefore, there are only independent equations for . Expanding the last term of Eq. (16),
where
Now, we must address how to calculate the CPFNO coefficients. As is the case for CPHF,49 in CC theory their direct calculation is avoided by using the Dalgarno-Stewart interchange theorem,50,51 and sometimes called the -vector method for CPHF.52,53 The governing equations of the FNOs are those expressed in Eqs. (3) and (4). Differentiating these equations, one has the requirements that
This choice then can be inserted into Eq. (26d), yielding
The definition of will be dependent on choosing canonical or noncanonical perturbed orbitals. Satisfying the Hartree-Fock perturbed Brillouin conditions is unchanged due to the FNO procedure, because the FNOs are still a (noncanonical) set of Hartree-Fock orbitals. Because the Brillouin condition is still satisfied, we do not need to include single excitation contributions to the MBPT(2) density matrix. On the other hand, focusing on the perturbed density matrix reveals some additional complexities.
The form of the density matrix illustrated in Eq. (1) only holds for canonical Hartree-Fock orbitals. Therefore, to directly use that equation to derive a perturbed density matrix, as is needed to impose the CPFNO condition [Eq. (23b)] one needs to require canonical perturbed underlying. Hartree-Fock orbitals. This restriction would necessitate solving the CPHF equations before constructing the perturbed density matrix for each perturbation. We instead construct the response independently of solving the CPHF coefficients by first working in the original Hartree-Fock basis (that defined by the coefficients ). The general form of the MBPT(2) density matrix for noncanonical Hartree-Fock orbitals is
where the first-order amplitudes satisfy the following equation:
with where interchanges orbitals and . In the case that the Hartree-Fock orbitals are canonical, the last two terms vanish, allowing the following solution:
which, when inserted into the general expression for the density matrix, returns the original result from Eq. (1). Introducing an external perturbation yields
where the perturbed amplitudes are defined by the following perturbed amplitude equation:54
Using the fact that the underlying unperturbed orbitals are canonical (even if the perturbed orbitals are not), one can simplify the amplitude equation to
The unperturbed amplitudes are known (since they correspond to canonical orbitals), leading to the following final expression:
This set of equations depends on the occupied-virtual block of CPHF coefficients through the perturbed two-electron integrals and the perturbed Fock operator, as can be seen by expanding the full perturbed density matrix,
Expanding the integral derivatives,
where the intermediate quantities and are defined in Table I and the perturbed density matrix is defined as
This term will be further discussed below.
These equations have been derived in the original Hartree-Fock basis, but the CPFNO equation is in the FNO basis. To transform the results, one uses
Differentiating this expression,
The perturbed FNO density can therefore be written, using and the relationship expressed in Eq. (30),
where the quantities and have been transformed to the FNO basis. The perturbed quantities have therefore been completely separated from the CPFNO coefficients, which allows for a perturbation-independent solution of the CPFNO equations. To go further, one must choose between noncanonical and canonical perturbed FNOs.
1. Noncanonical perturbed orbitals
For the choice of noncanonical perturbed orbitals, we are free to define
Then CPFNO equations in matrix form are
where
and the elements of matrix are in Table II.
The interchange theorem can be written
Therefore, one can solve the perturbation-independent equation below, instead of Eq. (45),
The second of these equations, which determines the orbital response of the uncorrelated molecular orbitals, can be solved independently of the first, using a standard linear equation solver. Substituting this result into the equation for the virtual-occupied block of the orbital response contribution to the density matrix,
This modified -vector equation can then be solved by the standard method.52 Using these orbital relaxation components, one can form the final full density matrices,
This object can now be contracted with the perturbation independent pieces of ,
This term can only be formed in the Hartree-Fock basis, leading to the following final expression for noncanonical perturbed orbital gradients:
Because of the extra term, a separate back-transformation is necessary to write the term in the atomic orbital basis before contraction with the derivative integrals.
2. Canonical gradients
In the case of , it is highly advantageous to impose the condition that the perturbed orbitals remain semicanonical.2,55 When frozen occupied or virtuals are used, the derivative is calculated using canonical perturbed orbitals as well.48 (This condition is actually more stringent than strictly necessary; as long as mixing occurs only within the frozen and active subsets of orbitals, they do not need to be maintained canonical.) Therefore, one must formulate the FNO orbital relaxation terms in semicanonical orbitals. Supplementing the conventional Brillouin condition (and the FNO condition) are the requirements that
There is no need to impose canonicality on the uncorrelated orbitals because the computational advantage lies in determining the CC contribution to the density matrices, which does not involve the uncorrelated orbitals. By imposing this requirement one can no longer choose and . However, one can choose , since the dropped virtuals can be noncanonical. Therefore, no iterative equations have to be solved in the uncorrelated-uncorrelated sector.
The new CPHF equations can be written in matrix form as follows:
The right-hand side of this equation is given by
The matrix elements of are given in Table III.
Solving the linear equation
yields the orbital response contribution to the overall density matrix. In this form, it is obvious that the solution for the active virtual–active virtual block does not couple to the other blocks, yielding
The FNO block can then be determined by inserting the new orbital relaxation terms from the active virtual block,
Furthermore, after solving for the inactive-active virtual block, the orbital response for the occupied-occupied block can be solved,
Finally, the response of the occupied-occupied block and the virtual-virtual blocks can be inserted into the equation for the occupied-virtual block,
This equation now fits the standard form of the -vector equations.
After solving for all of the orbital response components of the density matrices, one can define the full, relaxed, density matrices via
A summary of the steps necessary to calculate a derivative using the FNOs is shown in Fig. 1.
C. Smoothness of the potential energy surface
The FNO procedure developed here will not necessarily yield rigorously smooth potential energy surfaces (PESs). Note that the FNO truncation is performed point by point on the PES, without consideration of the connection between that point and other points on the potential energy surface. Therefore, if the structure or size of the space spanned by the correlated set of virtual orbitals changes as a function of the geometry, it is possible that the energy could change discontinuously.
To minimize the impact of discontinuities, the code recognizes orbitals that are close in occupation to the correlated orbitals. Those within a certain tolerance of the cutoff occupation are considered to be quasidegenerate and are retained. Assuming that the geometry steps are not too large, this procedure should smooth changes in the FNO structure. It should be clear that this problem is not unique to the frozen natural orbital truncation procedure, but exists for all procedures (such as localized orbitals methods) that truncate the correlation space in a geometry dependent way.56–58 To the best of the authors’ knowledge, there is no fully satisfactory solution to this problem.
III. IMPLEMENTATION
The FNO-CC gradients have been implemented within the ACES II program system.59 It takes advantage of real Abelian point group symmetry, and all equations are fully spin-summed and applicable to closed- or open-shells using single determinant relativistic Hartree-Fock (RHF) or spin-polarized unrestricted Hartree-Fock (UHF) references. In a FNO energy calculation, a partial integral transformation is performed before the FNO truncation, and then a full integral transformation is performed in the resultant truncated basis. This computational advantage is unachievable for gradients; instead, one must perform a full integral transformation for both the truncated and full basis sets, requiring the storage of more integrals. The correlated calculations are then performed within the truncated basis. The formation of the density matrices proceeds in two parts: First, the correlated contributions are formed within the truncated basis, then these density matrices are expanded to the full basis, and the orbital relaxation terms are calculated and included. One calculates the and in the HF basis and then stores them in the truncated FNO basis, so that they can be added to the orbital relaxation equations. The back-transformation of the FNO density matrix and the term are performed separately. These terms are then summed before contraction with derivative integrals.
Compared to gradient calculations that do not use FNOs, the largest added expense is the necessity of calculating and storing several new intermediates of a dimension similar to that of the two-electron integrals. The computational cost is far less than the cost of the CC procedure, though, and does not change the overall scaling of the coupled-cluster, but, instead reduces its cost in applications. However, the additional storage costs could be problematic for some combinations of computer and molecule.
All gradient calculations were verified by comparing the analytical gradient expression to those obtained by numerical differentiation of the energy.
IV. RESULTS AND DISCUSSION
A. Calibration
To determine the capability of FNO truncated gradients to reproduce structures, we applied FNO and to the set of well-characterized molecules from Bak et al.60 Comparative statistics are shown in Tables IV–VII. We have chosen to show the dependence of geometrical properties versus the percentage of the virtual space retained in the truncated calculations. In some ways, this way of choosing a truncation is unsatisfying; it would be better if one were able to examine the MBPT(2) occupation numbers, and then chose proper cutoffs based on these values. However, while we have looked into this issue, we have not been able to determine any consistent truncation criterion: The occupation numbers go smoothly from high to low occupation, without any sharp changes that would indicate a place to truncate. Because the goal of the method is to reduce the computational cost of the calculation, at this point it seems better to use a truncation scheme where the speed-up can be predicted, even if it is less satisfying theoretically.
Comparison of optimized equilibrium bond lengths for different correlation-consistent basis sets Refs. 65 and 66 for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Averages were calculated over the set of molecules from Ref. 60. Only valence electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of pm.
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.02 | 0.66 | 2.30 | 0.80 | 0.05 | 0.63 | 1.93 | 0.74 |
40 | 0.54 | 1.73 | 0.70 | 0.54 | 1.85 | 0.71 | ||
60 | 0.54 | 2.43 | 0.63 | 0.52 | 2.3 | 0.60 | ||
80 | 0.30 | 3.45 | 0.51 | 0.29 | 3.22 | 0.65 | ||
100a | 1.72 | 1.72 | 4.51 | 0.82 | 1.69 | 1.69 | 4.12 | 0.76 |
cc-pVTZ | ||||||||
20 | 0.30 | 0.64 | 5.50 | 1.21 | 0.32 | 0.63 | 5.47 | 1.20 |
40 | 0.15 | 0.21 | 0.87 | 0.25 | 0.16 | 0.22 | 0.88 | 0.24 |
60 | 0.14 | 0.57 | 0.19 | 0.14 | 0.53 | 0.19 | ||
80 | 0.09 | 0.45 | 0.13 | 0.09 | 0.41 | 0.13 | ||
100a | 0.05 | 0.22 | 0.90 | 0.29 | 0.02 | 0.22 | 0.71 | 0.27 |
cc-pVQZ | ||||||||
20 | 0.05 | 0.18 | 0.68 | 0.24 | 0.07 | 0.18 | 0.77 | 0.23 |
40 | 0.18 | 1.11 | 0.27 | 0.17 | 1.01 | 0.26 | ||
60 | 0.00 | 0.05 | 0.26 | 0.07 | 0.02 | 0.05 | 0.26 | 0.07 |
80 | 0.00 | 0.03 | 0.10 | 0.04 | 0.00 | 0.02 | 0.10 | 0.03 |
100a | 0.13 | 0.71 | 0.19 | 0.14 | 0.71 | 0.19 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.02 | 0.66 | 2.30 | 0.80 | 0.05 | 0.63 | 1.93 | 0.74 |
40 | 0.54 | 1.73 | 0.70 | 0.54 | 1.85 | 0.71 | ||
60 | 0.54 | 2.43 | 0.63 | 0.52 | 2.3 | 0.60 | ||
80 | 0.30 | 3.45 | 0.51 | 0.29 | 3.22 | 0.65 | ||
100a | 1.72 | 1.72 | 4.51 | 0.82 | 1.69 | 1.69 | 4.12 | 0.76 |
cc-pVTZ | ||||||||
20 | 0.30 | 0.64 | 5.50 | 1.21 | 0.32 | 0.63 | 5.47 | 1.20 |
40 | 0.15 | 0.21 | 0.87 | 0.25 | 0.16 | 0.22 | 0.88 | 0.24 |
60 | 0.14 | 0.57 | 0.19 | 0.14 | 0.53 | 0.19 | ||
80 | 0.09 | 0.45 | 0.13 | 0.09 | 0.41 | 0.13 | ||
100a | 0.05 | 0.22 | 0.90 | 0.29 | 0.02 | 0.22 | 0.71 | 0.27 |
cc-pVQZ | ||||||||
20 | 0.05 | 0.18 | 0.68 | 0.24 | 0.07 | 0.18 | 0.77 | 0.23 |
40 | 0.18 | 1.11 | 0.27 | 0.17 | 1.01 | 0.26 | ||
60 | 0.00 | 0.05 | 0.26 | 0.07 | 0.02 | 0.05 | 0.26 | 0.07 |
80 | 0.00 | 0.03 | 0.10 | 0.04 | 0.00 | 0.02 | 0.10 | 0.03 |
100a | 0.13 | 0.71 | 0.19 | 0.14 | 0.71 | 0.19 |
Relative to experiment.
Comparison of optimized equilibrium bond lengths for different correlation-consistent basis sets Refs. 65 and 66 for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Averages were calculated over the set of molecules from Ref. 60. All electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of pm.
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.66 | 2.21 | 0.81 | 0.62 | 1.84 | 0.74 | ||
40 | 0.63 | 2.96 | 0.73 | 0.60 | 2.86 | 0.69 | ||
60 | 0.33 | 2.34 | 0.49 | 0.31 | 2.14 | 0.45 | ||
80 | 0.17 | 0.99 | 0.24 | 0.17 | 0.98 | 0.24 | ||
100a | 1.66 | 1.66 | 4.42 | 0.80 | 1.63 | 1.63 | 4.03 | 0.74 |
cc-pVTZ | ||||||||
20 | 0.26 | 2.37 | 0.50 | 0.25 | 2.15 | 0.47 | ||
40 | 0.32 | 0.92 | 0.34 | 0.31 | 0.91 | 0.33 | ||
60 | 0.33 | 1.34 | 0.43 | 0.32 | 1.34 | 0.42 | ||
80 | 0.11 | 0.33 | 0.12 | 0.11 | 0.33 | 0.12 | ||
100a | 0.19 | 0.26 | 1.04 | 0.28 | 0.15 | 0.25 | 0.86 | 0.27 |
cc-pVQZ | ||||||||
20 | 0.25 | 0.71 | 0.23 | 0.25 | 0.70 | 0.22 | ||
40 | 0.07 | 0.33 | 0.10 | 0.08 | 0.30 | 0.09 | ||
60 | 0.09 | 0.52 | 0.13 | 0.09 | 0.49 | 0.12 | ||
80 | 0.00 | 0.01 | 0.05 | 0.02 | 0.00 | 0.01 | 0.05 | 0.02 |
100a | 0.09 | 0.63 | 0.17 | 0.10 | 0.64 | 0.18 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.66 | 2.21 | 0.81 | 0.62 | 1.84 | 0.74 | ||
40 | 0.63 | 2.96 | 0.73 | 0.60 | 2.86 | 0.69 | ||
60 | 0.33 | 2.34 | 0.49 | 0.31 | 2.14 | 0.45 | ||
80 | 0.17 | 0.99 | 0.24 | 0.17 | 0.98 | 0.24 | ||
100a | 1.66 | 1.66 | 4.42 | 0.80 | 1.63 | 1.63 | 4.03 | 0.74 |
cc-pVTZ | ||||||||
20 | 0.26 | 2.37 | 0.50 | 0.25 | 2.15 | 0.47 | ||
40 | 0.32 | 0.92 | 0.34 | 0.31 | 0.91 | 0.33 | ||
60 | 0.33 | 1.34 | 0.43 | 0.32 | 1.34 | 0.42 | ||
80 | 0.11 | 0.33 | 0.12 | 0.11 | 0.33 | 0.12 | ||
100a | 0.19 | 0.26 | 1.04 | 0.28 | 0.15 | 0.25 | 0.86 | 0.27 |
cc-pVQZ | ||||||||
20 | 0.25 | 0.71 | 0.23 | 0.25 | 0.70 | 0.22 | ||
40 | 0.07 | 0.33 | 0.10 | 0.08 | 0.30 | 0.09 | ||
60 | 0.09 | 0.52 | 0.13 | 0.09 | 0.49 | 0.12 | ||
80 | 0.00 | 0.01 | 0.05 | 0.02 | 0.00 | 0.01 | 0.05 | 0.02 |
100a | 0.09 | 0.63 | 0.17 | 0.10 | 0.64 | 0.18 |
Relative to experiment.
Comparison of optimized equilibrium bond angles for different correlation-consistent basis sets (Refs. 65 and 66) for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Averages were calculated over the set of molecules from Ref. 60. Only valence electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of degrees.
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.18 | 0.34 | 0.66 | 0.36 | 0.20 | 0.29 | 0.64 | 0.30 |
40 | 0.64 | 0.77 | 1.36 | 0.61 | 0.62 | 0.76 | 1.36 | 0.63 |
60 | 0.53 | 0.60 | 1.00 | 0.46 | 0.51 | 0.58 | 0.96 | 0.46 |
80 | 0.13 | 0.17 | 0.49 | 0.21 | 0.12 | 0.16 | 0.47 | 0.21 |
100a | 1.99 | 4.97 | 1.53 | 1.97 | 4.92 | 1.52 | ||
cc-pVTZ | ||||||||
20 | 0.10 | 0.36 | 0.89 | 0.46 | 0.09 | 0.34 | 0.85 | 0.45 |
40 | 0.01 | 0.19 | 0.47 | 0.27 | 0.01 | 0.18 | 0.47 | 0.27 |
60 | 0.12 | 0.19 | 0.41 | 0.22 | 0.12 | 0.19 | 0.41 | 0.22 |
80 | 0.07 | 0.10 | 0.27 | 0.11 | 0.07 | 0.10 | 0.28 | 0.11 |
100a | 0.91 | 4.26 | 1.31 | 0.89 | 4.20 | 1.29 | ||
cc-pVQZ | ||||||||
20 | 0.31 | 0.68 | 0.40 | 0.30 | 0.65 | 0.40 | ||
40 | 0.08 | 0.18 | 0.40 | 0.23 | 0.08 | 0.18 | 0.41 | 0.23 |
60 | 0.04 | 0.07 | 0.14 | 0.08 | 0.04 | 0.07 | 0.13 | 0.08 |
80 | 0.01 | 0.02 | 0.09 | 0.03 | 0.01 | 0.02 | 0.09 | 0.03 |
100a | 0.69 | 3.90 | 1.23 | 0.68 | 3.83 | 1.21 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pVDZ | ||||||||
20 | 0.18 | 0.34 | 0.66 | 0.36 | 0.20 | 0.29 | 0.64 | 0.30 |
40 | 0.64 | 0.77 | 1.36 | 0.61 | 0.62 | 0.76 | 1.36 | 0.63 |
60 | 0.53 | 0.60 | 1.00 | 0.46 | 0.51 | 0.58 | 0.96 | 0.46 |
80 | 0.13 | 0.17 | 0.49 | 0.21 | 0.12 | 0.16 | 0.47 | 0.21 |
100a | 1.99 | 4.97 | 1.53 | 1.97 | 4.92 | 1.52 | ||
cc-pVTZ | ||||||||
20 | 0.10 | 0.36 | 0.89 | 0.46 | 0.09 | 0.34 | 0.85 | 0.45 |
40 | 0.01 | 0.19 | 0.47 | 0.27 | 0.01 | 0.18 | 0.47 | 0.27 |
60 | 0.12 | 0.19 | 0.41 | 0.22 | 0.12 | 0.19 | 0.41 | 0.22 |
80 | 0.07 | 0.10 | 0.27 | 0.11 | 0.07 | 0.10 | 0.28 | 0.11 |
100a | 0.91 | 4.26 | 1.31 | 0.89 | 4.20 | 1.29 | ||
cc-pVQZ | ||||||||
20 | 0.31 | 0.68 | 0.40 | 0.30 | 0.65 | 0.40 | ||
40 | 0.08 | 0.18 | 0.40 | 0.23 | 0.08 | 0.18 | 0.41 | 0.23 |
60 | 0.04 | 0.07 | 0.14 | 0.08 | 0.04 | 0.07 | 0.13 | 0.08 |
80 | 0.01 | 0.02 | 0.09 | 0.03 | 0.01 | 0.02 | 0.09 | 0.03 |
100a | 0.69 | 3.90 | 1.23 | 0.68 | 3.83 | 1.21 |
Relative to experiment.
Comparison of optimized equilibrium bond angles for different correlation-consistent basis sets (Refs. 65 and 66) for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Averages were calculated over the set of molecules from Ref. 60. All electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of degrees.
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pCVDZ | ||||||||
20 | 0.39 | 0.40 | 1.06 | 0.41 | 0.38 | 0.38 | 1.01 | 0.39 |
40 | 0.43 | 0.61 | 1.26 | 0.65 | 0.42 | 0.60 | 1.25 | 0.65 |
60 | 0.37 | 0.37 | 0.65 | 0.18 | 0.37 | 0.37 | 0.66 | 0.18 |
80 | 0.10 | 0.24 | 0.99 | 0.41 | 0.10 | 0.24 | 0.99 | 0.41 |
100a | 1.99 | 4.97 | 1.53 | 1.98 | 4.93 | 1.52 | ||
cc-pCVTZ | ||||||||
20 | 0.17 | 0.29 | 0.72 | 0.34 | 0.16 | 0.29 | 0.73 | 0.34 |
40 | 0.38 | 0.49 | 1.02 | 0.46 | 0.38 | 0.50 | 1.02 | 0.47 |
60 | 0.32 | 0.37 | 1.28 | 0.44 | 0.32 | 0.37 | 1.28 | 0.44 |
80 | 0.20 | 0.53 | 0.26 | 0.20 | 0.52 | 0.26 | ||
100a | 1.01 | 4.26 | 1.28 | 1.00 | 4.20 | 1.26 | ||
cc-pCVQZ | ||||||||
20 | 0.02 | 0.28 | 0.63 | 0.37 | 0.01 | 0.27 | 0.62 | 0.36 |
40 | 0.09 | 0.16 | 0.39 | 0.21 | 0.09 | 0.16 | 0.39 | 0.21 |
60 | 0.07 | 0.11 | 0.28 | 0.12 | 0.07 | 0.10 | 0.27 | 0.12 |
80 | 0.01 | 0.04 | 0.02 | 0.03 | 0.10 | 0.04 | ||
100a | 0.70 | 3.92 | 1.23 | 0.69 | 3.84 | 1.21 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
cc-pCVDZ | ||||||||
20 | 0.39 | 0.40 | 1.06 | 0.41 | 0.38 | 0.38 | 1.01 | 0.39 |
40 | 0.43 | 0.61 | 1.26 | 0.65 | 0.42 | 0.60 | 1.25 | 0.65 |
60 | 0.37 | 0.37 | 0.65 | 0.18 | 0.37 | 0.37 | 0.66 | 0.18 |
80 | 0.10 | 0.24 | 0.99 | 0.41 | 0.10 | 0.24 | 0.99 | 0.41 |
100a | 1.99 | 4.97 | 1.53 | 1.98 | 4.93 | 1.52 | ||
cc-pCVTZ | ||||||||
20 | 0.17 | 0.29 | 0.72 | 0.34 | 0.16 | 0.29 | 0.73 | 0.34 |
40 | 0.38 | 0.49 | 1.02 | 0.46 | 0.38 | 0.50 | 1.02 | 0.47 |
60 | 0.32 | 0.37 | 1.28 | 0.44 | 0.32 | 0.37 | 1.28 | 0.44 |
80 | 0.20 | 0.53 | 0.26 | 0.20 | 0.52 | 0.26 | ||
100a | 1.01 | 4.26 | 1.28 | 1.00 | 4.20 | 1.26 | ||
cc-pCVQZ | ||||||||
20 | 0.02 | 0.28 | 0.63 | 0.37 | 0.01 | 0.27 | 0.62 | 0.36 |
40 | 0.09 | 0.16 | 0.39 | 0.21 | 0.09 | 0.16 | 0.39 | 0.21 |
60 | 0.07 | 0.11 | 0.28 | 0.12 | 0.07 | 0.10 | 0.27 | 0.12 |
80 | 0.01 | 0.04 | 0.02 | 0.03 | 0.10 | 0.04 | ||
100a | 0.70 | 3.92 | 1.23 | 0.69 | 3.84 | 1.21 |
Relative to experiment.
One immediate conclusion is that the FNO convergence behavior is identical for both and . Mean absolute errors (probably the best single measure of the results) are almost identical, especially for larger basis sets. The convergence with respect to truncation of the FNO geometries is not monotonic; while there is generally a trend that less truncation leads to better , there are exceptions. Even more dramatic are the maximum errors, which do not show a clear convergence behavior. These results are not necessarily surprising. Especially for the double- basis sets, the truncated basis sets can become so small that one cannot consider them meaningful points for extrapolation of the convergence behavior. Unlike the convergence of the energy, the convergence of geometric properties will tend to be less clear-cut: Optimized geometries are dependent not just on the energy at a point, but rather the relative energy at a point to the points around it. There is, therefore, a delicate balance to the best choice of basis and method for geometry prediction, leading to more complicated convergence behavior.
Examining the tables of bond angles (Tables VI and VII), it is clear that both methods underestimate bond angles, even with all the different truncations. Importantly, the FNO truncations do not significantly affect the standard deviations of the geometries.
To better understand the convergence behavior of the FNOs, the mean absolute errors from experiment for this data set are plotted in Figs. 2–5. In these plots, each point represents retaining 20%, 40%, 60%, 80%, or 100% of the virtual space of the corresponding basis. On the horizontal axis is a measure of the relative size of the truncated basis set as compared to the largest basis in the calculation (100% quadruple-). For bond lengths, all choices of basis sets 60% triple- or larger perform similarly. The picture in the bond angle plots is more mixed, with full convergence not achieved until 40% of the quadruple- basis, though the 60% triple- basis performs quite well. Double- basis sets are inadequate at every truncation. These plots provide a guide for the choice of an optimal basis set of a given size. For example, 20% of a cc-pVQZ basis or 40% of a cc-pVTZ basis yield results that are approximately the same for bond lengths (as shown in Fig. 2) and have the same cost (at the correlated level) as the inferior untruncated cc-pVDZ basis set.
Mean deviation from experiment for bond lengths (in pm) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pVQZ. Only valence orbitals were correlated.
Mean deviation from experiment for bond lengths (in pm) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pVQZ. Only valence orbitals were correlated.
Mean deviation from experiment for bond lengths (in pm) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent core-valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pCVQZ. All electrons were correlated.
Mean deviation from experiment for bond lengths (in pm) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent core-valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pCVQZ. All electrons were correlated.
Mean deviation from experiment for bond angles (in degrees) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pVQZ. Only valence orbitals were correlated.
Mean deviation from experiment for bond angles (in degrees) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pVQZ. Only valence orbitals were correlated.
Mean deviation from experiment for bond angles (in degrees) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent core-valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pCVQZ. All electrons were correlated.
Mean deviation from experiment for bond angles (in degrees) for the equilibrium geometries of the set of molecules from Ref. 60 as a function of correlation-consistent core-valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis cc-pCVQZ. All electrons were correlated.
The tables provide the required information to rezero the results to those in the complete basis, to isolate the FNO effect from the experimental values. Such plots would reach zero deviation much more rapidly as the tables show.
Even more sensitive to the electron structure method than geometries are vibrational frequencies. In Tables VIII and IX the data for vibrational frequencies as compared to the untruncated basis set results for aug-cc-pVDZ and aug-cc-pVTZ basis sets are shown. For the closed-shell molecules used to calculate the averages in Table VIII, the mean absolute errors are acceptable for basis set truncations of 40% or more, with mean errors of or less. This stands in stark contrast to the open-shell results in Table IX, where 80% of the given basis sets are required to reproduce the untruncated results. For the open-shell molecules, we use UHF reference functions because we have not yet implemented FNOs for restricted open-shell Hartree-Fock (ROHF) reference functions. For cyanide radical, it is known that a ROHF reference function provides significantly better results than UHF (Ref. 61) for perturbation theory, which may be skewing the averages. However, even the results show more dependence than the closed-shell molecules. It is possible that the UHF reference function is the source of this discrepancy, but further work is necessary to verify that conjecture.
Comparison of vibrational frequencies for the selected closed-shell molecules (Ref. 67), (Ref. 68), (Refs. 69 and 70), and (Ref. 71) at equilibrium with different augmented correlation-consistent basis sets (Refs. 65, 66, and 72) for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Only valence electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of .
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
aug-cc-pVDZ | ||||||||
20 | 27 | 86 | 37 | 27 | 86 | 37 | ||
40 | 10 | 37 | 10 | 10 | 36 | 10 | ||
60 | 7 | 34 | 8 | 7 | 34 | 8 | ||
80 | 1 | 4 | 1 | 1 | 4 | 1 | ||
100a | 18 | 26 | 70 | 27 | 17 | 25 | 70 | 27 |
aug-cc-pVTZ | ||||||||
20 | 18 | 54 | 22 | 18 | 53 | 22 | ||
40 | 8 | 20 | 8 | 8 | 20 | 7 | ||
60 | 9 | 48 | 15 | 9 | 48 | 15 | ||
80 | 1 | 5 | 1 | 1 | 5 | 1 | ||
100a | 4 | 18 | 49 | 25 | 3 | 17 | 48 | 25 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
aug-cc-pVDZ | ||||||||
20 | 27 | 86 | 37 | 27 | 86 | 37 | ||
40 | 10 | 37 | 10 | 10 | 36 | 10 | ||
60 | 7 | 34 | 8 | 7 | 34 | 8 | ||
80 | 1 | 4 | 1 | 1 | 4 | 1 | ||
100a | 18 | 26 | 70 | 27 | 17 | 25 | 70 | 27 |
aug-cc-pVTZ | ||||||||
20 | 18 | 54 | 22 | 18 | 53 | 22 | ||
40 | 8 | 20 | 8 | 8 | 20 | 7 | ||
60 | 9 | 48 | 15 | 9 | 48 | 15 | ||
80 | 1 | 5 | 1 | 1 | 5 | 1 | ||
100a | 4 | 18 | 49 | 25 | 3 | 17 | 48 | 25 |
Relative to experiment.
Comparison of vibrational frequencies for the selected open-shell radicals CN (Ref. 73) and (Ref. 74) at equilibrium with different augmented correlation-consistent basis sets (Refs. 65, 66, and 72) for multiple FNO truncations for and . The percentage indicates what percent of the virtual space of each molecule was active. For truncated basis sets (20%–80%) errors are relative to the untruncated basis set result; for 100%, errors are relative to experiment. Only valence electrons were correlated. is the (signed) mean error, is the mean absolute error, is the maximum absolute error, and is the standard deviation. All numbers are in units of .
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
aug-cc-pVDZ | ||||||||
20 | 99 | 280 | 122 | 105 | 309 | 137 | ||
40 | 30 | 101 | 48 | 27 | 91 | 43 | ||
60 | 16 | 31 | 10 | 17 | 34 | 12 | ||
80 | 3 | 8 | 3 | 1 | 2 | 1 | ||
100a | 69 | 121 | 52 | 76 | 122 | 44 | ||
aug-cc-pVTZ | ||||||||
20 | 47 | 126 | 63 | 34 | 76 | 41 | ||
40 | 17 | 45 | 20 | 20 | 58 | 27 | ||
60 | 12 | 30 | 12 | 13 | 33 | 15 | ||
80 | 3 | 6 | 4 | 8 | 30 | 15 | ||
100a | 105 | 171 | 52 | 114 | 173 | 51 |
Basis set(%) . | . | . | ||||||
---|---|---|---|---|---|---|---|---|
. | . | . | . | . | . | . | . | |
aug-cc-pVDZ | ||||||||
20 | 99 | 280 | 122 | 105 | 309 | 137 | ||
40 | 30 | 101 | 48 | 27 | 91 | 43 | ||
60 | 16 | 31 | 10 | 17 | 34 | 12 | ||
80 | 3 | 8 | 3 | 1 | 2 | 1 | ||
100a | 69 | 121 | 52 | 76 | 122 | 44 | ||
aug-cc-pVTZ | ||||||||
20 | 47 | 126 | 63 | 34 | 76 | 41 | ||
40 | 17 | 45 | 20 | 20 | 58 | 27 | ||
60 | 12 | 30 | 12 | 13 | 33 | 15 | ||
80 | 3 | 6 | 4 | 8 | 30 | 15 | ||
100a | 105 | 171 | 52 | 114 | 173 | 51 |
Relative to experiment.
Deviations from experiment for these sets of molecules are shown in Figs. 6 and 7. More so than the geometries, the deviations in the vibrational frequencies are nonuniform, with different percentages exhibiting radically different agreements with experiment. In the open-shell set, what is immediately clear is that the results agree much more poorly (at all basis sets sizes) with experiment than the closed-shell set. One surprising feature of the open-shell figure is that the augmented double- basis set results are significantly better than the triple- results. This behavior holds for all FNO truncations maintaining more than 40% of the basis set.
Mean deviation from experiment for vibrational frequencies (in ) for the equilibrium geometries of the closed-shell molecules , , , and as a function of augmented correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis aug-cc-pVTZ. Only valence orbitals are correlated.
Mean deviation from experiment for vibrational frequencies (in ) for the equilibrium geometries of the closed-shell molecules , , , and as a function of augmented correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis aug-cc-pVTZ. Only valence orbitals are correlated.
Mean deviation from experiment for vibrational frequencies (in ) for the equilibrium geometries of the open-shell molecules CN and as a function of augmented correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis aug-cc-pVTZ. Only valence orbitals are correlated.
Mean deviation from experiment for vibrational frequencies (in ) for the equilibrium geometries of the open-shell molecules CN and as a function of augmented correlation-consistent valence basis set and FNO truncation for and . The horizontal axis is the average number of virtual basis functions as a percentage of the virtual space of the largest basis aug-cc-pVTZ. Only valence orbitals are correlated.
These results are more difficult to interpret than those for geometries and energies. Especially when compared to experiment, the results are much less uniform and show more dependence on the degree of FNO truncation than other properties. This fact should not be surprising; a hessian depends more strongly on the energy differences around the equilibrium structure than does a (first) derivative. A note of caution: It is possible for the FNO procedure to show discontinuities in the vibrational frequencies. We did not see this appear in the results for the set of molecules used here, but in other cases small changes in the truncation level can lead to larger changes in the vibrational frequencies. This dependence illustrates the problem of local smoothness around any given point once the FNO procedure has been applied. We are currently looking more closely at these issues in an attempt to provide less truncation dependent vibrational frequencies for all molecules. There is also the issue of vibrational frequencies from atomic natural orbital basis sets62,63 compared to those from correlation-consistent ones, which will be considered in future work.
B. Nitroethane
The decomposition of nitroethane can occur via several different pathways.42 Schematics of the possible reaction paths are shown in Figs. 8–10. The numbering of transition states and intermediates corresponds to that used in Ref. 42. To sort out the relative importance of each of the individual pathways, all of the relevant species are optimized using FNO and FNO . In the main pathways, there is one reactant (nitroethane), five intermediates, ten transition states, and a total of twelve products. Each of these 28 critical points are fully optimized in a cc-pVTZ basis set with 60% of the virtual space kept using FNOs. This basis sets both performed well in the calibration tests and are small enough to allow the calculations to be completed using our computational resources. Calculations were performed both locally, on our SG1 Altix, as well as at Department of Defense Major Shared Resource Centers. The core occupied orbitals and corresponding core virtual orbitals are dropped as well. For nitroethane and its isomers, this yielded a total of 15 active occupied orbitals and 117 active virtual orbitals. The expected savings per geometry optimization step of each critical point, as compared to a full basis set calculation, is approximately 75%. RHF references are used for closed-shell species, and for open-shell species UHF references are used. At the optimized critical points, finite-difference Hessians are calculated to verify that the geometries did, in fact, correspond to either minima or first-order transition states, as well as to determine the vibrational frequencies, allowing zero-point energy (ZPE) corrections to be included.
Schematic of the one-step HONO elimination and direct fission pathways for decomposition of nitroethane. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the one-step HONO elimination and direct fission pathways for decomposition of nitroethane. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the decomposition pathway of nitroethane through isomerization to ethylnitrite. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the decomposition pathway of nitroethane through isomerization to ethylnitrite. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the decomposition pathway of nitroethane through isomerization to ethyl hydroxy nitroxide. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the decomposition pathway of nitroethane through isomerization to ethyl hydroxy nitroxide. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
The decomposition of nitroethane can be broken into four main classes of pathways: Direct fission of nitroethane to form ethyl radical and nitrogen dioxide (Fig. 8), single-step elimination of HONO (Fig. 8), isomerization to ethylnitrite (denoted INT3) and subsequent decomposition (Fig. 9), and isomerization to ethyl hydroxy nitroxide [CH3CHN(OH)O] (denoted INT5) and then further decomposition (Fig. 10). In the figures mentioned, we have used the notation from Ref. 42 for the intermediates, transition states, and some products (P4 and P8 are two cyclic isomers of nitroethane). When compared to nitromethane, analogies of each of these pathways exists—except for the HONO elimination. For the set of pathways beginning with isomerization to ethylnitrite, we focus on the mechanism that yields the lowest energy products . For isomerization through ethyl hydroxy nitroxide, we choose to focus on the thermodynamically minimum set of products , elimination of water. To provide an estimate of the importance of these different paths, in Fig. 11 we plot a qualitative picture of their relative energies.
Schematic of the most important pathways for each possible isomerization for the decomposition of nitroethane. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Schematic of the most important pathways for each possible isomerization for the decomposition of nitroethane. The vertical axis measures the ZPE corrected energies (in kcal/mol) relative to nitroethane from calculations with 60% of the virtual space of the cc-pVTZ basis set retained via FNOs.
Table X compares the results from B3LYP in a basis and the 60% FNO calculations with and at their respective optimized geometries. Focusing first on the B3LYP results from Ref. 42, the energy differences between the different pathways are relatively small. To appropriately model the kinetics of the decomposition of these reactions, it is important that the stationary point energies are converged with respect to electronic structure—small changes in barrier heights can lead to large differences in kinetics.
Relative energies of important stationary points for the decomposition of nitroethane in kcal/mol. All species are at their appropriately optimized structures and energies are relative to that of nitroethane including zero-point energy corrections. The B3LYP DFT results are from Ref. 42 and use the basis. Results for and are from this work using a cc-pVTZ basis set with 60% of the virtual orbitals kept by the FNO procedure. Only valence electrons were correlated. Species labels correspond to those in Figs. 8–10.
Species . | B3LYP . | . | . |
---|---|---|---|
Transition states | |||
TS5 | 42.11 | 48.29 | 48.32 |
TS6 | 59.40 | 64.83 | 64.94 |
TS8 | 35.35 | 38.07 | 38.46 |
TS9 | 52.50 | 57.62 | 57.72 |
TS10 | 63.08 | 60.15 | 60.81 |
TS11 | 64.74 | 67.27 | 67.42 |
TS12 | 55.93 | 67.10 | 68.32 |
TS13 | 61.41 | 60.48 | 63.30 |
TS14 | 31.83 | 31.07 | 31.31 |
TS15 | 70.41 | 66.20 | 68.26 |
Intermediates | |||
INT3 | 1.60 | ||
INT5 | 9.64 | 14.66 | 14.61 |
INT7 | |||
Products | |||
52.32 | 57.12 | 56.95 | |
15.62 | 18.35 | 18.26 | |
8.62 | 5.26 | 4.98 | |
1.57 | 1.34 | ||
36.22 | 34.53 | 34.00 | |
P4 | 54.40 | 53.88 | 54.61 |
P8 | 24.47 | 21.99 | 21.90 |
Species . | B3LYP . | . | . |
---|---|---|---|
Transition states | |||
TS5 | 42.11 | 48.29 | 48.32 |
TS6 | 59.40 | 64.83 | 64.94 |
TS8 | 35.35 | 38.07 | 38.46 |
TS9 | 52.50 | 57.62 | 57.72 |
TS10 | 63.08 | 60.15 | 60.81 |
TS11 | 64.74 | 67.27 | 67.42 |
TS12 | 55.93 | 67.10 | 68.32 |
TS13 | 61.41 | 60.48 | 63.30 |
TS14 | 31.83 | 31.07 | 31.31 |
TS15 | 70.41 | 66.20 | 68.26 |
Intermediates | |||
INT3 | 1.60 | ||
INT5 | 9.64 | 14.66 | 14.61 |
INT7 | |||
Products | |||
52.32 | 57.12 | 56.95 | |
15.62 | 18.35 | 18.26 | |
8.62 | 5.26 | 4.98 | |
1.57 | 1.34 | ||
36.22 | 34.53 | 34.00 | |
P4 | 54.40 | 53.88 | 54.61 |
P8 | 24.47 | 21.99 | 21.90 |
Before considering the differences between the coupled-cluster results and those from DFT, note that the results for and agree closely, with minimal changes in energy ordering to the different species, despite the fact that does much better for RHF-based CC bond breaking. Because of this similarity, we will simply refer to the CC results when comparing against B3LYP rather than choosing one or another. Qualitatively, the results from CC and DFT seem to agree quite well; products and intermediates are ordered the same in CC and DFT, and transition states are not radically rearranged. As is noted in Ref. 42, B3LYP tends to underestimate energy barriers; our coupled-cluster results support this conclusion, as the majority of the transition states were determined to be higher in energy than predicted by DFT. The shifts are not uniform, however, leading to a reordering of several of the high-lying transition states.
The lower-lying transition states were left unchanged in order, leading to the same conclusions about the kinetically favored channel. The transition state for the elimination of HONO via a concerted reaction has the lowest barrier by in B3LYP and by for both and . The concerted nature of this transition state might raise concern about the applicability of the perturbative method, which fails for RHF-based bond breaking, but recent work39 shows, surprisingly, that and (which ameliorates the RHF failure) tend to reproduce transition states with equal accuracy.
From the B3LYP calculations, the elimination of water is the most thermodynamically stable product by more than . On the other hand, the coupled-cluster calculations predict an energy gap between the elimination of water and the elimination of HNO of only or . The elimination of water is exoenergetic in B3LYP by more than , while it is endoenergetic by by both CC methods. When comparing to the energies of the intermediates, the global minimum on the CC potential energy surface is now 1,1-nitrosoethanol (INT7) and ethylnitrite (INT3) is slightly lower in energy than nitroethane. The coupled-cluster calculations also suggest that the elimination of HNO is less favorable kinetically, as the barriers along the reaction pathway are higher relative to those from B3LYP.
V. CONCLUSION
The application of methods that reduce basis set size will always be limited unless analytical gradients are available. For methods such as FNO-CC, where the basis set reduction is based on an auxiliary calculation for the molecule at a particular geometry, the inclusion of orbital relaxation terms is substantially more complicated than it is for more traditional methods that simply modify the orbital eigenvalue equations. In our case, because of the dependence on a MBPT(2) density matrix, there is an orbital relaxation contribution to the two-particle density matrix that is new. Because of the one- and two-particle natures of all the interactions in the Hamiltonian, the most general such truncation procedure should only contribute orbital relaxation effects to both density matrices.
Despite the complexity of the orbital relaxation terms, we are able to show that just as in the case for Hartree-Fock orbitals one can separate the perturbation-dependent integral derivatives from the perturbation-independent orbital relaxation. Therefore, one needs to solve the CPFNO equations (or equivalently, the -vector equations) once instead of for each perturbation. Then the CC results follow with substantial savings in time that can approach an order of magnitude, depending upon the level of CC correlation. Unfortunately, the price paid for this computational saving is the need to store several quantities of the dimension of two-electron integrals. Proper combination of the terms in an integral-direct formalism may be able to circumvent that complication.
The FNO procedure initiates the optimized virtual space (OVOS) method,16,17,21,22 which imposes the additional constraint of trying to obtain the lowest MBPT(2) energy16 or maximizes the overlap between the truncated and untruncated MBPT(2) wavefunctions.22,64 This constraint can easily be added to the analytical FNO gradient procedure presented here to enable OVOS structures and hessians to be obtained analytically. In fact, the OVOS method is an example of the general issue of imposing additional conditions on a virtual space to fulfill a desired objective.
The application of the FNO truncation methods to the test set of molecules showed that while a cc-pVDZ or cc-pCVDZ basis is inadequate to be predictive for geometries a truncated cc-pVTZ basis of the same number of active orbitals is substantially better. It is always preferable to use the largest possible basis set and then reduce its effective virtual orbital space dimension via the FNO method than to compromise on the size of the underlying basis set. Results for vibrational frequencies are more mixed, without the clear preference for FNO truncations over untruncated smaller basis sets. This conclusion may partly be due to the limited set of molecules studied, but it also suggests that some further developments of the proper treatment of vibrational frequencies within FNO-CC may be needed.
Our results support the general conclusions reached by Denis et al.42 about the decomposition of nitroethane. The one-step elimination of HONO appears to be favored kinetically, with the barrier for that reaction which is lower than that for the direct bond fission. However, there are important differences in the energetics, with the energies along the pathways initiated by the isomerization to ethylnitrite being most affected. The gap between the thermodynamically favored products and is reduced to roughly versus from B3LYP calculations.
ACKNOWLEDGMENTS
The authors would like to recognize support for this work from the Army Research Office through a MURI grant. Computational facilities for our group were supported by the Air Force Office of Scientific Research through DURIP funding. One of the authors (A.G.T.) would also like to acknowledge support from a Department of Defense graduate fellowship.