The average energy curvature as a function of the particle number is a molecule-specific quantity, which measures the deviation of a given functional from the exact conditions of density functional theory. Related to the lack of derivative discontinuity in approximate exchange-correlation potentials, the information about the curvature has been successfully used to restore the physical meaning of Kohn–Sham orbital eigenvalues and to develop non-empirical tuning and correction schemes for density functional approximations. In this work, we propose the construction of a machine-learning framework targeting the average energy curvature between the neutral and the radical cation state of thousands of small organic molecules (QM7 database). The applicability of the model is demonstrated in the context of system-specific gamma-tuning of the LC-ωPBE functional and validated against the molecular first ionization potentials at equation-of-motion coupled-cluster references. In addition, we propose a local version of the non-linear regression model and demonstrate its transferability and predictive power by determining the optimal range-separation parameter for two large molecules relevant to the field of hole-transporting materials. Finally, we explore the underlying structure of the QM7 database with the t-SNE dimensionality-reduction algorithm and identify structural and compositional patterns that promote the deviation from the piecewise linearity condition.

The extension of Hohenberg–Kohn density functional theory (HK-DFT)1 to non-integer particle numbers led to the determination of two fundamental properties of the exact density functional.2 The first is the piecewise linearity condition, which imposes that the total energy as a function of the (fractional) particle number [E(N)] must evolve as a series of straight-line segments.2–5 The second is the derivative discontinuity, which establishes that the exact exchange-correlation potential is characterized by sudden jumps while varying across integer particle numbers.2,6–10

Approximate density functionals do not fulfill these requirements. Instead, they are generally characterized by a convex E(N) curvature and by continuously derivable exchange-correlation potentials.3,11–16 As demonstrated by Kronik, Baer, and co-workers,4 these two quantities are related, and therefore, the knowledge of the first is sufficient to quantify the extent of the second. Using the same argument, the minimization of the energy curvature has the consequence of correcting the effects of the missing derivative discontinuity, restoring the compliance of approximate functionals to the exact conditions of DFT. Failure to comply with these requirements exacerbates the effects of the delocalization error,3,14,17–19 leads to an incorrect dissociation behavior of heterodimers,2,12,20 and causes the Kohn–Sham frontier orbital eigenvalues to deviate from the ionization potential and the electronic affinity, respectively.21–24 

The existence of a relationship between the curvature and the derivative discontinuity is especially convenient, as the first can be readily evaluated for a given functional and chemical system according to the following expression:4 

(1)

where CavgN represents the average curvature between two integer points with N and N − 1 electrons, while εHOMON and εLUMON1 are the eigenvalues of the frontier orbitals of the N and N − 1 particle states for a fixed molecular geometry. Equation (1) is exact and it is a direct consequence of Janak’s theorem.21,25

Straightforward accessibility to information about the energy curvature, as well as its relation to some fundamental pitfalls of approximate density functionals, has already proven to be fertile ground and is used in numerous applications. For instance, the minimization of CavgN serves as a formally motivated criterion for the compound-specific optimal tuning of range-separated hybrid density functionals.26,27 The accuracy of such functionals has been largely demonstrated in the computations of outer-valence spectra,28 optical rotations,29 and fundamental and optical gaps.30–32 The energy curvature has also been applied as a criterion to assess the extent of delocalization error in approximate functionals and to rationalize, on this basis, their relative accuracy.33–38 Within a different context, knowledge of the curvature played a central role in the validation of ensemble generalizations of standard density functionals,5,39–41 carefully designed to retrieve the correct piecewise-linearity behavior of E(N) and the derivative discontinuities in the exchange-correlation potential. Finally, information on the curvature is exploited to develop correction schemes for existing exchange-correlation density functionals.4,42–45

The relevance of the information encoded in the energy curvature, corroborated by the extent of possible applications, contrasts with the modest chemical complexity and relatively low number of molecules for which the curvature has been reported.4,35

Recently, machine-learning (ML) techniques have been redefining the scale and the complexity of achievable quantum chemical tasks.46–54 Supported by the construction of large molecular databases of quantum chemical benchmarks,46–48,55–58 supervised learning approaches promote the large-scale screening of any targeted molecular quantity ranging from simple ground-state properties59 to complex objects such as electron densities60–63 and the many-body-wavefunction.64 In addition, machine-learning techniques have been intensively used to promote the access to system-specific quantities such as atomic parameters for semi-empirical computations,65 atomic and molecular multipoles moments, polarizabilities, and overlap integrals in the context of intermolecular potentials.66,67 Tackling the scaling-up problem with artificial intelligence (AI) techniques is especially appealing. Specifically, AI results in a reduction in the computational cost of accessing molecular properties,59,68,69 facilitates the extrapolation of acquired information to larger and more complex chemical systems,70–72 and promotes the analysis and identification of non-trivial similarity patterns in otherwise unimaginably large amounts of data.73 

Within this context, we here report the construction of a machine-learning model of the average energy curvature (CavgN) of a set of 7165 organic molecules taken from the QM7 database.46,47 In this work, the focus is placed on the curvature between the neutral and the first radical cation state of each molecule, as its minimization leads to compliance with Koopmans’ theorem.26 The applicability of the regression framework is demonstrated by performing system-specific optimal tuning of the LC-ωPBE functional13,74,75 on the basis of the predicted curvatures. In addition, the transferability of the model is tested by predicting the optimal range–separation parameters and estimating the ionization potential of two larger molecules of practical use, relevant for the field of hole transport materials. Finally, we address the question whether specific chemical patterns are more prone to deviation from piecewise linearity using unsupervised dimensionality reduction algorithms to draw statistically robust relationships between the structure/composition of the molecules and their average energy curvature.

The training set for the non-linear regression of CavgN was selected by randomly choosing 6465 molecules out of the QM7 database, leaving the remaining 10% for out-of-sample (oos) predictions. The hyperparameters of the model have been tuned for each functional by 10-fold cross-validation on a randomly selected set containing 10% of the 6465 molecules. The parameters were optimized using a simplex algorithm and an array of 42 different initial conditions sampling a large scale of possible hyperparameter values. At each iteration of the simplex algorithm and for each initial condition, a new set for cross-validation containing 10% of the 6465 molecules was randomly selected. The performance of the model was then evaluated by training on five subsets of different sizes (100, 500, 1000, 2000, and 5000 molecules) while predicting on a test set of fixed size (645 molecules). The final learning curves (Fig. 1) are obtained by repeating this procedure 10 times, each time shuffling the training and the test set among the 6465 initial molecules. The regression model reported in the figure uses the spectrum of London and Axilrod–Teller–Muto (SLATM) molecular representation,76,77 as it was the best performing for the largest training set size (further details about the global and local framework used herein are given in the supplementary material).

FIG. 1.

Learning curves of the average energy curvature (CavgN) as a function of the training set size. The learning exercise is reported for three functionals and Hartree–Fock (HF) using the def2-SVP basis set. The error bars correspond to the standard deviation of the error over 10 random shuffling repetitions of the test set. The models were built using the SLATM molecular representation. In the upper panels, we report the distribution of CavgN across the QM7 database at HF and PBE0.

FIG. 1.

Learning curves of the average energy curvature (CavgN) as a function of the training set size. The learning exercise is reported for three functionals and Hartree–Fock (HF) using the def2-SVP basis set. The error bars correspond to the standard deviation of the error over 10 random shuffling repetitions of the test set. The models were built using the SLATM molecular representation. In the upper panels, we report the distribution of CavgN across the QM7 database at HF and PBE0.

Close modal

As shown in Fig. 1, the difficulty of the learning exercise largely depends on the electronic structure level at which the energy curvature is computed. The learning of PBE078,79/def2-SVP and LC-ωPBE13,74,75/def2-SVP is the most straightforward, followed by PBE80,81/def2-SVP and finally Hartree–Fock (HF/def2-SVP). As already apparent in the upper panels of Fig. 1, this specific ordering is directly related to the amount of variation of the target quantity (CavgN) within each functional and method. As shown in Fig. 2, the mean absolute error of the model trained on 5000 molecules correlates nearly perfectly with the standard deviation of CavgN for each level of theory.

FIG. 2.

MAE of the model at a training set size of 5000 molecules as a function of the standard deviation of CavgN using the three functionals and HF.

FIG. 2.

MAE of the model at a training set size of 5000 molecules as a function of the standard deviation of CavgN using the three functionals and HF.

Close modal

Following Eq. (1), the energy curvature depends on the HOMO eigenvalue of the neutral molecule [N-HOMO] and on the LUMO eigenvalue of its radical cation [(N-1)-LUMO]. Therefore, the relative robustness of each functional in describing these two quantities could be invoked to rationalize the overall spread of its CavgN. However, as shown in the top panel of Fig. 3, the individual variations of the frontier orbital energies are not sufficient to explain the overall trend found for CavgN. All the functionals are characterized by standard deviations of similar orbital energies, whereas HF shows larger deviations. Importantly, the spread of the individual orbital eigenvalues within each method (Fig. 3 upper panel) is much larger than the standard deviation of their difference (CavgN, x-axis of Fig. 2), with the sole exception of HF for which the two are comparable. The narrower distribution of the curvature across the dataset effectively makes this quantity a simpler learning target when compared to previously published efforts to learn individual orbital energies.50,56,82,83

FIG. 3.

(Top) Standard deviations of N-HOMO and (N − 1)-LUMO through QM7 with three functionals and HF. (Bottom) Pearson’s correlation coefficient between the N-HOMO and (N − 1)-LUMO energies at different levels of theory.

FIG. 3.

(Top) Standard deviations of N-HOMO and (N − 1)-LUMO through QM7 with three functionals and HF. (Bottom) Pearson’s correlation coefficient between the N-HOMO and (N − 1)-LUMO energies at different levels of theory.

Close modal

The ordering of Fig. 2 is retrieved only after combining the information about the variation of the orbital energies with the one about their correlation (Fig. 3, bottom panel). In particular, the frontier orbital eigenvalues correlate almost perfectly in LC-ωPBE and PBE0, while their correlation is lower in PBE and very poor within HF. Consequently, the difficulty of the learning exercise ultimately depends on the consistency of a method in describing the orbital energies both of the neutral and the radical cation state of a molecule. Both the spread in orbital eigenvalues and their covariance do not change while performing the computation with a larger basis set (see the supplementary material)

The poor covariance between the frontier orbital eigenvalues in Hartree–Fock is the consequence of the different way in which the occupied and the unoccupied manifolds are treated within the method. In particular, the orbital energies of the occupied manifold, hence the HOMO eigenvalue of the neutral molecule, are determined in Hartree–Fock by the effective potential of N − 1 particles, as the exchange cancels out the self-interaction contribution. This is not the case for the unoccupied manifold, where the effective potential originates from the totality of the particles. In contrast, the energies of both the occupied and the unoccupied orbitals in density functional theory are determined by a N − 1 particle effective potential, as the (approximate) exchange-correlation hole exclude a single electron from each and every orbital.

The energy curvature predicted for each molecule by using the machine-learning model can be readily applied as a criterion for system-specific γ-tuning of range-separated hybrid density functionals. Usually, the tuning procedure consists in adjusting the range–separation parameter to satisfy as closely as possible Koopmans’ theorem for both the neutral and the anionic state of a targeted molecule.26 This method is by far the most commonly used and relies on the knowledge of the ionization potential either from a computational/experimental reference or from an on-the-fly estimation using ΔSCF procedures.26,27,84 As already demonstrated by Kronick, Baer, and co-workers,4 the minimization of CavgN in approximate functionals implies their compliance with Koopmans’ theorem. Therefore, the optimal range-separation parameter for a specific compound can be found by imposing the curvature to be identically zero.

Figure 4 schematically illustrates a modification of the regression framework as presented in Sec. II, which uses the curvature information to determine the optimal γ parameter for a given chemical system. In particular, the procedure consists of nine independent kernel ridge regression models, each targeting CavgN at different values of the range-separation parameter (learning curves in the supplementary material). In the last step, a cubic spline interpolation of the predicted curvatures leads to the optimal γ parameter (i.e., the γ value for which CavgN=0) for a given molecule.

FIG. 4.

Schematic representation of the regression framework for the prediction of the optimal range separation parameter per compound. For each molecule, nine independent models predict the energy curvature at LC-ωPBE at nine γ values ranging from 0.1 to 0.9 bohr−1. The system-specific optimal γ parameter, for which CavgN=0, is then found by a cubic spline interpolation.

FIG. 4.

Schematic representation of the regression framework for the prediction of the optimal range separation parameter per compound. For each molecule, nine independent models predict the energy curvature at LC-ωPBE at nine γ values ranging from 0.1 to 0.9 bohr−1. The system-specific optimal γ parameter, for which CavgN=0, is then found by a cubic spline interpolation.

Close modal

To avoid the introduction of unpredictable noise in the data, we considered here only those compounds for which all the computations converged. In consequence, the model was trained using the energy curvature of 5754 small organic molecules taken from the QM7 database and used to predict the system-specific optimal LC-ωPBE γ values for a test set of 640 molecules. Upon a single point computation using the tuned functional, the ionization potential of each molecule is evaluated as −εHOMO and compared to the corresponding value at IP-EOM-CCSD. For consistency, all computations are performed with the def2-SVP basis set. Figure 5 shows the accuracy of estimated IPs averaged over the test set for the standard LC-ωPBE and its γ-tuned variant. The error bars show the maximum and the minimum deviation from the IP-EOM-CCSD reference registered among the 640 molecules.

FIG. 5.

Absolute error between −εHOMO of LC-ωPBE and its γ-tuned variant and the ionization potential at IP-EOM-CCSD across the 640 molecules of the test set. Optimal γ values derive from the model described in Fig. 4. The height of the histogram represents the mean absolute error, while the bars show the maximum and the minimum deviations.

FIG. 5.

Absolute error between −εHOMO of LC-ωPBE and its γ-tuned variant and the ionization potential at IP-EOM-CCSD across the 640 molecules of the test set. Optimal γ values derive from the model described in Fig. 4. The height of the histogram represents the mean absolute error, while the bars show the maximum and the minimum deviations.

Close modal

The tuning procedure on the basis of the predicted curvatures results in a fivefold decrease in the average ionization potential error compared to the standard functional. The robustness of the predictions is further demonstrated by the fact that the worst error made with the γ-tuned variant is only as high as the average error made with the standard LC-ωPBE.

Including several hundreds of different molecules, the test set represents a sufficiently large ensemble for a statistically relevant analysis of the optimal range-separation parameter in LC-ωPBE. By registering the frequency of appearance of the predicted γ values, it is shown that their distribution tends to a Gaussian function centered around 0.32 bohr−1 (Fig. 6). Out of the 640 molecules, only 12 are characterized by an optimal γ parameter close to the 0.4 bohr−1 of the standard functional. In all those cases where system-specific γ-tuning is not possible, for instance, in the computation of dimer binding energies,85 the distribution in Fig. 6 demonstrates that fixing the range-separation parameter of LC-ωPBE to 0.32 bohr−1 would reduce the curvature for the majority of molecules.

FIG. 6.

Distribution of the optimal γ parameters (bohr−1) across the 640 molecules of the test set as predicted by the model described in Fig. 4. The red line shows the value of the range-separation parameter in the standard LC-ωPBE.

FIG. 6.

Distribution of the optimal γ parameters (bohr−1) across the 640 molecules of the test set as predicted by the model described in Fig. 4. The red line shows the value of the range-separation parameter in the standard LC-ωPBE.

Close modal

The discrepancy with the original parametrization of LC-ωPBE has to be interpreted as the results of a different optimization strategy. Here, the suggested 0.32 bohr−1 minimizes the energy curvature for the highest number of compounds in a comprehensive dataset of organic molecules. Following the works of Baer,4,26 fixing the γ parameter by minimization of the energy curvature is a formally motivated procedure, as it leads to compliance with Koopmans’ theorem and exact conditions of DFT. The original approach used for the parametrization of LC-ωPBE is more pragmatic and seeks to minimize the error of the functional against different energy-based benchmark databases.74 The formal issue associated with this second strategy is that the range-separation parameter inevitably compensates for unrelated deficiencies in the rest of the approximated exchange-correlation functional.

The machine-learning models presented in the previous paragraphs rely on a global molecular representation, i.e., each vector in the feature space characterizes one specific compound. As the energy curvature is a molecular property, this class of representations is highly suitable and easily applicable to the regression problem. On the other hand, a model based on a global representation is not transferable, i.e., it cannot be trained on smaller compounds and used to make predictions on larger molecules.86 This issue can be tackled using local representations, which encode the molecular information as a collection of atoms in their environments. By establishing similarity measures between local atomic environments, rather than between whole molecules, local representations lead to more transferable models, applicable to larger and more diverse molecules than those included in the training set (see Refs. 63, 87, and 88). The regression framework shown in Fig. 4 is general and can be readily extended to local, atom-centered molecular representations. More details about the modification of the learning framework to accommodate locality and transferability are given in the supplementary material.

Figure 7 shows the application of the local regression framework to predict the optimal γ values of two large molecules commonly used in hole-transporting materials:91,92N,N′-Bis(3-methylphenyl)-N,N′-diphenylbenzidine (TPD) and 4,4′,4″-Tris[(3-methylphenyl)phenylamino]triphenylamine (m-MTDATA). The model was exclusively trained on the local environments of the small organic molecules of the QM7 database using the atomic spectrum of London and Axilrod–Teller–Muto (aSLATM)76,77 representation combined with an orthogonal matching pursuit93,94 algorithm for sparse regression (see the supplementary material).

FIG. 7.

Extrapolation: optimal γ parameter derived from the model described in Fig. 4 for two large molecules relevant for the field of hole-transporting materials compared to the value obtained by ab initio optimal tuning. The value of −εHOMO for both the standard LC-ωPBE and its γ-tuned variant (ML and ab initio) is reported along with reference ionization potentials. Experimental IPs are taken from Refs. 89 and 90.

FIG. 7.

Extrapolation: optimal γ parameter derived from the model described in Fig. 4 for two large molecules relevant for the field of hole-transporting materials compared to the value obtained by ab initio optimal tuning. The value of −εHOMO for both the standard LC-ωPBE and its γ-tuned variant (ML and ab initio) is reported along with reference ionization potentials. Experimental IPs are taken from Refs. 89 and 90.

Close modal

The −εHOMO computed with standard LC-ωPBE is a rather poor approximation of the ionization potential of TPD and m-MTDATA with errors around 1 eV compared to the ab initio references (bt-PNO-IP-EOM-CCSD and ΔSCF at DLPNO-CCSD). Upon ML-based γ-tuning, the error with respect to the wavefunction based methods is reduced to 0.1–0.2 eV for both molecules. In addition, the optimal γs from machine learning and the corresponding IPs are in very close agreement with the optimal parameters obtained ab initio by minimizing the difference between −εHOMO and the neutral-cation ΔSCF energy (Δγ = 0.025 bohr−1 and 0.01 bohr−1). These results, obtained on compounds four times larger than the largest molecule in the training set, demonstrate the transferability of the local model and its applicability to targeted complex molecules. Interestingly, the optimal γ parameters for both TPD and m-MTDATA are much lower than any value obtained on the smaller molecules of the QM7 test set (Fig. 6). This behavior is consistent with the results of the existing literature95–98 and further supports the conclusion that γ can be interpreted as the inverse of an effective conjugation length dependent on the system size. Finally, the HOMO eigenvalue of PBE0 is the farthest from the ab initio reference, but the closest to the experimental values obtained by cyclic voltammetry in organic solution (TPD)89 or by ultraviolet photoemission spectroscopy in amorphous solid state (m-MTDATA).90 This result is not unexpected (see Refs. 99 and 100) and shows that the error made by the global hybrid mimics the effects of the condensed phase environment (e.g., solvent and crystal field).101,102

The large chemical diversity contained in the QM7 database promotes a thorough assessment of the relation between the energy curvature computed with a given functional and the system-specific structural and compositional patterns. However, drawing such a relationship for thousands of molecules inevitably leads to a high-dimensional problem, which is unsuitable for analysis and visualization. In this context, non-linear dimensionality reduction algorithms reveal the underlying structure of high-dimensional data by projecting complex vectors into lower dimensions.103–117Figure 8 shows a two-dimensional representation of the chemical diversity of the database using t-distributed Stochastic Neighbor Embedding (t-SNE).118 This algorithm converts the similarity between molecules, which is defined herein as the Euclidean distance between their SLATM representation, to the probability of being each other’s neighbors. The embedding of high-dimensional data into lower dimensions is then performed by ensuring that the joint probability between molecules should not change upon projection. While the two axis (dimensions) obtained after a t-SNE transformation have no formal physical or chemical meaning, it is still possible to identify at least a qualitative correlation between chemical properties and the dimensions in Fig. 8vide infra.

FIG. 8.

Two-dimensional t-SNE map of the QM7 database on the basis of the SLATM representation. Each point represents a compound colored by its average energy curvature computed with PBE/def2-SVP. The diverging color map highlights the data with the highest and the lowest average energy curvature. Each of the clusters contains molecules with similar patterns, which are defined by the corresponding numbering.

FIG. 8.

Two-dimensional t-SNE map of the QM7 database on the basis of the SLATM representation. Each point represents a compound colored by its average energy curvature computed with PBE/def2-SVP. The diverging color map highlights the data with the highest and the lowest average energy curvature. Each of the clusters contains molecules with similar patterns, which are defined by the corresponding numbering.

Close modal

The application of t-SNE to QM7 reveals clusters of compounds with similar chemical patterns, mainly defined by the presence or the absence of heteroatoms and their connectivity. In particular, the vertical axis (Dim. 2) somehow correlates with the number of heteroatoms from zero (alkanes, bottom) to two or more non-carbon atoms (hydroxyamines and oxyamines, top). The horizontal axis follows instead a gradient of chemical composition going from the oxygen-based compounds (left) to nitrogen containing molecules (right), passing from mixed species. Each point is color-coded by its average energy curvature computed at PBE/def2-SVP to establish a global, qualitative connection between these macro-families of compounds and the degree of their deviation from piecewise linearity. The choice of PBE is motivated by the fact that the absence of Hartree–Fock exchange leads to a curvature that represents an upper limit for the other functionals.

Figure 8 highlights seven key families characterized by at least one region of high average energy curvature (in red). Out of those clusters, three contains only oxygen as heteroatom [alcohols (7), ethers (15), and acids/esters (8)], two includes sp-hybridized carbons [cyano groups (11) and alkynes (13)], one contains only nitrogen [amines (10)], and the last group includes the amides (9). In contrast, alkanes (with the exception of the smallest methane and ethane, see discussion below) (16), diamines separated by long carbon chains (14), all sulfur containing compounds (5), and amidines (6) are all characterized by lower curvatures. These trends suggest that the presence of increasingly electron-rich heteroatoms tends to increase the average energy curvature. In particular, the presence of oxygen atoms is especially sensitive as shown by the qualitative difference between amides and amidines. The low average energy curvature that characterizes all sulfur containing compounds suggests that the presence of heteroatoms beyond the second row of the periodic table does not have a critical impact on the deviation from piecewise linearity. In addition to heteroatoms, the hybridization of the carbon centers is also a relevant factor as illustrated by the contrast between alkane and alkyne groups. These conclusions are consistent with previous work on charge transfer complexes119,120 and delocalization error.37 In particular, the results presented here are comparable with the work of Kronik and Baer,4 who report the average energy curvature for a set of nine small molecules, whose order can be rationalized in terms of the presence of electron-rich heteroatoms, their hybridization, and the molecular size.

Although not explicitly evident in the mapping of Fig. 8, the molecular size is in fact crucial to determine the extent of the average energy curvature. To emphasize this point, we report in Fig. 9 the correlation of the curvature at PBE/def2-SVP and the size of the molecules, upon averaging CavgN over all the molecules with the same number of non-hydrogen atoms (Nheavy). Although the mean values for Nheavy = 1 and Nheavy = 2 are not statistically significant (i.e., these categories include only 1 and 3 molecules, respectively), the robust inverse size/curvature relationship justifies the high curvature of the smallest alkanes (cluster 16, Fig. 8).

FIG. 9.

Correlation between the average energy curvature at PBE/def2-SVP and the size of the molecules. The mean values (dots) are computed averaging CavgN over all the compounds with the same number of non-hydrogen atoms. The error bar represents one standard deviation from the mean. The inset shows the average energy curvature of all the compounds in QM7 with 3 non-hydrogen atoms. The color code in the inset highlights the presence of oxygen (red), nitrogen (blue), or carbon only compounds (black).

FIG. 9.

Correlation between the average energy curvature at PBE/def2-SVP and the size of the molecules. The mean values (dots) are computed averaging CavgN over all the compounds with the same number of non-hydrogen atoms. The error bar represents one standard deviation from the mean. The inset shows the average energy curvature of all the compounds in QM7 with 3 non-hydrogen atoms. The color code in the inset highlights the presence of oxygen (red), nitrogen (blue), or carbon only compounds (black).

Close modal

The error bars in Fig. 9 show that within every Nheavy, there is a distribution of curvatures that reflect the chemical composition. The analysis of the subset with 3 non-hydrogen atoms is especially suitable as it contains sufficient compounds to reflect general trends but is simultaneously small enough to list all its molecules. The inset of Fig. 9 shows the energy curvature of all the compounds with Nheavy = 3 ordered from the highest to the lowest. This plot validates the conclusions drawn from the t-SNE map, as the curvature decreases with the electron-richness of the heteroatom (O > N > C). One exception due to the effects of hybridization is acetonitrile, which has a slightly higher but comparable curvature to methyl ether. Complementing the information of the t-SNE map, the inset of the figure reveals the high-energy curvature of 3-membered rings (oxirane, aziridine, and cyclopropane), which are generally considered to act as unsaturated systems.121 The previous arguments remain valid to explain the relative order of the electron rich 5-membered conjugated heterocycles [thiophene (6.3) < pyrrole (6.5) < furane (6.7)], whose curvatures are as expected higher than benzene (6.0).

The average energy curvature with respect to the particle number is a crucial system-dependent property of density functionals, which quantify their deviation from the exact conditions of DFT and therefore affects their accuracy. Related to the lack of derivative discontinuity in the exchange-correlation potential and thus to the degree of severity of the delocalization error, the information about this quantity has been successfully used for optimal tuning of long-range corrected functionals and to correct Kohn–Sham orbital eigenvalues to match ionization potentials and electron affinities. In this work, we have proposed the construction of a machine-learning model of the average energy curvature and shown its applications for the system-specific tuning of the LC-ωPBE functional. In parallel, unsupervised learning techniques have been applied to obtain qualitative information about particular chemical patterns and molecular properties, which results in highly convex curvatures.

As the curvature is both a system-specific and a functional dependent quantity, we have first shown that the learning exercise is not equally difficult for any given functional, but it depends on its ability to describe on equal footing the neutral and the radical cation state of a molecule. This result implies that the possible spread of value for the average energy curvature is not equal for all methods. In particular, the largest standard deviation for the curvature is registered for Hartree–Fock due to the poor correlation between the neutral molecule HOMO eigenvalue and the LUMO of the radical cation.

Training several independent models to target the curvature at LC-ωPBE for different values of its range-separation parameter led to the construction of a second framework dedicated to the system-dependent optimal tuning of the functional. The use of the predicted γ parameters resulted in a fivefold increase in the accuracy when estimating the first ionization potential (IP) with −εHOMO with respect to the standard functional. The distribution of the predicted range-separation parameters on the QM7 database shows that the original 0.4 value of LC-ωPBE is far from optimal to minimize energy curvature. As a generalization of the framework, we use a local molecular representation for the training and demonstrate the transferability of the modified model by estimating the optimal γ-values and computing the ionization potentials of two larger molecules, relevant for the field of hole-transporting materials.

Finally, projecting the high dimensional SLATM representation of QM7 in two dimensions with a t-SNE algorithm revealed the underlying structure of the database. In particular, the mapping showed several distinct clusters enclosing molecules similar to each other in terms of their scaffold and the presence of heteroatoms. The curvature values across these clusters were found to assume the highest values for compounds with second row heteroatoms, most frequently oxygen, or for compounds with sp-hybridization. Additional analysis of the data supports the existence of an inverse correlation between the molecular size and the average energy curvature.

See the supplementary material for the learning curves of the energy curvature with different molecular representations, a comparison of the model performance between test and out-of-sample set, a more detailed description of the local model, the demonstration of the robustness of the results when changing the basis set, the learning curves for each range-separation parameter and for the numerical data associated to the results reported in the main text..

The authors acknowledge Kun-Han Lin for helpful discussions and the National Centre of Competence in Research (NCCR) “Materials’ Revolution: Computational Design and Discovery of Novel Materials (MARVEL)” of the Swiss National Science Foundation (SNSF) and the EPFL for financial support.

The data and the model that support the findings of this study are openly available in Materials Cloud at https://doi.org/10.24435/materialscloud:2020.0031/v1.

The molecular geometries for all species were taken as published in the QM7 database.46,47 The curvatures were computed according to Eq. (1) using the orbital eigenvalues of the neutral and the first radical cation state of each molecule. All the computations using PBE, PBE0, LC-ωPBE, and Hartree–Fock were performed in Gaussian16,122 in combination with the def2-SVP123 basis set. The first ionization potential energies at IP-EOM-CCSD124,125 and bt-PNO-IP-EOM-CCSD,126,127 as well as at ΔSCF (DLPNO-CCSD),128 were obtained with Orca 4.0129 using the def2-SVP basis set for consistency with the DFT values. The density fitting approximation was applied in the DLPNO-CCSD and bt-PNO-IP-EOM-CCSD computations. The machine-learning representations and similarity kernels were obtained using the Quantum Machine Learning toolkit QMLcode130 with the exception of SOAP131 (see the supplementary material), which was computed using DScribe 0.3.2.132 The mathematical form of the similarity kernels was chosen as the standard procedure according to the specific representation (see the supplementary material for more details). The two-dimensional map of the QM7 database was generated using the t-SNE118 algorithm as implemented in the scikit-learn package.133 

1.
P.
Hohenberg
and
W.
Kohn
, “
Inhomogeneous electron gas
,”
Phys. Rev.
136
,
B864
B871
(
1964
).
2.
J. P.
Perdew
,
R. G.
Parr
,
M.
Levy
, and
J. L.
Balduz
, “
Density functional theory for the fractional particle number: Derivative discontinuities of the energy
,”
Phys. Rev. Lett.
49
,
1691
(
1982
).
3.
A. J.
Cohen
,
P.
Mori-Sanchez
, and
W.
Yang
, “
Insights into current limitations of density functional theory
,”
Science
321
,
792
794
(
2008
).
4.
T.
Stein
,
J.
Autschbach
,
N.
Govind
,
L.
Kronik
, and
R.
Baer
, “
Curvature and frontier orbital energies in density functional theory
,”
J. Phys. Chem. Lett.
3
,
3740
3744
(
2012
).
5.
E.
Kraisler
and
L.
Kronik
, “
Piecewise linearity of approximate density functionals revisited: Implications for frontier orbital energies
,”
Phys. Rev. Lett.
110
,
126403
(
2013
).
6.
M.
Mundt
and
S.
Kümmel
, “
Derivative discontinuities in time-dependent density-functional theory
,”
Phys. Rev. Lett.
95
,
203004
(
2005
).
7.
P.
Mori-Sánchez
,
A. J.
Cohen
, and
W.
Yang
, “
Discontinuous nature of the exchange-correlation functional in strongly correlated systems
,”
Phys. Rev. Lett.
102
,
066403
(
2009
).
8.
A.
Mirtschink
,
M.
Seidl
, and
P.
Gori-Giorgi
, “
Derivative discontinuity in the strong-interaction limit of density-functional theory
,”
Phys. Rev. Lett.
111
,
126402
(
2013
).
9.
W.
Yang
,
A. J.
Cohen
, and
P.
Mori-Sánchez
, “
Derivative discontinuity, bandgap and lowest unoccupied molecular orbital in density functional theory
,”
J. Chem. Phys.
136
,
204111
(
2012
).
10.
P.
Mori-Sánchez
and
A. J.
Cohen
, “
The derivative discontinuity of the exchange-correlation functional
,”
Phys. Chem. Chem. Phys.
16
,
14378
14387
(
2014
); arXiv:1404.0902.
11.
P.
Mori-Sánchez
,
A. J.
Cohen
, and
W.
Yang
, “
Many-electron self-interaction error in approximate density functionals
,”
J. Chem. Phys.
125
,
201102
(
2006
).
12.
A.
Ruzsinszky
,
J. P.
Perdew
,
G. I.
Csonka
,
O. A.
Vydrov
, and
G. E.
Scuseria
, “
Density functionals that are one- and two- are not always many-electron self-interaction-free, as shown for H2+, He2+, LiH+, and Ne2+
,”
J. Chem. Phys.
126
,
104102
(
2007
).
13.
O. A.
Vydrov
,
G. E.
Scuseria
, and
J. P.
Perdew
, “
Tests of functionals for systems with fractional electron number
,”
J. Chem. Phys.
126
,
154109
(
2007
).
14.
P.
Mori-Sánchez
,
A. J.
Cohen
, and
W.
Yang
, “
Localization and delocalization errors in density functional theory and implications for band-gap prediction
,”
Phys. Rev. Lett.
100
,
146401
(
2008
).
15.
A. J.
Cohen
,
P.
Mori-Sánchez
, and
W.
Yang
, “
Fractional charge perspective on the band gap in density-functional theory
,”
Phys. Rev. B
77
,
115123
(
2008
).
16.
R.
Haunschild
,
T. M.
Henderson
,
C. A.
Jiménez-Hoyos
, and
G. E.
Scuseria
, “
Many-electron self-interaction and spin polarization errors in local hybrid density functionals
,”
J. Chem. Phys.
133
,
134116
(
2010
).
17.
D. J.
Tozer
, “
Relationship between long-range charge-transfer excitation energy error and integer discontinuity in Kohn-Sham theory
,”
J. Chem. Phys.
119
,
12697
12699
(
2003
).
18.
A. J.
Cohen
,
P.
Mori-Sánchez
, and
W.
Yang
, “
Challenges for density functional theory
,”
Chem. Rev.
112
,
289
320
(
2012
).
19.
Q.
Zhao
,
E. I.
Ioannidis
, and
H. J.
Kulik
, “
Global and local curvature in density functional theory
,”
J. Chem. Phys.
145
,
054109
(
2016
).
20.
Y.
Zhang
and
W.
Yang
, “
A challenge for density functionals: Self-interaction error increases for systems with a noninteger number of electrons
,”
J. Chem. Phys.
109
,
2604
2608
(
1998
).
21.
J. P.
Perdew
and
M.
Levy
, “
Physical content of the exact Kohn-Sham orbital energies: Band gaps and derivative discontinuities
,”
Phys. Rev. Lett.
51
,
1884
1887
(
1983
).
22.
M. J.
Allen
and
D. J.
Tozer
, “
Eigenvalues, integer discontinuities and NMR shielding constants in Kohn-Sham theory
,”
Mol. Phys.
100
,
433
439
(
2002
).
23.
S.
Kümmel
and
L.
Kronik
, “
Orbital-dependent density functionals: Theory and applications
,”
Rev. Mod. Phys.
80
,
3
60
(
2008
).
24.
A.
Görling
, “
Exchange-correlation potentials with proper discontinuities for physically meaningful Kohn-Sham eigenvalues and band structures
,”
Phys. Rev. B
91
,
245120
(
2015
).
25.
J. F.
Janak
, “
Proof that Eni=εi in density-functional theory
,”
Phys. Rev. B
18
,
7165
7168
(
1978
).
26.
R.
Baer
,
E.
Livshits
, and
U.
Salzner
, “
Tuned range-separated hybrids in density functional theory
,”
Annu. Rev. Phys. Chem.
61
,
85
109
(
2010
).
27.
J. D.
Gledhill
,
M. J. G.
Peach
, and
D. J.
Tozer
, “
Assessment of tuning methods for enforcing approximate energy linearity in range-separated hybrid functionals
,”
J. Chem. Theory Comput.
9
,
4414
4420
(
2013
).
28.
D. A.
Egger
,
S.
Weissman
,
S.
Refaely-Abramson
,
S.
Sharifzadeh
,
M.
Dauth
,
R.
Baer
,
S.
Kümmel
,
J. B.
Neaton
,
E.
Zojer
, and
L.
Kronik
, “
Outer-valence electron spectra of prototypical aromatic heterocycles from an optimally tuned range-separated hybrid functional
,”
J. Chem. Theory Comput.
10
,
1934
1952
(
2014
).
29.
M.
Srebro
and
J.
Autschbach
, “
Tuned range-separated time-dependent density functional theory applied to optical rotation
,”
J. Chem. Theory Comput.
8
,
245
256
(
2012
).
30.
S.
Refaely-Abramson
,
S.
Sharifzadeh
,
N.
Govind
,
J.
Autschbach
,
J. B.
Neaton
,
R.
Baer
, and
L.
Kronik
, “
Quasiparticle spectra from a nonempirical optimally tuned range-separated hybrid density functional
,”
Phys. Rev. Lett.
109
,
226405
(
2012
); arXiv:1203.2357.
31.
L.
Kronik
,
T.
Stein
,
S.
Refaely-Abramson
, and
R.
Baer
, “
Excitation gaps of finite-sized systems from optimally tuned range-separated hybrid functionals
,”
J. Chem. Theory Comput.
8
,
1515
1531
(
2012
).
32.
H.
Sun
and
J.
Autschbach
, “
Electronic energy gaps for π-conjugated oligomers and polymers calculated with density functional theory
,”
J. Chem. Theory Comput.
10
,
1035
1047
(
2014
).
33.
E. R.
Johnson
,
A.
Otero-de-la-Roza
, and
S. G.
Dale
, “
Extreme density-driven delocalization error for a model solvated-electron system
,”
J. Chem. Phys.
139
,
184116
(
2013
).
34.
J.
Autschbach
and
M.
Srebro
, “
Delocalization error and “functional tuning” in Kohn-Sham calculations of molecular properties
,”
Acc. Chem. Res.
47
,
2592
2602
(
2014
).
35.
S. R.
Whittleton
,
X. A.
Sosa Vazquez
,
C. M.
Isborn
, and
E. R.
Johnson
, “
Density-functional errors in ionization potential with increasing system size
,”
J. Chem. Phys.
142
,
184106
(
2015
).
36.
E. I.
Ioannidis
and
H. J.
Kulik
, “
Towards quantifying the role of exact exchange in predictions of transition metal complex properties
,”
J. Chem. Phys.
143
,
034104
(
2015
).
37.
T. Z. H.
Gani
and
H. J.
Kulik
, “
Where does the density localize? Convergent behavior for global hybrids, range separation, and DFT+U
,”
J. Chem. Theory Comput.
12
,
5931
5945
(
2016
).
38.
D.
Hait
and
M.
Head-Gordon
, “
Delocalization errors in density functional theory are essentially quadratic in fractional occupation number
,”
J. Phys. Chem. Lett.
9
,
6280
6288
(
2018
); arXiv:1808.02770.
39.
E.
Kraisler
and
L.
Kronik
, “
Fundamental gaps with approximate density functionals: The derivative discontinuity revealed from ensemble considerations
,”
J. Chem. Phys.
140
,
18A540
(
2014
).
40.
E.
Kraisler
and
L.
Kronik
, “
Elimination of the asymptotic fractional dissociation problem in Kohn-Sham density-functional theory using the ensemble-generalization approach
,”
Phys. Rev. A
91
,
032504
(
2015
).
41.
E.
Kraisler
,
T.
Schmidt
,
S.
Kümmel
, and
L.
Kronik
, “
Effect of ensemble generalization on the highest-occupied Kohn-Sham eigenvalue
,”
J. Chem. Phys.
143
,
104105
(
2015
).
42.
M.
Cococcioni
and
S.
de Gironcoli
, “
Linear response approach to the calculation of the effective interaction parameters in the LDA+U method
,”
Phys. Rev. B
71
,
035105
(
2005
); arXiv:0405160 [cond-mat].
43.
I.
Dabo
,
A.
Ferretti
,
N.
Poilvert
,
Y.
Li
,
N.
Marzari
, and
M.
Cococcioni
, “
Koopmans’ condition for density-functional theory
,”
Phys. Rev. B
82
,
115121
(
2010
); arXiv:1008.1934.
44.
J.-D.
Chai
and
P.-T.
Chen
, “
Restoration of the derivative discontinuity in Kohn-Sham density functional theory: An efficient scheme for energy gap correction
,”
Phys. Rev. Lett.
110
,
033002
(
2013
).
45.
A.
Ferretti
,
I.
Dabo
,
M.
Cococcioni
, and
N.
Marzari
, “
Bridging density-functional and many-body perturbation theory: Orbital-density dependence in electronic-structure functionals
,”
Phys. Rev. B
89
,
195134
(
2014
).
46.
L. C.
Blum
and
J.-L.
Reymond
, “
970 million druglike small molecules for virtual screening in the chemical universe database GDB-13
,”
J. Am. Chem. Soc.
131
,
8732
(
2009
).
47.
M.
Rupp
,
A.
Tkatchenko
,
K.-R.
Müller
, and
O. A.
von Lilienfeld
, “
Fast and accurate modeling of molecular atomization energies with machine learning
,”
Phys. Rev. Lett.
108
,
058301
(
2012
); arXiv:1109.2618.
48.
R.
Ramakrishnan
,
P. O.
Dral
,
M.
Rupp
, and
O. A.
von Lilienfeld
, “
Quantum chemistry structures and properties of 134 kilo molecules
,”
Sci. Data
1
,
140022
(
2014
).
49.
F. A.
Faber
,
A.
Lindmaa
,
O. A.
von Lilienfeld
, and
R.
Armiento
, “
Machine learning energies of 2 million elpasolite (ABC2D6) crystals
,”
Phys. Rev. Lett.
117
,
135502
(
2016
); arXiv:1508.05315.
50.
F. A.
Faber
,
L.
Hutchison
,
B.
Huang
,
J.
Gilmer
,
S. S.
Schoenholz
,
G. E.
Dahl
,
O.
Vinyals
,
S.
Kearnes
,
P. F.
Riley
, and
O. A.
von Lilienfeld
, “
Prediction errors of molecular machine learning models lower than hybrid DFT error
,”
J. Chem. Theory Comput.
13
,
5255
5264
(
2017
).
51.
O. A.
von Lilienfeld
,
K.-R.
Müller
, and
A.
Tkatchenko
, “
Exploring chemical compound space with quantum-based machine learning
,” arXiv:1911.10084 [physics.chem-ph] (
2019
).
52.
J.
Liang
,
Y.
Xu
,
R.
Liu
, and
X.
Zhu
, “
QM-sym, a symmetrized quantum chemistry database of 135 kilo molecules
,”
Sci. Data
6
,
213
(
2019
).
53.
K. T.
Butler
,
D. W.
Davies
,
H.
Cartwright
,
O.
Isayev
, and
A.
Walsh
, “
Machine learning for molecular and materials science
,”
Nature
559
,
547
555
(
2018
).
54.
A.
Fabrizio
,
B.
Meyer
,
R.
Fabregat
, and
C.
Corminboeuf
, “
Quantum chemistry mets machine learning
,”
Chim. Int. J. Chem.
73
,
983
989
(
2019
).
55.
L.
Ruddigkeit
,
R.
van Deursen
,
L. C.
Blum
, and
J.-L.
Reymond
, “
Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17
,”
J. Chem. Inf. Model.
52
,
2864
2875
(
2012
).
56.
G.
Montavon
,
M.
Rupp
,
V.
Gobre
,
A.
Vazquez-Mayagoitia
,
K.
Hansen
,
A.
Tkatchenko
,
K.-R.
Müller
, and
O.
Anatole von Lilienfeld
, “
Machine learning of molecular electronic properties in chemical compound space
,”
New J. Phys.
15
,
095003
(
2013
); arXiv:1305.7074.
57.
R. T.
McGibbon
,
A. G.
Taube
,
A. G.
Donchev
,
K.
Siva
,
F.
Hernández
,
C.
Hargus
,
K.-H.
Law
,
J. L.
Klepeis
, and
D. E.
Shaw
, “
Improving the accuracy of Møller-Plesset perturbation theory with neural networks
,”
J. Chem. Phys.
147
,
161725
(
2017
).
58.
L. A.
Burns
,
J. C.
Faver
,
Z.
Zheng
,
M. S.
Marshall
,
D. G. A.
Smith
,
K.
Vanommeslaeghe
,
A. D.
MacKerell
,
K. M.
Merz
, and
C. D.
Sherrill
, “
The BioFragment Database (BFDb): An open-data platform for computational chemistry analysis of noncovalent interactions
,”
J. Chem. Phys.
147
,
161727
(
2017
).
59.
O. A.
von Lilienfeld
, “
Quantum machine learning in chemical compound space
,”
Angew. Chem., Int. Ed.
57
,
4164
4169
(
2018
).
60.
F.
Brockherde
,
L.
Vogt
,
L.
Li
,
M. E.
Tuckerman
,
K.
Burke
, and
K.-R.
Müller
, “
Bypassing the Kohn-Sham equations with machine learning
,”
Nat. Commun.
8
,
872
(
2017
).
61.
A.
Grisafi
,
A.
Fabrizio
,
B.
Meyer
,
D. M.
Wilkins
,
C.
Corminboeuf
, and
M.
Ceriotti
, “
Transferable machine-learning model of the electron density
,”
ACS Cent. Sci.
5
,
57
64
(
2019
).
62.
A.
Chandrasekaran
,
D.
Kamal
,
R.
Batra
,
C.
Kim
,
L.
Chen
, and
R.
Ramprasad
, “
Solving the electronic structure problem with machine learning
,”
npj Comput. Mater.
5
,
22
(
2019
).
63.
A.
Fabrizio
,
A.
Grisafi
,
B.
Meyer
,
M.
Ceriotti
, and
C.
Corminboeuf
, “
Electron density learning of non-covalent systems
,”
Chem. Sci.
10
,
9424
9432
(
2019
).
64.
K. T.
Schütt
,
M.
Gastegger
,
A.
Tkatchenko
,
K. R.
Müller
, and
R. J.
Maurer
, “
Nat. Commun.
10
,
5024
(
2019
).
65.
P. O.
Dral
,
O. A.
von Lilienfeld
, and
W.
Thiel
, “
Machine learning of parameters for accurate semiempirical quantum chemical calculations
,”
J. Chem. Theory Comput.
11
,
2120
2125
(
2015
).
66.
T.
Bereau
,
D.
Andrienko
, and
O. A.
von Lilienfeld
, “
Transferable atomic multipole machine learning models for small organic molecules
,”
J. Chem. Theory Comput.
11
,
3225
3233
(
2015
).
67.
T.
Bereau
,
R. A.
DiStasio
,
A.
Tkatchenko
, and
O. A.
von Lilienfeld
, “
Non-covalent interactions across organic and biological subsets of chemical space: Physics-based potentials parametrized from machine learning
,”
J. Chem. Phys.
148
,
241706
(
2018
); arXiv:1710.05871.
68.
M.
Rupp
, “
Machine learning for quantum mechanics in a nutshell
,”
Int. J. Quantum Chem.
115
,
1058
1073
(
2015
).
69.
R.
Ramakrishnan
and
O. A.
von Lilienfeld
,
Machine Learning, Quantum Chemistry, and Chemical Space
(
Wiley Online Library
,
2017
), pp.
225
256
.
70.
M.
Rupp
,
R.
Ramakrishnan
, and
O. A.
von Lilienfeld
, “
Machine learning for quantum mechanical properties of atoms in molecules
,”
J. Phys. Chem. Lett.
6
,
3309
3313
(
2015
); arXiv:1505.00350.
71.
A. P.
Bartók
,
S.
De
,
C.
Poelking
,
N.
Bernstein
,
J. R.
Kermode
,
G.
Csányi
, and
M.
Ceriotti
, “
Machine learning unifies the modeling of materials and molecules
,”
Sci. Adv.
3
,
e1701816
(
2017
); arXiv:1706.00179.
72.
M.
Welborn
,
L.
Cheng
, and
T. F.
Miller
, “
Transferability in machine learning for electronic structure via the molecular orbital basis
,”
J. Chem. Theory Comput.
14
,
4772
4779
(
2018
); arXiv:1806.00133.
73.
M.
Ceriotti
, “
Unsupervised machine learning in atomistic simulations, between predictions and understanding
,”
J. Chem. Phys.
150
,
150901
(
2019
); arXiv:1902.05158.
74.
O. A.
Vydrov
,
J.
Heyd
,
A. V.
Krukau
, and
G. E.
Scuseria
, “
Importance of short-range versus long-range Hartree-Fock exchange for the performance of hybrid density functionals
,”
J. Chem. Phys.
125
,
074106
(
2006
).
75.
O. A.
Vydrov
and
G. E.
Scuseria
, “
Assessment of a long-range corrected hybrid functional
,”
J. Chem. Phys.
125
,
234109
(
2006
).
76.
B.
Huang
and
O. A.
von Lilienfeld
, “
Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity
,”
J. Chem. Phys.
145
,
161102
(
2016
); arXiv:1608.06194.
77.
B.
Huang
and
O. A.
von Lilienfeld
, “
The “DNA” of chemistry: Scalable quantum machine learning with “amons”
,” arXiv:1707.04146 (
2017
).
78.
C.
Adamo
and
V.
Barone
, “
Toward reliable density functional methods without adjustable parameters: The PBE0 model
,”
J. Chem. Phys.
110
,
6158
(
1999
); arXiv:1011.1669v3.
79.
M.
Ernzerhof
and
G. E.
Scuseria
, “
Assessment of the Perdew-Burke-Ernzerhof exchange-correlation functional
,”
J. Chem. Phys.
110
,
5029
(
1999
).
80.
J. P.
Perdew
,
K.
Burke
, and
M.
Ernzerhof
, “
Generalized gradient approximation made simple
,”
Phys. Rev. Lett.
77
,
3865
3868
(
1996
); arXiv:0927-0256(96)00008 [10.1016].
81.
J. P.
Perdew
,
K.
Burke
, and
M.
Ernzerhof
, “
Erratum: Generalized gradient approximation made simple [Phys. Rev. Lett. 77, 3865 (1996)]
,”
Phys. Rev. Lett.
78
,
1396
(
1997
); arXiv:0927-0256(96)00008 [10.1016].
82.
F. A.
Faber
,
A. S.
Christensen
,
B.
Huang
, and
O. A.
von Lilienfeld
, “
Alchemical and structural distribution based representation for universal quantum machine learning
,”
J. Chem. Phys.
148
,
241717
(
2018
); arXiv:1712.08417.
83.
K.
Ghosh
,
A.
Stuke
,
M.
Todorović
,
P. B.
Jørgensen
,
M. N.
Schmidt
,
A.
Vehtari
, and
P.
Rinke
, “
Deep learning spectroscopy: Neural networks for molecular excitation spectra
,”
Adv. Sci.
6
,
1801367
(
2019
).
84.
I.
Tamblyn
,
S.
Refaely-Abramson
,
J. B.
Neaton
, and
L.
Kronik
, “
Simultaneous determination of structures, vibrations, and frontier orbital energies from a self-consistent range-separated hybrid functional
,”
J. Phys. Chem. Lett.
5
,
2734
2741
(
2014
).
85.
A.
Karolewski
,
L.
Kronik
, and
S.
Kümmel
, “
Using optimally tuned range separated hybrid functionals in ground-state calculations: Consequences and caveats
,”
J. Chem. Phys.
138
,
204115
(
2013
).
86.
B.
Huang
,
N. O.
Symonds
, and
O. A.
von Lilienfeld
, “
Quantum machine learning in chemistry and materials
,” in
Handbook of Materials Modeling
(
Springer International Publishing
,
2018
), pp.
1
27
.
87.
S.
Fias
,
F.
Heidar-Zadeh
,
P.
Geerlings
, and
P. W.
Ayers
, “
Chemical transferability of functional groups follows from the nearsightedness of electronic matter
,”
Proc. Natl. Acad. Sci. U. S. A.
114
,
11633
11638
(
2017
).
88.
O. T.
Unke
and
M.
Meuwly
, “
A reactive, scalable, and transferable model for molecular energies from a neural network approach based on local information
,”
J. Chem. Phys.
148
,
241708
(
2018
).
89.
K.
Okumoto
and
Y.
Shirota
, “
Development of high-performance blue-violet-emitting organic electroluminescent devices
,”
Appl. Phys. Lett.
79
,
1231
1233
(
2001
).
90.
P. B.
Deotare
,
W.
Chang
,
E.
Hontz
,
D. N.
Congreve
,
L.
Shi
,
P. D.
Reusswig
,
B.
Modtland
,
M. E.
Bahlke
,
C. K.
Lee
,
A. P.
Willard
,
V.
Bulović
,
T.
Van Voorhis
, and
M. A.
Baldo
, “
Nanoscale transport of charge-transfer states in organic donor-acceptor blends
,”
Nat. Mater.
14
,
1130
1134
(
2015
).
91.
J.-H.
Jou
,
S.
Kumar
,
A.
Agrawal
,
T.-H.
Li
, and
S.
Sahoo
, “
Approaches for fabricating high efficiency organic light emitting diodes
,”
J. Mater. Chem. C
3
,
2974
3002
(
2015
).
92.
S.
Shahnawaz
,
S.
Sudheendran Swayamprabha
,
M. R.
Nagar
,
R. A. K.
Yadav
,
S.
Gull
,
D. K.
Dubey
, and
J.-H.
Jou
, “
Hole-transporting materials for organic light-emitting diodes: An overview
,”
J. Mater. Chem. C
7
,
7144
7158
(
2019
).
93.
S.
Mallat
and
Z.
Zhang
, “
Adaptive time-frequency decomposition with matching pursuits
,” in
Proceedings of IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis
(
IEEE
,
1992
), pp.
7
10
.
94.
R.
Rubinstein
,
M.
Zibulevsky
, and
M.
Elad
, “
Efficient implementation of the K-SVD algorithm using batch orthogonal matching pursuit
,” Technical Report CS-2008-08, Computer Science Department of Technion,
Israel Institute of Technology
,
Haifa, Israel
(
2008
).
95.
T.
Körzdörfer
,
J. S.
Sears
,
C.
Sutton
, and
J.-L.
Brédas
, “
Long-range corrected hybrid functionals for π-conjugated systems: Dependence of the range-separation parameter on conjugation length
,”
J. Chem. Phys.
135
,
204107
(
2011
).
96.
S.
Refaely-Abramson
,
R.
Baer
, and
L.
Kronik
, “
Fundamental and excitation gaps in molecules of relevance for organic photovoltaics from an optimally tuned range-separated hybrid functional
,”
Phys. Rev. B
84
,
075144
(
2011
).
97.
H.
Sun
and
J.
Autschbach
, “
Influence of the delocalization error and applicability of optimal functional tuning in density functional calculations of nonlinear optical properties of organic donor-acceptor chromophores
,”
ChemPhysChem
14
,
2450
2461
(
2013
).
98.
D.
Jacquemin
,
B.
Moore
,
A.
Planchat
,
C.
Adamo
, and
J.
Autschbach
, “
Performance of an optimally tuned range-separated hybrid functional for 0-0 electronic excitation energies
,”
J. Chem. Theory Comput.
10
,
1677
1685
(
2014
).
99.
W.-J.
Chi
,
P.-P.
Sun
, and
Z.-S.
Li
, “
How to regulate energy levels and hole mobility of spiro-type hole transport materials in perovskite solar cells
,”
Phys. Chem. Chem. Phys.
18
,
27073
27077
(
2016
).
100.
W.-J.
Chi
,
Q.-S.
Li
, and
Z.-S.
Li
, “
Exploring the electrochemical properties of hole transport materials with spiro-cores for efficient perovskite solar cells from first-principles
,”
Nanoscale
8
,
6146
6154
(
2016
).
101.
D.
Ghosh
,
O.
Isayev
,
L. V.
Slipchenko
, and
A. I.
Krylov
, “
Effect of solvation on the vertical ionization energy of thymine: From microhydration to bulk
,”
J. Phys. Chem. A
115
,
6028
6038
(
2011
).
102.
N. B.
Kotadiya
,
A.
Mondal
,
S.
Xiong
,
P. W. M.
Blom
,
D.
Andrienko
, and
G. J. A. H.
Wetzelaer
, “
Rigorous characterization and predictive modeling of hole transport in amorphous organic semiconductors
,”
Adv. Electron. Mater.
4
,
1800366
(
2018
).
103.
P.
Das
,
M.
Moll
,
H.
Stamati
,
L. E.
Kavraki
, and
C.
Clementi
, “
Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction
,”
Proc. Natl. Acad. Sci. U. S. A.
103
,
9885
9890
(
2006
).
104.
P. H.
Nguyen
, “
Complexity of free energy landscapes of peptides revealed by nonlinear principal component analysis
,”
Proteins: Struct., Funct., Bioinf.
65
,
898
913
(
2006
).
105.
W. M.
Brown
,
S.
Martin
,
S. N.
Pollock
,
E. A.
Coutsias
, and
J.-P.
Watson
, “
Algorithmic dimensionality reduction for molecular structure analysis
,”
J. Chem. Phys.
129
,
064118
(
2008
).
106.
A. L.
Ferguson
,
A. Z.
Panagiotopoulos
,
P. G.
Debenedetti
, and
I. G.
Kevrekidis
, “
Systematic determination of order parameters for chain dynamics using diffusion maps
,”
Proc. Natl. Acad. Sci. U. S. A.
107
,
13597
13602
(
2010
).
107.
V.
Spiwok
and
B.
Králová
, “
Metadynamics in the conformational space nonlinearly dimensionally reduced by Isomap
,”
J. Chem. Phys.
135
,
224504
(
2011
).
108.
A. L.
Ferguson
,
A. Z.
Panagiotopoulos
,
I. G.
Kevrekidis
, and
P. G.
Debenedetti
, “
Nonlinear dimensionality reduction in molecular simulation: The diffusion map approach
,”
Chem. Phys. Lett.
509
,
1
11
(
2011
).
109.
M.
Ceriotti
,
G. A.
Tribello
, and
M.
Parrinello
, “
Simplifying the representation of complex free-energy landscapes using sketch-map
,”
Proc. Natl. Acad. Sci. U. S. A.
108
,
13023
13028
(
2011
).
110.
G. A.
Tribello
,
M.
Ceriotti
, and
M.
Parrinello
, “
Using sketch-map coordinates to analyze and bias molecular dynamics simulations
,”
Proc. Natl. Acad. Sci. U. S. A.
109
,
5196
5201
(
2012
).
111.
M.
Ceriotti
,
G. A.
Tribello
, and
M.
Parrinello
, “
Demonstrating the transferability and the descriptive power of sketch-map
,”
J. Chem. Theory Comput.
9
,
1521
1532
(
2013
).
112.
S.
De
,
A. P.
Bartók
,
G.
Csányi
, and
M.
Ceriotti
, “
Comparing molecules and solids across structural and alchemical space
,”
Phys. Chem. Chem. Phys.
18
,
13754
13769
(
2016
); arXiv:1601.04077.
113.
M.
Reutlinger
and
G.
Schneider
, “
Nonlinear dimensionality reduction and mapping of compound libraries for drug discovery
,”
J. Mol. Graphics Modell.
34
,
108
117
(
2012
).
114.
A. M.
Virshup
,
J.
Chen
, and
T. J.
Martínez
, “
Nonlinear dimensionality reduction for nonadiabatic dynamics: The influence of conical intersection topography on population transfer rates
,”
J. Chem. Phys.
137
,
22A519
(
2012
).
115.
M.
Duan
,
J.
Fan
,
M.
Li
,
L.
Han
, and
S.
Huo
, “
Evaluation of dimensionality-reduction methods from peptide folding-unfolding simulations
,”
J. Chem. Theory Comput.
9
,
2490
2497
(
2013
).
116.
P.
Schneider
,
A. T.
Müller
,
G.
Gabernet
,
A. L.
Button
,
G.
Posselt
,
S.
Wessler
,
J. A.
Hiss
, and
G.
Schneider
, “
Hybrid network model for “deep learning” of chemical data: Application to antimicrobial peptides
,”
Mol. Inf.
36
,
1600011
(
2017
).
117.
T.
Lemke
and
C.
Peter
, “
EncoderMap: Dimensionality reduction and generation of molecule conformations
,”
J. Chem. Theory Comput.
15
,
1209
1215
(
2019
).
118.
L.
Van Der Maaten
and
G.
Hinton
, “
Visualizing data using t-SNE
,”
J. Mach. Learn. Res.
9
,
2579
2605
(
2008
).
119.
E.
Ruiz
,
D. R.
Salahub
, and
A.
Vela
, “
Defining the domain of density functionals: Charge-transfer complexes
,”
J. Am. Chem. Soc.
117
,
1141
1142
(
1995
).
120.
E.
Ruiz
,
D. R.
Salahub
, and
A.
Vela
, “
Charge-transfer complexes: Stringent tests for widely used density functionals
,”
J. Phys. Chem.
100
,
12265
12276
(
1996
).
121.
M. J. S.
Dewar
and
G. P.
Ford
, “
Relationship between olefinic π complexes and three-membered rings
,”
J. Am. Chem. Soc.
101
,
783
791
(
1979
).
122.
M. J.
Frisch
,
G. W.
Trucks
,
H. B.
Schlegel
,
G. E.
Scuseria
,
M. A.
Robb
,
J. R.
Cheeseman
,
G.
Scalmani
,
V.
Barone
,
G. A.
Petersson
,
H.
Nakatsuji
,
X.
Li
,
M.
Caricato
,
A. V.
Marenich
,
J.
Bloino
,
B. G.
Janesko
,
R.
Gomperts
,
B.
Mennucci
,
H. P.
Hratchian
,
J. V.
Ortiz
,
A. F.
Izmaylov
,
J. L.
Sonnenberg
,
D.
Williams-Young
,
F.
Ding
,
F.
Lipparini
,
F.
Egidi
,
J.
Goings
,
B.
Peng
,
A.
Petrone
,
T.
Henderson
,
D.
Ranasinghe
,
V. G.
Zakrzewski
,
J.
Gao
,
N.
Rega
,
G.
Zheng
,
W.
Liang
,
M.
Hada
,
M.
Ehara
,
K.
Toyota
,
R.
Fukuda
,
J.
Hasegawa
,
M.
Ishida
,
T.
Nakajima
,
Y.
Honda
,
O.
Kitao
,
H.
Nakai
,
T.
Vreven
,
K.
Throssell
,
J. A.
Montgomery
, Jr.
,
J. E.
Peralta
,
F.
Ogliaro
,
M. J.
Bearpark
,
J. J.
Heyd
,
E. N.
Brothers
,
K. N.
Kudin
,
V. N.
Staroverov
,
T. A.
Keith
,
R.
Kobayashi
,
J.
Normand
,
K.
Raghavachari
,
A. P.
Rendell
,
J. C.
Burant
,
S. S.
Iyengar
,
J.
Tomasi
,
M.
Cossi
,
J. M.
Millam
,
M.
Klene
,
C.
Adamo
,
R.
Cammi
,
J. W.
Ochterski
,
R. L.
Martin
,
K.
Morokuma
,
O.
Farkas
,
J. B.
Foresman
, and
D. J.
Fox
, Gaussian16 Revision B.01 (
2016
).
123.
F.
Weigend
and
R.
Ahlrichs
, “
Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy
,”
Phys. Chem. Chem. Phys.
7
,
3297
3305
(
2005
).
124.
D.
Sinha
,
S. K.
Mukhopadhyay
,
R.
Chaudhuri
, and
D.
Mukherjee
, “
The eigenvalue-independent partitioning technique in Fock space: An alternative route to open-shell coupled-cluster theory for incomplete model spaces
,”
Chem. Phys. Lett.
154
,
544
549
(
1989
).
125.
J. F.
Stanton
and
J.
Gauss
, “
Analytic energy derivatives for ionized states described by the equation-of-motion coupled cluster method
,”
J. Chem. Phys.
101
,
8938
8944
(
1994
).
126.
A. K.
Dutta
,
F.
Neese
, and
R.
Izsák
, “
Towards a pair natural orbital coupled cluster method for excited states
,”
J. Chem. Phys.
145
,
034102
(
2016
).
127.
A. K.
Dutta
,
M.
Nooijen
,
F.
Neese
, and
R.
Izsák
, “
Exploring the accuracy of a low scaling similarity transformed equation of motion method for vertical excitation energies
,”
J. Chem. Theory Comput.
14
,
72
91
(
2018
).
128.
C.
Riplinger
and
F.
Neese
, “
An efficient and near linear scaling pair natural orbital based local coupled cluster method
,”
J. Chem. Phys.
138
,
034106
(
2013
).
129.
F.
Neese
, “
The ORCA program system
,”
Wiley Interdiscip. Rev.: Comput. Mol. Sci.
2
,
73
78
(
2012
).
130.
A. S.
Christensen
,
F. A.
Faber
,
B.
Huang
,
L.
Bratholm
,
A.
Tkatchenko
,
K.
Muller
, and
O. A.
von Lilienfeld
, “
QML: A Python toolkit for quantum machine learning
” (
2017
), https://github.com/qmlcode/qml.
131.
A. P.
Bartók
,
R.
Kondor
, and
G.
Csányi
, “
On representing chemical environments
,”
Phys. Rev. B
87
,
184115
(
2013
); arXiv:1209.3140.
132.
L.
Himanen
,
M. O. J.
Jäger
,
E. V.
Morooka
,
F.
Federici Canova
,
Y. S.
Ranawat
,
D. Z.
Gao
,
P.
Rinke
, and
A. S.
Foster
, “
DScribe: Library of descriptors for machine learning in materials science
,”
Comput. Phys. Commun.
247
,
106949
(
2020
); arXiv:1904.08875.
133.
F.
Pedregosa
,
G.
Varoquaux
,
A.
Gramfort
,
V.
Michel
,
B.
Thirion
,
O.
Grisel
,
M.
Blondel
,
P.
Prettenhofer
,
R.
Weiss
,
V.
Dubourg
,
J.
Vanderplas
,
A.
Passos
,
D.
Cournapeau
,
M.
Brucher
,
M.
Perrot
, and
E.
Duchesnay
, “
Scikit-learn: Machine learning in Python
,”
J. Mach. Learn. Res.
12
,
2825
2830
(
2011
).

Supplementary Material