Computation of intermolecular interactions is a challenge in drug discovery because accurate ab initio techniques are too computationally expensive to be routinely applied to drug–protein models. Classical force fields are more computationally feasible, and force fields designed to match symmetry adapted perturbation theory (SAPT) interaction energies can remain accurate in this context. Unfortunately, the application of such force fields is complicated by the laborious parameterization required for computations on new molecules. Here, we introduce the component-based machine-learned intermolecular force field (CLIFF), which combines accurate, physics-based equations for intermolecular interaction energies with machine-learning models to enable automatic parameterization. The CLIFF uses functional forms corresponding to electrostatic, exchange-repulsion, induction/polarization, and London dispersion components in SAPT. Molecule-independent parameters are fit with respect to SAPT2+(3)δMP2/aug-cc-pVTZ, and molecule-dependent atomic parameters (atomic widths, atomic multipoles, and Hirshfeld ratios) are obtained from machine learning models developed for C, N, O, H, S, F, Cl, and Br. The CLIFF achieves mean absolute errors (MAEs) no worse than 0.70 kcal mol−1 in both total and component energies across a diverse dimer test set. For the side chain–side chain interaction database derived from protein fragments, the CLIFF produces total interaction energies with an MAE of 0.27 kcal mol−1 with respect to reference data, outperforming similar and even more expensive methods. In applications to a set of model drug–protein interactions, the CLIFF is able to accurately rank-order ligand binding strengths and achieves less than 10% error with respect to SAPT reference values for most complexes.
Skip Nav Destination
Article navigation
14 May 2021
Research Article|
May 14 2021
CLIFF: A component-based, machine-learned, intermolecular force field
Jeffrey B. Schriber
;
Jeffrey B. Schriber
a)
1
Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, Georgia Institute of Technology
, Atlanta, Georgia 30318, USA
Search for other works by this author on:
Daniel R. Nascimento;
Daniel R. Nascimento
1
Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, Georgia Institute of Technology
, Atlanta, Georgia 30318, USA
Search for other works by this author on:
Alexios Koutsoukas;
Alexios Koutsoukas
2
Molecular Structure and Design, Bristol Myers Squibb Company
, P.O. Box 5400, Princeton, New Jersey 08543, USA
Search for other works by this author on:
Steven A. Spronk
;
Steven A. Spronk
2
Molecular Structure and Design, Bristol Myers Squibb Company
, P.O. Box 5400, Princeton, New Jersey 08543, USA
Search for other works by this author on:
Daniel L. Cheney;
Daniel L. Cheney
2
Molecular Structure and Design, Bristol Myers Squibb Company
, P.O. Box 5400, Princeton, New Jersey 08543, USA
Search for other works by this author on:
C. David Sherrill
C. David Sherrill
b)
3
Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry and School of Computational Science and Engineering, Georgia Institute of Technology
, Atlanta, Georgia 30318, USA
b)Author to whom correspondence should be addressed: sherrill@gatech.edu
Search for other works by this author on:
a)
Electronic mail: jschriber7@gatech.edu
b)Author to whom correspondence should be addressed: sherrill@gatech.edu
J. Chem. Phys. 154, 184110 (2021)
Article history
Received:
January 05 2021
Accepted:
April 27 2021
Citation
Jeffrey B. Schriber, Daniel R. Nascimento, Alexios Koutsoukas, Steven A. Spronk, Daniel L. Cheney, C. David Sherrill; CLIFF: A component-based, machine-learned, intermolecular force field. J. Chem. Phys. 14 May 2021; 154 (18): 184110. https://doi.org/10.1063/5.0042989
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00