Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field based on the kernel ridge regression algorithm. Here, we demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data in a sample efficient manner. The CG force field is learned by following the thermodynamic consistency principle, here by minimizing the error between the predicted CG force and the all-atom mean force in the CG coordinates. Solving this problem by GDML directly is impossible because coarse-graining requires averaging over many training data points, resulting in impractical memory requirements for storing the kernel matrices. In this work, we propose a data-efficient and memory-saving alternative. Using ensemble learning and stratified sampling, we propose a 2-layer training scheme that enables GDML to learn an effective CG model. We illustrate our method on a simple biomolecular system, alanine dipeptide, by reconstructing the free energy landscape of a CG variant of this molecule. Our novel GDML training scheme yields a smaller free energy error than neural networks when the training set is small, and a comparably high accuracy when the training set is sufficiently large.
Skip Nav Destination
,
,
,
,
CHORUS
Article navigation
21 May 2020
Research Article|
May 20 2020
Ensemble learning of coarse-grained molecular dynamics force fields with a kernel approach
Special Collection:
Machine Learning Meets Chemical Physics
Jiang Wang
;
Jiang Wang
1
Center for Theoretical Biological Physics, Rice University
, Houston, Texas 77005, USA
2
Department of Chemistry, Rice University
, Houston, Texas 77005, USA
Search for other works by this author on:
Stefan Chmiela;
Stefan Chmiela
3
Machine Learning Group, Technische Universität Berlin
, 10587 Berlin, Germany
Search for other works by this author on:
Klaus-Robert Müller;
Klaus-Robert Müller
3
Machine Learning Group, Technische Universität Berlin
, 10587 Berlin, Germany
4
Department of Brain and Cognitive Engineering, Korea University
, Seoul 02841, South Korea
5
Max Planck Institute for Informatics
, Saarbrücken 66123, Germany
Search for other works by this author on:
Frank Noé
;
Frank Noé
a)
1
Center for Theoretical Biological Physics, Rice University
, Houston, Texas 77005, USA
2
Department of Chemistry, Rice University
, Houston, Texas 77005, USA
6
Department of Mathematics and Computer Science, Freie Universität Berlin
, Arnimallee 6, 14195 Berlin, Germany
7
Department of Physics, Freie Universität Berlin
, Arnimallee 14, 14195 Berlin, Germany
Search for other works by this author on:
Cecilia Clementi
Cecilia Clementi
b)
1
Center for Theoretical Biological Physics, Rice University
, Houston, Texas 77005, USA
2
Department of Chemistry, Rice University
, Houston, Texas 77005, USA
6
Department of Mathematics and Computer Science, Freie Universität Berlin
, Arnimallee 6, 14195 Berlin, Germany
7
Department of Physics, Freie Universität Berlin
, Arnimallee 14, 14195 Berlin, Germany
8
Department of Physics, Rice University
, Houston, Texas 77005, USA
b)Author to whom correspondence should be addressed: [email protected]
Search for other works by this author on:
Jiang Wang
1,2
Stefan Chmiela
3
Klaus-Robert Müller
3,4,5
Frank Noé
1,2,6,7,a)
Cecilia Clementi
1,2,6,7,8,b)
1
Center for Theoretical Biological Physics, Rice University
, Houston, Texas 77005, USA
2
Department of Chemistry, Rice University
, Houston, Texas 77005, USA
3
Machine Learning Group, Technische Universität Berlin
, 10587 Berlin, Germany
4
Department of Brain and Cognitive Engineering, Korea University
, Seoul 02841, South Korea
5
Max Planck Institute for Informatics
, Saarbrücken 66123, Germany
6
Department of Mathematics and Computer Science, Freie Universität Berlin
, Arnimallee 6, 14195 Berlin, Germany
7
Department of Physics, Freie Universität Berlin
, Arnimallee 14, 14195 Berlin, Germany
8
Department of Physics, Rice University
, Houston, Texas 77005, USA
a)
Electronic mail: [email protected]
b)Author to whom correspondence should be addressed: [email protected]
Note: This paper is part of the JCP Special Topic on Machine Learning Meets Chemical Physics.
J. Chem. Phys. 152, 194106 (2020)
Article history
Received:
March 11 2020
Accepted:
May 01 2020
Citation
Jiang Wang, Stefan Chmiela, Klaus-Robert Müller, Frank Noé, Cecilia Clementi; Ensemble learning of coarse-grained molecular dynamics force fields with a kernel approach. J. Chem. Phys. 21 May 2020; 152 (19): 194106. https://doi.org/10.1063/5.0007276
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
DeePMD-kit v2: A software package for deep potential models
Jinzhe Zeng, Duo Zhang, et al.
CREST—A program for the exploration of low-energy molecular chemical space
Philipp Pracht, Stefan Grimme, et al.
Related Content
Coarse graining molecular dynamics with graph neural networks
J. Chem. Phys. (November 2020)
Neural potentials of proteins extrapolate beyond training data
J. Chem. Phys. (August 2023)
FCHL revisited: Faster and more accurate quantum machine learning
J. Chem. Phys. (January 2020)
Molecular force fields with gradient-domain machine learning (GDML): Comparison and synergies with classical force fields
J. Chem. Phys. (September 2020)
Efficient force field and energy emulation through partition of permutationally equivalent atoms
J. Chem. Phys. (May 2022)