Machine learning of atomic-scale properties is revolutionizing molecular modeling, making it possible to evaluate inter-atomic potentials with first-principles accuracy, at a fraction of the costs. The accuracy, speed, and reliability of machine learning potentials, however, depend strongly on the way atomic configurations are represented, i.e., the choice of descriptors used as input for the machine learning method. The raw Cartesian coordinates are typically transformed in “fingerprints,” or “symmetry functions,” that are designed to encode, in addition to the structure, important properties of the potential energy surface like its invariances with respect to rotation, translation, and permutation of like atoms. Here we discuss automatic protocols to select a number of fingerprints out of a large pool of candidates, based on the correlations that are intrinsic to the training data. This procedure can greatly simplify the construction of neural network potentials that strike the best balance between accuracy and computational efficiency and has the potential to accelerate by orders of magnitude the evaluation of Gaussian approximation potentials based on the smooth overlap of atomic positions kernel. We present applications to the construction of neural network potentials for water and for an Al–Mg–Si alloy and to the prediction of the formation energies of small organic molecules using Gaussian process regression.
Skip Nav Destination
Article navigation
28 June 2018
Research Article|
April 30 2018
Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials
Special Collection:
Data-Enabled Theoretical Chemistry
Giulio Imbalzano;
Giulio Imbalzano
1
Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne
, 1015 Lausanne, Switzerland
Search for other works by this author on:
Andrea Anelli
;
Andrea Anelli
1
Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne
, 1015 Lausanne, Switzerland
Search for other works by this author on:
Daniele Giofré
;
Daniele Giofré
1
Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne
, 1015 Lausanne, Switzerland
Search for other works by this author on:
Sinja Klees;
Sinja Klees
2
Lehrstuhl für Theoretische Chemie, Ruhr-Universität Bochum
, 44801 Bochum, Germany
Search for other works by this author on:
Jörg Behler
;
Jörg Behler
2
Lehrstuhl für Theoretische Chemie, Ruhr-Universität Bochum
, 44801 Bochum, Germany
3
Universität Göttingen, Institut für Physikalische Chemie, Theoretische Chemie
, Tammannstr. 6, 37077 Göttingen, Germany
Search for other works by this author on:
Michele Ceriotti
Michele Ceriotti
1
Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne
, 1015 Lausanne, Switzerland
Search for other works by this author on:
J. Chem. Phys. 148, 241730 (2018)
Article history
Received:
February 02 2018
Accepted:
April 10 2018
Citation
Giulio Imbalzano, Andrea Anelli, Daniele Giofré, Sinja Klees, Jörg Behler, Michele Ceriotti; Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials. J. Chem. Phys. 28 June 2018; 148 (24): 241730. https://doi.org/10.1063/1.5024611
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
A theory of pitch for the hydrodynamic properties of molecules, helices, and achiral swimmers at low Reynolds number
Anderson D. S. Duraes, J. Daniel Gezelter
DeePMD-kit v2: A software package for deep potential models
Jinzhe Zeng, Duo Zhang, et al.