Machine learning potentials (MLPs) have attracted significant attention in computational chemistry and materials science due to their high accuracy and computational efficiency. The proper selection of atomic structures is crucial for developing reliable MLPs. Insufficient or redundant atomic structures can impede the training process and potentially result in a poor quality MLP. Here, we propose a local-environment-guided screening algorithm for efficient dataset selection in MLP development. The algorithm utilizes a local environment bank to store unique local environments of atoms. The dissimilarity between a particular local environment and those stored in the bank is evaluated using the Euclidean distance. A new structure is selected only if its local environment is significantly different from those already present in the bank. Consequently, the bank is then updated with all the new local environments found in the selected structure. To demonstrate the effectiveness of our algorithm, we applied it to select structures for a Ge system and a Pd13H2 particle system. The algorithm reduced the training data size by around 80% for both without compromising the performance of the MLP models. We verified that the results were independent of the selection and ordering of the initial structures. We also compared the performance of our method with the farthest point sampling algorithm, and the results show that our algorithm is superior in both robustness and computational efficiency. Furthermore, the generated local environment bank can be continuously updated and can potentially serve as a growing database of feature local environments, aiding in efficient dataset maintenance for constructing accurate MLPs.
Skip Nav Destination
CHORUS
Article navigation
21 February 2024
Research Article|
February 21 2024
Local-environment-guided selection of atomic structures for the development of machine-learning potentials
Renzhe Li
;
Renzhe Li
(Formal analysis, Investigation, Methodology, Writing – original draft)
1
Shenzhen Key Laboratory of Micro/Nano-Porous Functional Materials (SKLPM), Department of Materials Science and Engineering, Southern University of Science and Technology
, Shenzhen 518055, People's Republic of China
2
College of Chemistry, Xiangtan University
, Xiangtan 411105, Hunan Province, People's Republic of China
Search for other works by this author on:
Chuan Zhou
;
Chuan Zhou
(Formal analysis, Investigation, Methodology, Writing – review & editing)
1
Shenzhen Key Laboratory of Micro/Nano-Porous Functional Materials (SKLPM), Department of Materials Science and Engineering, Southern University of Science and Technology
, Shenzhen 518055, People's Republic of China
Search for other works by this author on:
Akksay Singh
;
Akksay Singh
(Investigation, Writing – review & editing)
1
Shenzhen Key Laboratory of Micro/Nano-Porous Functional Materials (SKLPM), Department of Materials Science and Engineering, Southern University of Science and Technology
, Shenzhen 518055, People's Republic of China
3
Department of Chemistry, The University of Texas at Austin
, Austin, Texas 78712, USA
4
Institute for Computational Engineering and Sciences, The University of Texas at Austin
, Austin, Texas 78712, USA
Search for other works by this author on:
Yong Pei
;
Yong Pei
(Writing – review & editing)
2
College of Chemistry, Xiangtan University
, Xiangtan 411105, Hunan Province, People's Republic of China
Search for other works by this author on:
Graeme Henkelman
;
Graeme Henkelman
a)
(Conceptualization, Funding acquisition, Project administration, Supervision, Validation, Writing – review & editing)
3
Department of Chemistry, The University of Texas at Austin
, Austin, Texas 78712, USA
4
Institute for Computational Engineering and Sciences, The University of Texas at Austin
, Austin, Texas 78712, USA
Search for other works by this author on:
Lei Li
Lei Li
a)
(Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Writing – review & editing)
1
Shenzhen Key Laboratory of Micro/Nano-Porous Functional Materials (SKLPM), Department of Materials Science and Engineering, Southern University of Science and Technology
, Shenzhen 518055, People's Republic of China
Search for other works by this author on:
J. Chem. Phys. 160, 074109 (2024)
Article history
Received:
November 17 2023
Accepted:
January 26 2024
Citation
Renzhe Li, Chuan Zhou, Akksay Singh, Yong Pei, Graeme Henkelman, Lei Li; Local-environment-guided selection of atomic structures for the development of machine-learning potentials. J. Chem. Phys. 21 February 2024; 160 (7): 074109. https://doi.org/10.1063/5.0187892
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
DeePMD-kit v2: A software package for deep potential models
Jinzhe Zeng, Duo Zhang, et al.
CREST—A program for the exploration of low-energy molecular chemical space
Philipp Pracht, Stefan Grimme, et al.
Rubber wear: Experiment and theory
B. N. J. Persson, R. Xu, et al.
Related Content
Pair-distribution-function guided optimization of fingerprints for atom-centered neural network potentials
J. Chem. Phys. (June 2020)
Sparse Gaussian process based machine learning first principles potentials for materials simulations: Application to batteries, solar cells, catalysts, and macromolecular systems
Chem. Phys. Rev. (November 2024)
Machine learning for potential energy surfaces: An extensive database and assessment of methods
J. Chem. Phys. (June 2019)
ABFML: A problem-oriented package for rapidly creating, screening, and optimizing new machine learning force fields
J. Chem. Phys. (February 2025)