Crystallographic group is an important character to describe the crystal structure, but it is difficult to identify the crystallographic group of crystal when only chemical composition is given. Here, we present a machine-learning method to predict the crystallographic group of crystal structure from its chemical formula. 34528 stable compounds in 230 crystallographic groups are investigated, of which 72% of data set are used as training set, 8% as validation set, and 20% as test set. Based on the results of machine learning, we present a model which can obtain correct crystallographic group in the top-1, top-5, and top-10 results with the estimated accuracy of 60.8%, 76.5%, and 82.6%, respectively. In particular, the performance of deep-learning model presents high generalization through comparison between validation set and test set. Additionally, 230 crystallographic groups are classified into 19 new labels, denoting 18 heavily represented crystallographic groups with each containing more than 400 compounds and one combination group of remaining compounds in other 212 crystallographic groups. A deep-learning model trained on 19 new labels yields a promising result to identify crystallographic group with the estimated accuracy of 72.2%. Our results provide a promising approach to identify crystallographic group of crystal structures only from their chemical composition.
Skip Nav Destination
Article navigation
Research Article|
February 01 2023
Crystallographic groups prediction from chemical composition via deep learning
Special Collection:
Virtual Issue on Machine Learning for Computational Chemistry
Da-yong Wang;
Da-yong Wang
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Sciences, CAS Key Laboratory of Materials for Energy Conversion, and CAS Center for Excellence in Nanoscience, Synergetic Innovation Center of Quantum Information & Quantum Physics, University of Science and Technology of China
, Hefei 230026, China
.
Search for other works by this author on:
Hai-feng Lv;
Hai-feng Lv
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Sciences, CAS Key Laboratory of Materials for Energy Conversion, and CAS Center for Excellence in Nanoscience, Synergetic Innovation Center of Quantum Information & Quantum Physics, University of Science and Technology of China
, Hefei 230026, China
.
Search for other works by this author on:
Xiao-jun Wu
Xiao-jun Wu
*
Hefei National Laboratory for Physical Sciences at the Microscale, School of Chemistry and Materials Sciences, CAS Key Laboratory of Materials for Energy Conversion, and CAS Center for Excellence in Nanoscience, Synergetic Innovation Center of Quantum Information & Quantum Physics, University of Science and Technology of China
, Hefei 230026, China
.*Author to whom correspondence should be addressed. E-mail: xjwu@ustc.edu.cn
Search for other works by this author on:
*Author to whom correspondence should be addressed. E-mail: xjwu@ustc.edu.cn
Chin. J. Chem. Phys. 36, 66–74 (2023)
Article history
Received:
July 26 2021
Accepted:
August 19 2021
Citation
Da-yong Wang, Hai-feng Lv, Xiao-jun Wu; Crystallographic groups prediction from chemical composition via deep learning. Chin. J. Chem. Phys. 1 February 2023; 36 (1): 66–74. https://doi.org/10.1063/1674-0068/cjcp2107124
Download citation file:
Citing articles via
Ultrafast intrinsic excited state localization m 2D layered As2S3 by interlayer bond formation
Xufeng Li, Li Yao, et al.
Crystallographic groups prediction from chemical composition via deep learning
Da-yong Wang, Hai-feng Lv, et al.
Quantum dynamics calculations on isotope effects of hydrogen transfer isomerization in formic acid dimer
Fengyi Li, Xiaoxi Liu, et al.
Related Content
Investigating phase transitions from local crystallographic analysis based on statistical learning of atomic environments in 2D MoS2-ReS2
Appl. Phys. Rev. (March 2021)
Multimodal learning of heat capacity based on transformers and crystallography pretraining
J. Appl. Phys. (April 2024)
Comparative crystallographic study of La2NiTiO6 double pervoskite structure using crystallographic software’s
AIP Conf. Proc. (April 2019)
High-performance chemical information database towards accelerating discovery of metal-organic frameworks for gas adsorption with machine learning
Chin. J. Chem. Phys. (August 2021)
Effect of crystallographic orientation on the nanohardness of Hadfield steel single crystals
AIP Conference Proceedings (December 2020)