Structures and properties of many inorganic compounds have been collected historically. However, it only covers a very small portion of possible inorganic crystals, which implies the presence of numerous currently unknown compounds. A powerful machine-learning strategy is mandatory to discover new inorganic compounds from all chemical combinations. Herein we propose a descriptor-based recommender-system approach to estimate the relevance of chemical compositions where crystals can be formed [i.e., chemically relevant compositions (CRCs)]. In addition to data-driven compositional similarity used in the literature, the use of compositional descriptors as a prior knowledge is helpful for the discovery of new compounds. We validate our recommender systems in two ways. First, one database is used to construct a model, while another is used for the validation. Second, we estimate the phase stability for compounds at expected CRCs using density functional theory calculations.

1.
G.
Bergerhoff
and
I. D.
Brown
, in
Crystallographic Databases
, edited by et al (
International Union of Crystallography
,
Chester
,
1987
).
2.
Y.
Xu
,
M.
Yamazaki
, and
P.
Villars
,
Jpn. J. Appl. Phys., Part 1
50
,
11RH02
(
2011
).
3.
R. T.
Downs
and
M.
Hall-Wallace
,
Am. Mineral.
88
,
247
(
2003
).
4.
A.
Jain
,
S. P.
Ong
,
G.
Hautier
,
W.
Chen
,
W. D.
Richards
,
S.
Dacek
,
S.
Cholia
,
D.
Gunter
,
D.
Skinner
,
G.
Ceder
 et al,
APL Mater.
1
,
011002
(
2013
).
5.
S.
Curtarolo
,
W.
Setyawan
,
S.
Wang
,
J.
Xue
,
K.
Yang
,
R. H.
Taylor
,
L. J.
Nelson
,
G. L.
Hart
,
S.
Sanvito
,
M.
Buongiorno-Nardelli
 et al,
Comput. Mater. Sci.
58
,
227
(
2012
).
6.
J. E.
Saal
,
S.
Kirklin
,
M.
Aykol
,
B.
Meredig
, and
C.
Wolverton
,
JOM
65
,
1501
(
2013
).
7.
D.
Pettifor
,
J. Phys. C: Solid State Phys.
19
,
285
(
1986
).
8.
G.
Hautier
,
C. C.
Fischer
,
A.
Jain
,
T.
Mueller
, and
G.
Ceder
,
Chem. Mater.
22
,
3762
(
2010
).
9.
G.
Hautier
,
C.
Fischer
,
V.
Ehrlacher
,
A.
Jain
, and
G.
Ceder
,
Inorg. Chem.
50
,
656
(
2010
).
10.
A.
Seko
,
H.
Hayashi
,
H.
Kashima
, and
I.
Tanaka
,
Phys. Rev. Mater.
2
,
013805
(
2018
).
11.
B.
Meredig
,
A.
Agrawal
,
S.
Kirklin
,
J. E.
Saal
,
J. W.
Doak
,
A.
Thompson
,
K.
Zhang
,
A.
Choudhary
, and
C.
Wolverton
,
Phys. Rev. B
89
,
094104
(
2014
).
12.
L.
Ward
,
A.
Agrawal
,
A.
Choudhary
, and
C.
Wolverton
,
npj Comput. Mater.
2
,
16028
(
2016
).
13.
P.
Resnick
and
H. R.
Varian
,
Commun. ACM
40
,
56
(
1997
).
14.
C. C.
Aggarwal
,
Recommender Systems
(
Springer
,
2016
).
15.
A.
Seko
,
H.
Hayashi
,
K.
Nakayama
,
A.
Takahashi
, and
I.
Tanaka
,
Phys. Rev. B
95
,
144110
(
2017
).
16.
D. R.
Cox
,
J. Roy. Stat. Soc. Ser. B
20
,
215
(
1958
).
17.
T.
Hastie
,
R.
Tibshirani
, and
J.
Friedman
,
The Elements of Statistical Learning
, 2nd ed. (
Springer
,
New York
,
2009
).
18.
L.
Breiman
, “
Arcing the edge
,” Technical Report No. 486,
Statistics Department, University of California
,
Berkeley
,
1997
.
19.
T. K.
Ho
, in
Proceedings of the Third International Conference on Document Analysis and Recognition
(
IEEE
,
1995
), Vol. 1, pp.
278
282
.
20.
ICDD
, in
PDF-4/Organics 2011 (Database)
, edited by
S.
Kabekkodu
(
International Centre for Diffraction Data
,
Newtown Square, PA, USA
,
2010
).
21.
N.
Kamaya
,
K.
Homma
,
Y.
Yamakawa
,
M.
Hirayama
,
R.
Kanno
,
M.
Yonemura
,
T.
Kamiyama
,
Y.
Kato
,
S.
Hama
,
K.
Kawamoto
 et al,
Nat. Mater.
10
,
682
(
2011
).
22.
A.
Kuhn
,
V.
Duppel
, and
B. V.
Lotsch
,
Energy Environ. Sci.
6
,
3548
(
2013
).
23.
P. E.
Blöchl
,
Phys. Rev. B
50
,
17953
(
1994
).
24.
G.
Kresse
and
D.
Joubert
,
Phys. Rev. B
59
,
1758
(
1999
).
25.
J. P.
Perdew
,
K.
Burke
, and
M.
Ernzerhof
,
Phys. Rev. Lett.
77
,
3865
(
1996
).
26.
G.
Kresse
and
J.
Hafner
,
Phys. Rev. B
47
,
558
(
1993
).
27.
G.
Kresse
and
J.
Furthmüller
,
Phys. Rev. B
54
,
11169
(
1996
).

Supplementary Material

You do not currently have access to this content.