Accurate and explainable artificial-intelligence (AI) models are promising tools for accelerating the discovery of new materials. Recently, symbolic regression has become an increasingly popular tool for explainable AI because it yields models that are relatively simple analytical descriptions of target properties. Due to its deterministic nature, the sure-independence screening and sparsifying operator (SISSO) method is a particularly promising approach for this application. Here, we describe the new advancements of the SISSO algorithm, as implemented into SISSO++, a C++ code with Python bindings. We introduce a new representation of the mathematical expressions found by SISSO. This is a first step toward introducing “grammar” rules into the feature creation step. Importantly, by introducing a controlled nonlinear optimization to the feature creation step, we expand the range of possible descriptors found by the methodology. Finally, we introduce refinements to the solver algorithms for both regression and classification, which drastically increase the reliability and efficiency of SISSO. For all these improvements to the basic SISSO algorithm, we not only illustrate their potential impact but also fully detail how they operate both mathematically and computationally.
Skip Nav Destination
Article navigation
21 September 2023
Research Article|
September 18 2023
Recent advances in the SISSO method and their implementation in the SISSO++ code
Special Collection:
Software for Atomistic Machine Learning
Thomas A. R. Purcell
;
Thomas A. R. Purcell
a)
(Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing)
1
The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin
, Faradayweg 4–6, D-14195 Berlin, Germany
Search for other works by this author on:
Matthias Scheffler
;
Matthias Scheffler
(Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing)
1
The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin
, Faradayweg 4–6, D-14195 Berlin, Germany
Search for other works by this author on:
Luca M. Ghiringhelli
Luca M. Ghiringhelli
b)
(Conceptualization, Formal analysis, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review & editing)
1
The NOMAD Laboratory at the FHI of the Max-Planck-Gesellschaft and IRIS-Adlershof of the Humboldt-Universität zu Berlin
, Faradayweg 4–6, D-14195 Berlin, Germany
2
Physics Department and IRIS-Adlershof, Humboldt Universität zu Berlin
, Zum Großen Windkanal 2, D-12489 Berlin, Germany
b)Author to whom correspondence should be addressed: ghiringhelli@fhi-berlin.mpg.de
Search for other works by this author on:
b)Author to whom correspondence should be addressed: ghiringhelli@fhi-berlin.mpg.de
a)
Electronic mail: purcell@fhi-berlin.mpg.de
J. Chem. Phys. 159, 114110 (2023)
Article history
Received:
May 01 2023
Accepted:
August 21 2023
Citation
Thomas A. R. Purcell, Matthias Scheffler, Luca M. Ghiringhelli; Recent advances in the SISSO method and their implementation in the SISSO++ code. J. Chem. Phys. 21 September 2023; 159 (11): 114110. https://doi.org/10.1063/5.0156620
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
157
Views