An active learning strategy using sampling based on uncertainties shows the promise of accelerating the development of new materials. We study the efficiencies of the active learning iteration loop with different uncertainty estimators to find the “best” material in four different experimental datasets. We use a bootstrap approach aggregating with support vector regression as the base learner to obtain uncertainties associated with model predictions. If the bootstrap replicate number is small, the variance estimated by the empirical standard error estimator is found to be close to the true variance, whereas the jackknife based estimators give an upward or downward biased estimation of variance. As increases, the bias of the jackknife based estimators decreases and the variance estimated finally converges to the true one. Therefore, the empirical standard error estimator needs the least number of iteration loops to find the best material in the datasets, especially when the bootstrap replicate number is small. Our work demonstrates that an appropriate Bootstrap replicate is conducive to minimizing calculation costs during the materials property optimization by active learning.
Role of uncertainty estimation in accelerating materials development via active learning
Yuan Tian, Ruihao Yuan, Dezhen Xue, Yumei Zhou, Xiangdong Ding, Jun Sun, Turab Lookman; Role of uncertainty estimation in accelerating materials development via active learning. J. Appl. Phys. 7 July 2020; 128 (1): 014103. https://doi.org/10.1063/5.0012405
Download citation file: