Future materials-science research will involve autonomous synthesis and characterization, requiring an approach that combines machine learning, robotics, and big data. In this paper, we highlight our recent experiments in autonomous synthesis and resistance minimization of Nb-doped TiO2 thin films. Combining Bayesian optimization with robotics, these experiments illustrate how the required speed and volume of future big-data collection in materials science will be achieved and demonstrate the tremendous potential of this combined approach. We briefly discuss the outlook and significance of these results and advances.
INTRODUCTION
Materials are a powerhouse of innovation,1 and accelerating the development of new materials is vital for a sustainable society. Currently, new materials development in laboratories involves repeated cycles of conception, synthesis, and characterization, manually performed by researchers. The inclusion of machine learning, robotics, and big data into these cycles promises to revolutionize materials research and beyond.
To begin, let us sketch the concept of a next-generation materials research laboratory. Future lab equipment and instruments should be Connected, Autonomous, Shared, and operate in a High-throughput manner (CASH, Fig. 1). With the ongoing evolution of machine learning and robotics, the CASH movement is steadily spreading around the world and is expected to have a transformative effect on basic research.
Concept of CASH (Connected, Autonomous, Shared, and High-throughput) in materials science. The blue dashed square shows a conceptual diagram of a closed-looped cycle. The synthesis conditions are predicted by Bayesian optimization, the synthesis is performed automatically under these conditions, and the evaluation results are used to predict the next set of conditions. The closed-loop system and the big data obtained from this system are shared with researchers (green dashed square). The predictions of materials using big data and the closed-loop system contribute to high-throughput materials research.
Concept of CASH (Connected, Autonomous, Shared, and High-throughput) in materials science. The blue dashed square shows a conceptual diagram of a closed-looped cycle. The synthesis conditions are predicted by Bayesian optimization, the synthesis is performed automatically under these conditions, and the evaluation results are used to predict the next set of conditions. The closed-loop system and the big data obtained from this system are shared with researchers (green dashed square). The predictions of materials using big data and the closed-loop system contribute to high-throughput materials research.
If CASH was implemented in a materials laboratory, then new materials with the desired properties could be rapidly synthesized using a closed-loop approach.2,3 Researchers would first prepare the raw material, let the machine-learning-driven apparatus and robotics synthesize and evaluate the material properties, and publish the results. Here, publishing would involve much more than writing a manuscript; all results—from the synthesis conditions (explanatory variables) to the physical properties (response variables)—would be automatically collected into a central database and shared. In addition, any information collected during the synthesis would be accumulated in a central database. In essence, future CASH laboratories would become data production factories.
To implement CASH, we first consider the materials development and optimization process as the following stepwise process: (1) determination of the synthesis conditions, (2) synthesis, (3) evaluation, and (4) determination of the next synthesis conditions (Fig. 1). Traditionally, each task in this cycle is manually performed by researchers. Even when the process is automated, the determination of the next conditions requires the judgment of researchers. This determination is a critical step that reduces throughput. The development of machine learning and robotics promises to make the autonomous closed-loop cycle possible, removing all researcher intervention after the start of the process, at least for routine work.
While the closed-loop concepts are beginning to be applied to biosystems4–6 and organic synthetic chemistry,7,8 few examples focus on solid-state materials such as polymers or inorganic materials.9 Focusing on inorganic materials, storing synthesis conditions in a database is critical. For example, TiO2 can be either a semiconductor or an insulator depending on specific synthesis conditions. Thus, with the ever-increasing demand for complex materials and exotic properties,10 the materials database should incorporate synthesis condition-dependent properties.
In this paper, we highlight our recent experiments that serve as examples of the CASE approach and demonstrate its great promise in materials discovery in materials science. We first describe the experimental design necessary for CASH implementation, including the apparatus based on machine learning and robotics. We then demonstrate the autonomous deposition of a TiO2 thin film with a targeted physical property using the apparatus. Finally, we discuss the significance of the inclusion of machine learning, robotics, and big data into future materials research and present our vision on how to integrate human knowledge, experience, and intuition.
OUTLINE OF THE SYSTEM AND EXPERIMENT
We have designed and custom-built a closed-loop system that combines Bayesian optimization,11–14 automatic synthesis, and automatic physical property evaluation [Fig. 2(a)]. The system consists of a robotic arm located at the center of a hexagonal chamber, which is attached to six satellite chambers equipped with an automatic film deposition apparatus and automatic evaluation system [Fig. 2(b)]. The robotic arm performs all sample transfers between the satellite chambers. The deposition conditions needed to obtain optimal physical properties are predicted using Bayesian optimization. The predicted conditions are fed-back to the automatic film deposition apparatus, achieving full autonomous closed-loop cycles.
(a) Photograph and (b) schematic diagram of the system. The system synthesizes a thin film according to the deposition conditions specified by Bayesian optimization, and then, the resistance of the film is evaluated automatically. The closed-loop cycle of deposition, evaluation, and optimization autonomously synthesize materials with the desired property. Blue circles denote modules for synthesis and evaluation of materials. Video is available in the supplementary material (closed_loop_movie.mp4).
(a) Photograph and (b) schematic diagram of the system. The system synthesizes a thin film according to the deposition conditions specified by Bayesian optimization, and then, the resistance of the film is evaluated automatically. The closed-loop cycle of deposition, evaluation, and optimization autonomously synthesize materials with the desired property. Blue circles denote modules for synthesis and evaluation of materials. Video is available in the supplementary material (closed_loop_movie.mp4).
To demonstrate the CASH approach, we aimed to minimize the electrical resistance of anatase Nb-doped TiO2 thin films.15 The minimization is achieved by varying the oxygen partial pressure (Po2) during the deposition of the TiO2 thin film. The closed-loop cycles proceed as follows. Before starting, a stock of glass substrates is loaded into the load-lock chamber. This is the only manual step of the process; every step hereafter is automated. Next, the robot arm transfers a substrate to a film deposition chamber, where a TiO2 thin film is deposited under the synthesis conditions indicated by Bayesian optimization. After the deposition, the film is transferred to the physical-property evaluation chamber, where a probe electrode is pressed against the film for resistance evaluation. The evaluated results are sent to the computer, which adds the latest Po2 and the electrical resistance values to the previous data. The system then uses Bayesian optimization to predict the Po2 that leads to minimal electrical resistance. The learning continues by repeating this cycle until convergence is reached, i.e., a thin film with minimal electrical resistance is obtained. We demonstrate the closed-loop optimization for one variable (one-dimensional experiment).
Anatase TiO2 film deposition
We used reactive magnetron sputter deposition, a method that bombards the TiO2 target with high-energy Ar gas to scatter TiO2 for deposition onto a glass substrate. Two sets of experiments were performed: one with a Ti0.94Nb0.06O2 target (T1) and the other with a Ti1.98Nb0.02O3 target (T2); the significant difference between T1 and T2 is the Ti:O ratio, with T2 being a more reduced condition (both targets have a purity of 99.9% and 2 in. diameter, Toshima Manufacturing Co., Ltd.). The Nb doping was used to increase TiO2 conductivity.16 The value of Po2 was controlled by changing the flow ratio of two gases: one was pure Ar gas and the other was a gas mixture of Ar (99 mol. %) and O2 (1 mol. %). The range of oxygen partial pressures was set to 2 × 10−4 Pa–5 × 10−3 Pa, considering the results of earlier experiments.17,18 The total gas flow rate and gas pressure were fixed at 10 SCCM and 0.50 Pa throughout the deposition, respectively. The radio-frequency power supply at the target was maintained at 100 W. The typical film thickness is ∼120 nm with a growth time of 1 h. For the Ti0.94Nb0.06O2 (T1) target, the thickness of the film deposited at the lowest PO2 (2 × 10−4 Pa) and at the highest PO2 (5 × 10−3 Pa) was ∼120 nm and ∼100 nm, respectively. Because the two thickness values are close, the difference in the thickness does not affect the minimization of electrical resistances, which change by orders of magnitude depending on PO2. The substrate temperature was maintained at room temperature during the deposition. After deposition, the samples were annealed in a high vacuum with a background pressure of ∼10−5 Pa at 450 °C for 30 min to crystallize the film.
Electrical resistance evaluation
An automatic system was custom-built for evaluating the electrical resistance of the thin films. After film deposition, the sample was allowed to cool for about 20 min to room temperature. Then, the sample was transferred to this system, where a motor-driven electrode probe came into contact with the sample to evaluate its electrical resistance.
Bayesian optimization
Bayesian optimization is a powerful tool for finding the global maximum/minimum of black-box functions with a small number of trials. In this work, we used a standard Bayesian optimization algorithm with Gaussian process regression for the prediction of the black-box function.
The main task of the Bayesian optimization is to predict the Po2 that produces thin films with minimal electrical resistance. This prediction is based on Gaussian process regression. To initiate Bayesian optimization, we first collected electrical resistance data for films deposited under preset Po2 conditions, namely, the smallest and the largest values and middle of the PO2 range. The optimization process was then performed in cycles. Each cycle consists of three steps: (1) selecting a value of Po2 for the next deposition using the data obtained so far, (2) depositing the thin film under that PO2, and (3) evaluating electrical resistance, followed by addition of that data to the dataset. Steps (1)–(3) are iterated repeatedly, and optimization is completed when convergence is achieved.
In this system, human intervention is only needed to decide whether the cycle has converged. In the future, this decision will be made by the computer as well. This system enables new methods for producing thin films with desired physical properties within a reasonable amount of time. It can also be used for multi-dimensional optimization involving several film formation parameters (oxygen partial pressure, substrate temperature, annealing temperature, nitrogen partial pressure, film formation rate, etc.).
We used the Python package GPy for Gaussian process regression. The grid for PO2 was linearly divided into 128 grids in the range from 2.0 × 10−4 Pa to 5.0 × 10−3 Pa. The radial-basis function was used as a kernel, where the variance and the length scale were set to 0.3 and 6 × 10−4 (3 grids), respectively. The lower confidence bound was adopted as the acquisition function, where its hyperparameter was set to 5.0. and the preset three conditions were sparsely selected, at both ends (2.0 × 10−4 Pa and 5.0 × 10−3 Pa) and at the center (2.7 × 10−3 Pa) of the grid. These conditions were set manually in advance. The fourth and subsequent deposition conditions were determined by Bayesian optimization sequentially, and previously used conditions were not selected again.
RESULTS AND DISCUSSION
Figures 3(a)–3(c) illustrate the closed-loop cycles performed to minimize the T1 thin film resistance. As shown with the red dots, the Bayesian optimization algorithm first searches across a wide range of PO2 [Fig. 3(a)] before converging toward the optimal Po2 = 7.29 × 10−4 Pa [Fig. 3(b)], reaching it at the fourteenth cycle [Fig. 3(c)]. After the fourteenth cycle, the Bayesian optimization selected PO2 stayed near the optimal value, suggesting the convergence. The trend of PO2 vs electrical resistance from previous experiments indicates that there is only one minimum within the searching space.17,18 Therefore, we conclude that the obtained minimum is the global minimum.
Illustration of the closed-loop cycles for both T1 [(a)–(c)] and T2 [(d)–(f)] thin films. The red dots in each figure show the latest deposition oxygen partial pressures (PO2) and evaluated electrical resistance, as sequenced by the corresponding numbers. The black dots in each figure show the points that appear as red in the previous figure. The blue curve shows the predicted curve, while the pink region indicates the Bayesian confidence intervals. The global resistance minima are obtained at the fourteenth and the eighteenth depositions for T1 and T2, respectively. Video is available in the supplementary material (TiO2_movie.mp4 and Ti2O3_movie.mp4).
Illustration of the closed-loop cycles for both T1 [(a)–(c)] and T2 [(d)–(f)] thin films. The red dots in each figure show the latest deposition oxygen partial pressures (PO2) and evaluated electrical resistance, as sequenced by the corresponding numbers. The black dots in each figure show the points that appear as red in the previous figure. The blue curve shows the predicted curve, while the pink region indicates the Bayesian confidence intervals. The global resistance minima are obtained at the fourteenth and the eighteenth depositions for T1 and T2, respectively. Video is available in the supplementary material (TiO2_movie.mp4 and Ti2O3_movie.mp4).
Similarly, the closed-loop cycles for minimizing the T2 thin film resistance are illustrated by Figs. 3(d)–3(f). Here, the Bayesian optimization converged toward the optimum PO2 = 2.43 × 10−3 Pa, reaching it at the eighteenth cycle [Fig. 3(f)]. Compared to T1, the entire curve for T2 is shifted to the right to a more oxidizing condition, as is consistent with the lower oxygen content of T2.19
The experiments on T1 and T2 yielded two positive results. The first is technical: the use of Bayesian optimization identified the global resistance minimum from several local minima caused by the reduced (e.g., Ti2O3) and Magneli phases [TinO2n−1 (n = 3–9)].
The second is temporal: the use of the autonomous closed-loop cycle has improved the experiment throughput by an order of magnitude. As the time stamps in Table I show, the system is capable of fabricating thin films at a rate of ∼2 h per sample. Thus, ideally, ∼12 samples can be fabricated within a day when the closed-loop cycles are operated continuously. The present system took more time to complete the convergence because the substrates had to be resupplied by the researchers. Manually, researchers can perform two film depositions per day on average, taking seven days to complete the fourteen-cycle optimization. Adding three days to account for the weekend and other work- and lifestyle-related essentials, ten days is a reasonable estimate. Accordingly, throughput has increased by tenfold.
Oxygen partial pressures (PO2) and the resistance for each deposition using Ti0.94Nb0.06O2 (T1) and Ti1.98Nb0.02O3 (T2) targets. Bold stamps highlight the minimum resistance reached.
Target: Ti0.94Nb0.06O2 . | Target: Ti1.98Nb0.02O3 . | ||||||
---|---|---|---|---|---|---|---|
No. | Date and time | PO2 (Pa, ×10−3) | Resistance (Ω) | No. | Date and time | PO2 (Pa, ×10−3) | Resistance (Ω) |
1 | 2020/02/25 22:46 | 5.000 | 1.000 × 108 | 1 | 2020/02/05 02:43 | 5.000 | 2.889 × 106 |
2 | 2020/02/26 01:09 | 0.200 | 3.576 × 103 | 2 | 2020/02/05 12:26 | 2.500 | 5.207 × 102 |
3 | 2020/02/26 03:33 | 2.544 | 1.028 × 104 | 3 | 2020/02/16 03:55 | 0.200 | 1.626 × 103 |
4 | 2020/02/26 05:57 | 0.238 | 3.187 × 103 | 4 | 2020/02/16 06:20 | 0.238 | 7.184 × 102 |
5 | 2020/02/26 08:22 | 1.259 | 3.923 × 102 | 5 | 2020/02/16 08:44 | 0.276 | 8.650 × 102 |
6 | 2020/02/26 10:46 | 1.523 | 8.298 × 102 | 6 | 2020/02/16 11:08 | 1.372 | 3.534 × 103 |
7 | 2020/02/26 22:53 | 1.183 | 3.416 × 102 | 7 | 2020/02/16 13:31 | 3.602 | 1.259 × 104 |
8 | 2020/02/27 01:16 | 3.942 | 1.817 × 104 | 8 | 2020/02/16 15:54 | 2.468 | 1.967 × 102 |
9 | 2020/02/27 03:40 | 3.337 | 1.280 × 104 | 9 | 2020/02/16 18:18 | 0.314 | 1.425 × 103 |
10 | 2020/02/27 06:05 | 1.750 | 1.469 × 103 | 10 | 2020/02/16 20:43 | 0.351 | 6.263 × 102 |
11 | 2020/02/27 08:30 | 0.843 | 1.279 × 102 | 11 | 2020/02/16 23:08 | 2.544 | 2.046 × 102 |
12 | 2020/02/27 11:17 | 0.692 | 8.865 × 101 | 12 | 2020/02/17 01:38 | 0.389 | 9.407 × 102 |
13 | 2020/02/27 15:45 | 4.471 | 6.373 × 105 | 13 | 2020/02/17 04:01 | 2.581 | 2.880 × 102 |
14 | 2020/02/27 20:05 | 0.729 | 7.960 × 101 | 14 | 2020/02/17 06:25 | 2.619 | 2.724 × 102 |
15 | 2020/02/27 22:36 | 0.654 | 9.247 × 101 | 15 | 2020/02/17 08:50 | 2.695 | 4.243 × 102 |
16 | 2020/02/28 01:00 | 0.276 | 3.521 × 103 | 16 | 2020/02/17 11:14 | 0.427 | 8.198 × 102 |
17 | 2020/02/28 03:24 | 0.805 | 9.805 × 101 | 17 | 2020/02/17 17:24 | 2.733 | 5.568 × 102 |
18 | 2020/02/28 05:47 | 0.767 | 9.200 × 101 | 18 | 2020/02/17 19:48 | 2.430 | 1.918 × 102 |
19 | 2020/02/17 22:11 | 2.392 | 2.321 × 102 | ||||
20 | 2020/02/18 00:34 | 2.355 | 1.938 × 102 | ||||
21 | 2020/02/18 02:59 | 2.317 | 2.175 × 102 | ||||
22 | 2020/02/18 05:23 | 4.320 | 1.296 × 105 |
Target: Ti0.94Nb0.06O2 . | Target: Ti1.98Nb0.02O3 . | ||||||
---|---|---|---|---|---|---|---|
No. | Date and time | PO2 (Pa, ×10−3) | Resistance (Ω) | No. | Date and time | PO2 (Pa, ×10−3) | Resistance (Ω) |
1 | 2020/02/25 22:46 | 5.000 | 1.000 × 108 | 1 | 2020/02/05 02:43 | 5.000 | 2.889 × 106 |
2 | 2020/02/26 01:09 | 0.200 | 3.576 × 103 | 2 | 2020/02/05 12:26 | 2.500 | 5.207 × 102 |
3 | 2020/02/26 03:33 | 2.544 | 1.028 × 104 | 3 | 2020/02/16 03:55 | 0.200 | 1.626 × 103 |
4 | 2020/02/26 05:57 | 0.238 | 3.187 × 103 | 4 | 2020/02/16 06:20 | 0.238 | 7.184 × 102 |
5 | 2020/02/26 08:22 | 1.259 | 3.923 × 102 | 5 | 2020/02/16 08:44 | 0.276 | 8.650 × 102 |
6 | 2020/02/26 10:46 | 1.523 | 8.298 × 102 | 6 | 2020/02/16 11:08 | 1.372 | 3.534 × 103 |
7 | 2020/02/26 22:53 | 1.183 | 3.416 × 102 | 7 | 2020/02/16 13:31 | 3.602 | 1.259 × 104 |
8 | 2020/02/27 01:16 | 3.942 | 1.817 × 104 | 8 | 2020/02/16 15:54 | 2.468 | 1.967 × 102 |
9 | 2020/02/27 03:40 | 3.337 | 1.280 × 104 | 9 | 2020/02/16 18:18 | 0.314 | 1.425 × 103 |
10 | 2020/02/27 06:05 | 1.750 | 1.469 × 103 | 10 | 2020/02/16 20:43 | 0.351 | 6.263 × 102 |
11 | 2020/02/27 08:30 | 0.843 | 1.279 × 102 | 11 | 2020/02/16 23:08 | 2.544 | 2.046 × 102 |
12 | 2020/02/27 11:17 | 0.692 | 8.865 × 101 | 12 | 2020/02/17 01:38 | 0.389 | 9.407 × 102 |
13 | 2020/02/27 15:45 | 4.471 | 6.373 × 105 | 13 | 2020/02/17 04:01 | 2.581 | 2.880 × 102 |
14 | 2020/02/27 20:05 | 0.729 | 7.960 × 101 | 14 | 2020/02/17 06:25 | 2.619 | 2.724 × 102 |
15 | 2020/02/27 22:36 | 0.654 | 9.247 × 101 | 15 | 2020/02/17 08:50 | 2.695 | 4.243 × 102 |
16 | 2020/02/28 01:00 | 0.276 | 3.521 × 103 | 16 | 2020/02/17 11:14 | 0.427 | 8.198 × 102 |
17 | 2020/02/28 03:24 | 0.805 | 9.805 × 101 | 17 | 2020/02/17 17:24 | 2.733 | 5.568 × 102 |
18 | 2020/02/28 05:47 | 0.767 | 9.200 × 101 | 18 | 2020/02/17 19:48 | 2.430 | 1.918 × 102 |
19 | 2020/02/17 22:11 | 2.392 | 2.321 × 102 | ||||
20 | 2020/02/18 00:34 | 2.355 | 1.938 × 102 | ||||
21 | 2020/02/18 02:59 | 2.317 | 2.175 × 102 | ||||
22 | 2020/02/18 05:23 | 4.320 | 1.296 × 105 |
In this study, we used a constant length scale of 3 grids to optimize an experimentally well-known relationship between electrical resistance and PO2. In general, to find the global maximum (or minimum) in a small number of trials, it is critical to tune the length scale. For this purpose, one may use a relatively larger length scale at the initial stage and a smaller length scale as the number of observation data is increased.
The actual magnitude of the improvements discussed above will only be evident when the CASH approach is applied to novel materials development. Unlike the T1 and T2 experiments, which only considered PO2 to minimize resistance, discoveries of novel materials with exotic properties will need to consider other dimensions such as materials composition, deposition temperature, sputtering pressure, and power input to the target. Manual optimization of such multidimensional systems will involve too many sets of manual experiments to be feasible. The CASH approach will use Bayesian optimization to consider every dimension at once and robotics to fabricate and evaluate samples, making the synthesis of new materials achievable in a short time. This approach will usher in the next paradigm shift in materials research and beyond.
DISCUSSION AND PERSPECTIVES
Materials science is headed toward a data-driven future. In this evolution, machine learning, robotics, and big data are turning a previously inconceivable idea into a real possibility: the understanding of materials in their entirety. As discussed above, traditional research usually synthesized and characterized a new material for a few exotic properties and ignored other properties for lack of resources. As illustrated in Table II, databases of substances and properties remain incomplete. The completion of the database will not only uncover hidden properties, but it will also lead to the discovery of new rules that determine material properties through the analysis of correlations and trends. Filling these gaps to create a database is one of the important objectives of materials science.
Illustration of the property and synthesis parameter database of a group of materials showing incompleteness. Because researchers cannot measure all kinds of physical properties because of limited resources, the database is incomplete. Only material D has a complete set of data.
. | Ionic . | . | Thermal . | Optical . | Synthesis . | Synthesis . |
---|---|---|---|---|---|---|
Material . | conductivity . | Magnetism . | properties . | properties . | parameter 1 . | parameter 2 . |
A | ✓ | ✓ | ||||
B | ✓ | ✓ | ||||
C | ✓ | ✓ | ✓ | |||
D | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
. | Ionic . | . | Thermal . | Optical . | Synthesis . | Synthesis . |
---|---|---|---|---|---|---|
Material . | conductivity . | Magnetism . | properties . | properties . | parameter 1 . | parameter 2 . |
A | ✓ | ✓ | ||||
B | ✓ | ✓ | ||||
C | ✓ | ✓ | ✓ | |||
D | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
To fill the gaps, the autonomous closed-loop system is a powerful tool. Currently, we are expanding the instrument in Fig. 2 into a multiunit system that can automatically characterize a list of properties such as electron transport, optical, dielectric, and magnetic properties (Fig. 4). With this new system, once a material is synthesized, many physical properties will be collected automatically to construct a database (Table II). This database will contain reliable non-best data useful for improving the performance of machine learning.
Schematic illustrating the expanded system designed to evaluate many physical properties. Video is available in the supplementary material (multiunit_system_movie.mp4). Blue circles denote modules for synthesis and characterization of materials.
Schematic illustrating the expanded system designed to evaluate many physical properties. Video is available in the supplementary material (multiunit_system_movie.mp4). Blue circles denote modules for synthesis and characterization of materials.
The construction of such databases enables us to discover trends that would not emerge with current research practices. Here, we have demonstrated two experiments with ten times the human throughput. However, the true revolution will occur when individual instruments are expanded into multiunit systems, and data from all multiunit systems are shared as big data to be processed by advanced machine learning (materials informatics).20–22
OUTLOOK
The example experiments presented here use the simplicity of thin film materials to demonstrate how machine learning and robotics can be applied for materials science. In practice, further advances in machine learning and robotics are required. It is clear that machine learning will play a key role in the formulation of future scientific theories. By including technologies such as explainable machine learning23 and symbolic regression,24 big data can be reduced down to simple, human-understandable expressions. This reduction would stimulate researchers to provide physical insights into materials and discover new laws of physics (human-in-the-loop). For robotics, a teaching technology to learn the motion of experimentalists is critical to incorporate researcher’s experience. Taken together, breakthrough occurs when researchers of machine learning, robotics, and materials collaborate to inspire materials science.
Finally, we note that machine learning and robotics will not work without expert materials scientists asking the right questions from the start. To this end, the training of future materials scientists must evolve; they need to understand what machine learning can solve and set the problem appropriately. With good problem setting, machine learning and robotics will significantly accelerate materials research. It is vital to educate future materials scientists with the abilities of problem setting and foresight to take on this challenge.
CONCLUSIONS
In this paper, we used our recent experiments in autonomous synthesis and resistance minimization of thin films as an example to illustrate how the speed and the volume of future big-data collection in materials science will be achieved. The approach that combines machine learning, robotics, and big data demonstrates the tremendous potential in materials science. It is only through coevolution with such technologies that future researchers can work on more creative research, leading to the acceleration of materials science research.
SUPPLEMENTARY MATERIAL
See the supplementary material for schematic videos of our closed-loop system in Fig. 2 (closed_loop_movie.mp4) and the expanded system designed to evaluate many physical properties in Fig. 4 (multiunit_system_movie.mp4) and animations of the closed-loop cycles for both Ti0.94Nb0.06O2 thin films in Figs. 3(a)–3(c) (TiO2_movie.mp4) and Ti0.99Nb0.01O2 thin films in Figs. 3(d)–3(f) (Ti2O3_movie.mp4).
DATA AVAILABILITY
The data that support the findings of this study are available within the article (and its supplementary material).
ACKNOWLEDGMENTS
This research was supported by JST-CREST (Grant No. JPMJCR1523) and JST-PRESTO (Grant No. JPMJPR17N6). We thank Dr. Daniel Packwood for the critical reading of this manuscript. We also thank Dr. Patrick Han from SayEdit.com for manuscript editing.