The lattice thermal conductivity (LTC) of is an important property due to the challenge in the thermal management of high-power devices. In this work, we develop machine-learned neuroevolution potentials (NEPs) for single-crystalline and and demonstrate their accuracy in modeling thermal transport properties. Combining NEP-driven homogeneous non-equilibrium molecular dynamics simulations with tensor analysis, we determine the spatial distributions of LTCs for two crystals, showing dissimilar thermal behaviors. Specifically, shows isotropic thermal transport properties, with the LTCs along [100], [010], and [001] directions being predicted to be , , and W/(m K), respectively, consistent with previous experimental measurements. For , our predictions suggest nearly isotropic thermal transport properties, with the LTCs along [100], [010], and [001] being estimated to be , , and W/(m K). The reduced LTC of vs stems from its restricted low-frequency phonons up to 5 THz. Furthermore, we find that the phase exhibits a typical temperature dependence slightly stronger than , whereas the phase shows a weaker temperature dependence, ranging from to .
I. INTRODUCTION
Ultra-wide bandgap semiconductors, such as , diamond, and AlN, have also become the focus of attention materials for next-generation electronics and optoelectronics. Owing to a bandgap of about 5 eV, exceptional breakdown electrical field, and cost-effective production, offers considerable promise for ultrahigh power device applications.1–3 However, the lattice thermal conductivity (LTC) of is subpar, leading to pronounced heat dissipation issues in certain semiconductor devices.1 Understanding the phonon thermal transport in is crucial for its practical applications.
The crystal actually exists in five distinct phases: , , , , and (sometimes referred to as ). Of these, the phase (space group C2/m) is the most stable one, which has been extensively explored for applications in deep-ultraviolet transparent conductive electrodes,4 solar blind detectors,5,6 high-performance field effect transistors,7 Schottky rectifiers,8 and high temperature gas sensors.1 In recent years, further efforts have been made to overcome the poor thermal stability and immature synthesis methods of other phases. Hexagonal crystal (space group P63mc) was reported to be the second most stable phase obtained at low temperatures,9,10 whereas subsequent studies claimed that the crystal structure of the polycrystalline form at low temperatures has an orthorhombic structure at the nanoscale (5–10 nm), named (space group Pna21).11,12 The strong polarization in is a prominent feature that does not possess, which may benefit potential device applications. For example, can form heterojunctions with other semiconductors such as GaN or AlN, and its polarization was utilized to regulate interface transport while effectively alleviating thermal problems.13,14 Considering the distinct crystal structures of and , a comparative understanding of the LTC for both phases is essential for their device thermal design, particularly as the phase remains unexplored.
Currently, growing large-scale single-crystalline is a challenging endeavor experimentally, and assessing its LTC is anticipated to be even more demanding. From a computational perspective, its LTC can be predicted using atomistic simulation techniques, such as molecular dynamics (MD) simulations15–17 and the combination of Boltzmann transport equation (BTE) with the anharmonic lattice dynamics (ALD) method.18–20 While atomic interactions can be derived from either empirical potentials or quantum mechanical density functional theory (DFT) calculations, intricate crystals like , which possess 40 atoms in their primitive cell, introduce substantial hurdles—accuracy concerns for empirical potentials and computational overhead for DFT. Recently, the prediction of thermal transport properties in materials has been significantly accelerated through the combined use of machine learning approaches and atomistic simulation methods.21 Specifically, machine-learned potential (MLP)-based large-scale MD simulations have been demonstrated to be a reliable approach to calculate the LTC of complex crystals including amorphous silicon,22 amorphous ,23 amorphous silica,24 metal-organic frameworks,25 and violet phosphorene,26 which can account for phonon anharmonicity to arbitrary order. Several MLPs for have been developed, addressing both the perfect bulk system15,19,27 and more intricate, disordered structures.28,29
In this work, we apply the neuroevolution potential (NEP) framework30–32 to develop two MLPs on demand for against quantum-mechanical DFT calculations, one for phase and one for phase. We choose the NEP approach here, because this method has been demonstrated to be highly efficient.32 We apply the developed NEPs to perform extensive homogeneous non-equilibrium molecular dynamics (HNEMD) simulations to investigate the LTC of the two phases of . Our results show that the LTC of the phase is much lower than that of the phase and exhibits low anisotropy and weak temperature dependence.
II. METHODS
A. Structural model and DFT calculations
All DFT calculations were performed using projected augmented wave method33 with Perdew–Burke–Ernzerhof functional of generalized gradient approximation34 implemented in the VASP package.35,36 The kinetic energy cutoff was set to 520 eV, and the convergence value for the total energy was eV. The stopping criteria for structural optimizations were that the maximum residual Hellmann–Feynman force on atoms was less than . A and -centered k-point grid was employed for the primitive cell of and , respectively. Additionally, to validate the accuracy of the MLP, the phonon dispersions of and were further calculated on and supercells, respectively, using the density functional perturbation theory together with the Phonopy package.37
Meanwhile, according to widely used conventions, we have optimized the corresponding conventional unit cell as the initial cell for the MD simulation. The lattice structure remains unchanged, but the number of atoms in the conventional unit cell of is higher than that in the primitive cell. The calculated lattice constants of are , , and Å, while these values for are , , and Å, which are close to the reported values.12,38 As shown in Fig. 1, the conventional unit cell of has 20 atoms, while has a relatively large number of 40 atoms. In the calculations of the reference datasets for NEP training, the and supercells were used for and , respectively, with and k-point grids.
B. The NEP model training
We used the identical method to construct the reference datasets for the and phases of . The reference structures were obtained by ab initio molecular dynamics (AIMD) simulations and random perturbations. For each phase, the AIMD simulations were run under an isothermal ensemble with temperature increasing linearly from 10 to 1000 K for 10 000 steps and a time step of 1 fs. We uniformly extracted 1000 structures from AIMD simulations for each case. For random perturbations, 200 structures were generated with random cell deformations ranging from 4% to 4% and atomic displacements less than 0.1 Å, all based on optimized structures. For both phases, our total dataset has 1200 structures including energy, atomic forces, and virials obtained from DFT calculations as outlined in Sec. II A. We randomly divided the total dataset into a training set of 1000 structures and a test set of 200 structures.
After obtaining the training set and test set, we applied the third generation of the NEP framework32 implemented in GPUMD (version 3.5) to train the MLP for and phases of , which were denoted as NEP- and NEP- , respectively. NEP used a feedforward neural network to represent the site energy with atomic cluster expansion (ACE)39-like descriptor components including radial and angular terms. The parameters of NEP models were optimized using the separable natural evolution strategy (SNES),40 with the loss function defined as a weighted sum over the root mean square error (RMSE) values of energy, atomic force, and virial. As shown in Fig. S1 (see the supplementary material), the RMSEs of energy, force, and virial converge well with increasing training step.
We used the same hyperparameters for these two NEP models. After extensive testing, the selected hyperparameters were determined as follows: the radial and angular cutoffs are Å and , respectively. The number of radial and angular descriptor components is and , respectively. For angular parameters, we have for three-body and for four-body terms, respectively. The number of neurons in the hidden layer of the neural network is 50. The size of the population is , and the number of generations is . The weights of energy, force, and virial RMSEs in the loss function were set to 1.0, 1.0, and 0.1, respectively.
C. Thermal conductivity calculations
All MD simulations were conducted at the target temperature using the Nose–Hoover chain method49 with a time step of 1.0 fs. Initially, the simulation was run for 0.1 fs in the isothermal–isobaric ensemble and subsequently in the isothermal ensemble for 1.0 ns to achieve equilibrium. Afterward, HNEMD simulations were performed in the isothermal ensemble for 10.0 ns to calculate the running LTCs. We employed and supercells for and , respectively, containing 10 240 and 9000 atoms. The simulation sizes were validated to be sufficiently large to eliminate finite-size effects. For each case, the predicted LTC was calculated as the average of five independent simulations, and the corresponding standard error was also estimated.
III. RESULTS AND DISCUSSION
A. Validation of the NEP models
Figures 2(a)–2(c) show a comparison between the predicted energy, force, and virial values obtained from the NEP models and the corresponding DFT reference values for both the training and test sets of the two phases. In all scenarios, the RMSEs for energy, force, and virial are below 0.5 meV/atom, 40 meV/Å, and 5 meV/atom, respectively. These results demonstrate a very high accuracy of our NEP models. Additionally, in Fig. S2 of the supplementary material, we present the energy changes in and by employing single atom perturbation, as predicted by NEP and DFT. We randomly select an oxygen atom, move it along the [100] direction, and calculate the change in total energy. The predictions made by NEP models closely align with the results from DFT calculations, further demonstrating the precision of our developed NEP models. In addition to their high accuracy, our NEPs are remarkably efficient. For a system comprising 10 000 atoms, our NEPs can attain a computational speed of approximately atom-step per second in the GPUMD package using a single GeForce RTX 3090 GPU card. Compared with the BTE-ALD approach using ShengBTE50 package with force constants obtained from DFT calculations,51 our GPUMD + NEP approach is much more computationally efficient (see Fig. S2 of the supplementary material), which enables us to perform extensive simulations to study the LTCs of the crystals.
The NEP predictions of [(a) and (d)] energy, [(b) and (e)] force, and [(c) and (f)] virial for the (top) and (bottom) train and test sets against the DFT reference values.
The NEP predictions of [(a) and (d)] energy, [(b) and (e)] force, and [(c) and (f)] virial for the (top) and (bottom) train and test sets against the DFT reference values.
B. Phonon dispersions
To assess the reliability of our NEP models in capturing the phonon transport properties of and , we compare the calculated phonon dispersions using both NEP and DFT methods, as shown in Fig. 3. It can be seen that for both phases, the acoustic branches predicted by NEP and DFT are very close, while the optical branches show slight deviations, especially at high frequencies. It should be noted that high-frequency optical phonons (>10 THz) contribute minimally to LTC,18 which is also confirmed by subsequent calculations of the spectrally decomposed LTC (see Fig. 6). Thus, we believe that the NEP models can reliably predict the LTC of the two phases of .
The phonon dispersions of (a) and (b) predicted by NEP and DFT. In order to accurately describe the splitting of LO-TO at the point, NEP adopts the same method as DFT to add non-analytical terms of polar charge interaction to correct the dynamic matrix, that is, to provide the Born effective charges of each atom in the structure.
The phonon dispersions of (a) and (b) predicted by NEP and DFT. In order to accurately describe the splitting of LO-TO at the point, NEP adopts the same method as DFT to add non-analytical terms of polar charge interaction to correct the dynamic matrix, that is, to provide the Born effective charges of each atom in the structure.
C. Thermal conductivity
After confirming the accuracy of the NEP models, we apply them to calculate LTCs using the HNEMD method. Figure 4 presents the LTC for both - and using the HNEMD method at 300 K. Notably, Fig. 1(a) reveals that possesses a monoclinic crystal structure, suggesting the potential for non-zero values in the off-diagonal elements in the LTC tensor. Fortunately, the HNEMD approach [see Eq. (1)] allows us to fully determine the LTC tensor as listed in Table I, by applying the external driving force along three orthogonal axes sequentially. As anticipated, the phase exhibits minor couplings between the and directions. In contrast, the off-diagonal elements of the LTC tensor for the phase are zero, attributed to its orthorhombic crystal structure.
Running LTC tensors from HNEMD simulations at 300 K for (a) - and (b) . Driving forces for top, middle, and bottom are along the , , and directions, respectively.
Running LTC tensors from HNEMD simulations at 300 K for (a) - and (b) . Driving forces for top, middle, and bottom are along the , , and directions, respectively.
The predicted LTC tensor [W/(m K)] of (a) β − Ga2O3 and (b) κ − Ga2O3 from HNEMD simulations at 300 K.
LTC components . | β − Ga2O3 . | κ − Ga2O3 . |
---|---|---|
κxx | 10.3 (0.2) | 4.5 (0.1) |
κyy | 19.9 (0.2) | 3.9 (0.1) |
κzz | 12.5 (0.2) | 4.0 (0.1) |
κxy | 0.2 (0.2) | 0.0 (0.1) |
κxz | 0.4 (0.2) | 0.0 (0.1) |
κyz | 0.0 (0.1) | 0.0 (0.1) |
LTC components . | β − Ga2O3 . | κ − Ga2O3 . |
---|---|---|
κxx | 10.3 (0.2) | 4.5 (0.1) |
κyy | 19.9 (0.2) | 3.9 (0.1) |
κzz | 12.5 (0.2) | 4.0 (0.1) |
κxy | 0.2 (0.2) | 0.0 (0.1) |
κxz | 0.4 (0.2) | 0.0 (0.1) |
κyz | 0.0 (0.1) | 0.0 (0.1) |
(a) The polar coordination used for the transformation of the LTC tensor. (b) and (c) The 3D distributions of the LTC for and . [(d)–(f)] The corresponding 2D projections onto the [100], [010], and [001] planes, respectively. All LTCs are in units of W/(m K).
(a) The polar coordination used for the transformation of the LTC tensor. (b) and (c) The 3D distributions of the LTC for and . [(d)–(f)] The corresponding 2D projections onto the [100], [010], and [001] planes, respectively. All LTCs are in units of W/(m K).
The spectrally decomposed LTC of (a) -Ga O and (b) -Ga O as a function of phonon frequency.
The spectrally decomposed LTC of (a) -Ga O and (b) -Ga O as a function of phonon frequency.
For the phase, the LTC is highest along the [010] direction with W/(m K), followed by [001] with W/(m K) [see the blue dotted line in Fig. 5(e)] and [100] at W/(m K). As shown in Table II, our HNEMD results are consistent with previous experimental results52,53 using time-domain thermoreflectance and theoretical predictions based on the BTE-ALD method18–20,51,54 or the EMD method.15,27 This further demonstrates the reliability of our HNEMD approach based on machine-learned NEP in characterizing the LTC of crystals.
The LTC [W/(m K)] of β − Ga2O3 at 300 K predicted by our HNEMD simulations and previously reported values. The values in parentheses denote the standard errors.
Method . | [100] . | [010] . | [001] . |
---|---|---|---|
Current HNEMD | 10.3 (0.2) | 19.9 (0.2) | 12.6 (0.2) |
Experiment52 | 9.5 (1.8) | 22.5 (2.5) | 13.3 (1.8) |
Experiment53 | 10.9 (1.0) | 27.0 (2.0) | 15.0 |
EMD15 | 10.7 | 20.8 | 12.6 |
EMD27 | 9.44 | 21.46 | 11.86 |
BTE-ALD18 | 12.7 | 20.0 | 17.8 |
BTE-ALD19 | 13.9 | 24.8 | 19.8 |
BTE-ALD20 | 10.02 | 23.74 | 12 |
BTE-ALD54 | 16.1 | 21.5 | 21.2 |
BTE-ALD51 | 10.63 | 18.93 | 13.9 |
Method . | [100] . | [010] . | [001] . |
---|---|---|---|
Current HNEMD | 10.3 (0.2) | 19.9 (0.2) | 12.6 (0.2) |
Experiment52 | 9.5 (1.8) | 22.5 (2.5) | 13.3 (1.8) |
Experiment53 | 10.9 (1.0) | 27.0 (2.0) | 15.0 |
EMD15 | 10.7 | 20.8 | 12.6 |
EMD27 | 9.44 | 21.46 | 11.86 |
BTE-ALD18 | 12.7 | 20.0 | 17.8 |
BTE-ALD19 | 13.9 | 24.8 | 19.8 |
BTE-ALD20 | 10.02 | 23.74 | 12 |
BTE-ALD54 | 16.1 | 21.5 | 21.2 |
BTE-ALD51 | 10.63 | 18.93 | 13.9 |
Furthermore, we predict that the LTC for to be , , and W/(m K) along the [100], [010], and [001] directions, respectively. The results are consistent with those of a previous work which combines BTE with ALD to study the thermodynamic properties of .51 These values are about one-fifth to half of those of . While the phase displays a significant LTC anisotropy with an anisotropy index of 1.94 [the maximum-to-minimum ratio of the LTC, see spatial distribution in Fig. 5(b)], the phase exhibits a nearly isotropic LTC pattern with an anisotropy index of 1.15 [see Fig. 5(c)].
Figure 6 presents the spectrally decomposed LTC of - and calculated from Eq. (5). Taken as a whole, only phonon modes with frequencies in the 0–10 THz range are really involved in thermal transport. It also can be found that, although in both phases the low-frequency acoustic phonons contribute significantly to the LTC, the phase has a much broader distribution of phonon frequencies contributing to its LTC than the phase. This is also related to the phonon dispersions as shown in Fig. 3. The branches of the phase with a phonon frequency in the range of 5–10 THz are much flatter than those of the phase, leading to much lower phonon group velocities. Moreover, in the phase, we observe significant overlaps between multiple bands for phonon frequencies greater than 3 THz, indicating the presence of multiple scattering channels, a phenomenon previously observed in Violet phosphorene.26
Furthermore, Fig. 7 shows the temperature-dependent LTC for - and . In the temperature range of 200–600 K, the LTCs of both phases decrease with increasing temperature. The LTC of the phase exhibits a temperature dependence slightly stronger than , as typical for systems dominated by three-phonon scattering processes.55 However, the LTC of shows a clearly weaker temperature dependence, which is along the [100] direction and along the [010] and [001] directions, suggesting a significant impact of higher-order anharmonic phonon scatterings attributable to its complex crystal structures.56
LTC of and phases of as a function of temperature along three directions. All the LTCs are normalized by their values at 300 K, respectively.
LTC of and phases of as a function of temperature along three directions. All the LTCs are normalized by their values at 300 K, respectively.
IV. CONCLUSIONS
In summary, we have developed machine-learned NEP models trained against quantum-mechanical DFT data for the and phases of , which have been demonstrated to be accurate and efficient in predicting energy, atomic forces, virial, and phonon dispersions in both phases. Based on large-scale HNEMD simulations, we reached a consistent prediction with previous experimental measurements for the phase and predicted the LTC of the phase. We found that the phase has a much lower LTC than the phase, due to its phonon frequency contributions being limited to below 5 THz compared to the 0–10 THz range of the phase. Furthermore, based on the tensor analysis, the phase exhibits an almost isotropic spatial distribution with an anisotropy index of 1.15 at 300 K, in contrast to the pronounced anisotropy index of 1.94 for the phase. We also examined the temperature dependence of the LTC in the two phases and found that the LTC of the phase follows a temperature dependence slightly stronger than , whereas the phase shows a weaker temperature dependence from to , indicating a significant effect of high-order anharmonicity as in low-LTC materials. Our work demonstrates that the machine-learned NEP-driven HNEMD simulations can reliably and effectively characterize phonon thermal transport properties for complex crystals such as , and thus we expect that this approach can be used to explore the LTCs of other phases of as well.
SUPPLEMENTARY MATERIAL
See the supplementary material for the evolution of various loss functions during the training process, a comparison of computational efficiency between the GPUMD + NEP potential and the DFT + BTE-ALD method, and the validation of energy changes with single atom perturbation against DFT calculations.
ACKNOWLEDGMENTS
This work was supported by the Key-Area Research and Development Program of Guangdong Province (Grant No. 2020B010169002), the Guangdong Special Support Program (Grant No. 2021TQ06C953), and the Science and Technology Planning Project of Shenzhen Municipality (Grant No. JCYJ20190806142614541).
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Xiaonan Wang: Data curation (lead); Formal analysis (lead); Investigation (equal); Methodology (equal); Software (equal); Visualization (lead); Writing – original draft (lead). Jinfeng Yang: Investigation (supporting); Methodology (supporting); Software (supporting); Writing – review & editing (supporting). Penghua Ying: Investigation (supporting); Methodology (equal); Software (equal); Validation (supporting); Writing – original draft (supporting); Writing – review & editing (supporting). Zheyong Fan: Methodology (supporting); Software (supporting); Writing – review & editing (equal). Jin Zhang: Writing – review & editing (supporting). Huarui Sun: Conceptualization (lead); Funding acquisition (lead); Project administration (lead); Resources (lead); Supervision (lead); Writing – review & editing (equal).
DATA AVAILABILITY
Complete input and output files for the NEP training of and crystals are freely available at https://gitlab.com/brucefan1983/nep-data. The source code and documentation for GPUMD are available at https://github.com/brucefan1983/GPUMD and https://gpumd.org, respectively.