This work extends the adjoint-deep learning framework for runaway electron (RE) evolution, developed by McDevitt et al. [Phys. Plasmas 32, 042503 (2025)], to account for large-angle collisions. By incorporating large-angle collisions, the framework allows the avalanche of REs to be captured, an essential component of RE dynamics. This extension is accomplished by using a Rosenbluth–Putvinski approximation to estimate the distribution of secondary electrons generated by large-angle collisions. By evolving both the primary and multiple generations of secondary electrons, the present formulation can capture both the detailed temporal evolution of a RE population beginning from an arbitrary initial momentum space distribution, along with providing approximations to the saturated growth and decay rates of the RE population. Predictions of the adjoint-deep learning framework are verified against a traditional RE solver, with good agreement present across a broad range of parameters.
I. INTRODUCTION
The description of runaway electron (RE) formation and evolution in magnetized fusion devices poses a substantial scientific challenge. While reduced analytic,2–6 semi-analytic,7–9 or machine learning10–13 models are often used to infer RE generation rates, threshold electric fields, or the number of seed REs, such models are unable to recover the detailed dynamics of RE evolution. Instead, the description of RE dynamics is often performed using continuum14–18 or particle-based RE solvers.19–22 In Paper I,1 the decay rate of a population of primary electrons was evaluated using a parametric solution to the adjoint of the relativistic Fokker–Planck equation. By targeting the adjoint to the relativistic Fokker–Planck equation, a single solution to this equation for a given set of parameters allowed the RE density to be evolved forward in time beginning from an arbitrary initial momentum space distribution of electrons. The framework was further generalized by utilizing a physics-informed neural network (PINN) to identify a parametric solution to the adjoint of the relativistic Fokker–Planck equation. Hence, once trained, the combined adjoint-deep learning framework enabled the dynamics of the RE density to be inferred beginning from an arbitrary initial momentum space distribution for a range of physics parameters. While the offline training time of the PINN was substantially greater than the solution time of a traditional RE solver, its online inference time was orders of magnitude faster, suggesting its potential as a surrogate model for RE evolution in integrated descriptions of tokamak disruptions.
Reference 1, however, did not describe the exponential growth of the RE population due to the avalanche mechanism,23 an essential component when describing the conversion of Ohmic to RE current in tokamak devices.24,25 The present paper seeks to remove this limitation by extending the adjoint-deep learning framework to incorporate large-angle collisions, thus allowing for the amplification of an initial seed population. A second generalization of the framework will be achieved by training across a broader range of physics parameters compared to Ref. 1. This is accomplished by introducing a normalized time and energy coordinate that allows features of the runaway probability function (RPF) to be accurately tracked while using a comparable number of training points as the narrow parameter range employed in Ref. 1. The derived adjoint-deep learning approach will thus provide an efficient framework through which RE dynamics can be accurately described.
The present paper is not the first to evaluate the avalanche growth rate of REs using an adjoint approach,26 nor the first to use a PINN for evaluating the avalanche growth rate.13 With regard to Ref. 13, that paper employed a PINN to solve the steady state adjoint of the relativistic Fokker–Planck equation. Using that solution, the saturated avalanche growth rate of the RE population was inferred. A limitation of Ref. 13, however, was that it computed the rate that secondary electrons are generated and run away, without accounting for the decay of the primary distribution. Thus, when below the threshold, Ref. 13 simply predicted a negligibly small avalanche growth, without accounting for the decay of the overall RE population. This resulted in a slight ambiguity in the determination of the RE avalanche threshold, together with the inability to correctly describe RE decay when below the threshold. Furthermore, Ref. 13 focused on the saturated growth rate of REs, and thus did not treat the transient evolution of the RE population. Results from the present model will fill these gaps by providing an efficient means of evaluating the temporal evolution of the RE density both above and below the threshold. The near and below threshold limits are of particular importance during the RE plateau, where accounting for the decay of the RE distribution is essential for accurately predicting the dissipation of the RE beam. While analytic approximations to this rate are available, see for example, Ref. 4, such formulas are highly inaccurate when below the threshold.1 When carrying out these extensions, we will consider the limit of a completely screened plasma for a constant electric field. A more comprehensive treatment of REs during a disruption will require including a temporally varying electric field and partial screening corrections,27 which will be left to a future work.
The remainder of this paper is organized as follows. Section II discusses an extension of the adjoint-deep learning formulation of the relativistic Fokker–Planck equation to include a secondary source term. Physics-informed neural networks are briefly described in Sec. III. The temporal evolution of the number density of an avalanching RE population both above and below the threshold is given in Sec. IV. Section V evaluates the avalanche growth and decay rates across a broad range of plasma conditions and compares with a traditional RE solver. A brief discussion along with conclusions are given in Sec. VI.
II. ADJOINT FRAMEWORK FOR RELATIVISTIC ELECTRON EVOLUTION
A. The adjoint of the relativistic Fokker–Planck equation
B. Runaway electron avalanche
III. PHYSICS-INFORMED NEURAL NETWORKS
IV. TEMPORAL EVOLUTION OF AN AVALANCHING RUNAWAY ELECTRON POPULATION
Training and test loss history. The solid blue curve indicates the training loss of the PDE, the solid red curve indicates the training loss for the boundary condition, and the “×” markers indicate the test loss (the test and training losses are the same for the boundary term). 3 000 000 training and testing points are used. The training points initially obey a Hammersley sequence, with the test distribution being uniform random. The training points are resampled using the residual-based adaptive distribution described in Ref. 37, every 50 000 epochs of L-BFGS.
Training and test loss history. The solid blue curve indicates the training loss of the PDE, the solid red curve indicates the training loss for the boundary condition, and the “×” markers indicate the test loss (the test and training losses are the same for the boundary term). 3 000 000 training and testing points are used. The training points initially obey a Hammersley sequence, with the test distribution being uniform random. The training points are resampled using the residual-based adaptive distribution described in Ref. 37, every 50 000 epochs of L-BFGS.
Two example time histories are shown in Fig. 2. Here, the time evolution of the total number of REs (solid blue curve), initial seed population (dashed black curve), and several generations of secondary REs (multicolored dashed curves) are shown. For panel (a) of Fig. 2, the electric field was chosen to be above the threshold ( ), with the total number of REs increasing exponentially (solid blue curve). Considering each generation, the seed electron population is normalized to have a value of one at , where its value remains largely unchanged since ( for these parameters). For each avalanche time step, a small number of secondary electrons are born, where each generation grows rapidly until plateauing later in time. The initial rapid increase in the secondary electron population is due to the energy and pitch distribution of secondaries immediately after they are born obeying Eq. (10), where the majority of these electrons will have energies below the 1 MeV threshold energy for an electron to be counted as a RE. For cases above the threshold, a large number of the secondary electrons will be quickly accelerated above 1 MeV, resulting in an increase in the total number of REs. This process is repeated for each generation of secondaries, with each generation having a larger magnitude due to the number of REs increasing according to Eq. (17b). Summing together the number of primary electrons with all generations of secondary electrons results in an exponentially increasing number of REs, as indicated by the solid blue curve in Fig. 2. Turning to Fig. 2(b), here, the electric field was chosen to be below the threshold ( ), with the total number of REs initially increasing slightly before decaying slowly in time. Considering first the seed runaway population, the number of seed electrons initially remains approximately constant, but then begins to decay at a later time. The finite number of seed electrons leads to the generation of secondary electrons, and hence an initial increase in the total number of REs. However, since each secondary generation decays exponentially in time, this does not lead to the long-term growth of the overall population. At late times, the total number of REs decays exponentially at a rate that is similar, though not identical, to the decay of the seed RE population.
Runaway electron evolution over time for two different values of the electric field. The number of total REs is shown by the blue curves; the seed population is indicated by the black dashed curve; and each generation of secondaries are indicated by the multicolor dashed curves. Panel (a) is for (above marginality), whereas panel (b) is for (below marginality). The other parameters are given by , , and , with 19 generations of secondaries included (i.e., 20 time steps). The seed electron distribution was taken to be Eq. (23) with .
Runaway electron evolution over time for two different values of the electric field. The number of total REs is shown by the blue curves; the seed population is indicated by the black dashed curve; and each generation of secondaries are indicated by the multicolor dashed curves. Panel (a) is for (above marginality), whereas panel (b) is for (below marginality). The other parameters are given by , , and , with 19 generations of secondaries included (i.e., 20 time steps). The seed electron distribution was taken to be Eq. (23) with .
The RE density histories for several different electric fields are shown in Fig. 3, where the PINN's predictions (blue curves) are compared with the RE solver RAMc (black curves). RAMc is a particle-based RE solver21 that evolves guiding center orbits together with small and large angle collisions, as well as synchrotron radiation. For the present case, the electrons were initialized on the magnetic axis for a large ITER-like device, such that neoclassical effects arising from magnetic trapping4,38–40 along with spatial transport41 are negligible. This will allow for comparisons with the present formulation, which does not account for toroidal geometry or spatial transport. Furthermore, RAMc uses a Møller source [Eqs. (8) and (9)] to describe secondary electron generation, a higher physics fidelity treatment of large-angle collisions compared to the Rosenbluth–Putvinski source used in the adjoint-deep learning approach. The present comparison will thus test the accuracy of the adjoint-deep learning framework along with regimes where a Rosenbluth–Putvinski source provides an adequate approximation to the large-angle collision operator. In all cases, the electron distribution was initialized to be a block of particles at high energies with pitch near [see Fig. 3(f)] to be consistent with the initial RE distribution implemented in the RAMc code. From Figs. 3(a) and 3(b), it is evident that when well above the threshold, the two approaches are in good agreement. However, when approaching the threshold [panels (c) and (d) of Fig. 3], more substantial differences between the two approaches are evident, with the PINN systematically overpredicting the RE density. The cause of this discrepancy can be linked to the Rosenbluth–Putvinski secondary source, which assumes the primary electrons to have asymptotically large energy. While very high energy primary electrons will be present when the system is above marginality, no such electrons are expected to be present when at or below marginality. As a result, the Rosenbluth–Putvinski source will prove inaccurate when near the threshold. Comparing the time evolution of the primary electron population [dashed curves in Figs. 3(d) and 3(e)] with the result from RAMc, it is evident that the decay rate of the primary electron population is in better agreement with the results of RAMc. The reason for this is that RAMc employs a Møller secondary source term that accounts for the energy and pitch of the primary electrons. When below the threshold, the energy of the primary distribution will be modest, such that the secondary electrons generated by the Møller source will have little impact on the RE number density, with the majority of them being born with too low an energy to run away. Thus, as the system drops further below marginality, neglecting the Rosenbluth–Putvinski source in the PINN leads to an increasingly good approximation to the RE density evolution.
Comparison of RE density evolution with the Monte Carlo code RAMc for electric fields [panel (a)], [panel (b)], [panel (c)], [panel (d)], and [panel (e)]. The initial electron distribution assumed for both RAMc and the PINN is shown in panel (f), where a rectangular block of electrons were inserted. The other parameters were taken to be , , and .
Comparison of RE density evolution with the Monte Carlo code RAMc for electric fields [panel (a)], [panel (b)], [panel (c)], [panel (d)], and [panel (e)]. The initial electron distribution assumed for both RAMc and the PINN is shown in panel (f), where a rectangular block of electrons were inserted. The other parameters were taken to be , , and .
V. PARAMETRIC VARIATION OF RUNAWAY ELECTRON AVALANCHE GROWTH AND DECAY RATE
(a) Training and test loss history. The solid blue curve indicates the training loss of the PDE, the solid red curve indicates the training loss for the boundary condition, and the “×” markers indicate the test loss (the test and training losses are the same for the boundary term). 1 000 000 training and test points were used. (b) Final training point distribution.
(a) Training and test loss history. The solid blue curve indicates the training loss of the PDE, the solid red curve indicates the training loss for the boundary condition, and the “×” markers indicate the test loss (the test and training losses are the same for the boundary term). 1 000 000 training and test points were used. (b) Final training point distribution.
Time slices of the RPF evolution (top row) along with associated residuals (bottom row) for and . The bottom row indicates the residual to Eq. (1a) multiplied by , such that the low energy divergence is removed. The first column indicates the terminal condition at , the second column and the last column . The dashed white curves are the location of secondary injection defined by Eq. (11). The other parameters are given by and , and we chose , , , along with a consistent with .
Time slices of the RPF evolution (top row) along with associated residuals (bottom row) for and . The bottom row indicates the residual to Eq. (1a) multiplied by , such that the low energy divergence is removed. The first column indicates the terminal condition at , the second column and the last column . The dashed white curves are the location of secondary injection defined by Eq. (11). The other parameters are given by and , and we chose , , , along with a consistent with .
Time slices of the RPF evolution (top row) along with the associated residual (bottom row) for and . The bottom row indicates the residual to Eq. (1a) multiplied by , such that the low energy divergence is removed. The first column indicates the terminal condition at , the second column , and the last column . The dashed white curves are the location of secondary injection defined by Eq. (11). The other parameters are given by and , and we chose , , , along with a consistent with .
Time slices of the RPF evolution (top row) along with the associated residual (bottom row) for and . The bottom row indicates the residual to Eq. (1a) multiplied by , such that the low energy divergence is removed. The first column indicates the terminal condition at , the second column , and the last column . The dashed white curves are the location of secondary injection defined by Eq. (11). The other parameters are given by and , and we chose , , , along with a consistent with .
Considering now a case well above marginality ( , , , see Fig. 6), the RPF quickly reaches a steady state, with the RPF at having nearly the same form as at , with the total time of the simulation being 2.19. The steady state solution has a contour of roughly one half at an energy of , suggesting that the maximum energy used in this case of is sufficient to capture the critical portion of the RPF.
Using the PINN described in Figs. 4–6, we will be interested in comparing predictions of the RE avalanche growth or decay rate with Monte Carlo simulations carried out with RAMc. Before carrying out this comparison, we note that the use of a Rosenbluth–Putvinski source to describe large-angle collisions introduces an ambiguity in the evaluation of the avalanche growth rate. In particular, since the Rosenbluth–Putvinski source only depends on the number of REs, and not on the energy distribution, it is necessary to introduce a criterion for classifying which electrons should be counted as REs. In the present formulation, the quantity defined by Eq. (3) acts as the effective definition of which electrons are counted as REs. In Fig. 7, this value is roughly for the hyperparameters selected, where is the critical energy to run away given by . This definition ensures that only electrons located substantially above the critical energy to run away are included in the large-angle collision operator.
Comparison of avalanche growth rates between the Monte Carlo code RAMc with a Møller secondary source and predictions of the PINN. The solid blue curve includes large-angle collisions, whereas the solid red curve includes large-angle collisions when above the threshold, but uses the primary decay rate when below the threshold, and the black “×” markers are values from RAMc. The other parameters were , , and . The initial electron distribution was defined by Eq. (23) with .
Comparison of avalanche growth rates between the Monte Carlo code RAMc with a Møller secondary source and predictions of the PINN. The solid blue curve includes large-angle collisions, whereas the solid red curve includes large-angle collisions when above the threshold, but uses the primary decay rate when below the threshold, and the black “×” markers are values from RAMc. The other parameters were , , and . The initial electron distribution was defined by Eq. (23) with .
An additional subtlety that arises when using an adaptive energy region is that, for the case of a large electric field, the modest value of the upper momentum boundary results in the integral used to evaluate the number of secondary electrons [Eq. (17a)] being prematurely cutoff. This can result in underestimating the number of secondary electrons generated. To account for this, we extended the energy integral in Eq. (17a) to infinity, where we take the value of the RPF to be for the extended region of the integral. For cases well above threshold, we will have (see the dashed white curves in Fig. 6, for example), and we would expect this quantity to remain near unity for . In contrast, when near or below the threshold, will no longer be unity, and will likely vary significantly at higher momentum. For this latter case, however, the high energy boundary will be located at several MeV (see Fig. 5, for example), where at such high energies, the Rosenbluth–Putvinski source has a negligibly small value. In this limit, the contribution from the high energy extension of the integral defined by Eq. (17a) will make a negligible contribution to the rate of RE growth or decay, such that our extension of the integral defined in Eq. (17a) past the boundary will have little impact on the predicted number of secondary electrons.
Finally, as evident in Fig. 5(c), when well below the threshold, a spurious contribution to the RPF can emerge at the low energy boundary. To remove this contribution, we will note two momenta. The first is the critical momentum for an electron to overcome drag, defined by , for , or otherwise. The second is a hybrid momentum formed by taking the weighted average of and , i.e., . To remove the spurious contribution to the RPF near , we will multiply the integrand appearing in Eq. (17a) by a Heaviside function for , thus removing electrons below the critical momentum, or otherwise. In so doing, contributions from momenta below will be removed when near or below the threshold, but when far above the threshold (i.e., when ), only momenta below are removed. Thus, in both limits, the spurious contribution to the RPF near is removed, but when well above the threshold, only very small values of momenta are impacted.
Using the above algorithm, the avalanche growth or decay of an initial seed RE population has been evaluated for cases both well above and below the threshold, with the result compared with RAMc simulations that employ a Møller secondary source (see Fig. 7). Good agreement is found both for a strongly decaying RE distribution along with a RE distribution that is rapidly growing. Considering a broader comparison between predictions from the PINN and RAMc, 50 values of the parameters were randomly sampled over the range of parameters the PINN was trained across. Cases that decayed too rapidly for a good exponential fit to be identified, which corresponded to cases far below the threshold as discussed in Ref. 1, were discarded leaving 44 cases that will be compared with PINN predictions. The resulting comparison between the two approaches is shown in Fig. 8(a). Generally, good agreement is evident, suggesting that the PINN is able to describe the growth or decay of a RE population across a range of plasma conditions. We note that the rapid increase in the magnitude of the RE decay rate as the electric field drops below the threshold (i.e., the steep drop off evident in Fig. 7) resulted in only a few data points that were substantially below the threshold, but not so far below the threshold to prevent an exponential fit, being present in our randomly sampled dataset.
Comparison of avalanche growth and decay rates between the Monte Carlo code RAMc and the PINN over a broad range of plasma conditions. Panel (a) includes a Rosenbluth–Putvinski secondary source term for all the PINN's predictions, whereas panel (b) uses the decay rate of the primary distribution when the PINN predicts RE decay. These latter points are highlighted in red. The electric field, effective charge , and synchrotron radiation were selected randomly over the region the PINN was trained. The Coulomb logarithm was taken to be in this scan. The initial electron distribution was defined by Eq. (23) with .
Comparison of avalanche growth and decay rates between the Monte Carlo code RAMc and the PINN over a broad range of plasma conditions. Panel (a) includes a Rosenbluth–Putvinski secondary source term for all the PINN's predictions, whereas panel (b) uses the decay rate of the primary distribution when the PINN predicts RE decay. These latter points are highlighted in red. The electric field, effective charge , and synchrotron radiation were selected randomly over the region the PINN was trained. The Coulomb logarithm was taken to be in this scan. The initial electron distribution was defined by Eq. (23) with .
The largest disagreement between the predictions of the PINN and RAMc occurs when below the threshold. As noted in Fig. 3, and the ensuing discussion, when below the threshold, the Rosenbluth–Putvinski source overpredicts the impact of secondary electron generation on the RE density, where a better approximation can be achieved by neglecting the secondary source and computing the decay rate of the primary distribution. This is done in Fig. 8(b), where the decay rate of the values (red “×” markers) that the PINN predicted were below the threshold, is now computed using the decay rate of the primary distribution, yielding improved agreement with predictions from RAMc, particularly for the strongest decaying cases.
VI. CONCLUSION
An adjoint-deep learning framework for evaluating the time evolution of the RE density incorporating large-angle collisions was derived. By utilizing a PINN to identify the parametric solution to the adjoint problem, this enables the time evolution of the RE density to be rapidly inferred across a broad range of parameters for an arbitrary initial momentum space distribution. This framework was applied to evaluate time histories of the RE density, along with saturated growth and decay rates across a broad range of physics parameters. Excellent agreement was found between predictions of the adjoint-deep learning framework and a traditional RE solver for the saturated avalanche growth rate. It was found that near and below marginality, the Rosenbluth–Putvinski approximation to the secondary source term lead to quantitatively inaccurate time histories of the RE density, though the qualitative behavior was captured. Furthermore, it was found that when below marginality, the decay rate of the RE distribution is well approximated by the decay rate of the primary distribution function (i.e., neglecting large-angle collisions). We anticipate that extending the present framework to include a more comprehensive set of physics, including time varying electric fields, temporally varying plasma compositions, and partial screening corrections to collisional coefficients, will allow the present approach to provide an efficient, yet accurate tool for evaluating saturated RE growth and decay rates, as well as describing RE dynamics during rapid variations in the background plasma. Finally, the present approach to describing RE generation can be extended to describe other RE generation mechanisms including nuclear seeds arising from tritium decay or Compton scattering,6 thus, providing a comprehensive treatment of RE formation across a range of plasma scenarios. Such extensions will be the topic of future work.
ACKNOWLEDGMENTS
This work was supported by the Department of Energy, Office of Fusion Energy Sciences, at the University of Florida under Award Nos. DE-SC0024649 and DE-SC0024634, and at Los Alamos National Laboratory (LANL) under Contract No. 89233218CNA000001. The authors acknowledge the University of Florida Research Computing for providing computational resources that have contributed to the research results reported in this publication. This research used resources of the National Energy Research Scientific Computing Center (NERSC), a Department of Energy Office of Science User Facility, using NERSC Award No. FES-ERCAP0028155.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Christopher J. McDevitt: Conceptualization (lead); Data curation (equal); Formal analysis (lead); Funding acquisition (equal); Investigation (equal); Methodology (equal); Project administration (equal); Resources (equal); Software (equal); Supervision (equal); Visualization (equal); Writing – original draft (lead); Writing – review & editing (equal). Jonathan S. Arnaud: Conceptualization (equal); Data curation (lead); Software (equal); Visualization (equal); Writing – review & editing (equal). Xian-Zhu Tang: Conceptualization (equal); Formal analysis (equal); Funding acquisition (equal); Methodology (equal); Project administration (equal); Writing – review & editing (equal).
DATA AVAILABILITY
The data that support the findings of this study are available from the corresponding author upon reasonable request.