Particle dynamics in Earth's outer radiation belt can be modeled using a diffusion framework, where large-scale electron movements are captured by a diffusion equation across a single adiabatic invariant, . While ensemble models are promoted to represent physical uncertainty, as yet there is no validated method to analyze radiation belt ensembles. Comparisons are complicated by the domain dependent diffusion, since diffusion coefficient is dependent on L. We derive two tools to analyze ensemble members: time to monotonicity and mass/energy moment quantities . We find that the Jacobian ( ) is necessary for radiation belt error metrics. Components of are explicitly calculated to compare the effects of outer and inner boundary conditions, and loss, on the ongoing diffusion. Using , , and , we find that: (a) different physically motivated choices of outer boundary condition and location result in different final states and different rates of evolution; (b) the gradients of the particle distribution affect evolution more significantly than ; (c) the enhancement location, and the amount of initial background particles, are both significant factors determining system evolution; (d) loss from pitch-angle scattering is generally dominant; it mitigates but does not remove the influence of both initial conditions and outer boundary settings, which are due to the L-dependence of . We anticipate that this study will promote renewed focus on the distribution gradients, on the location and nature of the outer boundary in radiation belt modeling, and provide a foundation for systematic ensemble modeling.
I. INTRODUCTION
Radial diffusion is a phenomenon studied in both space and fusion plasmas. In this work, we investigate how the radial dependence of that diffusion interacts with initial and boundary conditions. We provide here a guide for different readers to navigate this paper. First, in the introduction, we provide a broad introduction of the motivations behind understanding the uncertainty in radial diffusion modeling in near-Earth space. Plasma physicists without a radiation belt background may wish to review the application of radial diffusion in near-Earth space in Sec. II. For all readers, we explicitly list our goals in Sec. III, introducing labels which are used throughout the manuscript as we develop each research question, find relevant results, and then discuss our conclusions. Section IV contains details of the numerical models and develops the properties we require in an analysis metric, thereby motivating our use of time to monotonicity and the mass- and energy-like quantities . We find and present the most significant ways in which these quantities vary in Sec. V. Radiation belt modelers and space weather physicists may be particularly interested in the suggestions we make for future based on our findings in Sec. VI, where we also compare our results to current modeling practices. Some of our more significant conclusions relate to the importance of the outer boundary and the particle gradients instead, which are also discussed in Sec. VI. Where possible, each section is self-contained, to enable those with different interest to find the relevant sections useful.
Earth's radiation belts are a region of highly energized particles, magnetically trapped by the Earth's magnetosphere. The trapped particles undergo several types of periodic motion, the slowest of these being the drift around the Earth. Electromagnetic perturbations on timescales of the drift of electrons around the Earth will scatter those electrons onto orbits closer to, or more distant from, the Earth; this is radial diffusion. Drift timescales can vary with particle energy from minutes to days but is typically considered to be on the order of a few hours. Radial diffusion is one of the major drivers of Earth's radiation belts and due to the variation in timescales of radiation belt particles motions, an approximation for radiation belt modeling can be made using solely this mechanism to reproduce the broad dynamics, although not shorter timescale phenomena such as dropouts.1,2 This simple yet effective model enables us to explore ways of including uncertainty in our radiation belt models using ensembles. A full model of the radiation belts would require us to acknowledge that they are part of a complex, interdependent system, with consequently larger amounts of uncertainty. Radial diffusion modeling using the Fokker–Planck formalism is reviewed in more detail in Sec. II; briefly, radiation belt modeling using Fokker–Planck simulations does not use the motion of individual particles but instead averages over motion on larger scales, where wave–particle interactions across many scales are included using diffusion coefficients. Where the wave–particle interactions are well understood, variability should be properly characterized in order to capture the correct uncertainty. Where these interactions are not well understood, large uncertainties can indicate inadequate modeling. We see uncertainty across orders of magnitude when estimating in units of . For example, using different models on diffusion estimated by particle tracing in an MHD simulation spans four orders of magnitude at (i.e., spanning minutes to days),3 while empirical models parameterized by geomagnetic activity still vary by more than one order of magnitude even at the same location and geomagnetic activity level.4 Unsurprisingly, varying the spatiotemporal distribution of diffusion coefficients changes the final particle distribution.5 Previous work to address the uncertainty in diffusion coefficients shows that probabilistic inference of these coefficients outperforms our current empirical models.6 Uncertainty can be both inherent to the system and an indicator of a poorly described system; both these situations can be addressed using appropriate modeling techniques.
Ensembles can be used to quantify uncertainty in radiation belt modeling. Ensemble modeling involves running a simulation multiple times, with a variety of input conditions or parameter settings, to represent the unknowns in a given system. The impact of these unknowns on the final state can be quantified; alternatively, variability across model outputs provide a measure of uncertainty on that final state. Probability distribution of model outputs provide us with a better understanding of the uncertainty associated with our models. There are several sources of uncertainty in radiation belt modeling, including uncertainty due to approximations or physical unknowns, uncertainty in observations (i.e., in the measured value and in the spacecraft location), uncertainty in boundary conditions or model settings, and physical uncertainty inherent to the system. Physical uncertainty arises from the fact that deterministic models may exhibit chaotic behavior if they are particularly sensitive to initial conditions. Given that the magnetosphere is a complex system and that observations are spare compared to spatiotemporal scales of interest, it is extremely likely that we will need some way to include this chaotic deterministic character. Furthermore, the computational requirements of our modeling mean that we have sub-grid physics: physics on smaller scales that must be included, but cannot be fully modeled numerically. Uncertainty in driving parameters, such as upstream solar wind and ultra-low-frequency (ULF) waves driving radial diffusion, can also affect the accuracy of the model. Example sources of uncertainty include the properties of the solar wind and how this drives ULF waves. In turn, the uncertainty in magnitude and spatiotemporal occurrence of ULF waves will result in uncertainty in radiation belt models. In addition to this uncertainty chain, modeling of each physical process adds further sources of uncertainty, for example, to include the impact of ULF waves, it is typically assumed that the ULF azimuthal mode number is , which does not hold in in situ observations.7,8 As radiation belt modeling improves, it becomes increasingly important to understand how all of these sources of uncertainty impact our final output to be able to analyze ensembles.
As uncertainty becomes increasingly important in radiation belt modeling, methods to account for uncertainty such as data assimilation and statistical methods are becoming the state-of-the-art method for modeling radiation belts,9–11 while complex systems approaches are being applied to understand the underlying physics.12 Ref. 13 states that the field of space physics needs ensembles and methods to deal with and analyze ensembles, in order to manage uncertainty in parameterizations and in various parts of numerical space weather prediction. Ensembles, and other probabilistic or statistical methods, are already becoming the norm.6,14–16
Satellite operators and national meteorological organizations are increasingly compelled to include space weather forecasting as part of their services, including the radiation environment. Geomagnetically trapped particles in the radiation belts have been established as a hazard to spacecraft due to processes such as surface charging, deep dielectric charging, and single upset events.17,18 The loss of services, such as the Internet, would cause severe socio-economic issues such as losses to navigation, finances (including card-based payments), and tracking/allocation of emergency services and commercial aircraft. These economic motivations for adapting modeling this fascinating system have also encouraged the space plasma physics community to adopt techniques mastered by meteorologists. However, the domain dependence of the diffusion coefficient means that using and analyzing ensemble members is not simple, rendering ensembles less meaningful than desired.
To understand ensemble models for radial diffusion (and for radiation belt modeling more generally), we need to understand how simple model changes and initial and boundary conditions change the outputs, before we include variability to represent uncertainty in physical conditions.
II. BACKGROUND: RADIAL DIFFUSION MODELING IN EARTH'S RADIATION BELTS
Radiation belt modeling is typically done using three adiabatic invariants; three quantities associated with the periodic motions of trapped, highly charged particles in Earth's magnetosphere. At relatively low energies (e.g., electrons of 1-hundreds of keV), particles are better described using the ring current paradigm. At higher energies (e.g., relativistic electrons MeV), we use the geomagnetically trapped radiation belt description.19,20 Periodic motions of particles in a slowly changing conservative field (such as electromagnetic fields) correspond to conserved quantities; adiabatic invariants. The first adiabatic invariant, , corresponds to the magnetic moment as particles gyrate in an electromagnetic field. The second invariant corresponds to momentum along the bounce path, as the gyrating particle travels up and down the field line, reflected by the stronger magnetic field “magnetic bottle”). These motions are on scales of microseconds and seconds, respectively. Under the adiabatic approximation, if the system is changing slowly, then these quantities are conserved. On the other hand, if the system is perturbed on timescales comparable to these motions, the quantities or J would not be conserved. On a much longer timescale, the periodic motion of electrons drifting around the Earth corresponds to the third adiabatic invariant , the magnetic flux enclosed by this drift path. This can be written as for particles remaining at the equator, with the equatorial distance, the magnetic field strength at the Earth's surface, and the radius of the Earth. One can, therefore, model the radiation belts using these conserved quantities, by tracking the distribution of particles in this adiabatic invariant space and assuming that the change in this space can be modeled using diffusion—where diffusion between these occurs when electromagnetic perturbations occur on the temporal and spatial scales of the corresponding periodic motions. Diffusion is described using diffusion coefficients . Typically, cross-terms corresponding to the third adiabatic invariant are considered to be zero; on the whole, changes to the third adiabatic invariant (“radial diffusion”) is considered to be on a separate timescale and can, therefore, be approximated alone (e.g., Ref. 21) The full diffusion model of the radiation belts has been used in a number of places, e.g., Refs. 21–24.
For the third adiabatic invariant, one can equivalently consider the drift shells instead of the magnetic flux conserved by the drift orbit. Since the drift shell is uniquely defined by its intersection with the equatorial magnetic field, it is often easier and more intuitive to parameterize the third adiabatic invariant using some form of parameter, which roughly translates to which nested drift shell a given electron is confined to. (From this point on we will refer to L-parameters, rather than , for simplicity in notation). The drift shells a given L correspond to will depend on the specific magnetic field model used, and multiple methods of defining L exist depending on both the magnetic field and the approximations used to specify the drift shells (see, e.g., Refs. 19, 20, 25, and 26). If one considers the L parameter as a proxy for the relative radius of each drift shell, it becomes clear why diffusion across the third adiabatic invariant is called radial diffusion: particles diffusing to different L values are diffusing onto drift shells closer to, or further from the Earth. On the whole, radial diffusion is related to the large scale movement of particles toward and away from the Earth, acting to reduce gradients in the phase space density profile. The inward diffusion also corresponds to an increase in energy, if the first adiabatic invariant is conserved. Meanwhile, smaller scale processes breaking the first and second adiabatic invariants can result in local acceleration of particles (e.g., Refs. 27 and 28).
The diffusion coefficient contains the combined effect of electromagnetic perturbations on timescales corresponding to the electron drift. These perturbations consequently break the third adiabatic invariant, causing the diffusion to nearby (or, equivalently, L). A comprehensive review of the current state of knowledge of radial diffusion coefficients can be found in Ref. 29. The first attempts to quantify this into a diffusion coefficient assumed that such perturbations were stochastic; small scale continual ripples in the magnetosphere. The contributions from magnetic and electric potential perturbations were considered separately, using asymmetric and symmetric perturbations from a simple magnetic field model.30 Unfortunately these theoretical diffusion coefficients are problematic to apply; the magnetosphere is significantly more dynamic, rendering these assumptions invalid, while in practice one cannot observe these quantities to estimate them more accurately. There exists a gap between the theory and application of radial diffusion; accurate diffusion coefficients would require knowledge of the entire magnetosphere at each time step to be able to calculate electron drift paths. Theoretical approaches must use magnetic field models and tend to focus on determining the validity of the underlying assumptions in order to derive a more appropriate method of calculating diffusion coefficients.26,31,32 Any estimations of used in modeling must make a significant number of approximations, most often based on the techniques in Refs. 33 and 34 (particularly operational models).
Reference 34 derives the diffusion coefficients for a particular storm using a formalism based on:30 for a given magnetic field model, find the deviation from drift contours due to azimuthally symmetric and asymmetric electromagnetic perturbations, and from this find and hence Ref. 34 uses a compressed dipole magnetic field, using an asymmetry factor .Ref. 34 also splits the diffusion coefficients into diffusion due to magnetic and electric perturbations, rather than into perturbations from induced electric and electric potential fields.Ref. 34 used the larger component of the two to avoid counting the effect of the same perturbation twice, but the component which was dominant during their case study is not always the strongest one.35,36 In many subsequent studies, these components are simply added together to estimate . This makes the diffusion coefficients much easier to find and apply, but the resultant double-counting of perturbations means the final values could be off by around a factor of 2.31 However, uncertainty in spans at least one and sometimes up to four orders of magnitude, e.g., across similar time periods,36 between observations and models (Ref. 35, Fig. 6), and across different models at the same Kp and L (Ref. 3, Fig. 6) (Ref. 4, Fig. 4). Given this uncertainty, and also the uncertainty in the relative size of induced electric and electric potential components, double-counting is considered an acceptable compromise. Other choices made by Ref. 34 based on properties of their storm-time magnetosphere are also replicated in subsequent methods, simply to have any ability to model the radiation belts at all. This may be why so many of these methods perform similarly - well on average, but with little specificity. A list of sources of uncertainty in can be found in Ref. 37.
III. RESEARCH GOALS
Here, we pause to explicitly identify research goals. Having a separate section for this allows us to report back on unfruitful research avenues and to directly label our goals to conclusion in the discussion, making the logic easy to navigate across several key results. No research paper has only a single important result, and we hope this will bring out subtleties that may otherwise be missed.
We split these research goals up into:
-
Primary goals (research questions identified when scoping out the project and applying for funding)
-
Secondary goals (research questions that arose when developing our methodology)
-
Additional goals (further questions that arose as part of the analysis which we realized can answer as part of this work)
We choose to present our work in this way as it reflects our cyclic methodology far more accurately, where we constantly revisited concepts and experiments until we reached a coherent structure explaining our results.
A. Primary goals
[P1] Benchmarking for ensembles.
-
How does one analyze ensembles for radiation belt modeling? How can we quantify differences between ensemble runs?
-
Do changes in initial condition affect the differences between ensemble members? If so, which aspects of the initial condition have the strongest impacts?
-
Do changes in model settings (such as ) affect the differences between ensemble members?
[P2] What is the timescale of radial diffusion?
-
What is a useful definition of “timescale”?
-
What is the general radial diffusion timescale?
-
Does timescale vary with initial conditions?
B. Secondary goals
[S1] What do we do if we are not using a data-driven outer boundary?
-
What boundary conditions would be physical to use? (rather than what is convenient for our observations)
-
Where there are multiple potentially physical boundary conditions, how do they affect the timescale and evolution of radial diffusion?
-
What boundary conditions can we use in practice to balance physical boundaries (S1a) with observational and operational modeling constraints?
-
How does the choice of outer boundary condition and location interact with the current uncertainty on the outer boundary of radiation belt models?
-
How does the outer boundary location interact with the increased diffusion coefficient at high L?
[S2] What happens to an enhancement from local acceleration under radial diffusion (rather than just inwards diffusion of the “background” distribution, i.e., a source at high L?)
[S3] How can time to monotonicity/morphology of the particle distribution be used to analyze ensemble members?
[S4] Do we need to include loss from precipitation via pitch-angle scattering to represent radial diffusion?
C. Additional goals
[A1] What analytical methods can be adapted to understand radial diffusion?
[A2] How important are PSD gradients vs the L-dependence of ?
[A3] Is diffusion limited by the smallest value of in the domain, i.e., the diffusion coefficient at lower L?
In Sec. VI, we will discuss our findings for each of these questions and outline future questions that are unanswered.
IV. METHODS
In this Sec., we will outline the scheme used for the numerical experiments in our ensemble and the metrics we use to analyze our experiments. We choose not to compare variability across the final phase space density distributions after a set time period, but to compare how several useful quantities vary across ensemble members. We use a proxy (time to monotonicity, ) that indicates when radial diffusion has finished changing the shape of the distribution, and mass-like and energy-like techniques from analysis of dynamical systems to understand the ongoing evolution of the distribution. These tools allow us to see how radial diffusion is still contributing to radiation belt dynamics.
In Sec. IV A, we discuss our equation for the initial condition, the parameters we will vary in our ensemble and the diffusion model we use. In Sec. IV B, we will motivate time to monotonicity as a metric for timescale. In Sec. IV C, we derive the energy density-like and mass-like quantities we use to verify and interpret our simulations.
A. Numerical experiments
1. The diffusion model
2. Diffusion model with loss
Following initial experiments, we investigated whether our simple model required additional terms to validly represent the physics of the outer radiation belt. An important loss mechanism in the outer radiation belt is pitch-angle scattering (e.g., Ref. 44) where the interaction between electromagnetic waves and electrons results in changes to the velocity vector relative to the magnetic field direction. In the relatively dense region of the plasmasphere, observed plasma and whistler-mode wave conditions are such that pitch-angle scattering is enhanced for high-energy electrons (e.g., Ref. 45).
3. Details of the numerical scheme
Reference 49 shows that this scheme is unconditionally stable in the case where diffusion coefficients vary in time, but does not consider where the diffusion coefficients vary in space, as the coefficient matrix is then no longer symmetric. However, verification for our model can be found in the supplementary material of Ref. 5 in addition to the initial SpacePy verification of Ref. 48. The time step and spatial resolution of the simulations are 1 s and 0.1L, respectively.
4. Initial conditions
This reflects typical phase space densities using a peak and step, which represents a state where inward radial diffusion has been occurring for some time (the step) and an enhancement of locally energized particles (the Gaussian). These distributions were chose to reflect the phase space density distributions observed in Ref. 50. Individual parameters are
-
A amplitude
-
B step size (strength of error function)
-
width of density peak
-
location of phase space density peak in L-space
Demonstrations of these initial settings can be found in Fig. 1. Note that would result in an initial PSD distribution of only a Gaussian, while increasing B represents a decreasing difference between height of the peak and height of the step. results in a step of comparable size to the enhancement, where the PSD at high L is the same or greater than the pre-peak PSD. Although at the plateau asymptotes to the height of the Gaussian, since the two terms are summed together this is not a flat plateau. Our default settings are and The inner boundary is always set at . We vary the location of the outer boundary ( ) to reflect the fact that radial diffusion models often have different and to investigate the impact of different outer boundary locations when we know that diffusion varies in space as well as time.
5. Outer boundary condition
The model in Ref. 5 uses a constant value (Dirichlet) inner boundary to characterize the inner edge of the radiation belt, where particles are lost to the atmosphere. Choices of the outer boundary are less clear. Physically, one expects the PSD to be smooth across the boundary; hence, a Neumann (constant gradient/zero flux) boundary may be appropriate. By default, the model uses a constant gradient (Neumann) outer boundary, set at zero, which matches the physics of a slow injection from high-L as represented in our initial condition. However, a Dirichlet boundary allows the modeler to input PSD values, and indeed large-scale models tend to interpolate time-varying outer boundary values from available data. Neither of these methods are designed to use the true edge of the outer radiation belt, which varies considerably when an outer edge is distinct enough to observe at all.51 Since both methods should be physically appropriate for the underlying plasma, in our results we investigate the impact of both choices of outer boundary condition. We will review this outer boundary in the discussion, following our results [S1a].
B. Metric requirements to compare ensemble members: Time to monotonicity
Ensemble modeling for predictive purpose use the variability across model runs in the given ensemble, using a given error metric or loss function. We are also interested in examining timescale of radial diffusion, for which we need a quantitative measure.
However, error metrics are not an ideal tool for comparing the evolution of phase space density. They do not tell us about the evolution of the system state, or properties of that state we are interested in. Therefore, one of the secondary goals of this work was to select and investigate potential metrics. Initially, simple error metrics such as mean square error (MSE) were selected. MSE would quantify a scalar difference between distributions, which would be useful for analysis of variation and uncertainty across ensembles. However, the significant variation in scales covered in this problem make it difficult to generalize or compare different cases. We found this regardless of using linear- or log-based scales; thresholds usually would need to be specified for when radial diffusion was “finished enough” or for when two distributions were “similar enough,” and the results became dependent on that choice of threshold. Instead, our experiments were analyzed with a property that captured the physics of the system we were interested in: time to monotonicity, . This choice of metric is motivated below. Requirements of the metric used for analysis are [A1]:
-
Robustness. The metric used must be insensitive to any thresholds used. It must, therefore, be scale independent (i.e., work across multiple orders of magnitude, because of Kp dependence)
-
Interpretable The metric must aid in understanding the system.
-
Radiation belt system specific. The metric must be related to radiation belt modeling; it should provide insight into either the system state, or specific properties related to physical processes.
-
Time-series informative. The metric must enable analysis of the evolution of system.
Initially, potential measures were tested, such as the decreasing MSE between distributions at each time step, the evolution of maximum gradients, the total area, and the proportional change in amplitude. All these required an arbitrary threshold to determine when a given experiment was “finished,” the choice of which strongly impacted the time until distributions became similar, particularly for larger Kp. For example, MSE-based metrics varied by orders of magnitude depending on several model choices. No MSE-based metric could be found that worked robustly. Furthermore, many of these metrics ended up being dominated by the inner boundary condition, whereas we wanted to know about the evolution of the entire phase space density distribution.
To find an appropriate metric of the how the ongoing radial diffusion affected the evolution of the system, we turned to properties of the PSD distribution under radial diffusion. Figure 2(a) shows the phase space density profiles we expect on the timescale of radial diffusion. A quiet time, or background distribution, is shown in solid black lines: a low level, high L source of particles feeds the system from constant substorms. Due to the L dependence of the diffusion, the drop-off at low L is quite sharp. In dashed lines, an enhancement of particles due to local acceleration is shown at an intermediate L. Following a single enhancement, we expect the distribution to gradually return to a monotonic state via radial diffusion.27,28
Therefore, since diffusion acts to even out gradients in the PSD, we are more interested in changes in the shape of the PSD distribution than the total PSD (i.e., the integrated area under the curve). Hence, we also include the following criteria:
-
Insensitive to total particle population. The metric must be insensitive to shifting the distribution up and down the y-axis.
We chose to work with time to monotonicity, . When the distribution has become monotonic, the PSD distribution no longer has a peak for radial diffusion to smooth out. , therefore, indicates when the PSD distribution has stopped changing shape, when radial diffusion is no longer changing the properties of the radiation belts. is a proxy for whether radial diffusion is still significantly affecting the evolution of the particle distribution. The potential monotonic distributions are shown in Fig. 2(b); the “background” distribution which is nonzero but unchanging in time (i.e., the influx of particles balanced by constant movement of particles inwards), a shrinking version of this when more particles are lost than enter the domain, and finally a zero profile.
In the results section, we use to explore how idealized radial diffusion models vary when changing initial settings. represents our physical expectations; our intuition that after a localized enhancement, the PSD will eventually relax to monotonic distribution. We can test our expectations and how the changing parameters affect these expectations. See Sec. VI for evaluation of our chosen measure and for alternative approaches.
C. Analytical approach to comparing evolution of ensemble members
In this section, we derive the terms that comprise the changing energy ( ) and mass ( ). The figures comparing these terms, evaluated explicitly for each of our boundary conditions, can be found in Sec. V where they best support our analysis.
The third term in is one that will give us insight into how the dynamics of the distribution will evolve to minimize our energy. This term has an integral, which depends on the square of the derivative in L. As this is a non-negative quantity, this ensures that the integral contributes to energy loss (as it is preceded by a negative sign) while gradients in the distribution function exist, up to a point where the contributions from the boundary conditions balance this out. This is an property identical to Cahn–Hilliard diffusive dynamics, where diffusive terms correspond to energetics that penalize the formation of (sharp) gradients.53,54 This penalty is enhanced for gradients occurring at higher values of L, as their contribution to the integral will be increased by a positive power of L, suggesting that the distribution function can minimize this integral by moving gradients to lower values of L. Thus, decreases toward a long-term solution through diffusion to reduce gradients and through the population moving toward lower L, and this movement of the population will be increased for higher powers n.
V. RESULTS
Our methodology was to begin with , to find out how long it takes a given initial distribution to reach a monotonic state, and how this varies with initial conditions. We first compared with Kp for each parameter in Eq. (10). For , we note that from Eq. (2), we expect timescale to vary significantly with Kp. Kp is a proxy for strength of radial diffusion, even though is unlikely. Since we expect time to monotonicity to depend on Kp we primarily use a heatmap for the ensemble used to investigate each parameter, demonstrating how varies with each (Kp, parameter) pair. To aid understanding, these results are also presented in an alternate format, where the for each Kp are plotted as a line. Each experiment ran for a week; model runs where monotonicity were not reached are left empty.
Following our initial analysis, we then investigated any noteworthy results, for example by looking at the evolution of the phase space density of a specific simulation. In general, runs with a Neumann outer boundary condition (zero flux) are shown on the left, while the right hand column corresponds to a Dirichlet outer boundary condition (constant value).
Finally, we incorporated our analysis from Sec. IV C for each parameter to understand and generalize any patterns we saw. Our experiments are also run for a week, using in our diffusion coefficient. Just as for our analysis, we calculated for each returned time step (every 6 h).
A. Results part 1: Without loss rate
Each parameter in the initial condition was systematically investigated. Selected results are presented in the main text in an order chosen to best convey our conclusions; the full set of individual , and experimental figures can be consulted in the supplementary material, labeled as Figure. SX. As a one dimensional simulation, the number of particles in a given phase space bin (PSD) is in units of [SI units, per (unit speed unit length)]. This can be scaled to any of the other units systems used for PSD elsewhere.
1. The difference between the two outer boundary conditions
Overall, it is clear from Fig. S1 that generally, more runs with a Neumann (fixed gradient) outer boundary reach a monotonic state within a week, compared to runs with a Dirichlet (fixed value) outer boundary. We will investigate the two outer boundary conditions before examining the effect of each parameter in the initial condition. To understand the evolution of f in these experiments, in Fig. 3, we show the phase space density for both Neumann and Dirichlet outer boundary conditions, with an outer boundary located at . We use default settings for the initial PSD, a Kp of 4 and run for a week. Panels (a) and (b) show the heatmap, while waterfall plots are in panels (c) and (d).
In both experiments, the peak remains high and moves inwards. However, there is a difference in the outer part of the simulation, as demonstrated in Fig. 4. The plateau becomes significant for the Neumann boundary but not for the Dirichlet boundary. This makes sense as the outer boundary value is able to rise for the Neumann case, reflecting outward radial diffusion. For Dirichlet runs, the outer boundary value cannot rise, but particles can be lost. Note that the amplitude of the Neumann peak is still comparable to the Dirichlet case (4.55 and 4.52 at and 3.7, respectively); it is the plateau that has changed. More Neumann experiments reach monotonicity as the plateau can rise instead of having to wait until the peak diffuses completely inwards. This inability for the high-L (to the right of the peak) PSD to reach (positive) monotonicity independently of the left part is one reason why it takes longer for the Dirichlet runs to reach monotonicity. This corresponds to outward radial diffusion varying depending on the outer boundary condition [S1b].
For a more realistic outer boundary condition, we need to consider these, along with the fact that we do not currently have a clear outer boundary location. We discuss all these factors together in Sec. VI D.
We can also examine how the evolution of our mass- and energy-like densities varies with Neumann and Dirichlet outer boundary conditions. Figure 5 shows across the week, plus the components of . Panels (b) and (d) show the absolute value of components on a log scale to make order of magnitude comparisons earlier; in rare cases in later analyses where the change becomes positive (i.e., a gain in mass or energy, rather than a loss), this is specified.
By definition, for Neumann experiments , mass is only lost through the inner boundary. For the Dirichlet experiments, it is clear that the outer boundary dominates loss, with up to two orders of magnitude larger than ; hence, the number of particles decreases more rapidly. We find that less than 0.8% of the original mass is lost with a Neumann outer boundary, while around 24% of the mass is lost with a Dirichlet boundary [S1b].
Comparing the individual terms contributing to changes in mass in Fig. 5(b), we see that loss from the inner boundary is the same across the week regardless of the outer boundary (and therefore independent of the different interior distribution as the system evolves). For inner boundary loss to be big enough to vary, one must run experiments with very large diffusion coefficients [e.g., using in Eq. (2)] or for a much longer time.
The results for our norm are somewhat counterintuitive. In total, we know that experiments with a Neumann outer boundary can eventually reach a lower- state (zero everywhere) than experiments with a fixed outer boundary value, where the minimum-energy state will have the same fixed value at as the initial condition. However, we find that Neumann simulations appear to be reaching a limiting state, where is increasingly smaller and relatively unchanged.
For a Neumann outer boundary, . While the Dirichlet outer boundary can contribute to changing energy ( ), we can see from Fig. 5(d) that the dominant mechanism for energy loss is mostly from the reconfiguration term . However, this term accounts for more energy loss when the outer boundary is Dirichlet rather than Neumann, because there are more steeper gradients when the outer boundary is fixed. Since depends on gradients ( ), is larger for Dirichlet runs as there are gradients both sides of the peak, rather than just to a plateau. For Neumann experiments, the gradients rapidly flatten into a plateau at higher L. Remember that our equation for has a factor of in it: the same PSD at a higher L contributes less to the norm, because it can be moved around more easily. Hence, Neumann runs have a lower . Once the plateau has been reached, the only way to lose energy is for material to move down the gradient (and then out through the inner boundary). This process is slow and so is effectively limited. On the other hand, the Dirichlet experiments are more effectively moving material to higher L, and then out of the domain. Having an outer boundary that allows flux in/out means that norm is reducing more quickly than when we allow the PSD at the outer boundary to change. Hence, Neumann appears to be reaching a limiting state first, where and is unchanging, even though we know that if left forever, it can reach a lower-energy state than Dirichlet [S1b;S1e].
2. Significant properties of the initial condition ( )
The results of systematically investigating each parameter, using first time to monotonicity and then our quantities , are shown here. Those properties we have considered to have a significant impact on time to monotonicity are presented in detail, while remaining properties are covered briefly in Sec. V A 3 [P1b;S3a].
Step size B: The step size B corresponds to a system where the phase space density is higher further out in the radiation belts; i.e., a situation where inwards radial diffusion from a distant source has already occurred. A higher step, therefore, corresponds to radiation belts that have more material before the enhancement occurs.
Figures 6(a) and 6(b) show how time to monotonicity varies with both Kp and increasing step size. Blank values indicate monotonicity was not reached within a week, the length of the experiments. When starting with a larger step size, we find that monotonicity is reached sooner for both outer boundary conditions. This behavior is as expected as with an increased B, and f is already closer to a monotonic state. Again, more ensemble runs with Neumann outer boundary reach monotonicity. Looking at this information in the alternative format in panels (c) and (d), we see that for both outer boundary conditions, appears to increase exponentially for smaller step sizes.
Our results show that the evolution is not necessarily straightforward, however. Figure 6(e) shows the total mass in f across the week simulated, for the larger step size for both Neumann and Dirichlet outer boundary conditions (the comparison against the default value can be found in Figs. S3 and S4). As expected, the total number of particles changes very little for a zero flux (Neumann) outer boundary. However, the overall mass response changes when mass can flow across the outer boundary (Dirichlet experiments). We see that for a larger step size , the experiment starts to gain mass at around 80 h. From the mass change terms in Fig. 6(f), this is clearly from the outer boundary, when the distribution drops below the fixed outer boundary point . At this point, the gradient will become positive, and material will flow into the domain. Inner boundary loss varies with B but is again independent of the outer boundary condition.
The norm is much higher for a larger step, and the norm reduces over several orders of magnitude [see Fig. 6(g)]. The Dirichlet run started out losing more energy than Neumann (as we also see using default settings above) but later in the week, energy loss drops off and the Neumann case has lower energy (and is, therefore, closer to a point where the dynamics have stopped changing). This is because there is an increase in the norm ( ) with the reversed outer boundary flow; however, the corresponding energy change term reaches a comparable magnitude to the dominant reconfiguration term . Therefore, the total change for the Dirichlet case with becomes very small, while the Neumann case is still reducing in norm because the PSD can be reconfigured (diffused).
Physically, this means that with a higher step, we are finding that the Neumann case reaches a lower energy state by the end of the week than the Dirichlet experiment, unlike our default settings Fig. 5. For the Dirichlet case, the constant outer boundary value is higher than the peak, allowing material to come in through the outer boundary. This experiment is in a state where the main dynamic is a constant churn of mass coming in and then being diffused to lower L to reduce the entire distribution to a lower energy state. Physically, this corresponds to an infinite source at the outer boundary if we were to run this indefinitely. See Sec. VI for our overall conclusions on more suitable outer boundary settings.
Enhancement location : The Gaussian in the first term of Eq. (10) corresponds to an enhancement, and corresponds to the location of this enhancement in L.
Considering across a variety of enhancement locations in Fig. 7, we find that monotonicity is reached more quickly for higher for both outer boundary conditions, although the shape of the dependence is seen to be very different in Figs. 7(c) and 7(d). A sooner for higher makes sense as will be larger at higher L, so when the peak is located at higher L, diffusion to flatten this peak happens more quickly. Again, far more runs reach monotonicity with a Neumann outer boundary.
Figures 7(a) and 7(b) show us the general relationship between each parameter and , while (c) and (d) show us the specifics of each relationship. We see that the relationships between and for each Kp are quite different for the different outer boundary conditions. This disparity could be due to the different mechanism to the right of the enhancement; a zero flux outer boundary condition allows the plateau to rise and reach monotonicity to the right of the peak, while the same region with a constant value outer boundary cannot reach monotonicity until the enhancement has completely diffused, although material can be lost through the boundary.
The bottom two rows of Fig. 7 compare the evolution of across the simulation for the default value of to a more distant peak at . The total number of particles is higher when is lower, which is as expected since the step extends further inwards. There is little change in mass with for Neumann runs, also as expected. There appears to be different amounts of mass lost in Dirichlet simulations, so we consider the outer and inner boundary particles losses and in (f). These are always negative (particles are only lost, not gained) but look quite different. The outer boundary loss dominates for both values of . With a higher-L enhancement, the inner boundary loss is less. Therefore, while the outer boundary loss is comparable, the higher the , the more strongly that outer boundary loss dominates over the loss from the inner boundary. Figure 7(g) shows the evolution of . Lower values of (enhancements at lower L) actually result in higher norms, because you have more PSD total (for the same reason as the mass above) and more of this mass is at lower L. In both cases, the Neumann experiments reach a configuration where is very small and stops changing. The Dirichlet experiment with a higher-L enhancement rapidly loses but by the end of the week, is no longer changing much. In (h), we can see the terms of for each experiment. For readibility, we show only the terms for here. The reconfiguration energy loss ( ) evens out more quickly for Neumann experiment with than with the standard initial condition, presumably because it is easier for the step to rise up and plateau in a monotonic state. For the Dirichlet experiment, the reconfiguration term dominates, but rapidly drops off until it is comparable with energy loss at the outer boundary.
We find that an initial distribution with more material at high L (i.e., step size B) and with an enhancement at high L (i.e., ) diffuse more quickly. B and are the most significant initial conditions, yet the specific evolution of the system varies depending on interaction with boundary conditions [P1b].
3. Minor properties of the initial condition ( )
Enhancement width : Figures 8(a) and 8(b) show that for both Neumann and Dirichlet boundaries, is reached more quickly for narrower peaks, i.e., for smaller values of . This difference is only slight. There are two aspects at work here; a wider will have access to larger diffusion rates at high L, but the gradient will be less steep. We can examine these components using and to determine which is more significant.
An enhancement across more L has somewhat more mass and appears to lose mass more quickly for both Neumann and Dirichlet outer boundary conditions. Despite the fact that experiments with a larger also start with higher , at the end of the week, 98% and 75% of the mass remains from the initial population (for Neumann and Dirichlet experiments, respectively), compared to 99% and 76% remaining from our default experiments. Both experiments with a wider enhancement lose more from the inner boundary [Fig. 8(d)], again showing that the initial condition controls more loss from the inner boundary than changes in the outer boundary condition. Loss from the outer boundary is slightly more with a larger , but is roughly comparable.
Total is higher for a larger , which makes sense as the experiment has more mass and hence larger ; the distribution f is further from a steady state. All with are slightly larger, but overall very similar. Unsurprisingly, with similar but a larger starting , experiments starting with a larger end up retaining proportionally more of the initial energy (37% and 58% for Dirichlet and Neumann, respectively, rather than 30% and 47% of the initial energy when using default ). Overall, with a wider , the Dirichlet experiment loses more , even though is slower. The results from the investigation of indicate that the trade-off between gradient in the PSD and the L-dependence of is subtle and nuanced (a wider enhancement has a less steep gradient but samples higher ).
4. Gradients vs the L-dependence of
The results suggest that we should compare the role of the spatial (i.e., L) dependence and the gradients in the distribution function on the overall amount of diffusion. We consider their role in reconfiguration term , since this generally dominates . With a constant diffusion coefficient, only the gradient term would contribute to the . With an L-dependent , both and will contribute to . We simply compare the order of magnitude of these components via the ratio' of to for , shown in Fig. 9. Overwhelmingly, it is the gradient component that dominates. This is unsurprising once one considers that s−1.
Throughout this analysis, we have found that one must consider the whole domain; for example, the amount of diffusion is not limited by the smallest but also the shape of the distribution, loss, the choice of domain, etc. This is because is the dominant component of , which determines the PSD evolution. is an integral over the entire simulation domain.
Figure 9 indicates that the gradients have more impact than the L-dependence of the diffusion coefficient. However, with a longer spatial (L) domain, this will begin to change, particularly for a Neumann (zero flux) outer boundary, where there are fewer gradients. Using an idealized , gradients almost always dominate over the effect of increasing with L. This will be why wider enhancements (larger ) take longer to reach monotonicity when using our operational (Ozeke) : there are consequently shallower gradients [S1e].
We find that the PSD gradient contributes more to the evolution of the system than the diffusion coefficient . Note that this idealized scaling was chosen to be comparable with the Ozeke diffusion coefficient at a given L and Kp; there are many other models, often more sophisticated, yet all have a significant L-dependence. This is discussed further in Sec. VI E [A2].
5. Outer edge of domain,
In this experiment, we varied the domain for the simulation to see what difference it made. A Dirichlet (fixed value) condition is used in the majority of operational radiation belt models, to reflect observations e.g., Refs. 24, 55, and 56. The simulation domain is curtailed to the location of the spacecraft; different values then correspond to using data from different spacecraft missions to set this outer boundary. Both types of outer boundary condition are investigated in this phase of experiments, and the Dirichlet experiments retain the outer boundary value fixed in the initial condition.
For a Neumann outer boundary condition, a more distant outer boundary (larger ) took longer to reach monotonicity; i.e., a smaller domain reached monotonicity more quickly [Fig. 10(a)]. This suggests that the choice of outer boundary location changes the shape of the PSD distribution, especially the height of the plateau. For Dirichlet conditions, was independent of domain size.
To investigate this, we compare two Neumann runs where we vary the outer boundary location to be and 7.5. (Equivalent plots for Dirichlet can be found in the supplementary material, Fig. S9). Figure 11 shows the PSD distribution at eight equally distant times throughout the week, with Kp = 4 and using Ozeke .
The difference in PSD between peak, and plateau edge (both indicated with dotted lines) by the end of the week is much larger for the run with a wider domain, in Fig. 11(b). The effect of this can be seen more clearly by considering the PSD at the final time step, in Fig. 11(c). The run with is more close to monotonic because the value of the outer boundary has raised higher. Despite the larger at high L, more reconfiguration of the distribution is needed for a longer domain governed by the same equation and as was seen in Sec. V A 4, the gradients are still more important than the up to . Although the material in the peak being diffused outwards can be spread across more L when there is a more distant and large values at high L encourage this, the material has to travel farther before the plateau rises and monotonicity is reached. The Neumann dependence on domain (i.e., on arises because the shape of the distribution varies with changes in the simulation domain).
Analysis of in Fig. 12(b) indicates that neither the outer boundary location or condition affects loss from the inner boundary. For Dirichlet experiments with varying , mass loss from the outer boundary is of comparable order within a few hours, regardless of where that outer boundary is. This corroborates findings in Sec. V A 4 that the gradients in the PSD distribution dominate diffusion over the entire domain, rather than higher diffusion coefficients located in one region. While the extra mass at time was obvious for a longer domain, the change in initial is negligible, as can be seen in Fig. 12(c). However, the evolution of the norm is nuanced; despite having terms of similar order [Fig. 12(d)], with a longer domain, the Neumann experiments reaches a lower level of , while the Dirichlet experiment has a greater value of .
The mechanism behind these results is, unsurprisingly, the rising plateau for Neumann and the outer boundary flux for Dirichlet experiments. In order for Neumann experiments to reach a configuration of lower , the high-L plateau rises. With a longer domain, this means that there are more particles at high L; hence, can be lower than with the same number of particles at a lower L. Additionally, is slightly larger for than 6.5, which can be attributed to outward radial diffusion due to the longer domain and higher at . For the Dirichlet case, the reconfiguration energy change is almost exactly the same. However, more is lost to the outer boundary for than 6.5. Even though they lose roughly the same each hour after 70 h, this is enough to make slightly lower for
Using , we find that the choice of outer boundary location changes the shape of the PSD distribution for Neumann; in exactly the same manner, varies depending on the outer boundary location. For different outer boundary conditions, a different outer boundary location could result in heading faster or slower to a state where the dynamics are minimally changing (i.e., to minimum ). In Sec. VI, we discuss outer boundary choices, including the applicability of using a Dirichlet outer boundary that is fixed, but not using observations [P1c].
B. Results part 2: Including loss rate
Loss from pitch angle scattering is significant and should be included to see how it relates to the timescale of radial diffusion. We include this loss by modeling the electron lifetime.
1. The difference between the two outer boundary conditions, with loss
In general, with loss it is no longer always true that more Neumann experiments reach monotonicity; nevertheless, they still have shorter than Dirichlet experiments. All the plots can be found in supplementary material; again we select the results that inform us about the overall pattern.
Figure 13 show experiments with pitch angle loss for the default initial condition. Neumann and Dirichlet runs look very similar, suggesting that pitch angle loss may control more of the dynamics than the outer boundary condition. The analytic quantities for these runs confirms this; Loss from the pitch angle scattering approximation dominates over loss from the outer or inner boundary Fig. 14(b), and the evolution of with loss looks similar regardless of outer boundary condition, unlike the runs without loss [Fig. 14(a)]. The experiments including pitch angle loss are heading much more quickly toward a steady state, and after around a hundred hours the experiments are not changing much in , as .
is more complex to analyze. Over the span of the week, enough mass is lost that the Dirichlet run begins to gain mass. Just as without loss, particles entering the domain are contributing to an increase in , which results in a final that is higher for Dirichlet than Neumann. As a result, the Neumann case is closer to a steady state. Loss from pitch angle scattering effectively mimics the reduction in PSD that would occur over a very long timescale of radial diffusion, as it is strongest near the edge of the plasmapause (which is located around the bulk of the enhancement). With a constant inflow of particles, Dirichlet runs have a higher but do not come closer to monotonicity. Electron lifetime loss is creating a new local minimum in the PSD, and so the distribution is not monotonic. To demonstrate what is happening in Fig. 14 for both Neumann and Dirichlet runs, the diagram in Fig. 15(a) shows an example phase space density distribution with this additional minimum. This physical profile is corroborated by the fact that and therefore loss dominates over reconfiguration in the system evolution. We will expand on the consequences of this below. Finally, there is a difference in inner boundary flux; all experiments including lifetime loss have the same, lower flux [S1b;S4].
2. When can never be reached: How affects monotonicity
Although experiments with loss that reach monotonicity do so quicker than they did without , for all parameters there are several initial values that reached monotonicity without loss but no longer do once loss is included, usually at lower Kp (i.e., weaker radial diffusion relative to the same loss). The reverse is rarely true; only for narrow (e.g., 0.2, 0.25) and is a found with where none was found before. In this section, we explain the physical mechanism behind this general pattern; again, all individual plots can be found in the supplementary material, Fig. S5.
The existence of experimental setups that will never reach monotonicity is most clearly demonstrated by following a case where [Fig. 15(b)]. There will be loss everywhere left of . In the region , if loss is strong enough, it can work against monotonicity by creating a minimum. The second row of Fig. 15 demonstrates how this case can evolve for Neumann and Dirichlet outer boundary conditions, using default initial conditions, the same loss with the default plasmapause at and in the diffusion coefficients. The Neumann and Dirichlet [Figs. 15(c) and 15(d), respectively] experiments show that even when the distribution is much reduced (over 24 and 72 h, respectively), it is not monotonic [Note that where the loss does not dominate over radial diffusion, then instead the effect of the loss is for the overall distribution to reduce more quickly, but still maintain the characteristic diffusion distribution (e.g., the intermediate distribution in Fig. 2(b))] [S3a;S3b].
3. Loss affects the evolution of diffusion more than most properties of the initial condition
In general, initial conditions impact the diffusion in a similar manner to without loss, for example, varies significantly with . In this section, variation of each parameter is compared to the case without loss. Again, we select the most significant results here, while all the figures can be found in the supplementary material Figs. S5–S8. In Sec. V A, each was normalized using initial values for the phase space density using all default parameter values for the initial condition. To compare the effect of loss, we instead choose normalization values from the initial phase space density of the higher parameter value in each experiment pair, e.g., we normalize using the initial mass and energy density with rather than , rather than , etc. This normalization was chosen to ensure that the effect of each parameter is extracted, rather than repeating experiments comparing the default initial condition with and without loss, which was explored in Secs. V B 1 and V B 2 [P1b;S3a;S4].
Step B: Just as without loss, a higher step size B results in a shorter as the distribution is already closer to monotonic. However, this effect is much smaller. As can be seen in Figs. 16(a) and 16(b), the relationship to changing B is still exponential, but stops changing once . In fact, a step this large quickly results in gains from the outer boundary; in Figs. 16(d) and 16(f), and for experiments with loss are always positive. The Dirichlet experiment also finds that at around 72 h, the reconfiguration term and outer boundary terms dominate over the loss term for the norm, . As a result, the high-B Dirichlet experiment reaches a state where the constant churn of material being brought into the domain and diffused inwards dominates over the loss from pitch angle scattering. The Neumann experiment obviously does not experience this. Because the Dirichlet simulation has quickly reached a point where the dynamics are no longer changing (due to the large fixed value of ), the Neumann and Dirichlet experiments diverge in even though in general, the outer boundary condition has less effect than loss.
Enhancement location : As without loss, a higher means that is reached more quickly. [We also note the minor effect from Fig. S8(f) that for the first few hours, the loss experiments actually find that is dominated by reconfiguration —this will be because the enhancement is at higher L so more diffusion is possible. Then, loss becomes dominant—for Neumann with loss, the reconfiguration and loss contributions to become comparable after around 70 h. “Reconfiguration” is not as strong as without loss.]
Amplitude A: There is no change with overall amplitude parameter A—the same as without loss. We see the same general patterns in as discussed in Sec. V A 3.
Enhancement width : Again, a narrower width (i.e., lower ) reaches quicker, the same as without loss. From analysis [shown in supplementary material, Figs. S7(g), S7(h), S8(g), and S8(h)] we find the same overall results as discussed in Sec. V B 1. With loss, the Neumann and Dirichlet runs are more similar to each other than to the runs without loss. With loss, less is lost through the inner boundary.
4. Outer edge of domain,
Without loss, we found that varied when we changed the location of a Neumann outer boundary condition, but not a Dirichlet condition. With loss, we find that dependence on is very small for both a Neumann boundary [Fig. 17(a)] and a Dirichlet boundary, particularly when the domain boundary and the plasmapause are close [Fig. 17(b)][P1c].
5. Plasmapause location
When including loss, the extent of the lossy region is a new parameter to consider. The outer limit of this region is the plasmapause, . By default our simulations have , which is higher in L than the default peak location, . Figures 18(a) and 18(b) show time to monotonicity across a variety of plasmapause locations. The effect for a Dirichlet outer boundary condition is small, with an effect for the lowest plasmapause values (for which is sooner), which quickly drops off to reach a value of that no longer changes with . The effect of plasmapause location with Neumann outer boundary condition is more complex; overall, a plasmapause closer to the Earth reaches monotonicity sooner. However, there is a cutoff in L after which monotonicity is not reached at all; our experiments place this around . This is unlikely to be a “hard” cutoff but instead due to interactions with and . These relationships are explored in the following paragraphs.
In Sec. V B 2, we noted that could result in the PSD being unable to reach a monotonic state. This relationship is explored by using three plasmapause locations in the analysis: These results are shown in the final three rows of Fig. 18.
As the default enhancement location is , a plasmapause at is at a lower L than the enhancement. Figure 18(e) indicates that material lost from pitch angle scattering is quickly similar to loss from the outer boundary, and Fig. 18(h) demonstrates that is dominated by the reconfiguration term (although this becomes comparable to for the Neumann experiment by the end of the week). With less overall loss, the Neumann and Dirichlet experiments still have distinct values of .
For a plasmapause at or 6, much more material is lost, as expected since the proportion of particles lost increases with L. The norm for is quickly very low [Fig. 18(d)]. For , the Neumann experiment reaches a lower energy state, while for the Dirichlet experiment has a lower norm. This is due to more material coming in the outer boundary with a higher .
The relative location of peak and plasmapause is the determiners of whether a local minimum is at all possible, while details of , , and combine to see whether it occurs in a given simulation. For example, despite the entire enhancement lying inside the plasmapause for , we see that the local minimum in Fig. 19(e) is very small, even though a local minimum very clearly occurs when the peak is just inside the plasmapause (e.g., , Fig. 19 row 2). In this case, the loss from pitch angle scattering results in the peak rapidly moving inwards, and a local PSD minimum arising between and [Figs. 19(c) and 19(d)]. Peak width may change the rate at which a local minimum arises (e.g., a wider enhancement could mean a higher PSD to lose before reaching a local minimum, while a narrower enhancement could mean that diffusion occurs more quickly, offsetting the pitch angle loss), but it is the relative location of and that determine whether this is possible.
For a higher plasmapause location, becomes increasingly larger. When , diffusion cannot prevent the formation of the extra minimum demonstrated in Sec. V B 2. Indeed, this is the relationship observed when plotting several intermediate time instances of the experiments with , shown in Fig. 19. For , this minimum does not form (first row of Fig. 19) while the minimum is larger as increases (Fig. 19 second and third rows).
Note that since depends on the gradient of the PSD, the growth of this minima (i.e., when ) is dependent on the initial conditions. Loss may or may not dominate the evolution of the system over reconfiguration. Overall, our results emphasize that the plasmapause location relative to the enhancement is important and that a more distant plasmapause has so much more loss that this can totally change the dynamics [P1b; S1b; S2; S4].
C. Summary of results: The role of the initial condition
Monotonicity was easier to obtain when there was already a significant background PSD, corresponding to a large high-L source (i.e., high B). Therefore, corresponds to the timescale of radially diffusing a local enhancement, with respect to the background PSD (the background particle population). If the enhancement location occurs at high L, it does not last as long before the distribution becomes monotonic, although including loss from pitch angle scattering means that the relative location of the peak and plasmapause strongly interact to affect . Furthermore, a narrower enhancement will be reduced more quickly than a wider one and using , we attributed this to the gradients of the PSD, which will be discussed in Sec. VI E. Less intuitively, we found that the time for our enhancement to “fade” into the background varies significantly with the outer boundary location if we used a Neumann boundary. This is a key result, and we explore the consequences of this and future avenues in Sec. VI D. Finally, we found that loss from pitch angle scattering has a strong effect on the final shape of the distribution (and therefore the timescale for radial diffusion). Indeed, some numerical experiments never reach monotonicity, casting some questions about future appropriateness of , which we discuss in Sec. VI A [P1b;S1b;S2;S3a;A2].
Using the evolution of , we could work out what processes were going on, and from we could work out how the system was evolving to reach a maximally diffused state. We found that although theoretically an experiment with a Neumann OBC can reach a state with a lower norm (i.e., zero everywhere), the majority of the time Dirichlet experiments were reaching a state of lower , because mass could be lost from both boundaries. The Neumann experiments were reaching a state where ; where the dynamics were changing very little once the plateau had risen up, while the Dirichlet experiments were still diffusing material from the enhancement by the end of the week [S1b;A1].
Other specific results from the use of include the importance of gradients in the distribution; using we can estimate the comparative effect of the dependence of and the gradients in the distribution on the evolution of f. We found that gradients are more significant, a result worthy of its own discussion section Sec. VI E. We can also see in Fig. 12(c) that although using a different domain (i.e., a different ) does not significantly impact the starting value of , it does affect the rate at which the system moves toward a steady state (more in Secs. VI D and VI B). Finally, we can quantify that loss from pitch-angle scattering has a stronger effect on the evolution of f than radial diffusion does [S4;A2].
VI. DISCUSSION: IMPLICATIONS FOR FUTURE WORK
Understanding of the research goals, and the context of our results, has developed throughout the research lifetime of this work. The initial questions shaped the methodology and investigation of the results shaped the narrative of the analysis. Here, we put the results into context with existing literature and with future modeling choices. To aid navigation, relevant paragraphs are labeled with our initial research questions in Sec. III. Each research goal may be addressed in multiple paragraphs.
A. Evaluation of and as analysis tools
Error metrics such as the log-accuracy ratio57 are often the first tools considered for comparing distributions; this would be a suitable method to compare the deviation between two phase space density (PSD) distributions f, such as between observations and models (although weighting by the Jacobian may be necessary, as it has been here). When no “truth” is available to compare against, error metrics become an unsuitable tool as it would require a threshold (e.g., when two distributions are “close enough,” or when radial diffusion is “done”), which would be difficult to motivate objectively. In this section, we discuss the performance of tools (time for PSD reach a monotonic state), [mass density, Eq. (11)], and [Eq. (12), energy density, or “distance from zero state”] when comparing distributions and how the distribution morphology became so integral to our analysis.
and are found to be complementary measures of the shape and evolution of the distribution. They are related, as the state of monotonicity and the state of lowest possible both have PSDs predominantly weighted toward the outer edge of the domain. is a state where the only existing gradients are to the left hand side of the enhancement. Indeed, particularly penalizes high f at low L [e.g., at results in a higher (1/16) than at (1/25)]. (Note, however, that a monotonic distribution can be gaining particles from the outer boundary and, therefore, be increasing in ). Despite the similarity between low- and monotonic states, evolution is determined by ; the steady state reached may not be the lowest configuration possible. is in turn usually dominated by reconfiguration term , which is an integral across L that is also dependent on L. This term tells us that gradients in f (and their location in L) are the strongest factors dominating . As a result, the distribution will change more rapidly toward the low- , monotonic-like state when there are steeper gradients, and when those gradients are situated at low L.
The dominance of consequently suggests that quantities capturing the shape of the distribution and the location of features in it are going to be the most useful tools when analyzing the output - as we have found. We conclude that measures using the distribution morphology across the entire domain are necessary to capture the system evolution. Indeed, the outer boundary condition affects the monotonicity of the final state. The shape of the distribution and the ongoing evolution change with the location and condition of the outer boundary. The act of “reducing gradients” means that in general, Neumann experiments reach monotonicity quickly, while the steep gradients remaining in the Dirichlet experiments means that they are actually at lower energy states and still changing more rapidly (losing more norm and mass) toward a steady state.
We suggest that we finally settled on this pair of tools because they are (a) both domain dependent and (b) include information about the morphology of the PSD distribution. Both tell us about the entire system, i.e., tells us about the system across all L at each time step; inform us about the ongoing evolution of the system at each time step, and is a measure of the long-term evolution. Initially, we searched for domain in-dependent measures; but as the diffusion coefficient, is domain dependent, so should be our method of analysis. Both tools relate, predominantly, to gradients. The existence of steep gradients is the largest contributor to a rapidly changing ; a distribution rapidly heading toward a steady state. A moderate limitation of is that one can only use idealized diffusion coefficients rather than the empirical ones from Ref. 39 used in calculating . Unfortunately, a limitation of is that some parameter combinations never reach a monotonic state, i.e., once loss is included at higher L values (see Sec. V B 2 and Fig. 19). The extra minimum in these cases suggest that we may need to characterize by convexity (i.e., number and location of extrema) rather than simply monotonicity and . Nevertheless, we found to be an extremely useful measure to compare the different experiments, which could be related back to radiation belt processes, to a “quiet” state and to quantifiable properties of the PSD f. Using both and together gives us a nuanced description of timescale (Sec. VI B). These quantities allowed us a clear and efficient means of comparing experiments and understanding the processes behind the evolution of each [S3a;S3b].
B. Timescale for radial diffusion
A radial diffusion “timescale” is poorly defined when depends on L; one cannot simply take as the characteristic timescale. This is an open question both for idealized models and the real radiation belts, because of the number of underlying approximations. Potential measures of timescale include the autocorrelation or dimensional analysis of self-similar solutions (e.g., Ref. 32) Unfortunately, self-similar timescales are difficult to find for the 1D radial diffusion equation Eq. (1) with diffusion coefficients Eq. (2) or Eq. (5) because of the high order of L. Typically, one makes “by eye” judgments of storm time radiation belt simulations. One could numerically solve for the equilibrium solution and then find the time taken to reach this (e.g., Ref. 58), but this will also depend on the metric or threshold one chooses to define as close enough to this equilibrium solution. Alternatively, suggestions were to take either the minimum or maximum in the system, as this would at least give you a “fastest possible” timescale (various personal communications). We have explored methods of quantifying timescale using our tools.
Using , the concept of timescale reduces to “how long until an enhancement is diffused away.” The results in Sec. V A, therefore, include how the initial condition (e.g., size and location of enhancement) relates to timescale without loss. There is a large variability with Kp , etc. From at a Kp of 4–5, we conclude that the timescale is on the order of days, or tens of hours (for all the plots together, see Fig. S1). Some combinations of initial conditions will keep close to one day, others closer to a week. We show some example timescales using each of these measures in Table I. Based on these results, we conclude that time to monotonicity is a more representative measure of timescale, but has practical limitations based on the outer boundary and on the fact that not all distributions can reach a monotonic distribution. This definition is not particularly suited once one considers loss from pitch angle scattering; although the timescale drops to hours or days, monotonicity is no longer guaranteed it is a poor indicator of timescale [P2a,b,c].
. | Min. . | Max. . | (N) . | (D) . |
---|---|---|---|---|
9 h | 18 668 h | 40 h | ⋯ | |
0.2 s | 18 668 h | 75 h | ⋯ | |
(loss) | 9 h | 18 668 h | 24 h | 92 h |
(loss) | 0.2 s | 18 668 h | 39 h | 111 h |
. | Min. . | Max. . | (N) . | (D) . |
---|---|---|---|---|
9 h | 18 668 h | 40 h | ⋯ | |
0.2 s | 18 668 h | 75 h | ⋯ | |
(loss) | 9 h | 18 668 h | 24 h | 92 h |
(loss) | 0.2 s | 18 668 h | 39 h | 111 h |
is even less suited to extracting a timescale as one would need to set a threshold. Nevertheless, we can draw some qualitative conclusions. Without loss, within a few days we see that the Neumann ensembles for all initial conditions have a relatively flat —the dynamics are not changing. On the other hand, most Dirichlet experiments are still reducing in at this point. Obviously this timescale is difficult to use as it has such a strong dependence on the outer boundary condition—which reflects the importance of the outer boundary condition. With loss, we see that initial conditions matter far less; with 72 h, most ensemble members have reached a very low energy, even though they are still losing mass; the timescale of the dynamic processes is only a few days before becoming “quiet” [P2a].
Our measures of timescales tell us (a) when the enhancement is diffused to the background via (although needs adapting for the case with loss) and (b) whether the dynamics are still changing (i.e., whether there are rapidly changing gradients being diffused away) via . We find that pitch angle loss generally dominates over the radial diffusion timescale (and note that in practice, pitch angle scattering timescales also vary with L and energy58).
As a result, from our work, we can suggest only that the radial diffusion timescale is on a scale of hours to days, depending on the parameters one uses (e.g., Kp, etc.). We note that using either the minimum or maximum value of is not a good indicator of timescale, for two reasons: (a) our conclusion that one should consider the entire domain, and (b) our result that gradients in the PSD affect the evolution of the system more than the value of , at least for a reasonable activity magnetosphere ( ) and up to . The definition of timescale is also still poorly defined and should be specified for a given purpose in order for any quantitative conclusions to be reached, e.g., time for an enhancement to be diffused away, for the radiation belts to drop below a certain energy, or return to a specific state. One interesting potential timescale would be the time taken for loss or reconfiguration terms ( , ) to dominate evolution (i.e., to be the dominant term of ) [P2a;P2b;A3].
C. Ensembles for radiation belt modeling
The goal of ensemble modeling needs to be more carefully specified, before a method of comparing ensemble members can be analyzed. For example, an error metric would work to compare variation from a “truth” (e.g., observation) or from a baseline forecast (i.e., before one creates ensemble members by varying parameterizations). We make several observations on ensemble modeling for future use.
A simple use of ensemble modeling would be to sample unknown quantities. Our results suggest that this would need to be done carefully to avoid bias; for example, sampling a range of values would err on the of faster diffusion and earlier monotonicity, because does not change linearly with . Furthermore, one should be wary of determining an “average” from an ensemble, simply averaging PSD values at each L across many ensemble members would not be meaningful when one needs to consider the whole domain first. This domain dependence raises a further problem; given an L-dependent , there is no clear way to compare simulations with a different outer boundary location or condition.
Finally, we note that one possible goal for ensemble modeling could be to characterize the influence of chaotic or stochastic processes. This would need to be carefully thought through, using the inherent properties of such a complex system; simply sampling from the initial conditions above (or from similar values of 5) is not likely to represent either a chaotic or stochastic underlying nature, but only to reinforce any bias toward higher amounts of diffusion. Poorly defined averaging of underlying properties may be why existing diffusion coefficients vary drastically.42
In Sec. IV B, we motivated by considering the properties required of a metric to analyze our ensemble. We note that our additional, analytic tools meet the proposed initial requirements (i.e., excluding the requirement for insensitivity to the total particle population) and suggest that these requirements may be useful in finding other qualitative tools for analyzing ensembles [A1].
We conclude that and are good tools for qualitatively understanding what an ensemble is doing, but not necessarily a good tool for comparing ensemble members for modeling or forecasting purpose [P1a;S3b].
D. The simulation outer boundary
We used both Neumann and Dirichlet outer boundary conditions as both have physical motivations. We tested multiple options as the true radiation belt outer boundary is both (a) poorly defined and (b) poorly represented in current models. The radiation belt outer boundary is poorly defined because particles do not stop existing beyond the last closed drift shell, making the last closed drift shell difficult to identify, and because the last closed orbits themselves are not clearly defined; they may be “split” by drift-orbit bifurcations.59,78 The outer boundary can be poorly represented in current models for different reasons, including that this outer boundary changes in time, and that for practical reasons the outer boundary is often placed where observations exist in order to drive that outer boundary. Operational geostationary satellites, such as GOES, have good coverage around the Earth and for many years and so are very practical for outer boundary conditions, e.g., Refs. 55 and 56. See Ref. 56 and references therein for other examples of models using in situ spacecraft to drive the outer boundary. Unfortunately, GOES is situated around , far short of the true outer boundary, which can vary considerably but is statistically placed at . These constraints are considered in more detail later in this section, with respect to our results. We have found that both the outer boundary location and condition affects the evolution of the system and the final PSD distribution after a week. The outer boundary condition used in our experiments to identify this effect is very idealized; here, we discuss what impacts this may have on outer boundaries used in practice.
showed a clear difference between Neumann and Dirichlet conditions; Neumann conditions reached monotonicity first. also varied with the choice of , especially for a Neumann boundary without loss from pitch angle scattering but also for a Dirichlet condition with loss, when the plasmapause is nearby. Using , we also found differences with both outer boundary location and condition. We expected a different long term solution for Neumann and Dirichlet conditions and found that evolution toward these steady states was very different (i.e., Neumann could reach a lower- state, but Dirichlet lost more quickly, heading toward their steady state more rapidly). While the initial did not change significantly with , the evolution of did vary; a smaller (and hence a shorter domain) had that diverged more with time between Neumann and Dirichlet outer boundary conditions. Loss mitigated, but did not remove, the differences between simulations with outer boundary location and condition. We conclude that if these options give different dynamics and different PSD distributions over the week, then we need to find the correct boundary conditions [P1c,S1a,S1b].
Typically, models of the outer radiation belt use a Dirichlet outer boundary to make use of spacecraft observations. These are at positions well short of the true outer edge of the radiation belt, for example, they may be curtailed to 23 or extrapolated to higher L values. (This is also close to the typical plasmapause location, which interacts with the outer boundary location and condition.)24,60 Unfortunately, most missions are limited to a few years; for a consistent set of flux observations, one must use geostationary data, for example GOES, observed daily between and 6.4 but mapped to a constant L to enable modeling.55 While at first this appears to be more physical than the idealized boundary conditions tested in this work, we have identified that just the inclusion of observations does not remove all the problems associated with the outer boundary [S1d].
Either outer boundary condition, imposed incorrectly, can correspond to erroneous sinks or sources. For example, in our experiments, a Dirichlet condition where the outer boundary value is higher than in the bulk of the simulation domain represents an infinite source of material. Since our Neumann boundary experiments have a plateau that can freely rise, this actually represents an increased source at the outer boundary, which is also not physical. Using a data-driven outer boundary (such as used in most real-life models) would reduce the imposition of unphysical sinks and sources, but we have shown that it does not resolve the problem as these boundaries are placed at a low that curtails the domain, resulting in different evolution (as we are removing the stronger high-L diffusion). The effect of this curtailment changes with outer boundary condition [see, e.g., the diverging in Figs. 12(c) and 12(e)]. From Fig. 17, it is clear that once one includes loss from pitch angle scattering, the evolution differences between outer boundary conditions are reduced, but the shape of the distribution still changes with , as is not independent of . Using an outer boundary location determined by the different trajectories of different spacecraft is not ideal - the model corresponding to each spacecraft would then have a different , and each of these models would respond differently, reaching monotonicity at different times, because outward radial diffusion is not being properly captured.[P1c,S1b]
Would using observations at every time step sufficiently constrain the simulation so as to remove the variability we see when changing and the outer boundary condition? This is not clear. It may be that using observations approximates the missing outer boundary processes well enough; results from Ref. 56 using both Dirichlet and Neumann outer boundaries at several outer boundary locations show that using GOES data are clearly superior to a Neumann outer boundary to capture long-term behavior. Our results suggest that the poor performance of the Neumann boundary in Ref. 56 represent inadequate characterizations of the constraining physics and subsequently unphysical PSD behavior (i.e., a rising plateau). However, while relying on observations to correct improper boundary conditions may perform well for event reproduction (i.e., hindcasting), observations are naturally not available for the future. This may limit how far in advance we can model radiation belt behavior. Neither Neumann nor Dirichlet boundary conditions are suitable for predictive purpose. Ideally we would not have to pick between these when we do not have clear values with which to constrain this outer boundary (as is the case here). Currently, we are reduced to comparing empirically which simulation settings account for variation across more orders of magnitude, rather than solely using physical motivations to understand the resulting uncertainty [S1d].
Options would, therefore, be to used mixed (Robin) boundaries that relate the flux from the domain to the outer boundary values, and/or some way to include the observations in the middle of the domain such as data assimilation or source terms. Methods such as these are already under investigation to resolve the fact that current modeling misses processes such as dropouts61,62 [S1a,S1c].
However, even a Robin boundary condition would not fix the fact that a true outer boundary location is both difficult to define and difficult to find. One could find the empirical extent of the highly-charged particles. However, this is not the same as the last closed drift shell (LCDS), which is the outer limit of adiabatically trapped particles. Particles within a closed drift shell will continue to drift on their path (of constant magnetic field) around the Earth. However, if a drift shell is open then sections of the drift path lie on open field lines outside the magnetosphere. Here, particles can be lost to the solar wind. The last closed drift shell is the last point in which particles are trapped rather than being lost to the solar wind. Therefore, the LCDS could be considered the “true” edge of the diffusion domain (but not the edge of the particle population) [S1a].
There is significant uncertainty in the location of the LCDS as modeled using state-of-the-art global magnetosphere models (e.g., Ref. 63), and there is significant variability in the location of the LCDS (e.g., Ref. 51) due to motion of the magnetopause and spatiotemporal variability of the magnetic field in Earth's outer magnetosphere.51 estimated the typical outer boundary to be at - significantly more distant than models used in practice [S1d].
In this work, we have shown that in simulations, both the outer boundary condition and location change the rate of evolution and the shape of the phase space density distribution. We have argued that using a curtailed domain to set a Dirichlet outer boundary may remove these problems for historical event studies, but are not suitable for predictive purpose. Finally, as radial diffusion is about the diffusion of particles across different drift paths, radial diffusion is not well defined when drift paths are open. Therefore, a theoretical limit to the radiation belts is the last closed drift shell (LCDS). However, this is (a) a dynamic boundary, (b) difficult to identify in practice, (c) not a closed outer boundary (i.e., particles can be lost to or gained through it), and (d) still an approximation, as in reality the last closed drift orbits are not uniquely defined. It may be impossible to set a “true” outer boundary; in the meantime, we do not know what level of accuracy is needed in the outer boundary location and conditions to adequately reflect the radiation belts. We conclude that significant work is required to identify reasonable outer boundary conditions and location for modeling; the outer boundary in real life is very variable, and we have shown that simulations are sensitive to several outer boundary choices [S1a,c,d].
E. Improving vs improving PSD distribution
The overwhelming recent focus to improve radial diffusion is through the diffusion coefficient , for example the theory behind ,26 the strength of the electromagnetic perturbations driving (both generally,64 or for event specific diffusion coefficients43,65) parameterizations of for use operationally,42 the effect of plumes on diffusion coefficients,66 or even radial diffusion vs radial transport.67 However, our results suggest that the gradients of the underlying phase space density distribution may have more effect on the radial diffusion. This is an unexpected result to radiation belt modelers and will need to be tested further, for example using more complex diffusion coefficients and/or expanding from just radial diffusion to include other radiation belt dynamics [A2].
Comparatively less effort has gone into finding the underlying PSD distribution. Typically, this is built up on a case-by-case basis for event studies, or to specifically understand the mechanism behind enhancements, rather than as a foundation for radiation belt modeling. There are many difficulties with the construction of radial PSD profiles, particularly as these are made from observations which are sparse in time and space, often on scales much slower than enhancements.68,69 includes a statistical study and emphasizes the need to include error and uncertainty. Our results above (Sec. V A 4) indicate that it is the gradients in the radial profile, rather than the exact , that determines radial diffusion, to several orders of magnitude. We have not explored the extent of this result: if there is a limit to this with a stronger L dependence, a longer domain or the choice of boundary condition. Nevertheless, as radial diffusion is the large scale, bulk mechanism behind radiation belt evolution, and recent PSD profiles are shown to regularly contain gradients from enhancements,68,70,71 finding the radial PSD profiles may be a more valuable future route of study; the form of may be less important than getting the gradients of local enhancements right. If this is the case, then given that the uncertainty in is also of orders of magnitude (see discussion in Secs. I and II), improved s are still likely to result in improved radiation belt modeling. Either way, this work demonstrates that analytical tools based on the principles here could result in valuable ways to test radiation belt models and to quantify the impact of model components such as gradients and diffusion coefficients [P1a; A1]
F. The Role of the Plasmapause in Loss and Radial Diffusion
The plasmapause is already known to contribute to radiation belt dynamics, especially through wave–particle interactions faster than radial diffusion (i.e., affecting the first and second adiabatic invariants). Plasmaspheric structure is not typically considered a significant component to radial diffusion outside the loss due to pitch-angle scattering, which was the original expectation here. However, we have shown that although the loss from pitch-angle diffusion dominates over radial diffusion, the spatial limit (in L) of this loss has significant implications for the PSD evolution.
Furthermore, the plasmaspheric structure may can also affect the radial diffusion directly in a way that is not incorporated to radiation belt models today. Recent work has shown that ULF waves can vary in structure when plasmaspheric plumes arise, which will consequently affect on the radial diffusion.66,72,73 Our work has demonstrated that even a simple plasmapause interacts with the initial morphology of the PSD distribution (especially the location of the central peak of an enhancement) and the outer boundary of the simulation to produce very different final PSD profiles with different maxima. A more realistic scenario would include (a) a plasmapause varies azimuthally around the Earth, (b) plume structure instead of a single plasmapause and (c) increased radial diffusion inside that plume. Developing the interactions, we have identified here for this more realistic scenario may have significant implications for the evolution of electron PSD in the radiation belts in practice.
G. Limitations of our numerical experiments
The strengths and weaknesses of our study arise from the same principle: idealized experiments. By examining the fundamentals of radial diffusion modeling, we aimed to understand the results of ensembles. To do so, we made many simplifications, which we shall review.
Our experiments showed that the “background” initial condition used above should be skewed further to lower L, for example, the final monotonic distributions shown in Fig. 4, rather than the step-and-bump of our initial conditions. The “background” higher PSD at the outer boundary is assumed to be from substorms. Given that the inter-substorm time is relatively fast, with a mode of 3 h (Refs. 74 and 75), this “background” level is a reasonable initial condition. We also used a Gaussian to represent an enhancement. However, if the enhancements occur on the order of days, single events (and therefore “time for an enhancement to diffuse away”) should be replaced by compounded events.71
We did not explore the result of variance of the inner boundary. While the outer boundary condition and location have been thoroughly explored, they are unrealistic. They do not include observations as per most operational models and despite being able to physically motivate these settings initially, they essentially relate to an unphysical source or sink at the boundary. This poses several questions about how radiation belt modeling should be attempted in future, which are covered in Sec. VI D; a different methodology than ours would be needed to investigate these questions and to examine the relative effect of different driving conditions at the outer boundary, which was out of the scope of this investigation.
The radial diffusion equation Eq. (7) is well-known and widely used. Our results confirm that it may not be meaningful to consider radial diffusion without loss from pitch angle scattering. Our equation for loss is very simple, in order to match the number of parameters used in our experiments; more sophisticated parameterizations exist (e.g., Ref. 46) To keep our problem tractable, we also only considered a single value for the first and second adiabatic invariants. A downside of our simple loss equation is the lack of time dependence; unlike real life, the amount of particles lost does not vary, except when we set a different plasmapause location. Of course, exploring all these options is a difficult problem; in order for the problem to remain tractable, we chose not to vary these. We have also only included loss from the interior of the domain (precipitation). Loss across the magnetopause, also known as magnetopause shadowing, is necessary for realistic radiation belt models but again requires estimates of the radiation belt outer boundary.
A major choice in this investigation was the Ozeke and idealized versions of [Eqs. (2) and (5)]. Unfortunately, we could not use a single throughout the investigation; while the Ozeke model is a simple parameterization that is widely used operationally, it does not admit easy solutions for and . The parameterization of Kp is very rough and is a coarse average over many different processes that all affect radial diffusion (e.g., ULF waves, compressions, plumes). Nevertheless, both models are comparable to diffusion coefficients used elsewhere, given the dependence on , etc., multiplied by a proxy for the effect of electromagnetic perturbations, e.g., Refs. 29, 33, and 34. The assumption that some form of radial diffusion is always ongoing is a reasonable one as there is always some level of ULF waves; a summary of several other limitations of currently applied radial diffusion modeling can be found in Ref. 37. In this manuscript, we have only discussed limitations of the quasilinear techniques typically used, yet recent work indicates that current diffusive models cannot contain all radial mechanisms and nonlinear contributions to radial diffusion are needed.67
Despite the large number of idealizations required for this work, we nevertheless reached some general and significant conclusions that will inform future ensemble modeling and suggest that new directions are needed for radiation belt modeling. These are summarized below.
VII. CONCLUSION
We used numerical experiments to investigate how initial conditions and basic radial diffusion modeling techniques affect the final phase space density of radiation belt ensembles. As yet, there are a few standard methods of analyzing ensembles; hence, we explore some in this work. Despite the radial diffusion equation arising from the simple heat equation, the space and time dependence of the diffusion coefficient make analysis of this system rather complicated, even while space weather modeling demands are increasing. Our initial goals included defining a radial diffusion timescale and the effect of model settings across ensemble members, yet as part of this work we have identified significant questions in current practice for radiation belt modeling. To aid navigation of this paper, each goal has been explicitly stated in Sec. III and tagged throughout.
The key findings are summarized here and briefly expanded below:
-
Evolution of the system depended on the outer boundary condition and location. A shorter domain evolved at a different rate than a longer one; this will be due to the L dependence of . It is not clear what outer boundaries should be used and this may have consequences for modeling the radiation belts [P1c;S1].
-
Using an analytical quantity (the energy moment, or norm ), we found that the gradient of the phase space density distribution contributed more to the evolution of the system than the diffusion coefficient [A2].
-
Flaws in typical metrics included (a) threshold-based metrics which gave results highly threshold-dependent, and (b) the requirement of a Jacobian factor due to the co-ordinate system [P1a].
-
Time to monotonicity and mass/energy moments were developed to analyze radial diffusion models [P1a].
These metrics were appropriate because they consider the whole domain and are L dependent,
is intuitively interpretable as time for an enhancement to diffuse away [S2],
the system can be considered as continually moving to a low energy moment (or norm) state [A1],
could be easily adapted to other radiation belt models [S1].
-
Loss from pitch angle scattering generally dominated over radial diffusion [S4].
-
Taking an average over ensembles where the enhancement location varied in L would result in a PSD biased toward additional radial diffusion; linear increases in result in nonlinear decreases in . This analysis could be extended to find how mass and energy density vary across gradually changing ensemble members [P1b,S3].
The methodology of this study was to perturb simulations, selecting ensemble members based on sampling idealized conditions from Earth's radiation belts. Initial conditions were a “background” phase space density (PSD) distribution, plus a localized enhancement. These properties were varied, and their impact on the evolving PSD distribution assessed using time to monotonicity and two analytic mass- and energy-like quantities . Multiple physical outer boundary conditions and locations were tested, using both empirically fitted and idealized radial diffusion coefficients. Finally, loss from pitch angle scattering was included. Throughout this paper, the impact of all these factors on the PSD has been extracted. Here, we summarize significant findings and important discussions.
The final PSD, and evolution toward that state, varied with both the outer boundary condition and location. Both constant flux (Neumann) and fixed value (Dirichlet) outer boundaries can be physically motivated, although neither can be easily implemented to reflect real-life conditions. Current operational methods generally involve either a diffusion domain shortened to where observations are available, or extrapolations from this point to a distant outer boundary. We have shown that while observations could constrain simulation errors from a curtailed boundary for historical events, this is far from assured when modeling future behavior. The interplay of domain-dependent and outer boundary condition and location as an additional source of error has not been heretofore considered. We suggest several ways forward, including mixed boundaries and data assimilation. Our work also suggests that identifying an outer edge to the real radiation belts—currently poorly defined and difficult to identify from observations—will have significant implications for modeling. In particular, as modeling of the radiation belts becomes more realistic, the outer boundary location (and therefore the size of the domain) will vary more in time in our model [P1c;S1a;S1b;S1c;S1d;S2e].
The bulk of previous work on improving radial diffusion estimates has focused on finding and characterizing . By comparing the components of , which determines the ongoing evolution of the system, we find that it is instead the gradients of the PSD distribution that determine the ongoing amount of diffusion. This result was for an idealized 1-D (radial only) diffusion and idealized diffusion coefficients and needs to be examined in more realistic contexts. This result also emphasizes the need to integrate any comparative measures across the entire domain to capture the full impact of the L dependence of radial diffusion. Diffusion is not well characterized by either the largest or smallest in the domain [A2;A3].
Upon analysis of our measures and , we find that their effectiveness is partly due to their relationship to gradients and hence the morphology of the PSD distribution. Our two measures are related, yet complementary. Together they include information on both the system state and the ongoing diffusion. They are both interpretable; has a clear physical interpretation (how long until an enhancement has been completely diffused away) while the components of and describe the mass in the simulation and the ongoing evolution to a steady state, respectively. Using an idealized diffusion coefficient in enables an explicit comparison of the inner boundary, outer boundary, and loss terms on the system. A limitation of is the requirement of an idealized , while ideally should be adapted to enable better application for loss from pitch angle scattering. These are excellent tools for qualitatively understanding what an ensemble is doing, but they are not error metrics. For error metrics, it is likely that L will need to be accounted for, e.g., with the Jacobian . Both measures are domain- and boundary type-dependent; this is a desirable property as it highlights the fact that for radial diffusion, different domain sizes and boundary types are effectively modeling different physical systems [P1a;P1b;P2a;S2;S3;A1].
Using and all the components of a typical radial diffusion model were compared. Loss from pitch-angle scattering (using an extremely simple loss model) generally had more effect on the PSD distribution than diffusion, although this was highly dependent on the location of the plasmapause. A more distant plasmapause results in so much more loss that the dynamics could change totally, and the system never be able to reach a monotonic state. Of the initial condition, step size parameterized by B (corresponding to the quiet background PSD distribution) and enhancement location had the greatest effect on the timescale of diffusion and the state of the system; these results agree with our findings that gradients control the diffusion. Roughly, we found that for the impact on evolution, by order of magnitude; extreme values change this ordering. The initial condition controlled loss at the inner boundary [P2c;S3;S4;A1].
To summarize, in this paper we have presented a methodology to analyze the components of a numerical model of radial diffusion in Earth's radiation belt and compare the impact of those components on the shape and evolution of the particle phase space density distribution. Our work emphasizes the need to consider the entire modeling domain when comparing or analyzing radial diffusion simulations. Furthermore, we find that the L-dependence of the diffusion coefficient results in a model that is domain dependent; curtailing the outer boundary or using inappropriate boundary conditions effectively models very different systems. We make several suggestions for the outer boundary in future models. We find that it is the gradients of the phase space density that mainly control diffusion, contrary to the focus of most radial diffusion studies on .
SUPPLEMENTARY MATERIAL
See the supplementary material for contain plots for all numerical experiments run for this paper, i.e., the time to monotonicity ensembles for each parameter A,B,μ,σ,Louter,Lp with and without loss, and the calculated quantities E,Et,N,Nt for each parameter also. In the main body of the manuscript, selected results were shown to convey significant results.
ACKNOWLEDGMENTS
We would like to thank the reviewer for their thoughtful comments and attention to detail in making this manuscript more accessible. J.S. was supported by a RAS Summer Bursary. R.L.T. was supported by the Engineering and Physical Sciences Research Council (EPSRC) (Grant No. EP/L016613/1). S.N.B. was supported in part by STFC Grant ST/R000921/1. C.E.J.W. was supported in part by STFC ST/W000369/1 and NERC NE/V002759/1. Both S.N.B. and C.E.J.W. were supported in part by STFC ST/X001008/1.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
S. N. Bentley: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Funding acquisition (lead); Investigation (equal); Methodology (equal); Project administration (lead); Software (supporting); Supervision (equal); Validation (lead); Visualization (equal); Writing – original draft (lead); Writing – review & editing (lead). J. Stout: Formal analysis (supporting); Investigation (equal); Methodology (supporting); Software (equal); Visualization (equal); Writing – review & editing (supporting). R. Thompson: Conceptualization (equal); Methodology (equal); Resources (lead); Software (equal); Supervision (equal); Writing – review & editing (supporting). D. J. Ratliff: Formal analysis (supporting); Methodology (equal); Writing – review & editing (supporting). C. E. J. Watt: Conceptualization (supporting); Methodology (supporting); Writing – review & editing (supporting).
DATA AVAILABILITY
The data that support the findings of this study are openly available in Zenodo at https://zenodo.org/doi/10.5281/zenodo.13314166 (Ref. 76) using the Python notebook on Zenodo/GitHub https://zenodo.org/doi/10.5281/zenodo.13336681 (Ref. 77).