Deep learning (DL) has recently become a powerful tool for optimizing parameters and predicting phenomena to boost laser-driven ion acceleration. We developed a neural network surrogate model using an ensemble of 355 one-dimensional particle-in-cell simulations to validate the theory of phase-stable acceleration (PSA) driven by a circularly polarized laser driver. Our DL predictions confirm the PSA theory and reveal a discrepancy in the required target density for stable ion acceleration at larger target thicknesses. We discuss the physical reasons behind this density underestimation based on our DL insights.
I. INTRODUCTION
Laser-driven ion acceleration is a topic of significant importance in the field of laser–plasma interaction. The generation of high-energy ion beams holds great potential for applications in cancer therapy as well as for fundamental science studies.1–3 Various acceleration schemes have been proposed. One widely studied scheme is target-normal sheath acceleration (TNSA), which involves irradiating a solid target with an intense laser pulse. The laser–plasma interaction creates a plasma sheath at the target rear, causing ions to be accelerated away from the target.4–7 Another notable scheme is radiation pressure acceleration (RPA), where the radiation pressure of the laser pulse drives the acceleration of ions. The interaction between the laser and the plasma generates a strong electrostatic field, which aids in the ion acceleration process.8–12 Breakout afterburner (BOA) is another acceleration scheme that involves a two-stage process using intense laser pulses and skin-depth scale targets.13,14 During the initial phase of laser absorption at the front surface of the target, rapid evolution occurs, leading to the volumetric heating of electrons throughout the target volume beneath the laser focal point. As a consequence, the nanometer-scale target quickly transitions to a state of relativistic transparency, allowing the laser to penetrate through the dense target plasma. During the subsequent phase, the co-motion of the accelerating electric field and ions throughout the acceleration period is instrumental. This mechanism results in a substantial enhancement in beam energy and conversion efficiency when compared to traditional TNSA. Magnetized-vortex-acceleration (MVA) is a scheme that exploits the interaction between a magnetized plasma and a laser pulse. The presence of a magnetic field enhances the ion acceleration by enabling self-focusing of the laser pulse and reducing the electron energy loss.15,16 Collisionless shock acceleration (CSA) is a scheme where a shock wave is generated in a plasma through the interaction of an intense laser pulse. This shock wave can efficiently accelerate ions to high energies without significant heating.17,18
The existing acceleration schemes are primarily based on one-dimensional (1D) theory and demonstrate good agreement with particle-in-cell simulations (PICs), but multi-dimensional PIC simulations reveal that the complex physical processes involved19 and transverse effects introduced instabilities significantly reduce the efficiency of ion acceleration.20–23 Moreover, numerous simulations indicate that the ion energy and spectrum are highly sensitive to the laser and target parameters, such as beam size, energy, focal position, wavelength, target thickness, density, and structure. Despite achieving the highest recorded proton energy of 94 MeV in 2018,24 which nearly meets the requirements for radio-biological in vivo studies, further enhancements in proton energy and numbers are necessary for general tumor treatment. In 2022, an experiment of tumor irradiation in mice with a laser-accelerated proton beam was conducted,25 and thus, clinical experiments by laser-driven proton beam will be expected in the coming future.26
The progress of laser-driven ion acceleration has been limited and Badziak discussed the challenges and prospects,27 e.g., understanding and controlling ion beam properties, mitigating instabilities, optimizing laser and target parameters, and developing diagnostic techniques to characterize the ion beams accurately. Therefore, finding an optimized parameter set and understanding the underlying physics are critical. However, the consumed-cost associated with conducting experiments or performing multi-dimensional PIC simulations presents a challenge in parameter optimization.
Recently, deep learning (DL) has emerged as a powerful tool for scientists to address the challenges of the optimization problem because28 DL possesses the potential to explore the vast parameter space associated with ion acceleration. By leveraging large datasets generated from simulations and experiments, DL algorithms can capture intricate patterns, correlations, and nonlinear relationships that may not be easily discernible through traditional analytical approaches. This data-driven approach allows scientists to extract valuable insight and develop predictive models for ion acceleration.29 For example, in recent work by Djordjevic et al., in 2021, an ensemble of over 1000 1D-particle-in-cell (PIC) simulations was used to develop a neural network (NN) surrogate model to study the dependence of ion energy on the pre-plasma gradient length scale in target-normal sheath acceleration (TNSA) and conclude that the pre-plasma is critical issue in the interaction between the laser and the target.30 This paper suggests carefully measuring the laser pre-pulse and amplified spontaneous emission during experiments for modeling the pre-plasma. Hence, varying the pre-plasma profile would be their future study. Therefore, the approach allowed for a comprehensive exploration of the parameter space and provided valuable insight into the underlying physics of ion acceleration.
In addition to the NN surrogate model, Dolier et al. demonstrated the optimization of laser-driven proton acceleration PIC simulations controlled by a Bayesian algorithm.31 Moreover, Goodman et al. also indicated the utility of applying a machine learning-based optimization approach providing new insight into the physics of radiation generation in laser–foil interactions, which will inform the design of experiments in the quantum electrodynamics (QED)-plasma regime.32 An essential development in the integration of physics and machine learning is physics-informed neural networks (PINNs).33 A PINN is a specialized class of neural networks that leverage the principles of physics, frequently expressed in the form of ordinary differential equations or partial differential equations, to act as a regularization mechanism. This ensures that the model's solutions align harmoniously with the fundamental laws of physics. Furthermore, Scheinker and Pokharel recently presented a physics-constrained neural network (PCNN) approach to solving Maxwell's equations for the electromagnetic fields of intense relativistic charged particle beams.34 In 2023, Döpp et al. published a review article that presents an overview of relevant machine learning methods with a focus on applicability to laser-plasma physics and its important sub-fields of laser-plasma acceleration and inertial confinement fusion.35 Hence, data-driven science and machine-learning methods as a new approach to research have been a trend.
Our NN surrogate model can predict the maximum proton energy by the given parameter set to reveal the validity of the PSA theory. To verify the predictions, we performed 1D-PIC to compare with the prediction to demonstrate the successful establishment of a PSA scenario, where the radiation pressure and the electrostatic field achieve a delicate equilibrium, resulting in highly efficient and controlled ion acceleration. Furthermore, our NN surrogate model also indicates a discrepancy between the PSA theory and PIC simulations at larger target thickness region ( ). By exploring scenarios with larger target thicknesses, we observed the disruption of the electron layer due to an imbalance between the radiation pressure and the electrostatic force, ultimately leading to the termination of the PSA process. We performed 1D-PIC to investigate the physical reason and found that the collapse of the electron layer occurs when the plasma skin depth surpasses the thickness of the pulse-compressed electron layer during the interaction process, causing the termination of the PSA process.
II. METHOD
A. The dataset from 1D PIC
In our study, we constructed a dataset consisting of scalar outputs extracted from 355 one-dimensional (1D) particle-in-cell (PIC) simulations performed using the EPOCH PIC code.37 The simulation setup is illustrated in Fig. 1. We considered a circularly polarized laser pulse with a wavelength of . The laser pulse was modeled as a square pulse with a duration of and a Gaussian profile with a width of on both the rising and falling edges, where TL represents the laser period. The target was a pure hydrogen plasma with a step density profile, and the front edge was located at x = 0. The target thickness (D) and density n0 were normalized to the laser wavelength λL and the critical density nc, respectively. We employed a grid number with NX = 50 000 cells and a grid size of . Each cell contained 500 macro-particles. Here, we aimed to use DL to verify the PSA formula so the pulse duration is purposely fixed, which is the same as Yan's PSA paper.36 The simulation time for each run was because it is long enough to distinguish the successes and the failures of the PSA scheme.
The configuration of 1D PIC simulation. aL is the normalized laser amplitude, D is the target thickness, is the laser wavelength in vacuum, n0 is the target density, nc is the critical density of the incident laser pulse, τ is the pulse duration, and TL is the laser period. The propagating direction is in x and the left edge of the target is at .
The configuration of 1D PIC simulation. aL is the normalized laser amplitude, D is the target thickness, is the laser wavelength in vacuum, n0 is the target density, nc is the critical density of the incident laser pulse, τ is the pulse duration, and TL is the laser period. The propagating direction is in x and the left edge of the target is at .
To construct the dataset, we varied the laser amplitudes a0 as 5, 10, 15, and 20. The target thickness ranged from 0.01 to 1.5, while the target density ranged from 0.5 to 15. At , we recorded the maximum kinetic energy Ek of protons in each simulation run. The 1D-PIC simulation results for the dataset of deep learning and the phase-stable acceleration (PSA) theory (illustrated by black-dashed lines) are shown in Fig. 2. The higher proton energies were obtained near the region of the black-dashed line showing that the proton was efficiently accelerated when the PSA criterion was satisfied.36 Moreover, the maximum kinetic energy of protons Ek increases with laser strength aL. In Subsection II B, we aim to develop a neural network (NN) surrogate model to verify the PSA theory and predict the maximum proton energy EK in laser-driven ion acceleration instead of running the consumed-cost PIC.
The maximum proton energies from different parameters set (aL, ). The black-dashed lines in each figure are the PSA theory from Eq. (1).
The maximum proton energies from different parameters set (aL, ). The black-dashed lines in each figure are the PSA theory from Eq. (1).
B. Neural network architecture
To construct the NN surrogate model, we employ the PyTorch framework, a powerful tool for deep learning that has gained widespread recognition in the scientific community.38 The input dataset, as shown in Fig. 1, used for training and validation consists of input parameters (aL, ) paired with their corresponding maximum proton energy Ek. By leveraging this dataset, we can guide the NN surrogate model to learn the intricate relationships between the input parameters and the target variable. Our NN architecture follows a four-layer structure, as depicted in Fig. 3. PyTorch's flexible and intuitive interface enables us to easily define the layers and activation functions within the network, allowing for seamless customization and fine-tuning.
The four-layer neural network architecture. The predicted value is the maximum proton energy Ek.
The four-layer neural network architecture. The predicted value is the maximum proton energy Ek.
The initial layer serves as the input layer, where the dataset's three features are fed into the network. Leveraging the SELU activation function (Scaled Exponential Linear Unit), this layer outputs nine features. Subsequently, the second layer applies the LeakyReLU activation function (leaky rectified linear unit) to the nine input features, resulting in the transformation into 27 output features. In the third layer, the 27 input features are further expanded to 81 output features using the ELU activation function (exponential rectified linear unit). Finally, the output layer takes the 81 input features and generates a single target value Ek employing the ReLU activation function (rectified linear unit). ReLU, widely adopted in regression tasks, effectively handles non-linearity by passing positive values while disregarding negative ones which guarantees none of the negative proton energy output.
III. RESULTS
A. Validation and training result
To further validate the performance of our NN surrogate model, we conducted a comparative analysis by contrasting its predicted values with those derived from an ensemble of 355 1D-PIC simulations, depicted as the blue data points in Fig. 5. This comparative examination serves as a pivotal evaluation of the predictive capabilities inherent in our NN surrogate model. In Fig. 5, the reference line with a slope of 1, representing the ideal predictions by the NN surrogate model, indicates a scenario where predicted values align perfectly with the simulation data. The blue data points exhibit exceptional concordance between the PIC-derived values and the NN-predicted values, resulting in a corresponding R-squared value of .
Comparison of maximum proton energy between the PIC-derived values and NN-predicted values: The black line (slope = 1) represents ideal model predictions, perfectly aligned with data. Blue points show NN model predictions for the input PIC dataset (refer to Fig. 2), with an . Red and green points depict the additional PIC simulations compared within and outside the training domain ( , and ), yielding and .
Comparison of maximum proton energy between the PIC-derived values and NN-predicted values: The black line (slope = 1) represents ideal model predictions, perfectly aligned with data. Blue points show NN model predictions for the input PIC dataset (refer to Fig. 2), with an . Red and green points depict the additional PIC simulations compared within and outside the training domain ( , and ), yielding and .
To further scrutinize the reliability of our NN model, we conducted additional PIC simulations using datasets both within and beyond the confines of the training domain, defined as , and . These are represented by the red (within) and green (beyond) data points depicted in Fig. 5. The computed R2 values for the red and green datasets are 0.83 and –1.56, respectively. In the case of the red data points, the majority of predictions exhibit a favorable alignment with the PIC simulations, albeit with some instances of underestimation or overestimation. The scattering of red points corresponds to the predictive accuracy of our model. This accuracy, defined as the percentage of sources with errors less than 10%, reaches 0.6 in the final epoch, as shown in Fig. 4(b). Out of a total of 270 red points, 162 exhibit errors below 10%. Consequently, the calculated predictive accuracy for the red points is 162/270 = 0.6, aligning with the overall model accuracy.
Conversely, for the green data points, most predictions tend to be overestimated, and the negative R2 value signifies a challenging extrapolation task for the NN surrogate model, where the data points fall outside the training domain, defined as , and , which may cause the green points are always beyond the ideal model prediction (black line). Notwithstanding the rudimentary NN surrogate model presenting the inherent limitations of extrapolation predictions in this paper, numerous scientists are still dedicated to the refinement of extrapolation techniques and the reduction of extrapolation error.39–42
B. Parameter scans using deep learning model
The well-established NN surrogate model enables the construction of a continuous data map based on a discrete ensemble of datasets derived from sparse PIC simulations. Following the training of our NN surrogate model using the 355 input PIC datasets, we leverage it to conduct an extensive parameter sweep, accompanied by the corresponding predicted proton energy Ek, as depicted in Fig. 6. In the figure, a discernible layer becomes evident, signifying the presence of an optimal parameter set. This narrow layer exhibits two distinctive characteristics:1 an increase in the maximum proton energy Ek with rising aL;2 a symmetric hyperbolic layer presents due to the structure of Eq. (1), but the predicted energy distribution is asymmetric on the hyperbolic layer. For example, in the hyperbolic curve at aL = 20, it is clear that the higher energy (red) and the lower energy (yellow-green) are obtained by the thin-high density target and the thick-low density target, respectively.
Considering as a variable, the PSA formula can be represented as a linear equation with a slope of π, as depicted in Fig. 7. It becomes evident that the PSA theory aligns well with the predictions generated by our NN surrogate model, indicating efficient proton acceleration when Eq. (1) is satisfied. Consider aL = 20 and to illustrate. In this case, increasing target density while reducing thickness is equivalent to decreasing target density while increasing thickness. However, when comparing Fig. 7 with Fig. 6, it becomes apparent that the symmetry property of the parameter set exists only within a limited region of the parameter space. This implies that some information remains obscured in the representation of Fig. 7.
The data map of the predicted maximum proton energies. The black line with a slope of π is the PSA theory from Eq. 1.
The data map of the predicted maximum proton energies. The black line with a slope of π is the PSA theory from Eq. 1.
To compare the predictions generated by the NN surrogate model with the input PIC data, as illustrated in Fig. 2, we have performed parameter slice analyses on the parameter cube for values of 5, 10, 15, and 20, as demonstrated in Fig. 8. Some filament patterns in the region below the black lines are observed because of the sparse input PIC data.
The data map of the predicted maximum proton energies for the comparison of Fig. 2. The black lines in each figure are the PSA theory from Eq. (1).
For a specific slice of aL = 5, the NN surrogate model predicts a maximum proton energy of approximately Ek = 1200 MeV at the coordinates . To validate this prediction, an additional PIC simulation was conducted using these parameter settings, resulting in an energy measurement of MeV. This observation highlights the potential for the NN surrogate model to occasionally overestimate or underestimate proton energy, aligning with the presentation in Fig. 5. In addition, Fig. 8 showcases a promising level of agreement between the predicted values and the PSA theory, particularly in cases, where . Nevertheless, certain discrepancies between the predictions and the theoretical expectations are apparent especially in the region of , warranting further investigation.
The density distributions (blue is electron and red is proton) and energy spectra at t = 500 fs for initial target density (a) and (b) .
The density distributions (blue is electron and red is proton) and energy spectra at t = 500 fs for initial target density (a) and (b) .
In the case of , the density distribution exhibits a double-layer structure, where the electron layer is propelled forward by radiation pressure, while the proton layer is consistently drawn by the established electrostatic field, as illustrated in Fig. 9(a). We introduce a negative sign to the electron density to differentiate between the proton and electron layers. The accumulated densities of the proton and electron layers ( ) are much higher than the initial target density. Moreover, the mono-energetic proton energy spectrum with a central energy of 1300 MeV is observed which is evidence of the efficient acceleration of the proton by the PSA mechanism. Conversely, for , the density distribution demonstrates layer destruction without a mono-energetic proton energy spectrum, leading to the failure of the PSA mechanism, as depicted in Fig. 9(b).
IV. DISCUSSION
A. The physical reason causing the failure of the PSA mechanism
To elucidate the underlying physical mechanisms leading to the success or failure of the PSA mechanism, we examine the density distribution snapshots for both scenarios, as depicted in Fig. 10.
The snapshots of density distributions at t = 0, 70, 80, 90, 95, and 100 fs for initial target densities and .
The snapshots of density distributions at t = 0, 70, 80, 90, 95, and 100 fs for initial target densities and .
Figure 11 showcases the density distributions at t = 105 fs for both and . In this depiction, we define D as the initial target thickness and as the pulse-compressed electron layer's thickness. It is important to acknowledge that precisely defining proves challenging due to its dynamic nature throughout the pulse-pushing process. In Fig. 11, the designated region represents the skin depth (depicted in gray), computed using the maximum electron density. A comparison is made between the calculated skin depth ls and the compressed electron layer thickness . For , the compressed electron layer thickness surpasses the calculated skin depth ls. As a result, the laser field (depicted by the black dashed line) experiences a steep drop to near-zero values after traversing the electron layer ( ), as exemplified in Fig. 11(a). This occurrence signifies complete reflection of the laser field, effectively channeling energy into the electron layer and indirectly facilitating proton bunch acceleration via the electrostatic field between the double-layer structure. Conversely, for , the dimensions of the sharp electron density structure prove smaller than the skin depth ls, thereby enabling the transmission of the laser field (indicated by the black dashed line) past the electron layer ( ), as depicted in Fig. 11(b). Consequently, the radiation pressure resulting from partial reflection fails to counterbalance the electrostatic force originating from the sharp electron density structure.
The snapshots of density distributions and laser electric field (green line) at t = 105 fs for initial target density (a) and (b) .
The snapshots of density distributions and laser electric field (green line) at t = 105 fs for initial target density (a) and (b) .
B. Future work
By employing the PSA mechanism as the benchmark, we have already ensured that the DL can extract the physical feature instead of performing numerous consumed-cost PIC if the NN model is trained appropriately. However, the situation is predictably different from 1D to 2D/3D PIC simulations because a critical parameter, focal spot size, is included. Even though we obtain the optimal conditions based on the 1D PSA scheme and apply the parameter set to 2D/3D PIC simulations, the 2D/3D simulation result would not present the same performance (e.g., energy, spectra, or density distributions) as the 1D PIC simulation since the transverse effects or instabilities cause the damage of the accumulated electron layer, such as Rayleigh-Taylor instability,20,43 Weibel instability,44,45 half-wavelength instability,46 and transverse instability.21–23 Although the 1D scheme cannot be directly compared with 2D/3D PIC, 1D theory still plays an important role in establishing the physical insight for the complex process of laser-driven proton acceleration. Hence, we expect that the NN surrogate model using sparse training data based on this paper will be a powerful tool to allow us to figure out the dependencies between the accelerated proton energy and each parameter.
After studying Fuchs' scaling by DL, we will switch the 2D scenarios for the hybrid acceleration scheme proposed by Isayama.47 This hybrid acceleration scheme is a combination acceleration scheme of RPA,8–11 laser wakefield acceleration (LWFA),48,49 and TNSA4–6 by the interaction of the tabletop dual pulses with a tandem solid density and a near-critical density target. By choosing an appropriate parameter set, the achievement of 100 MeV (or even higher) protons with a tabletop laser system is possible, e.g., NCU 100-TW laser system.50 In this hybrid acceleration scheme,47 multiple parameters are necessary to consider for the optimization of the maximum proton energy, such as the individual laser parameters (energy, duration, spot size), the pulse time delay between two lasers, the density and the size of the solid target and the near-critical density target, and the space between the two targets.
V. CONCLUSION
We developed a neural network (NN) surrogate model, constructed from an ensemble of 355 one-dimensional (1D) particle-in-cell (PIC) simulations, to investigate the phenomenon of phase-stable acceleration (PSA) in laser-driven ion acceleration. This work highlights that even with a sparse and discrete dataset, a well-trained NN model can effectively generate a continuous data map, enabling accurate predictions. Significantly, the widely accepted PSA theory serves as a benchmark for evaluating the reliability and robustness of the predictions made by the trained NN model. Moreover, our NN surrogate model not only validates the well-established PSA theory but also uncovers a noteworthy disparity between theory and predictions when considering targets with larger thicknesses. Through comprehensive 1D PIC simulations, we identify that the collapse of the electron layer occurs when the plasma skin depth surpasses the thickness of the pulse-compressed electron layer during the interaction process, causing the termination of the PSA process. Although our study predominantly focuses on the 1D effects, it is essential to underscore the significance of the skin depth in multi-dimensional simulations. The skin depth plays a pivotal role in determining the dynamics of laser–plasma interactions and the mechanisms governing ion acceleration in both one and higher-dimensional scenarios.
The integration of deep learning (DL) methodologies into multi-dimensional PIC simulations or experiments holds great promise for advancing the field of laser-driven ion acceleration. Future investigations will leverage DL techniques to explore intricate phenomena, such as Rayleigh–Taylor instability,20,43 Weibel instability,44,45 half-wavelength instability46 and transverse instability,21–23,51 measurement technique,52,53 target design,54,55 and novel acceleration schemes,56,57 especially the hybrid acceleration scheme proposed by Isayama as mentioned in future work.47 These breakthroughs will have far-reaching implications in areas, such as cancer therapy, fundamental science research, and other practical applications, relying on high-energy ion beams.
SUPPLEMENTARY MATERIAL
See the supplementary material for two GIF video files for n 0 = 1.24 n PSA and n 0 = 1.23 n PSA in Fig. 10. There are three subfigures in each GIF file: the first is the electrostatic field (Ex), second is the laser field (Ey), and third is the electron and the proton densities ( n / n c).
ACKNOWLEDGMENTS
This work was supported by the National Science and Technology Council (Ministry of Science and Technology), Taiwan, under Grant Nos. MOST 111-2112-M-006-011-MY3, 111-2111-M-006-005-MY2, and 112-2811-M-006-027. This work was also supported by JSPS KAKENHI under Grant Nos. 19H00668, 20KK0064, and 22H01195 and Core-to-Core Program (No. JPJSCCA2019002). We thank the National Center for High-performance Computing (NCHC) for providing computational and storage resources.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Yao-Li Liu: Conceptualization (lead); Data curation (lead); Formal analysis (lead); Funding acquisition (lead); Investigation (lead); Methodology (equal); Project administration (lead); Resources (lead); Software (equal); Supervision (equal); Validation (lead); Visualization (lead); Writing – original draft (lead); Writing – review & editing (lead). Yen-Chen Chen: Data curation (equal); Formal analysis (equal); Methodology (equal); Software (equal); Validation (equal); Visualization (equal); Writing – original draft (equal). Chun-Sung Jao: Formal analysis (supporting); Methodology (supporting); Validation (supporting); Visualization (supporting); Writing – original draft (supporting); Writing – review & editing (supporting). Mao-Syun Wong: Data curation (supporting); Methodology (supporting); Software (supporting); Visualization (supporting). Chun-Han Huang: Data curation (supporting); Formal analysis (supporting); Software (supporting); Validation (supporting); Visualization (supporting); Writing – original draft (supporting); Writing – review & editing (supporting). Han-Wei Chen: Data curation (supporting); Formal analysis (supporting); Methodology (supporting); Software (supporting); Visualization (supporting). Shogo Isayama: Writing – original draft (supporting); Writing – review & editing (supporting). Yasuhiro Kuramitsu: Conceptualization (supporting); Project administration (equal); Supervision (equal); Validation (supporting); Visualization (supporting); Writing – original draft (supporting); Writing – review & editing (equal).
DATA AVAILABILITY
Data sharing is not applicable to this article as no new data were created or analyzed in this study.