We present an inference scheme of long timescale, non-exponential kinetics from molecular dynamics simulations accelerated by stochastic resetting. Standard simulations provide valuable insight into chemical processes but are limited to timescales shorter than 1μs. Slower processes require the use of enhanced sampling methods to expedite them and inference schemes to obtain the unbiased kinetics. However, most kinetics inference schemes assume an underlying exponential first-passage time distribution and are inappropriate for other distributions, e.g., with a power-law decay. We propose an inference scheme that is designed for such cases, based on simulations enhanced by stochastic resetting. We show that resetting promotes enhanced sampling of the first-passage time distribution at short timescales but often also provides sufficient information to estimate the long-time asymptotics, which allows the kinetics inference. We apply our method to a model system and a peptide in an explicit solvent, successfully estimating the unbiased mean first-passage time while accelerating the sampling by more than an order of magnitude.

Transitions between metastable states are a key feature of many chemical and physical phenomena, such as chemical reactions and protein conformational changes. Molecular Dynamics (MD) simulations are an appealing computational tool for estimating the kinetic rates associated with such transitions. They track the dynamics of the system in microscopic detail, providing accurate predictions of both thermodynamic and kinetic properties. However, MD simulations require an integration time step on the order of 1fs, limiting the simulations to a timescale of 1μs. Processes that occur on longer timescales, such as protein folding and crystal nucleation, cannot be sampled efficiently in standard MD simulations.1–5 

Different enhanced sampling methods were developed to overcome this well-known timescale problem. An incomplete representative list includes umbrella sampling,6,7 milestoning,8–10 replica-exchange MD,11–13 Gaussian accelerated MD (GaMD),14–16 and metadynamics.1,17,18 These methods promote extensive sampling of the transitions between metastable states within the accessible timescales of MD simulations, allowing the calculation of both thermodynamic averages and dynamic properties. For kinetics inference, most schemes assume that the underlying process has an exponential first-passage time (FPT) distribution,1–4,13,16,19–22 relying on transition state theory (TST)23,24 or Kramers’ theory.4,25–27

Exponential FPT distributions arise when a high energy barrier separates two narrow metastable states. However, many processes in nature have non-exponential FPT distributions. For instance, protein dynamics are sometimes better described by a sum of exponents with different rates, or skewed exponents.28–31 Power-law kinetics were also found experimentally for transitions between two stable conformers of an enzyme.32 Even when the FPT distribution has an exponential tail, it can behave differently at short to intermediate times, for instance, obeying a power-law.33 Yet, kinetic inference schemes for processes with non-exponential FPT distributions are currently lacking. We propose such a scheme, designed for simulations enhanced by stochastic resetting (SR).

Resetting is the procedure of stopping random processes and restarting them subject to the sampling of independent and identically distributed initial conditions. It was shown to expedite different kinds of processes, including randomized computer algorithms,34–36 queuing systems,37,38 and various first-passage and search processes.39–47 We recently demonstrated the power of resetting for enhanced sampling of MD simulations.5,48 We showed that it can expedite rare events in molecular simulations when used as a standalone tool5 and in combination with metadynamics simulations, which also improved the kinetics inference.48 

The first two moments of the FPT distribution provide a sufficient condition for SR to be beneficial: if the ratio between the standard deviation and the mean, known as the coefficient of variation (COV), is greater than unity, introducing a small resetting rate is guaranteed to lower the mean FPT (MFPT).49 The COV of the exponential distribution is exactly equal to one, so processes with a broader FPT distribution can be accelerated with SR. However, they cannot be properly treated by most kinetics inference schemes from accelerated simulations since these assume an exponential FPT distribution.2–4,20–22 Resetting, on the other hand, can potentially provide accurate predictions, as it minimally perturbs the natural dynamics of the system between restart events.

In this paper, we present a method to extract the unbiased MFPT of processes with non-exponential FPT distributions from simulations accelerated by SR. We first present the theory underlying our method and use analytical FPT distributions to discuss its advantages and disadvantages. Next, we employ it to study a three-state, two-dimensional model system and the dynamics of a nine-residue alanine peptide in explicit water. We show that our method can consistently predict the MFPT with high accuracy, with speedups of more than an order of magnitude over brute-force unbiased simulations.

Different protocols of resetting were suggested in the literature.50 Here, we employ sharp resetting, where the waiting times between resetting events are constant, with some duration T, which is often called the timer. Sharp resetting was shown to provide greater acceleration than any other resetting protocol when performed with the optimal timer.51 The MFPT ⟨τT of a process with sharp resetting, as a function of the timer, is related to the unbiased FPT distribution through52 
(1)
with S(t) being the survival probability of the FPT τ at time t,
(2)
Note that Eq. (1) requires the survival function only at tT. As the dynamics between resetting events remain unperturbed, simulations with some timer T* sample the exact survival probability at times tT*. Given a sample of trajectories undergoing resetting with timer T*, Eq. (1) provides a practical tool to assess the MFPT with any timer T < T*, indicating whether it is possible to further enhance the sampling by choosing shorter timers.
Simulations with timer T* also sample the exact conditional average ⟨τ|τT*⟩, i.e., the MFPT of trajectories with a FPT τT*. Using the total expectation theorem, the unbiased MFPT can be expressed as
(3)
We note that simulations with timer T* provide all terms on the right-hand side of Eq. (3), except for ⟨τ|τ > T*⟩, the MFPT of trajectories with a FPT τ > T*. If we could accurately estimate ⟨τ|τ > T*⟩ through the behavior at times tT*, we would be able to extract the unbiased MFPT via Eq. (3). Fortunately, many FPT distributions have some distinctive decaying tail at t > t′, with t′ being some characteristic timescale. If we know that is the case and choose a timer T* > t′, we can sample part of this tail, fit it with the correct functional form, and obtain an accurate estimate of ⟨τ|τ > T*⟩.
For instance, if the FPT distribution decays exponentially with a rate k at t > t′, then logS(t) would be linear for t > t′, with a slope −k. A linear fit of logS(t) in the proximity of t = T* > t′ would provide k, and consequently,
(4)
Similarly, if the FPT distribution is governed by a power-law at t > t′, i.e., S(t) ∝ tα for t > t′ with some α > 1, then we can estimate53 
(5)
Once we estimate ⟨τ|τ > T*⟩, we substitute it in Eq. (3) and have an estimate of the unbiased MFPT. In practice, when using real data, for both exponential and power-law tails, we fit the log of the survival function for every choice of t′ < T* and choose the t′ that provides the best linear fit, i.e., with the highest Pearson correlation coefficient. Note that for an exponential and power law tail, the linear fit should be done on semi-log and log–log scales, respectively.

Ideally, T* should be greater than t′ but not too large so that sampling by resetting would still lead to acceleration over standard MD simulations. We imagine the following use case: First, be conservative. Given a few trajectories, roughly estimate a T* that would cut the far tail of the FPT distribution, leading to acceleration while leaving enough to fit. Choose the functional form that best fits the data. In the second step, use Eq. (1) to predict whether an even shorter timer will lead to greater acceleration. In doing so, one must still leave enough of the tail to fit it. Similar to other kinetic inference methods,3,4,22 there will be a trade-off between speedup and accuracy. In all the examples below, we find that it is possible to estimate the scaling of the distribution tails through this procedure while benefiting from significant speedups.

1. Hyperexponential distribution

We first employ our approach to an analytical, dimensionless FPT distribution, a hyperexponential distribution composed of two exponents with rates k1 and k2,
(6)
We take A = 0.5 for simplicity and k1 = 100 ≫ k2 = 0.1 for separation of timescales between the terms. The survival function is plotted in Fig. 1(a). It shows fast decay at short times and fits the slower rate k2 at longer times. The rate of decay is 1% off k2 at time T*, marked with a black dashed line. The gray dotted lines decay with rates k1 and k2.
FIG. 1.

(a) Exact survival function for a hyperexponential distribution [Eq. (6)]. The dashed gray lines show slopes of −k1 and −k2 for comparison. The dotted black line highlights a specific T* = 0.115. (b) Speedup (green) and estimated MFPT (blue) as a function of the timer using Eqs. (3) and (4) and the analytical survival function. The dotted black line highlights a timer of T* = 0.115.

FIG. 1.

(a) Exact survival function for a hyperexponential distribution [Eq. (6)]. The dashed gray lines show slopes of −k1 and −k2 for comparison. The dotted black line highlights a specific T* = 0.115. (b) Speedup (green) and estimated MFPT (blue) as a function of the timer using Eqs. (3) and (4) and the analytical survival function. The dotted black line highlights a timer of T* = 0.115.

Close modal

For this analytical example, we know the exact survival function and can use Eq. (1) to obtain the exact MFPT under resetting for any choice of timer T*. We can also estimate the MFPT with no resetting (=5.005) through Eqs. (3) and (4), using the exact values of ⟨τ|τT*⟩ and S(T*), and evaluating k=dlog(S(t))dt|t=T* with the exact derivative. Figure 1(b) shows the resulting estimations as a function of T* (blue, right y-axis). It also presents the speedup (green, left y-axis), with the speedup defined as the ratio between the MFPT without and with resetting, respectively. The dashed black line marks the results with the specific timer T* shown in Fig. 1(a). We observe that the MFPT estimation is within 1% of the true value for speedups up to 40. This is expected since the slope of the survival at T* is within 1% of k2 at T*. This means that, for a process with this FPT distribution, we could spend 40 times less computational time per first-passage event compared to tedious, unbiased simulations and obtain nearly the same accuracy.

However, in more realistic scenarios, the accuracy of the predicted MFPT would also be limited by the number of trajectories sampled and how well they capture the slope of the distribution tails. We investigate the sensitivity of our approach to sampling noise by numerical sampling from the distribution in Eq. (6) to estimate its survival function. To estimate the MFPT in this case, we sampled 1000 batches for every timer, each composed of 100 FPTs that were numerically sampled from the distribution. We constructed trajectories with resetting in the following way: We first sampled a time from the FPT distribution without resetting and compared it with the timer. If the timer was smaller, we tallied it up and sampled a new time from the FPT distribution without resetting. We repeated the process until the sample was smaller than the timer. In that case, we added the sample to the sum, and the sampling of the trajectory was completed. The total summed time was a sample of the FPT with resetting at that value of T*.

Figure 2 shows the MFPT estimation as a function of the selected timer T* (top panel). The boxes show the range between the first and third quartiles (interquartile range, IQR), and the whiskers show extreme values within 1.5 IQR below and above these quartiles. The circles and horizontal lines give the mean and median, respectively. The associated speedups are plotted with green squares in the bottom panel. We find that a timer of 0.115 gives a speedup of 40, as anticipated, but provides a poor estimate of the unbiased MFPT due to insufficient sampling in the proximity of t′. Longer timers provide better estimations, with accurate averages (<0.6% absolute deviation) for speedups of up to 6. Increasing the number of samples improves the estimations. Figure 1 of the supplementary material shows results equivalent to those of Fig. 2 but sampling 1000 first-passage events for each timer. We find that a timer of 0.2 already gives <4% absolute deviation, with a speedup of 24.

FIG. 2.

Estimated MFPT (top panel) and speedup (bottom panel) as a function of T* for the hyperexponential distribution. The circles, horizontal lines, boxes, and whiskers in the top panel show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The shaded region shows the IQR of 1000 batches of 100 unbiased simulations each. The speedup was calculated for a single processor (green squares) and 100 processors (orange triangles).

FIG. 2.

Estimated MFPT (top panel) and speedup (bottom panel) as a function of T* for the hyperexponential distribution. The circles, horizontal lines, boxes, and whiskers in the top panel show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The shaded region shows the IQR of 1000 batches of 100 unbiased simulations each. The speedup was calculated for a single processor (green squares) and 100 processors (orange triangles).

Close modal

2. Additional speedup by parallelization

So far, we assumed that all simulations are performed serially, each one initiated only when the former is done. This would be the case if only one processor was available. However, we can usually perform simulations in parallel on several processors. This becomes increasingly affordable with the improvement in computer power, as demonstrated, for instance, by the parallel replica dynamics (ParRep) method.54–56 As explained in Ref. 55, the improvement in computer power is mainly reflected in ever-greater levels of parallelization and not in per-processor speed. While it does not directly solve the timescale problem, it greatly benefits parallelizable enhanced sampling methods.

Consider, for instance, running 100 trajectories on 100 processors simultaneously; for unbiased statistics, one has to wait for all trajectories to end. Thus, the walltime of the sampling is the FPT of the longest trajectory, which is always larger than the empirical MFPT. For the hyperexponential distribution above, using 100 trajectories, it is 45, i.e., almost an order of magnitude greater than the MFPT [see Eq. (S1)]. The larger the COV, i.e., the larger the fluctuations in the FPT distribution, the larger the walltime compared to the MFPT. Resetting lowers the COV,57 reducing the ratio between the longest first-passage time and the MFPT. This, in turn, translates to greater speedups in parallelizable settings. We demonstrate this by plotting the walltime speedup with 100 processors for the hyperexponential distribution (orange triangles in Fig. 2). The walltime speedup is defined as the ratio between the sampling walltimes without and with resetting, respectively. We find that in this case, it is larger by up to 40% than the equivalent speedup with a single processor.

3. Power law distribution

We finish this section with a non-exponential analytical example, whose behavior is qualitatively different than the previous example: The Pareto distribution,
(7)
with α = 1.25 and τm = 1. It has power-law decay [Eq. (5) is thus used for MFPT estimations], an MFPT of 5, and its COV without resetting diverges. For this case, we find that the walltime speedup using 100 processors is 15, an order of magnitude larger than that using a single processor (2). Since the power law decay starts at τm, all FPT measurements with timers greater than τm sample the tail. Moreover, since the decay is uniform, different timers give very similar MFPT estimations. We thus present results for a single timer T* = 2.

Figure 3 shows the estimations of the MFPT with 100–2000 FPT samples (top panel). The median of the MFPT estimations is close to the true MFPT even with 100 samples with resetting. The mean is overestimated due to a few outlier values not shown. With 500 and 2000 samples, the mean is only 12% and 2% off the true MFPT, respectively. The bottom panel shows the estimates of α from the same data used in the top panel, demonstrating that we can accurately obtain power-law tails from simulations with resetting.

FIG. 3.

The Pareto distribution: Estimated MFPT (top) and estimated α (bottom) as a function of the number of first-passage samples. The dashed black lines give the analytic MFPT and α in the corresponding panels. The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The dotted lines and the shaded regions show the mean and IQR, respectively, for 1000 sets of 2000 unbiased measurements each.

FIG. 3.

The Pareto distribution: Estimated MFPT (top) and estimated α (bottom) as a function of the number of first-passage samples. The dashed black lines give the analytic MFPT and α in the corresponding panels. The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The dotted lines and the shaded regions show the mean and IQR, respectively, for 1000 sets of 2000 unbiased measurements each.

Close modal

We next apply our inference scheme for MD simulations of a particle diffusing over a two-dimensional three-state potential [Fig. 4(a) and Eq. (S2)]. This potential was introduced by Khan et al. to represent a simple kinetic model with two first-order reactions.20 The particle is initially positioned at state A. One reaction path leads to state I, with a kinetic rate of k1. It is a reversible reaction, with transitions from I to A governed by rate k−1 < k1. The second path leads to the product state B with rate k2 < k−1. As Khan et al. point out, the system should reach a quasi-equilibrium between states A and I at time k1+k11, after which transitions from states A and I to B follow an effective rate of κ=k1k1+k1k2. We assessed the values of k1, k−1, and k2 using unbiased simulations in which only a single reaction path was available, with the other blocked by a strong repulsive potential. The survival function shown in Fig. 4(b) confirms that transitions to state B follow rate κ at times larger than some t′, while faster at t < t′.

FIG. 4.

(a) The three-state system. The white dashed line marks the first-passage criterion. (b) The survival function at times <10ns. The dashed line marks the time t=1k1+k1 and the dotted line shows the decay at rate κ=k1k1+k1k2. (c) Estimated MFPT (top), timescale t′ (center), and speedup (bottom) as a function of the timer. Speedup was calculated for simulations on a single processor (green squares) or on 100 parallel processors (orange triangles). The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The dotted lines and shaded areas show the mean values and IQR for 100 bootstrapping sets of 100 unbiased simulations each, respectively (1000 independent unbiased trajectories were collected in total). The dashed line in the middle panel shows t=1k1+k1.

FIG. 4.

(a) The three-state system. The white dashed line marks the first-passage criterion. (b) The survival function at times <10ns. The dashed line marks the time t=1k1+k1 and the dotted line shows the decay at rate κ=k1k1+k1k2. (c) Estimated MFPT (top), timescale t′ (center), and speedup (bottom) as a function of the timer. Speedup was calculated for simulations on a single processor (green squares) or on 100 parallel processors (orange triangles). The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The dotted lines and shaded areas show the mean values and IQR for 100 bootstrapping sets of 100 unbiased simulations each, respectively (1000 independent unbiased trajectories were collected in total). The dashed line in the middle panel shows t=1k1+k1.

Close modal

Figure 4(c) presents estimations of the MFPT (top panel) and time t′ (center panel) as a function of T*, sampling 100 first-passage events per batch for each timer. The associated speedups are plotted in the bottom panel. The speedup is similar with 1 and 100 processors because the COV without resetting is 1.05, very close to unity. We find that even the shortest timer, providing speedups of 6, estimates the MFPT within an order of magnitude of the unbiased value. As expected, the accuracy improves with larger timers. In addition, unbiased simulations fail to provide any insight on the timescale t′: the IQR is very broad, and the mean is higher than the third quartile due to large outlier values. However, simulations with resetting correctly estimate its order of magnitude with almost any timer choice.

Many proteins are characterized by a rugged free-energy surface (FES), with multiple metastable states separated by low energy barriers.31,58,59 An extreme case is downward folding, where there are no energy barriers along the folding path.28–30 The assumption of Poisson statistics, which is usually acceptable in systems with high energy barriers, is often invalid in protein dynamics. Therefore, we expect protein dynamics to be a natural proving ground for our new method. To demonstrate it, we employ our method to study the dynamics of a nine-residue peptide of alanine [Fig. 5(a)], which is the shortest peptide known to form a stable α-helix.60 It also forms a stable “loop,” similar to other alanine chains.33 This system was chosen to benchmark our approach due to its multiple transitions between metastable states on a long timescale, but not too slow, allowing benchmarking against brute-force unbiased simulations.

FIG. 5.

(a) Ball-and-stick and cartoon representations of four stable configurations of a nine-residue alanine peptide. The white, gray, blue, and red spheres represent the hydrogen, carbon, nitrogen, and oxygen atoms, respectively. (b) Free energy along an end-to-end-based CV and an H-bond-based CV. The white dotted lines define the four metastable states. (c) Estimated MFPT (top) for different timers. The MFPT of 1000 unbiased trajectories is indicated with a black dotted line in the top panel, with the gray shading showing ±1/1000 the standard deviation—the estimated error. The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The speedup using a single or 100 processors is plotted in the bottom panel with green squares and orange triangles, respectively.

FIG. 5.

(a) Ball-and-stick and cartoon representations of four stable configurations of a nine-residue alanine peptide. The white, gray, blue, and red spheres represent the hydrogen, carbon, nitrogen, and oxygen atoms, respectively. (b) Free energy along an end-to-end-based CV and an H-bond-based CV. The white dotted lines define the four metastable states. (c) Estimated MFPT (top) for different timers. The MFPT of 1000 unbiased trajectories is indicated with a black dotted line in the top panel, with the gray shading showing ±1/1000 the standard deviation—the estimated error. The circles, horizontal lines, boxes, and whiskers show the mean, median, IQR, and extreme values within 1.5 IQR of the first and third quartiles, respectively. The speedup using a single or 100 processors is plotted in the bottom panel with green squares and orange triangles, respectively.

Close modal

Figure 5(b) shows the FES of the system along two collective variables (CVs), obtained from a continuous, 2 µs-long unbiased trajectory. One-dimensional FES along these CVs are provided in Fig. 2 of the supplementary material. The first CV is the end-to-end distance x, calculated using the center of mass of the two edge residues, which identifies the closed “loop” state (A, x < 0.52 nm). The second is the average of three distances between pairs of H-donor nitrogen and acceptor oxygen within the peptide. This average, denoted here as y, identifies the helix60 (C, y < 0.37 nm). We also identify a broad basin of metastable open configurations (B, x > 1.25 nm and y > 0.8 nm) and a metastable intermediate state, where two of the three H-bonds are formed (D, 1.42 < x < 1.67 nm and 0.39 < y < 0.44 nm). Representative configurations of the states are presented in Fig. 5(a).

We first sampled 100 configurations of state A, obtained in time intervals of 0.5 ns from a trajectory restricted to state A using a strongly repelling potential (see Sec. V) along the end-to-end-based CV. We then ran independent simulations with resetting, uniformly sampling the initial conditions from those configurations. The first-passage was defined as settling into any other stable state: B, C, or D. For comparison, we also performed 1000 brute-force unbiased simulations to determine that this process has a MFPT of 102ns and a COV of 1.46. The tail of the FPT distribution is exponential; hence, we use Eq. (4) for the MFPT estimations.

Figure 5(c) shows the MFPT estimations from simulations with resetting with different timers (top). The associated speedups are plotted in the bottom panel. The shortest timer provides speedups of 11 and 18 using 1 and 100 processors, respectively. Using this timer, we estimate the MFPT as 7.9 ns, about an order of magnitude lower than the true value. Larger timers give more accurate results and still lead to accelerations. For instance, using a timer of 20 ns we estimate the MFPT as 92ns with speedups of 2.2 and 3.3 using 1 and 100 processors, respectively. To achieve even higher speedups, resetting could be combined with metadynamics48 or other enhanced sampling methods. However, a kinetics inference scheme that does not assume an underlying exponential FPT distribution for metadynamics must be developed first and left for future work.

To conclude, we presented an inference scheme for non-exponential kinetics from MD simulations accelerated by resetting. Almost all kinetics inference methods for enhanced sampling simulations assume an underlying exponential FPT distribution, but in many cases of interest, this assumption does not hold. Resetting is an especially appealing tool for non-exponential systems since it is guaranteed to expedite processes whose FPT distribution is broader than exponential. Moreover, it does not affect the dynamics between restarts and samples the natural dynamics of the underlying process up to the resetting time. If the FPT distribution has a well-behaved asymptotic decay starting at times that are slightly shorter than the resetting time, we can estimate the tail of the distribution by its behavior at shorter timescales. This, in turn, allows estimating the unbiased MFPT of non-exponential processes using simulations accelerated by resetting.

Our approach becomes increasingly favorable as the number of available processors grows. With standard unbiased simulations, a few trajectories with long FPTs dominate the total walltime for all simulations to end, regardless of the number of available processors. By limiting the trajectories to short timescales using resetting, we use the available resources more efficiently. Compared to sets of unbiased trajectories, we obtain similar accuracy at much shorter simulation times. As acknowledged earlier by the developers of ParRep, parallelizability is an increasingly important quality: while the rapid improvement in computer power does not enable longer simulations per se, it makes trivially parallelizable methods much more appealing.55,56

We applied our method to several analytical FPT distributions, a three-state model potential, and an alanine peptide in explicit water. We obtain speedups of more than an order of magnitude with small statistical errors in the predicted MFPT. In all systems, we rely on the fact that the FPT distribution has either an exponential or power-law tail. However, our approach is much more general and can be employed for other kinds of distributions.

Samples from analytical distributions were obtained using Python. The script is available on the associated GitHub repository. Simulations of the three-state model system were carried out in the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS),61 following the motion of a single particle with a mass of 40 gr mol−1. They were performed in the canonical (NVT) ensemble at a temperature of 300 K, using a Langevin thermostat62 with a friction coefficient of 0.01 fs−1.

We used input files by Ayaz63 for the alanine peptide. The simulations were carried out in GROMACS 2019.6,64 using the Amber03 force field65 for the peptide and the extended simple point-charge (SPC/E) model66 for the solvent. They were performed in the NVT ensemble at 300 K using a stochastic velocity rescaling thermostat.67 Additional simulation settings are as reported in Ref. 60.

The integration time step was 1 fs for both systems. The progress of the systems in the CV space was measured using PLUMED 2.7.1.68–70 For the alanine peptide, we tested whether the system entered a new basin every 1 ps. A first-passage event was considered only if the system was observed in the new basin for at least 5 consecutive tests. We also used PLUMED to restrict the system to state A when obtaining initial configurations. We added a harmonic potential 0.5kx0.522 at x > 0.52 nm, with k = 100 kBT/nm2 and x being the end-to-end-based CV.

Finally, in Figs. 24, we report statistics over 1000 independent sample batches. In Figs. 5 and 6, we present statistics over 1000 bootstrapping sets. The total number of simulations used for bootstrapping and the total number of trajectories ending in first-passage for each timer are given in Tables 1 and 2 of the supplementary material.

The supplementary material contains the following: Results for the hyperexponential distribution with extensive sampling, discussion on the expected walltime with multiple processors, details of the three-state potential, one-dimensional FES of the alanine peptide, and metadata for Figs. 4 and 5.

B.H. acknowledges the support from the Israel Science Foundation (Grant Nos. 1037/22 and 1312/22) and the Pazy Foundation of the IAEC-UPBC (Grant No. 415-2023). This project has received the funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant Agreement No. 947731).

The authors have no conflicts to disclose.

Ofir Blumer: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (lead); Visualization (lead); Writing – original draft (lead). Shlomi Reuveni: Conceptualization (equal); Funding acquisition (supporting); Supervision (supporting); Writing – review & editing (equal). Barak Hirshberg: Conceptualization (equal); Funding acquisition (lead); Resources (lead); Supervision (lead); Writing – review & editing (equal).

Example input files, source data for all plots, and a Python class for the proposed inference method are available in the GitHub repository: github.com/OfirBlumer/nonExponentialKinetics.

1.
O.
Valsson
,
P.
Tiwary
, and
M.
Parrinello
, “
Enhancing important fluctuations: Rare events and metadynamics from a conceptual viewpoint
,”
Annu. Rev. Phys. Chem.
67
,
159
184
(
2016
).
2.
P.
Tiwary
and
M.
Parrinello
, “
From metadynamics to dynamics
,”
Phys. Rev. Lett.
111
,
230602
(
2013
).
3.
M.
Salvalaglio
,
P.
Tiwary
, and
M.
Parrinello
, “
Assessing the reliability of the dynamics reconstructed from metadynamics
,”
J. Chem. Theory Comput.
10
,
1420
1425
(
2014
).
4.
K.
Palacio-Rodriguez
,
H.
Vroylandt
,
L. S.
Stelzl
,
F.
Pietrucci
,
G.
Hummer
, and
P.
Cossio
, “
Transition rates and efficiency of collective variables from time-dependent biased simulations
,”
J. Phys. Chem. Lett.
13
,
7490
7496
(
2022
).
5.
O.
Blumer
,
S.
Reuveni
, and
B.
Hirshberg
, “
Stochastic resetting for enhanced sampling
,”
J. Phys. Chem. Lett.
13
,
11230
11236
(
2022
).
6.
G. M.
Torrie
and
J. P.
Valleau
, “
Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling
,”
J. Comput. Phys.
23
,
187
199
(
1977
).
7.
J.
Kästner
, “
Umbrella sampling
,”
Wiley Interdiscip. Rev.:Comput. Mol. Sci.
1
,
932
942
(
2011
).
8.
A. K.
Faradjian
and
R.
Elber
, “
Computing time scales from reaction coordinates by milestoning
,”
J. Chem. Phys.
120
,
10880
10889
(
2004
).
9.
R.
Elber
, “
Milestoning: An efficient approach for atomically detailed simulations of kinetics in biophysics
,”
Annu. Rev. Biophys.
49
,
69
85
(
2020
).
10.
R.
Elber
,
A.
Fathizadeh
,
P.
Ma
, and
H.
Wang
, “
Modeling molecular kinetics with milestoning
,”
Wiley Interdiscip. Rev.:Comput. Mol. Sci.
11
,
e1512
(
2021
).
11.
Y.
Sugita
and
Y.
Okamoto
, “
Replica-exchange molecular dynamics method for protein folding
,”
Chem. Phys. Lett.
314
,
141
151
(
1999
).
12.
U. H. E.
Hansmann
, “
Parallel tempering algorithm for conformational studies of biological molecules
,”
Chem. Phys. Lett.
281
,
140
150
(
1997
).
13.
L. S.
Stelzl
and
G.
Hummer
, “
Kinetics from replica exchange molecular dynamics simulations
,”
J. Chem. Theory Comput.
13
,
3927
3935
(
2017
).
14.
Y.
Miao
,
V. A.
Feher
, and
J. A.
McCammon
, “
Gaussian accelerated molecular dynamics: Unconstrained enhanced sampling and free energy calculation
,”
J. Chem. Theory Comput.
11
,
3584
3595
(
2015
).
15.
J.
Wang
,
P. R.
Arantes
,
A.
Bhattarai
,
R. V.
Hsu
,
S.
Pawnikar
,
Y.-m. M.
Huang
,
G.
Palermo
, and
Y.
Miao
, “
Gaussian accelerated molecular dynamics: Principles and applications
,”
Wiley Interdiscip. Rev.:Comput. Mol. Sci.
11
,
e1521
(
2021
).
16.
Y.
Miao
,
A.
Bhattarai
, and
J.
Wang
, “
Ligand Gaussian accelerated molecular dynamics (LiGaMD): Characterization of ligand binding thermodynamics and kinetics
,”
J. Chem. Theory Comput.
16
,
5526
5547
(
2020
).
17.
A.
Barducci
,
M.
Bonomi
, and
M.
Parrinello
, “
Metadynamics
,”
Wiley Interdiscip. Rev.:Comput. Mol. Sci.
1
,
826
843
(
2011
).
18.
L.
Sutto
,
S.
Marsili
, and
F. L.
Gervasio
, “
New advances in metadynamics
,”
Wiley Interdiscip. Rev.:Comput. Mol. Sci.
2
,
771
779
(
2012
).
19.
A. F.
Voter
, “
Hyperdynamics: Accelerated molecular dynamics of infrequent events
,”
Phys. Rev. Lett.
78
,
3908
3911
(
1997
).
20.
S. A.
Khan
,
B. M.
Dickson
, and
B.
Peters
, “
How fluxional reactants limit the accuracy/efficiency of infrequent metadynamics
,”
J. Chem. Phys.
153
,
054125
(
2020
).
21.
D.
Ray
and
M.
Parrinello
, “
Kinetics from metadynamics: Principles, applications, and outlook
,”
J. Chem. Theory Comput.
19
,
5649
5670
(
2023
)
22.
O.
Blumer
,
S.
Reuveni
, and
B.
Hirshberg
, “
Short-time infrequent metadynamics for improved kinetics inference
,”
J. Chem. Theory Comput.
20
,
3484
3491
(
2024
).
23.
E.
Wigner
, “
The transition state method
,”
Trans. Faraday Soc.
34
,
29
41
(
1938
).
24.
B.
Peters
, “
Chapter 10–Transition state theory
,” in
Reaction Rate Theory and Rare Events Simulations
, edited by
B.
Peters
(
Elsevier
,
Amsterdam
,
2017
), pp.
227
271
.
25.
H. A.
Kramers
, “
Brownian motion in a field of force and the diffusion model of chemical reactions
,”
Physica
7
,
284
304
(
1940
).
26.
B.
Peters
, “
Chapter 16–Kramers theory
,” in
Reaction Rate Theory and Rare Events Simulations
, edited by
B.
Peters
(
Elsevier
,
Amsterdam
,
2017
), pp.
435
450
.
27.
N.
Mazzaferro
,
S.
Sasmal
,
P.
Cossio
, and
G. M.
Hocky
, “
Good rates from bad coordinates: The exponential average time-dependent rate approach
,”
J. Chem. Theory Comput.
20
,
5901
5912
(
2024
).
28.
M.
Gruebele
, “
Downhill protein folding: Evolution meets physics
,”
C. R. Biol.
328
,
701
712
(
2005
).
29.
H.
Ma
and
M.
Gruebele
, “
Kinetics are probe-dependent during downhill folding of an engineered λ6–85 protein
,”
Proc. Natl. Acad. Sci. U. S. A.
102
,
2283
2287
(
2005
).
30.
F.
Liu
,
D.
Du
,
A. A.
Fuller
,
J. E.
Davoren
,
P.
Wipf
,
J. W.
Kelly
, and
M.
Gruebele
, “
An experimental survey of the transition between two-state and downhill protein folding scenarios
,”
Proc. Natl. Acad. Sci. U. S. A.
105
,
2369
2374
(
2008
).
31.
R.
Moulick
,
R. R.
Goluguri
, and
J. B.
Udgaonkar
, “
Ruggedness in the free energy landscape dictates misfolding of the prion protein
,”
J. Mol. Biol.
431
,
807
824
(
2019
).
32.
I.
Grossman-Haham
,
G.
Rosenblum
,
T.
Namani
, and
H.
Hofmann
, “
Slow domain reconfiguration causes power-law kinetics in a two-state enzyme
,”
Proc. Natl. Acad. Sci. U. S. A.
115
,
513
518
(
2018
).
33.
J.
Gowdy
,
M.
Batchelor
,
I.
Neelov
, and
E.
Paci
, “
Nonexponential kinetics of loop formation in proteins and peptides: A signature of rugged free energy landscapes?
,”
J. Phys. Chem. B
121
,
9518
9525
(
2017
).
34.
M.
Luby
,
A.
Sinclair
, and
D.
Zuckerman
, “
Optimal speedup of Las Vegas algorithms
,”
Inf. Process. Lett.
47
,
173
180
(
1993
).
35.
C.
Gomes
,
B.
Selman
, and
H.
Kautz
, “
Boosting combinatorial search through randomization
,” in
15th AAAI
(
The AAAI Press
,
Madison, WI
,
1998
), pp.
431
437
.
36.
A.
Montanari
and
R.
Zecchina
, “
Optimizing searches via rare events
,”
Phys. Rev. Lett.
88
,
178701
(
2002
).
37.
P. C.
Bressloff
, “
Queueing theory of search processes with stochastic resetting
,”
Phys. Rev. E
102
,
032109
(
2020
).
38.
O. L.
Bonomo
,
A.
Pal
, and
S.
Reuveni
, “
Mitigating long queues and waiting times with service resetting
,”
PNAS Nexus
1
,
pgac070
(
2022
).
39.
M. R.
Evans
and
S. N.
Majumdar
, “
Diffusion with stochastic resetting
,”
Phys. Rev. Lett.
106
,
160601
(
2011
).
40.
L.
Kuśmierz
and
E.
Gudowska-Nowak
, “
Optimal first-arrival times in Lévy flights with resetting
,”
Phys. Rev. E
92
,
052127
(
2015
).
41.
U.
Bhat
,
C. D.
Bacco
, and
S.
Redner
, “
Stochastic search with Poisson and deterministic resetting
,”
J. Stat. Mech.: Theory Exp.
2016
(
8
),
083401
.
42.
A.
Chechkin
and
I.
Sokolov
, “
Random search with resetting: A unified renewal approach
,”
Phys. Rev. Lett.
121
,
050601
(
2018
).
43.
S.
Ray
,
D.
Mondal
, and
S.
Reuveni
, “
Péclet number governs transition to acceleratory restart in drift-diffusion
,”
J. Phys. A: Math. Theor.
52
,
255002
(
2019
).
44.
T.
Robin
,
L.
Hadany
, and
M.
Urbakh
, “
Random search with resetting as a strategy for optimal pollination
,”
Phys. Rev. E
99
,
052119
(
2019
).
45.
M. R.
Evans
and
S. N.
Majumdar
, “
Run and tumble particle under resetting: A renewal approach
,”
J. Phys. A: Math. Theor.
51
,
475003
(
2018
).
46.
A.
Pal
,
L.
Kuśmierz
, and
S.
Reuveni
, “
Search with home returns provides advantage under high uncertainty
,”
Phys. Rev. Res.
2
,
043174
(
2020
).
47.
Y.
Luo
,
C.
Zeng
,
T.
Huang
, and
B.-Q.
Ai
, “
Anomalous transport tuned through stochastic resetting in the rugged energy landscape of a chaotic system with roughness
,”
Phys. Rev. E
106
,
034208
(
2022
).
48.
O.
Blumer
,
S.
Reuveni
, and
B.
Hirshberg
, “
Combining stochastic resetting with metadynamics to speed-up molecular dynamics simulations
,”
Nat. Commun.
15
,
240
(
2024
).
49.
A.
Pal
,
S.
Kostinski
, and
S.
Reuveni
, “
The inspection paradox in stochastic resetting
,”
J. Phys. A: Math. Theor.
55
,
021001
(
2022
).
50.
M. R.
Evans
,
S. N.
Majumdar
, and
G.
Schehr
, “
Stochastic resetting and applications
,”
J. Phys. A: Math. Theor.
53
,
193001
(
2020
).
51.
A.
Pal
and
S.
Reuveni
, “
First passage under restart
,”
Phys. Rev. Lett.
118
,
030603
(
2017
).
52.
I.
Eliazar
and
S.
Reuveni
, “
Mean-performance of sharp restart I: Statistical roadmap
,”
J. Phys. A: Math. Theor.
53
,
405004
(
2020
).
53.
C.
Forbes
,
M.
Evans
,
N.
Hastings
, and
B.
Peacock
, “
Pareto distribution
,” in
Statistical Distributions
(
John Wiley & Sons, Ltd.
,
2010
), Chap. 34, pp.
149
151
.
54.
A. F.
Voter
, “
Parallel replica method for dynamics of infrequent events
,”
Phys. Rev. B
57
,
R13985
(
1998
).
55.
D.
Perez
,
B. P.
Uberuaga
, and
A. F.
Voter
, “
The parallel replica dynamics method–Coming of age
,”
Comput. Mater. Sci.
100
,
90
103
(
2015
), part of the Special Issue: Advanced Simulation Methods.
56.
D.
Perez
,
E. D.
Cubuk
,
A.
Waterland
,
E.
Kaxiras
, and
A. F.
Voter
, “
Long-time dynamics through parallel trajectory splicing
,”
J. Chem. Theory Comput.
12
,
18
28
(
2016
).
57.
S.
Reuveni
, “
Optimal stochastic restart renders fluctuations in first passage times universal
,”
Phys. Rev. Lett.
116
,
170601
(
2016
).
58.
R.
Nevo
,
V.
Brumfeld
,
R.
Kapon
,
P.
Hinterdorfer
, and
Z.
Reich
, “
Direct measurement of protein energy landscape roughness
,”
EMBO Rep.
6
,
482
486
(
2005
).
59.
P.
Wolynes
,
Z.
Luthey-Schulten
, and
J.
Onuchic
, “
Fast-folding eriments and the topography of protein folding energy landscapes
,”
Chem. Biol.
3
,
425
432
(
1996
).
60.
C.
Ayaz
,
L.
Tepper
,
F. N.
Brünig
,
J.
Kappler
,
J. O.
Daldrop
, and
R. R.
Netz
, “
Non-Markovian modeling of protein folding
,”
Proc. Natl. Acad. Sci. U. S. A.
118
,
e2023856118
(
2021
).
61.
A. P.
Thompson
,
H. M.
Aktulga
,
R.
Berger
,
D. S.
Bolintineanu
,
W. M.
Brown
,
P. S.
Crozier
,
P. J.
in ’t Veld
,
A.
Kohlmeyer
,
S. G.
Moore
,
T. D.
Nguyen
,
R.
Shan
,
M. J.
Stevens
,
J.
Tranchida
,
C.
Trott
, and
S. J.
Plimpton
, “
LAMMPS–A flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales
,”
Comput. Phys. Commun.
271
,
108171
(
2022
).
62.
T.
Schneider
and
E.
Stoll
, “
Molecular-dynamics study of a three-dimensional one-component model for distortive phase transitions
,”
Phys. Rev. B
17
,
1302
1322
(
1978
).
63.
C.
Ayaz
(
2021
). “
Non-Markovian modeling of protein folding
,” Freie Universität Berlin. http://dx.doi.org/10.17169/refubium-29935
64.
M. J.
Abraham
,
T.
Murtola
,
R.
Schulz
,
S.
Páll
,
J. C.
Smith
,
B.
Hess
, and
E.
Lindahl
, “
GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers
,”
SoftwareX
1–2
,
19
25
(
2015
).
65.
Y.
Duan
,
C.
Wu
,
S.
Chowdhury
,
M. C.
Lee
,
G.
Xiong
,
W.
Zhang
,
R.
Yang
,
P.
Cieplak
,
R.
Luo
,
T.
Lee
et al, “
A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations
,”
J. Comput. Chem.
24
,
1999
2012
(
2003
).
66.
H. J. C.
Berendsen
,
J. R.
Grigera
, and
T. P.
Straatsma
, “
The missing term in effective pair potentials
,”
J. Phys. Chem.
91
,
6269
6271
(
1987
).
67.
G.
Bussi
,
D.
Donadio
, and
M.
Parrinello
, “
Canonical sampling through velocity rescaling
,”
J. Chem. Phys.
126
,
014101
(
2007
).
68.
M.
Bonomi
,
D.
Branduardi
,
G.
Bussi
,
C.
Camilloni
,
D.
Provasi
,
P.
Raiteri
,
D.
Donadio
,
F.
Marinelli
,
F.
Pietrucci
,
R. A.
Broglia
, and
M.
Parrinello
, “
PLUMED: A portable plugin for free-energy calculations with molecular dynamics
,”
Comput. Phys. Commun.
180
,
1961
1972
(
2009
).
69.
G. A.
Tribello
,
M.
Bonomi
,
D.
Branduardi
,
C.
Camilloni
, and
G.
Bussi
, “
PLUMED 2: New feathers for an old bird
,”
Comput. Phys. Commun.
185
,
604
613
(
2014
).
70.
PLUMED consortium
, “
Promoting transparency and reproducibility in enhanced molecular simulations
,”
Nat. Methods
16
,
670
673
(
2019
).