Methods derived from the generalized quantum master equation (GQME) framework have provided the basis for elucidating energy and charge transfer in systems ranging from molecular solids to photosynthetic complexes. Recently, the nonperturbative combination of the GQME with quantum-classical methods has resulted in approaches whose accuracy and efficiency exceed those of the original quantum-classical schemes while offering significant accuracy improvements over perturbative expansions of the GQME. Here, we show that, while the non-Markovian memory kernel required to propagate the GQME scales quartically with the number of subsystem states, the number of trajectories required scales at most quadratically when using quantum-classical methods to construct the kernel. We then present an algorithm that allows further acceleration of the quantum-classical GQME by providing a way to selectively sample the kernel matrix elements that are most important to the process of interest. We demonstrate the utility of these advances by applying the combination of Ehrenfest mean field theory with the GQME (MF-GQME) to models of the Fenna-Matthews-Olson (FMO) complex and the light harvesting complex II (LHCII), with 7 and 14 states, respectively. This allows us to show that the MF-GQME is able to accurately capture all the relevant dynamical time scales in LHCII: the initial nonequilibrium population transfer on the femtosecond time scale, the steady state-type trapping on the picosecond time scale, and the long time population relaxation. Remarkably, all of these physical effects spanning tens of picoseconds can be encoded in a memory kernel that decays only after ∼65 fs.
I. INTRODUCTION
Theoretical modeling of processes ranging from energy transfer in photosynthetic light harvesting complexes and organic electronics to proton coupled electron transfer requires the development of methods that allow for the accurate and efficient calculation of the dynamics of open quantum systems, i.e., a subsystem of interest coupled to an external environment. Generalized quantum master equations (GQMEs) provide a formally exact way of rewriting the equation of motion of a set of degrees of freedom of particular interest via the use of the appropriate projection operator.1–3 For example, in open quantum systems, the projection operator is frequently chosen to encompass all the states of the electronic subsystem, although this choice need not align with simple partitions at the level of the Hamiltonian, i.e., the projector could contain only a subset of electronic states or even include some nuclear degrees of freedom. The effect of the remaining degrees of freedom on the dynamics of those of interest is encoded through the memory kernel.1,2,4 However, evaluating the memory kernel requires overcoming the complications that arise from the presence of the projected propagator.
A common approach to avoid the projected propagator is to treat the memory kernel using second order perturbation theory. This leads to methods including Markovian and non-Markovian Redfield theory and its variants,5–8 the noninteracting blip approximation (NIBA),9,10 Förster theory,11 and the polaronic quantum master equation,12–15 which have been used to treat a wide variety of multistate electronic energy transfer systems. In these approaches, the memory kernel depends only on correlation functions of the isolated system and bath and hence can be straightforwardly generated even for systems with many subsystem states.16,17 However, these approaches are limited in their applicability to systems where the perturbative parameter remains small.
More recently, it has been shown that the projected propagator can be exactly removed by employing the Dyson identity, leading to a representation of the full (nonperturbative and non-Markovian) memory kernel in terms of unprojected correlation functions of the system and bath.18 This allows one to gain accurate results over a much wider range of regimes than perturbative master equations, albeit at the expense of requiring cross-correlation functions of system and bath operators to generate the memory kernel. Evaluating these cross-correlation functions using trajectory-based quantum-classical approaches with the GQME formalism has been shown to give considerable increases in both accuracy and efficiency compared to using the same quantum-classical method alone.19–25
However, while these studies showed the efficacy of such a combination in systems containing two subsystem states, for a system with Ns subsystem states, the reduced density matrix (RDM) has elements and hence the size of the memory kernel needed to propagate it grows as . One might therefore imagine that combining quantum-classical methods with the GQME would be limited to problems with small numbers of subsystem states. Here, we demonstrate that for quantum-classical methods that treat the subsystem using wavefunctions, e.g., Ehrenfest mean field theory (MFT)26–28 and fewest switches surface hopping (FSSH),29–31 the full memory kernel can be obtained using only trajectories.
We then show how, in cases where one is interested in particular (or a small set of) subsystem initial conditions, one can further improve the efficiency by focusing resources on accurately capturing the matrix elements of the memory kernel that most strongly determine the evolution of a given initial condition. We achieve this by noting that, while the fully converged memory kernel contains the information necessary to propagate the GQME from any subsystem initial condition contained in the projector, often many of these may not be of interest. Here, we introduce an algorithm that exploits this observation to identify and selectively converge the kernel elements that are most important to capture the relaxation from a given subsystem initial condition. This allows one to minimize the computational effort expended on the elements that only exert a minor influence on the process of interest.
We then illustrate the scalability of the quantum-classical GQME to multistate systems by applying the mean field GQME (MF-GQME) method21 to treat the 7-state Fenna-Matthews-Olson (FMO) complex32,33 and the 14-state light harvesting complex II (LHCII).34–36 We show that the MF-GQME gives quantitative accuracy in regimes where direct MFT and modified Redfield theory fail, while remaining less expensive than a direct application of MFT even in these larger systems. This development thus opens the door to the application of the quantum-classical GQME approaches to multistate systems coupled to atomistic environments that lie beyond linear system-bath coupling and harmonic environments.22
II. THEORY
To show how the GQME formalism can be efficiently used with trajectory-based quantum dynamics methods in multistate systems, here, we give the general expression for the memory kernel for a system with arbitrary system-bath coupling. By analyzing the form of this general expression, we demonstrate that only trajectories are required to calculate the memory kernel in the MF-GQME method in a system with Ns subsystem states.
To begin, we consider a Hamiltonian of the general form,
where Ĥs is the isolated subsystem Hamiltonian, Ĥb is the isolated bath Hamiltonian, and Ĥsb contains all the terms that couple the system and the bath. For any Ĥsb, one can write it such that it is in the form of a sum of direct products of system and bath operators,
where is a pure subsystem operator, is a pure bath operator, and Nsb is the number of terms in the sum.
Here, we consider initial conditions for the full system density operator that are of the spectroscopic form, , where encodes the initial state of the subsystem and is the equilibrium bath density operator, . While this choice of initial condition simplifies the expressions for the memory kernel, we emphasize that the scaling arguments presented here do not depend on this choice and are straightforward to generalize to any factorizable initial condition.
The quantity of interest is the RDM, , from which any subsystem observable, such as the population relaxation or electronic spectra, can be generated. In a given basis, one can write the RDM as
where , which is a set of subsystem operators that span the subsystem Liouville space and denotes a particular matrix element of the RDM. These matrix elements can be expressed as quantum mechanical expectation values,
In the Mori-Nakajima-Zwanzig formalism, the dynamics of the RDM can be obtained by integrating the GQME,1–4,16
where ρs(t) is a row vector with time-dependent elements and and are matrices of size and , respectively. The static matrix = ensures that the GQME exactly recovers the free subsystem evolution when the memory kernel, , is zero, i.e., when the subsystem and bath do not interact. The memory kernel, which encodes the effect of the bath on the subsystem dynamics, can be constructed using correlation functions in the full system and bath Hilbert space. Using the Dyson decomposition of the propagator,37 one obtains18,23
Here, and are auxiliary kernels, which can be obtained using one’s choice of dynamical method, and the subscripts 1 and 3 are used to be consistent with earlier work.18–23 For any system-bath coupling, and can be written as linear combinations of correlation functions of system and bath operators of the form (see Appendix A),
Here, like the memory kernels, c(j,±), , and are matrices of size . The transformation matrices c(j,±) are time-independent and simple to evaluate analytically, with their elements given by
where [·, ·]− denotes the commutator and [·, ·]+ the anticommutator. The matrix elements of the correlation functions and are (see Appendix A)
Equations (7) and (8) provide a formal basis for the calculation of the memory kernel, in which one obtains the auxiliary kernels (t) and (t) by evaluating c(j,±) exactly and calculating and using one’s choice of exact or approximate dynamical method. These quantities are then used to obtain the auxiliary kernels and using Eqs. (7) and (8), and the full memory kernel is obtained using Eq. (6) (see Appendix B). In Sec. II A, we use these expressions to analyze the scaling with the number of subsystem states.
A. Scaling of the MF-GQME method with number of subsystem states
In the MF-GQME method, one approximates the correlation functions defined in Eqs. (10) and (11) using MFT (see Appendix C). The memory kernel can then be used to solve the GQME for the dynamics of the RDM, given by Eq. (5). In practice, the computational cost of generating the full memory kernel using Eqs. (6)–(8) and integrating the GQME is negligible compared to the cost of generating the MFT trajectories necessary to calculate and . The computational cost is therefore determined by the number of MFT trajectories required to evaluate and , and how this cost scales with the number of subsystem states.
The number of correlation functions that need to be calculated to evaluate and using Eqs. (10) and (11) can at first appear formidable, as the indices m and n in these equations each run over elements. This might lead one to erroneously conclude that the MF-GQME method scales as , which would severely limit its applicability to treat problems with many subsystem states. However, closer inspection of these expressions in the context of MFT dynamics reveals that one only needs to sample the distinct electronic initial conditions, labeled by the index m, to calculate all the necessary correlation functions. This is because the MFT trajectories generated from a given electronic initial condition, , provide the time-dependent observables necessary to calculate all of the correlation functions corresponding to that initial condition. Consequently, since only electronic initial conditions need to be sampled, only quantum-classical trajectories are required. In practice, for MFT, the number of initial conditions can be shown to be by following the procedure outlined in Appendix D, which exploits the fact that the correlation functions corresponding to a given off-diagonal initial condition and its Hermitian conjugate can be obtained from the same set of trajectories.
In general, evaluating forces dominates the overall cost, especially in atomistic systems or systems with a large number of bath degrees of freedom. However, the calculation of the correlation functions required scales as and integration of the GQME scales as which could dominate the cost for large Ns. However, even for the systems studied here which use very simple forms of the bath, the cost of integrating the GQME is negligible.
An additional concern regarding the scalability of the MF-GQME approach comes from the amount of memory required to store all elements in and . For instance, the number of time-dependent elements in can be calculated as the product of initial electronic conditions , Nsb bath operators measured at t = 0, and electronic operators Ân(t) and Nsb bath operators measured at finite times. This corresponds to time dependent elements that need to be stored in memory, and hence one might expect the amount of available memory to be a significant limitation. In practice, however, this problem can be easily circumvented because and do not themselves need to be stored in memory. Instead, one can evaluate their contributions to and on the fly using Eqs. (7) and (8) so that only and need to be stored. In addition, because not all of the matrix elements are independent, there are only unique elements in each, with the remainder being complex conjugates of these elements.18 The amount of memory required to store the kernels is therefore proportional to , where Nsteps is the number of time steps before the kernel decays. In practice, when the memory kernel decays quickly to zero Nsteps is a small number, meaning that negligible storage is required.
B. Accelerating convergence of the GQME using selective sampling of kernel matrix elements
Once the memory kernel has been calculated, solving the GQME, Eq. (5), provides access to the subsystem dynamics for any factorizable initial condition of the subsystem RDM. However, in many cases, one is only interested in a particular initial condition, or a set of them. For example, one may be interested in the relaxation dynamics resulting from the initial excitation of a particular chromophore in a molecular system consisting of multiple chromophores. Here it is important to make a distinction between the initial condition of interest at the RDM level and the electronic initial conditions that need to be sampled in the MFT calculation of the memory kernel. At the memory kernel level, all separable initial conditions (e.g., initial excitation of each chromophore and coherence between chromophores) need to be sampled to construct the full memory kernel. It is this complete sampling of the initial conditions at the memory kernel level that ensures that one can use Eq. (5) to propagate the RDM dynamics from any separable initial condition.
However, as in the case mentioned above, suppose that one is only interested in propagating the RDM starting from one particular initial condition (or a small subset of initial conditions) in a system with a large number of subsystem states. In this case, there may be many pathways that do not play a role in the relaxation dynamics from the RDM initial condition of interest. Hence, it is clearly unnecessary to have accurate knowledge of the memory kernel elements that describe these unimportant relaxation pathways.
One can exploit this realization by focusing resources into converging the elements of the memory kernel that are most important to describe the relaxation process of interest. When the memory kernel elements are generated using MFT, this corresponds to identifying and selectively converging the most important correlation functions from the sets and [defined in Eqs. (10) and (11)], which give rise to the memory kernel. As such, here we present an algorithm that allows one to use fewer trajectories to generate the less important correlation functions while minimizing the resulting loss in accuracy for the RDM.
Any algorithm that allows for the selective convergence of the memory kernel elements should satisfy certain requirements. In particular, after specifying the RDM-level initial condition that one wishes to propagate, the algorithm should provide a prescription for apportioning a fixed total number of quantum-classical trajectories, Ntot, among the different initial conditions that need to be sampled at the memory kernel level. Specifically, the algorithm should provide a set of normalized weights, , that determine the portion of trajectories Ntot assigned to a given memory kernel level initial condition, . These weights should be chosen such that, when the time-evolved RDM is propagated from the specified initial condition, the error that arises from the statistical error in the memory kernel is minimized. In addition, the calculation of these weights should take into account two important observations. First, correlation functions with certain memory kernel-level initial conditions may contribute more to the relevant relaxation pathways than others. Second, correlation functions arising from different memory kernel-level initial conditions may require different numbers of trajectories to achieve reasonable levels of convergence.
To calculate the optimal weights, some knowledge of the relaxation pathway and current level of convergence of the memory kernel is crucial. Here, we suggest an iterative approach, where increasingly improved estimates for the memory kernel and the RDM dynamics are used to recompute the weights at each iteration. An overview of the algorithm is shown in Fig. 1. We note that while here we present one particular form for the weights, the approach used here is general and does not depend on this particular choice.
To provide a form for the weights, , we consider how a given realization of the RDM dynamics, which we call the trial RDM dynamics, deviates from its converged form, . Although in practice one does not know the converged dynamics , considering how an unconverged dynamics may deviate from its converged form will allow us to elucidate how the convergence of the dynamics can be accelerated via selective sampling. Our goal is to reduce the absolute value of this difference from the converged dynamics for a given number of trajectories, Ntot. Because the dynamics, ρs(t), arise from integrating Eq. (5), one can minimize the absolute difference between the time-derivative of the trial RDM and its converged form, . Because this error metric is time dependent and different for each element of the density, it is convenient to consider the mean absolute deviation, which is the absolute value of δρs(t), averaged over time and elements of the density matrix,
where tmax is the maximum time that is of interest of the dynamics.
Because there is no error in the initial condition [i.e., δρs(0) = 0], one can minimize the error at subsequent times by minimizing the error in the time-derivative of the RDM, . Again, it is convenient to consider the mean absolute deviation in this quantity, which is
An approximate upper bound of this quantity is (see Appendix E)
where the index m runs over all initial conditions. The selective sampling error metric (see also Appendix E) is approximately proportional to the error in the dynamics at time t that arises from finite sampling of correlation functions with initial condition m. For the models considered here, this quantity is
where Nm is the number of trajectories that have been run in all previous iterations of the algorithm with memory kernel level initial condition m. The quantity is given by Eq. (E10) in Appendix E.
Motivated by the above considerations, the computed weights in our algorithm depend on ρs(t), as well as the statistical error of the correlation functions that contribute to a specific elements of the memory kernel. By distributing Ntot trajectories over nstep steps in increments of ΔN = Ntot/nstep to different GQME-level initial conditions, we refine our estimate for these weights iteratively at each step, as described below. We repeat this procedure either until the RDM converges to a desired threshold or until Ntot trajectories have been run.
Using the definitions provided above, we define the un-normalized weight, am, as
The normalized weight is then
Using these weights, we then apportion trajectories to each memory kernel level initial condition in the subsequent cycle so that
where j is the index of the current iteration of the algorithm (for example, in the first iteration of the algorithm j = 1). is therefore the number of trajectories that have been assigned to initial condition m at step j. The algorithm can be summarized as follows:
Initialize the algorithm by setting and .
Use to integrate the GQME [Eq. (5)] for some time range tmax, yielding ρs(t).
Use the current guess for and ρs(t) to compute the weights, , corresponding to a given memory kernel-level initial condition using Eqs. (15)–(17).
Distribute a number of MFT trajectories, ΔN, among all memory kernel-level initial conditions according to Eq. (18).
Using the correlation functions and obtained from all MFT trajectories that have been run so far, rebuild using Eqs. (6)–(11).
Calculate using the standard deviations in the correlation functions (see Appendix E).
Calculate the mean absolute deviation between the previous and the current iteration of ρs(t). If the maximum number of trajectories, Ntot is exceeded, or if the mean absolute deviation falls below the predetermined threshold, stop. Otherwise, return to step 2.
A plot of the number of trajectories assigned to each initial condition for the LHCII model discussed in Sec. IV is available in the supplementary material.
III. SIMULATION DETAILS
A. Hamiltonian
Here, we apply the MF-GQME to the Frenkel exciton Hamiltonian.32,33 This model Hamiltonian forms the basis of many studies of electronic energy transfer in photosynthetic light harvesting systems.38,39 In addition to providing physical insight into these systems, the Frenkel exciton model’s simplicity, i.e., its bilinear coupling to harmonic baths, allows for exact results to be calculated using, for example, the hierarchical equations of motion (HEOM) approach.40 Although GQMEs, and in particular MF-GQME, are not limited to any specific form of the Hamiltonian, the availability of exact results for this model system allows us to benchmark the accuracy and efficiency of the MF-GQME for multistate problems. The Frenkel exciton Hamiltonian is of the general form given in Eq. (1), with the subsystem Hamiltonian taking the form,
where E(k) is the energy of diabatic state |k⟩ and Δkk′ is the electronic coupling between states |k⟩ and |k′⟩. Each diabatic state is bilinearly coupled to a bath of Nosc independent harmonic oscillators,
where j indexes the bath degrees of freedom, cj,k is the coupling constant for a given bath oscillator, and is the position operator for that oscillator, and in the final equality we define to be the bath part of the system-bath coupling to state |k⟩. The bath therefore consists of Ns × Nosc independent harmonic oscillators,
where is the momentum operator of the bath mode, and ωj,k is its frequency, and the Hamiltonian is given in mass-weighted coordinates and momenta. The coupling of each diabatic state to its local bath is characterized by its spectral density,
For the FMO model, it is common to assume that all spectral densities are equivalent and take the Debye form,
where λ is the reorganization energy that defines the strength of the system-bath coupling and ωc is the inverse of the characteristic time scale of the bath. For the LHCII model, we adopt a sum of shifted Drude-Lorentz peaks,41
where we use the parameters of Ref. 41: Npeaks = 7, fs, , and .
B. Computational methods
For the FMO model, the spectral density was discretized with frequencies,
For the LHCII model, the spectral density was discretized by numerically solving for ωj that satisfies
where λtot = ∑kλk. For both Eqs. (25) and (26), ωj is obtained by taking j = 1, 2, …, Nosc. For both models, the couplings of the discrete oscillators are given by
For Ehrenfest trajectories, the equations of motion were solved using a split evolution method, where at each dynamical time step, the bath is evolved through a half time step of using mean field theory (see Appendix C) with the velocity Verlet algorithm.42 The system was then evolved for a full time step Δt by numerically diagonalizing Hs + Hsb(R, P), holding the bath positions and momenta constant. The bath was then evolved for another half time step of using velocity Verlet, keeping the state of the electronic subsystem constant. For the FMO model, the time step used was Δt = 0.25 fs, while for LHCII the time step was Δt = 0.05 fs.
The partial memory kernels were constructed using Eqs. (7)–(11), with the necessary correlation functions in Eqs. (10) and (11) approximated using mean field theory as described in Appendix C. The memory kernel was calculated at a lower time resolution (0.5 fs) than the direct MFT dynamics used to generate the underlying correlation functions, Eqs. (10) and (11), which was observed not to change the resulting GQME dynamics. Additional details on the specific form these equations in the context of the Frenkel exciton model are given in Appendix A. Plots of selective memory kernel matrix elements that relate to transitions between the states that have significant populations for FMO and LHCII are available in the supplementary material.
Typically, the memory kernel decays to zero after a short amount of time and hence can be truncated (set equal to zero) at all times greater than this time. For the FMO model, this kernel cutoff time, τc, was chosen by increasing the cutoff until the dynamics ceased to change, with τc between 125 and 300 fs giving very similar results. In practice, larger values of τc require more trajectories to converge. For the dynamics of the FMO model shown in this work, we used τc = 150 fs. For the LHCII model, the dynamics remain unchanged for τc between 50 and 65 fs. To stably integrate the MF-GQME dynamics with cutoff times greater than 65 fs requires a large number of trajectories to achieve stable dynamics. This is because the LHCII complex is in a more strongly coupled regime, which gives rise to highly oscillatory memory kernels that require a very large number of trajectories to converge. Such oscillatory memory kernels have been observed for the dynamics of the spin-boson model with the Debye spectral density in intermediate to strong coupling regimes.23 For all LHCII dynamics shown, we used τc = 65 fs. The instabilities observed for LHCII might be able to be alleviated by employing alternative closures23,43,44 or by using higher level quantum-classical methods to generate the memory kernel.20,24
After calculating the partial memory kernels, the GQME, Eq. (5), was integrated using Heun’s method,45 with the integral in Eq. (5) approximated using the trapezoid rule.
Because the sampling of bath initial conditions from which the correlation functions are generated is stochastic, there is variability in how many trajectories are required to converge the memory kernel for a given realization of uniform or selective sampling. To evaluate our selective sampling algorithm on the LHCII model, we performed 50 separate realizations of the selective sampling algorithm, as well as 50 separate realizations of uniform sampling. All GQME dynamics shown are from the realization with the median error in the dynamics, where error in the dynamics is quantified by the mean absolute deviation in the subsystem density matrix from its converged form, Eq. (12). The converged form for LHCII corresponds to 65 × 106 total trajectories, apportioned equally across all initial conditions. A plot that shows the distribution of errors of different realizations of the selective sampling procedure as a function of number of trajectories added is shown in the supplementary material. The selective sampling procedure was performed by adding a total of ΔN = 100 trajectories at each iteration.
IV. RESULTS AND DISCUSSION
Here, we apply the MF-GQME method to electronic energy transfer in Frenkel exciton models of the FMO and LHCII light harvesting complexes. The FMO model consists of Ns = 7 subsystem states, whereas Ns = 14 for LHCII. Therefore, for FMO, there are 74 = 2401 time dependent matrix elements in the memory kernel, while, for LHCII, there are 144 = 38 416. However, as described in Sec. II A, all of the matrix elements can be obtained using very short simulations starting from 28 initial conditions for FMO or 105 distinct initial conditions for LHCII.
Figure 2 compares the direct MFT and MF-GQME dynamics to exact HEOM results41 for the population relaxation following an initial excitation for the FMO model at low temperatures in a regime of high nonadiabaticity (ωc ≳ Δkk′ for most pairs of states). As observed in previous work for the FMO model,46 direct MFT gives qualitatively correct coherent oscillations at short times but fails to recover the correct long-time populations. This can be explained by the high nonadiabaticity and the low temperature bath, both of which are problematic for MFT. In contrast, the MF-GQME produces highly accurate results, consistent with previous observations that this scheme offers significant improvements over direct MFT in nonadiabatic regimes.21–23
For this nonadiabatic problem, the memory kernel decays by a cutoff time of τc = 0.15 ps, which is considerably shorter than the 1–2 ps lifetime of the population dynamics. However, because the MF-GQME requires sampling 28 distinct initial conditions to generate the memory kernel, uniform sampling of all initial conditions in this regime, i.e., when trajectories are apportioned equally to each initial condition, requires a total of 105 total trajectories (∼3500 per initial condition) to converge the memory kernel. This is in contrast to the ∼104 trajectories required to obtain a similar level of convergence with direct MFT. Hence, despite the shorter lifetime of the kernel, the larger number of trajectories required to generate the kernel means that the total cost of generating the MF-GQME dynamics shown in Fig. 2 is ∼1.5 times greater than that of MFT. Moreover, because the memory kernel provides the dynamics for all factorizable initial conditions of the subsystem RDM, the dynamics resulting from the two distinct initial subsystem excitations shown in Figs. 2(b) and 2(d) are generated using the same memory kernel. In contrast, Figs. 2(a) and 2(b), which correspond to direct MFT dynamics, require separate sets of MFT trajectories initialized from the two different subsystem initial conditions.
An additional advantage of the MF-GQME scheme is that, once the memory kernel is converged, one can obtain dynamics for arbitrarily long times by integrating Eq. (5). In practice, the GQME propagation can be done at negligible computational cost compared to the costs of generating MFT trajectories. Figure 3 shows the long time population dynamics for direct MFT and MF-GQME following the initial excitation of state 6 in the same parameter regime as Fig. 2. Because of the prohibitive cost of converging the exact dynamics for such long times, the arrows in Fig. 3 correspond to the Boltzmann equilibrium populations that would be expected based on the subsystem Hamiltonian, . As expected, due to the well-known detailed balance problems in methods that rely on a mean field description of the interaction between the electronic subsystem and the nuclei,47–49 MFT incorrectly predicts that all states have nearly equal population at long times, in qualitative disagreement with the expected result. In contrast, the MF-GQME accurately captures the expected long-time populations. Furthermore, to converge the MFT dynamics out to the 20 ps shown in Fig. 3 requires significant additional computational effort (at least 20 times more than for 1 ps), while in the GQME approach, Eq. (5) can be integrated for arbitrarily long times using the same memory kernel as Fig. 2. Because integrating the GQME for longer times incurs negligible additional cost, the dynamics shown in Fig. 3 are ∼13 times cheaper when generated with the MF-GQME as compared with direct MFT.
A more stringent test of the MF-GQME method is the LHCII model, which has significantly more subsystem states each of which is also more strongly coupled to the bath degrees of freedom. The combination of a large number of states, stronger system-bath coupling, and the complex form of the spectral density of LHCII presents a significant challenge to exact methods. Hence, very limited benchmarks are available using the HEOM method and these required substantial computational effort and some simplification of the spectral density.41 Here, we use the form and parameterization of the LHCII Hamiltonian of Ref. 41 with the J7 spectral density from that reference, and compare against the HEOM results presented therein. These HEOM results relied on the high temperature approximation,41 and hence, while they are expected to be highly accurate, it remains unclear whether they are numerically exact.
Figure 4(a) shows the dynamics for the sum of the populations of all Chl b chromophores for the LHCII model following the initial excitation of the highest energy eigenstate of the subsystem Hamiltonian. For this model, the exact HEOM dynamics exhibit an initial fast (subpicosecond) decay in the population of Chl b chromophores followed by a longer time component which decays over tens of picoseconds. Direct MFT fails to capture the magnitude of the short time drop in the population of Chl b chromophores but qualitatively reproduces the longer time scale of population transfer out of this set of chromophores. As previously observed, the modified Redfield dynamics, which is a major workhorse in the modeling and investigation of the excitation dynamics in photosynthetic systems, shows erroneous rapid decay within the first 1–2 ps.41 The MF-GQME, however, yields accurate short and long time behavior, in good agreement with the HEOM result. This behavior is mirrored in the populations of the Chl 604a and Chl 605b states in Figs. 4(b) and 4(c) which contribute to the Chl b decay shown in (a). Direct MFT underestimates the transfer into Chl 604a while overestimating that into Chl 605b, while modified Redfield theory leads to spuriously fast population decay of these states. In contrast, the MF-GQME accurately captures the HEOM result.
Figure 5 shows the long-time dynamics obtained from the MF-GQME using the same initial condition. In addition to capturing the exact HEOM dynamics at short times, the MF-GQME also obtains the intermediate time scale (picosecond) trapping in Chl 604a and Chl 605b bottleneck states, and the long time scale (tens of picoseconds) relaxation to the Chl 610a-611a-612a trimer, the Chl 613a-614a dimer, and the Chl 602a-603a dimer.34
Because of the stronger system-bath coupling of the LHCII model in comparison to the FMO complex, converging the memory kernel using uniform sampling of all 105 distinct initial conditions would require approximately 65 × 106 total trajectories. Although this number of trajectories is feasible for simple models, it is intractable when more sophisticated atomistic descriptions of the environment are used. The fact that LHCII consists of a large number of subsystem states makes it a prime candidate to benefit from using selective sampling to accelerate convergence. As such, we now consider the computational speedups that can be obtained by applying the selective sampling algorithm introduced in Sec. II B to this model. To demonstrate the utility of the selective sampling scheme, we thus compare the convergence of the GQME dynamics for a fixed total number of trajectories when using the selective sampling and the uniform sampling approaches to compute the memory kernel. See the supplementary material for a plot showing the average number of trajectories assigned to each memory-kernel level initial condition at each step of the algorithm.
Figure 6 shows the populations for the median realizations of selective and uniform sampling at 100 000 and 200 000 trajectories. Already, at 100 000 trajectories, the selective sampling yields qualitatively correct dynamics, while the uniformly sampled memory kernel is not numerically stable, with the populations several chromophores becoming negative by t = 1.0 ps, and the unphysically high frequency and large amplitude oscillations seen in (a) are caused by numerical instability. By 200 000 trajectories, selective sampling is already in semiquantitative agreement with the exact result, while uniform sampling produces negative populations only after ∼0.5 ps. In fact, the uniform sampling approach fully converges only after ∼65 000 000 trajectories. Hence, in this regime, the selective sampling algorithm is approximately 300 times more efficient than uniform sampling.
V. CONCLUSIONS
In this paper, we have shown how the MF-GQME approach can be efficiently extended to treat systems where the electronic subspace consists of many states. By analyzing the symmetries of the memory kernel, we have demonstrated that, for a system consisting of Ns subsystem states, the memory kernel, despite having matrix elements, can be generated using only quantum-classical trajectories. This scaling analysis is directly applicable to quantum-classical methods using wavefunction-based treatments of the electronic subsystem dynamics, such as MFT and FSSH. The same scaling arguments also apply to methods of generating the kernel using only the original dynamics23,24,50 and to other methods that involve a time convolution master equation such as the transfer tensor method.51,52 This is in contrast to methods53–60 based on the Meyer-Miller-Stock-Thoss (MMST) mapping,61,62 where the initial conditions necessary to generate all memory kernel elements could be recovered by an appropriate weighting of the variables drawn from a single Gaussian distribution of MMST variables.19,24
To further reduce the cost of generating the memory kernel necessary to propagate an RDM subject to specific initial conditions, we have outlined a general procedure for how one can select an optimal set of quantum-classical trajectories to converge the kernel. Employing a specific realization of a selective sampling procedure on the LHCII model that focuses resources on initial conditions that mostly contribute to the fast convergence of the memory kernel, we have demonstrated that one can obtain further 2 orders of magnitude speed-up. While the specific procedure used here is one possible choice, alternative ways of apportioning trajectories to specific initial conditions could lead to more efficient selective sampling schemes and represent a promising direction for future work.
By applying the MF-GQME to models of excitation energy transfer in the FMO and LHCII complexes, which consist of 7 and 14 electronic states, respectively, we have shown that this method can provide accurate results for multistate systems for both short-time dynamics and long-time populations at the computational cost of low-level quantum classical trajectory-based methods. Furthermore, when memory kernels are short-lived in comparison to the electronic dynamics of interest, the MF-GQME approach usually costs less than a direct application of MFT. Moreover, once the memory kernel is generated up to the required cutoff time, τc, the GQME can be propagated with linear scaling in time, making long-time dynamics easy to access.
To date, the MF-GQME has previously proven successful in treating the dynamics of systems coupled to harmonic21,23,25 or fully atomistic22 environments, both in25 and out21–23 of equilibrium. In the present work, we have demonstrated that this approach is also able to accurately and efficiently address the dynamics of systems where the electronic subspace consists of many states. Indeed, we have shown that the MF-GQME can provide quantitatively accurate results in LHCII where commonly invoked methods, such as Redfield theory5,6 and its modified variant,7,8 are known to fail.41 The MF-GQME thus provides a powerful tool to investigate charge and energy transfer in the condensed phase.
SUPPLEMENTARY MATERIAL
See supplementary material for plots of selective memory kernel matrix elements for FMO and LHCII, a plot of the distribution of errors of different realizations of the selective sampling procedure as a function of number of trajectories added, and a plot of the number of trajectories assigned to each initial condition of the LHCII model using the selective sampling approach.
ACKNOWLEDGMENTS
This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Award No. DE-SC0014437. T.E.M. also acknowledges support from the Camille Dreyfus Teacher-Scholar Awards Program. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility, supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. W.C.P. acknowledges support from the Melvin and Joan Lane Stanford Graduate Fellowship. We would also like to thank Stanford University and the Stanford Research Computing Center for providing computational resources and support that have contributed to these research results.
APPENDIX A: AUXILIARY KERNELS FOR A MULTILEVEL SYSTEM
In this section, we provide more details for the derivation of the expressions for the auxiliary kernels and for the GQME that describes the dynamics of the subsystem RDM subject to a spectroscopic initial condition. These expressions are applicable to a general Ns state system coupled to an arbitrary bath of the type described in Sec. II. We subsequently specialize our discussion to the Frenkel exciton model presented in Sec. III A given by Eqs. (19)–(21).
We start from the general form of the Hamiltonian, Eq. (1), for an Ns state electronic system coupled to a bath, which can take a harmonic [e.g., Eq. (20)] or atomistic form, with a general form of the system-bath coupling, given by Eq. (2).
As stated in Sec. II, we are interested in the nonequilibrium RDM dynamics arising from a spectroscopic or Franck-Condon initial condition, , where the initial condition of the electronic subsystem can be expanded in terms of the Liouville basis, , and the bath is in local equilibrium, . To derive a particular GQME, it is also necessary to choose a projection operator. Here, we choose the Ns state system generalization of the Redfield-type projection operator, shown in Ref. 23 in the context of a two-level system. In the more conventional Nakajima-Zwanzig form of the GQME for nonequilibrium RDM dynamics,1,2 this projector corresponds to the commonly used Argyres-Kelley projector,63 , which performs a partial trace over the bath and multiplies by the canonical density of the bath, i.e., , where Ôb and Ôs are bath and system operators, respectively.
With this form of the projection operator, and assuming that (which can always be enforced with no loss of generality by redefining the subsystem and bath parts of the Hamiltonian: and ), it is possible to obtain the general forms for the auxiliary kernels. Focusing first on ,
Using similar manipulations, one can show that
where the elements of the dynamical matrices, and , are given by Eqs. (11) and (10), the elements of the static transformation matrix c(j,±) take the form,
and we have used the fact that
Equations (A1) and (A2) are general, applicable to an Ns state system coupled to an arbitrary bath with system-bath coupling given by Eq. (2), and subject to a spectroscopic initial condition. We emphasize that such systems include electronic states coupled to harmonic or atomistic baths and system-bath interactions beyond linear coupling.
We now specialize our discussion to the Frenkel exciton Hamiltonian, Eqs. (19)–(21), where the bath is harmonic and can be decomposed into contributions that are local, in addition to the coupling being bilinear and local. Given the Frenkel exciton form of the Hamiltonian, the matrix elements of the static transformation matrix are given by
where δ is the Kronecker delta.
APPENDIX B: OBTAINING FROM AND BY SOLVING A SYSTEM OF LINEAR EQUATIONS
Here, we describe a procedure for obtaining from and by discretizing the self-consistent expansion and solving a system of linear equations, hence avoiding an iterative solution. The self-consistency relation for the memory kernel, Eq. (6), can be written in discrete time as
where we have discretized the integral using the trapezoid rule. Here, t0, t1, …, tn are equally spaced discrete times, and hence tj+1 − tj ≡ Δt, and tm − tj = tm−j (m > j). Equation (B1) can be solved to obtain an explicit expression for in terms of , , and (0 ≤ j ≤ n − 1),
where n > 0 and 1 is the identity matrix. Equation (B2) allows (n > 0) to be obtained from the (j < n) using a single, noniterative step where one solves for the unknown matrix in Eq. (B2). In practice, the system of linear equations implied by Eq. (B2) was solved by computing and storing the Cholesky factorization of , which can be used to solve for .
APPENDIX C: MFT TREATMENT OF THE DYNAMICAL MATRICES, AND
Here, we outline how one can calculate the dynamical matrices, and in Eqs. (11) and (10) using the Ehrenfest MFT method. To do this, we first review the partial transformation to Wigner phase space with respect to bath variables of a general correlation function. We then show the general MFT approximation to the dynamical matrices corresponding to a general Ns-state system and specialize our discussion to the Frenkel exciton model. Finally, we provide the equations of motion for the subsystem and bath variables.
To treat a general correlation function of a system which can be separated into a subsystem coupled to a bath,
using the Ehrenfest MFT method, we first perform a partial Wigner transformation64,65 with respect to the bath variables. Here, encodes the initial condition of the full system, and and are arbitrary operators. Upon performing the partial Wigner transformation, we can rewrite the correlation function as follows:
where X = (R, P) are the classical coordinates, R, and momenta, P, of the bath, and the Wigner transform of an operator, Ô, is defined as65,66
The Wigner transform of the product of two operators is65,66
where
is the Poisson bracket operator and the arrows indicate the directions in which the differential operators act.
The MFT approximation to this correlation function amounts to replacing the time-dependent operator with , where the classical bath degrees of freedom evolve according to the Poisson bracket subject to an additional time-dependent mean force arising from the subsystem dynamics and the quantum variables evolve according to the time-dependent Schrodinger equation subject to the time-dependent contribution of the bath. This allows one to rewrite the dynamical matrices, and , corresponding to a general Ns state system in the MFT approximation as
where
Here, we have explicitly included the dependence of operator Ân(t) on the time-dependent bath variables, Xt, to emphasize that the value of is dependent on the realization of the bath.
For the Frenkel exciton model, where the bath is harmonic and the bath part of the system-bath coupling is linear in the bath coordinates and given by , the Wigner transform of the anticommutator and commutator of the bath canonical distribution and take simple forms,
where, using Eq. (C4), one obtains
and
Application of the Ehrenfest MFT approximation to the dynamics of the Frenkel exciton model leads to the evolution of the subsystem wavefunction subject to the Hamiltonian of the subsystem modulated by the time-dependent system-bath coupling,
where the subsystem initial condition, ρs(0) = |ψ⟩⟨ψ|, is of pure form.
The bath, in turn, follows Hamilton’s equations,
where the Ehrenfest version of the bath Hamiltonian takes the form,
where is the mean potential arising from the subsystem dynamics.
The correlation functions given in Eqs. (C6a)–(C6d) are generated using MFT, and the memory kernel is then generated according to Eqs. (6)–(8).
APPENDIX D: CORRELATION FUNCTIONS WITH OFF-DIAGONAL INITIAL CONDITIONS USING EHRENFEST MEAN FIELD THEORY
In this section, we describe how the correlation functions in Eqs. (10) and (11) are evaluated using MFT for initial conditions that do not correspond to a single wavefunction.
If the subsystem initial condition is diagonal, i.e., Âj = |α⟩⟨α′|, where α = α′, then MFT is straightforward to apply because one can use the subsystem initial condition corresponding to pure state |α⟩, i.e., |ψ(0)⟩ = |α⟩. On the other hand, if Aj is an off-diagonal (if α ≠ α′), the procedure is less straightforward because Aj does not correspond to an initial condition that arises from a single wavefunction. Several possible approaches and subtle issues related to this are discussed in detail in the Appendices of Ref. 23. A simple and computationally efficient approach for calculating such correlation functions involves an auxiliary wavefunction,
where ϕ is a random number uniformly sampled from the interval [0, 2π). Then, consider the expression
where q(t) is a general memory kernel correlation function which is partially Wigner transformed over the bath degrees of freedom, such as those defined in Eqs. (C6a)–(C6d), and we have explicitly used that α ≠ α′. Equation (D2) thus provides a way to access two q correlation functions using the same initial wavefunction , in particular those for which Âm = |α⟩⟨α′| and Âm = |α′⟩⟨α|. The explicit procedure for calculating such correlation functions is
Initialize the wavefunction , sampling a new random number ϕ from a uniform distribution on the interval [0, 2π). Sample the bath initial conditions from the appropriate bath distribution, ρb,W, as in the diagonal case.
At t = 0, evaluate and store eiϕ and e−iϕ.
Run each trajectory using MFT to calculate the desired observables at time t in the same way as for a diagonal initial condition. Multiply the correlation function by eiϕ to obtain and by e−iϕ to obtain .
Repeat steps 1–4, averaging over an ensemble of trajectories with different bath initial conditions and different initial values of ϕ. As shown in Eq. (D2), the trajectory-averaged corresponds q(t) with Am = |α⟩⟨α′| and corresponds to q(t) with Aj = |α′⟩⟨α|.
This approach of phase averaging over a superposition state in order to obtain the correlation function of interest is similar to that employed in the recently proposed CFBT method,67 which extends Ehrenfest mean field theory.
APPENDIX E: CONVERGENCE ANALYSIS OF THE MEMORY KERNEL
Here, we describe how to motivate our selective sampling algorithm from the formal analysis of how the error in the memory kernel arising from the lack of convergence translates into error in the GQME dynamics.
Our analysis starts from the GQME corresponding to the fully converged RDM, , and the GQME corresponding to a (possibly unconverged) RDM ρs(t). The fully converged dynamics arise from a fully converged memory kernel, , while the unconverged dynamics arise from an unconverged memory kernel, . A simple metric for the error in ρs(t) is its deviation from the converged RDM,
Since this quantity can be positive or negative, it is convenient to consider its absolute value, . This deviation is time dependent and is different for each element of the density. A scalar metric for the total error is the mean absolute deviation of the RDM from its converged value, with the mean taken over time and elements of the density,
where tmax is the total amount of time over which one wants to know the RDM dynamics. Because there is no error in the initial condition [i.e., δρs(0) = 0], one can minimize the error at subsequent times by minimizing the error in the time-derivative of the RDM,
Equation (5) can be combined with Eq. (E3) in order to express the error in the time derivative, , in terms of δρs(t) and ,
where
Of the three terms, only E2 can be directly controlled by running more trajectories, since more trajectories lead to a smaller as the correlation functions that give rise to become more converged. In contrast, E1 and E3 cannot be directly controlled because they depend on δρs(t), a quantity which arises from and previous values of δρs(t) according to Eqs. (E3)–(E7). For our choice of selective sampling algorithm, we make the ansatz that the mean absolute deviation in the RDM dynamics is proportional to the mean absolute value of E2, i.e., . Figure 7(a), which is generated using data from LHCII, shows strong correlation between the error in the GQME dynamics, and |E2| suggests that this ansatz is valid.
In order to obtain an expression for the selective sampling weights, , we consider how ⟨|E2|⟩ is related to the error in specific elements of ,
Using this expression to determine the error in the dynamics still requires prior knowledge of the converged kernel. In order to use this expression to obtain an expression for the selective sampling weights, we make the additional approximation that so that
where is the standard deviation in ,
Here, it is useful to rewrite Eq. (E9) as a sum of terms arising from different initial conditions at the memory kernel level,
where m indexes the initial conditions at the memory kernel level and [ϵ(t)]m is the sum of all the terms in Eq. (E9) arising from initial condition m,
Equations (E11) and (E12) provide the basis for our choice of weights in our selective sampling algorithm. In particular, the weight associated with initial condition m is proportional to the time average of [ϵ(t)]m,
This metric is an upper bound parts of the |E2| error term that arises from initial condition m, as shown in Eq. (E9). As shown in Fig. 7(b), |E2(t)| and |ϵ(t)| track the error in the GQME dynamics, |δρs(t)|. Hence, a selective sampling algorithm that minimizes this term should also minimize the error in the GQME dynamics. For the models considered here, we note that [ϵ(t)]m takes a somewhat simpler form because if α ≠ m and hence