Determination of the optimum kinetic model is an essential prerequisite for characterizing dynamics and mechanism of a reaction. Here, we propose a simple method, termed as singular value decomposition-aided pseudo principal-component analysis (SAPPA), to facilitate determination of the optimum kinetic model from time-resolved data by bypassing any need to examine candidate kinetic models. We demonstrate the wide applicability of SAPPA by examining three different sets of experimental time-resolved data and show that SAPPA can efficiently determine the optimum kinetic model. In addition, the results of SAPPA for both time-resolved X-ray solution scattering (TRXSS) and transient absorption (TA) data of the same protein reveal that global structural changes of protein, which is probed by TRXSS, may occur more slowly than local structural changes around the chromophore, which is probed by TA spectroscopy.

Characterization of molecular structures of transient species formed during chemical and biologically relevant reactions is necessary for understanding their reaction mechanisms and functions. Over the last decade, time-resolved X-ray solution scattering (TRXSS), also known as time-resolved X-ray liquidography (TRXL), based on 3rd- and 4th-generation light sources has been used to investigate molecular structural dynamics of various solution-phase reactions.1–68 In our previous TRXSS studies,60–68 especially on proteins,60–65 we applied singular value decomposition (SVD) analysis and kinetic analysis to determine the optimum kinetic model that best describes the experimental data. As a result of this SVD-aided kinetic analysis, we obtained both time-dependent concentrations of transient intermediate species and time-independent difference X-ray scattering curves, which are directly associated with the structure of the intermediate species. These species-associated difference X-ray scattering curves (SACs) obeying the optimum kinetic model were further examined to reveal molecular structures of the intermediate species by performing structure refinement.61,62,65–67 Thus, determining the optimum kinetic model is an essential prerequisite for characterizing the dynamics of a reaction and molecular structures of transient species formed during the reaction.

As illustrated in Figure 1, SVD analysis provides model-independent kinetic information, for example, the number of structurally distinct intermediates (np) and their associated relaxation times (Λi, where i = 1,…, nΛ). Subsequently, kinetic analysis determines the optimum kinetic model among all possible candidate kinetic models that are considered based on the information obtained from the SVD analysis (termed as the C method in Figure 2(a)). For example, in our recent report on direct observation of the bond formation in a gold trimer complex,66 SVD analysis on the TRXSS data showed that there exist “three” intermediate states and “three” relaxation times, allowing us to consider only a simple sequential kinetic model in the kinetic analysis. In contrast, in our TRXSS study on wild-type sperm whale myoglobin (Mb),63 SVD analysis revealed that there exist “four” intermediates and “six” relaxation times. The fifth and sixth relaxation times correspond to nonexponential recovery of the ground-state Mb liganded with CO molecules from the last (fourth) intermediate, that is, bimolecular nongeminate CO recombination.69,70 Because the number of relaxation times is larger than that of intermediates, the optimum kinetic model must contain parallel (that is, biphasic) and/or bypass pathway(s) and thus we considered a total of eighteen candidate kinetic models.

FIG. 1.

Flowchart of the SVD analysis for TRXSS data. As an example, here we show the case of wild-type sperm whale MbCO where np and nΛ are 4 and 6, respectively (see the text for details). The model-independent kinetic information obtained from the SVD analysis, such as the number of intermediates (in this case, np = 4), the number of relaxation times (in this case, nΛ = 6), and their values (in this case, Λ1, Λ2, Λ3, Λ4, Λ5, and Λ6) can be used in the subsequent analysis for determining the optimum kinetic model as illustrated in Figures 2(a) and 2(b).

FIG. 1.

Flowchart of the SVD analysis for TRXSS data. As an example, here we show the case of wild-type sperm whale MbCO where np and nΛ are 4 and 6, respectively (see the text for details). The model-independent kinetic information obtained from the SVD analysis, such as the number of intermediates (in this case, np = 4), the number of relaxation times (in this case, nΛ = 6), and their values (in this case, Λ1, Λ2, Λ3, Λ4, Λ5, and Λ6) can be used in the subsequent analysis for determining the optimum kinetic model as illustrated in Figures 2(a) and 2(b).

Close modal
FIG. 2.

Comparison of three analysis methods for determining the optimum kinetic model from the experimental TRXSS data. For example, here we demonstrate the process of each method for the case of wild-type sperm whale MbCO. (a) In conventional SVD-aided kinetic analysis (termed as C method), we first generate all (in this case, eighteen) possible candidate kinetic models based on the model-independent kinetic information of the SVD analysis in the entire time range (np, nΛ, Λ1, Λ2, Λ3, Λ4, Λ5, and Λ6) as shown in Figure 1. Then, we performed the kinetic analysis for each of the eighteen candidate models. By comparing the minimized χ2 values of all the candidate models, we finally determined the optimum kinetic model that best fits the experimental data and extracted the species-associated difference X-ray scattering curves (SACs) of the intermediate species for the optimum model. (b) In the second method (termed as the V method), the number of candidate kinetic models can be reduced by additionally performing the SVD analyses in certain reduced time ranges whose upper limits are set to be close to the value of one of the relaxation components obtained from the SVD analysis in the entire time range. By doing so, we had to perform the kinetic analysis only for the four candidate models that are consistent with the SVD analysis in variable reduced time ranges. We note that both the C method and the V method require examination of candidate models against the data. (c) In contrast, SVD-aided pseudo PCA analysis (termed as the P method) can be used to extract the pseudo SACs for the optimum kinetic model directly from the experimental data based on the SVD analysis in the entire time range. Then, the PCA analysis using the pseudo SACs as the time-independent principal components provides the time-dependent population changes of each intermediate from the time-dependent coefficient of the corresponding pseudo SAC. Consequently, the optimum kinetic model can be determined without the need of any candidate kinetic model.

FIG. 2.

Comparison of three analysis methods for determining the optimum kinetic model from the experimental TRXSS data. For example, here we demonstrate the process of each method for the case of wild-type sperm whale MbCO. (a) In conventional SVD-aided kinetic analysis (termed as C method), we first generate all (in this case, eighteen) possible candidate kinetic models based on the model-independent kinetic information of the SVD analysis in the entire time range (np, nΛ, Λ1, Λ2, Λ3, Λ4, Λ5, and Λ6) as shown in Figure 1. Then, we performed the kinetic analysis for each of the eighteen candidate models. By comparing the minimized χ2 values of all the candidate models, we finally determined the optimum kinetic model that best fits the experimental data and extracted the species-associated difference X-ray scattering curves (SACs) of the intermediate species for the optimum model. (b) In the second method (termed as the V method), the number of candidate kinetic models can be reduced by additionally performing the SVD analyses in certain reduced time ranges whose upper limits are set to be close to the value of one of the relaxation components obtained from the SVD analysis in the entire time range. By doing so, we had to perform the kinetic analysis only for the four candidate models that are consistent with the SVD analysis in variable reduced time ranges. We note that both the C method and the V method require examination of candidate models against the data. (c) In contrast, SVD-aided pseudo PCA analysis (termed as the P method) can be used to extract the pseudo SACs for the optimum kinetic model directly from the experimental data based on the SVD analysis in the entire time range. Then, the PCA analysis using the pseudo SACs as the time-independent principal components provides the time-dependent population changes of each intermediate from the time-dependent coefficient of the corresponding pseudo SAC. Consequently, the optimum kinetic model can be determined without the need of any candidate kinetic model.

Close modal

As can be seen in these examples, when the number of relaxation times obtained from the SVD analysis exceeds that of intermediates, the number of candidate kinetic models to be considered in the kinetic analysis increases significantly, making the analysis complicated. Consequently, it is desirable to reduce the number of candidate kinetic models for fast and accurate determination of the optimum kinetic model. In the case of wild-type sperm whale MbCO discussed above,63 we devised a new method whereby the SVD analysis was performed in variable time ranges (termed as the V method in Figure 2(b)). By doing so, we identified the number of intermediates involved in specific time ranges of interest and used this additional information as a constraint to reduce the number of candidate kinetic models to be considered in the kinetic analysis. As a result, the number of candidate kinetic models was significantly reduced from eighteen to four.

In this work, we introduce a new method of extracting model-dependent kinetic information directly from the experimental data without considering any candidate kinetic model. In this method termed as the P method in Figure 2(c), from the SVD analysis in the entire time range, we identify stationary time zones where the amplitude of left singular vectors (lSVs) remains constant and define the experimental TRXSS data in such time ranges as pseudo SACs of reaction intermediates. Then, the principal-component analysis of the pseudo SACs provides time-dependent relative population of each intermediate species. In this way, we can determine the optimum kinetic model easily and accurately without considering any candidate kinetic model. We demonstrate the applicability of this SVD-aided pseudo principal-component analysis (SAPPA) by examining experimental TRXSS data for wild-type sperm whale MbCO63 and wild-type homodimeric hemoglobin liganded with CO molecules (HbI(CO)2).62 In addition, we show that the new method can be extended to transient absorption (TA) data on HbI(CO)2.

In order to determine the optimum kinetic model for a reaction, we need to examine how many distinct transient species are involved in the reaction and how fast the population of each species changes. For this purpose, we first apply the singular value decomposition (SVD) analysis60–68,71 to experimental TRXSS data as illustrated in Figure 1. From the experimental scattering curves measured at various time delays, we can build an nq × nt matrix A, where nq is the number of q points in the scattering curve at a given time-delay point and nt is the number of time-delay points. The matrix A can be decomposed while satisfying the relationship of A = USVT, where U is an nq × nt matrix whose columns are called left singular vectors (lSVs) (i.e., time-independent q spectra) of A, V is an nt × nt matrix whose columns are called right singular vectors (rSVs) (i.e., amplitude changes of U as time evolves) of A, and S is an nt × nt diagonal matrix whose diagonal elements are called singular values of A and can possess only non-negative values. The matrices U and V have the properties of UTU = Int and VTV = Int, respectively, where Int is the identity matrix. Since the diagonal elements (i.e., singular values) of S, which represent the weight of left singular vectors in U, are ordered so that s1 ≥ s2 ≥ ⋯ ≥ sn ≥ 0, lSVs and rSVs on more left are supposed to have larger contribution to the constructed experimental data. In this manner, we can extract the time-independent scattering intensity components from the lSVs and the time evolution of their amplitudes from the rSVs. The former, when combined together, can give the information on the scattering curves of distinct transient species, while the latter contains the information on the population dynamics of the transient species. Thus, the SVD analysis provides a model-independent estimation of the number of structurally distinguishable species and the population dynamics of each species.

Using the principal singular vectors with significant singular values obtained from the SVD analysis of the experimental data, we typically perform kinetic analysis (termed as the C method in Figure 2(a)) to determine the optimum kinetic model. Dimensionality-reduced matrices, U′, S′, and V′, which can be generated by removing non-significant singular components from U, S, and V, respectively, are illustrated in Figure 1. In other words, U′ is an nq × np matrix containing only the first np left singular vectors of U, S′ is an np × np diagonal matrix containing the first np singular values of S, and V′ is an nt × np matrix containing the first np right singular vectors of V. Here, we define a matrix C, of which the columns represent time-dependent concentrations of transiently formed intermediate species and can be described by a candidate kinetic model that can be generated on the basis of the SVD analysis. Then, the matrix C can be related to V′ by using a parameter matrix P that satisfies V′ = CP. In our analysis, C is an nt × np matrix containing the time-dependent concentrations of np intermediates involved in a reaction of interest, and P is an np × np matrix containing coefficients for the time-dependent concentrations so that the linear combination of concentrations of the np intermediates can form the np right singular vectors in V′. Once C is expressed using a set of variable kinetic parameters based on a candidate kinetic model, P and C can be optimized by minimizing the discrepancy between V′ (from the experiment) and CP (from the kinetic theory). We perform this optimization for each of the candidate kinetic models and compare the minimized discrepancies of all the kinetic models to determine the optimum kinetic model that best fits the experimental data.

However, standard deviations for V′ are not available from the experimental data and thus we instead use the following method to optimize P and C. Since V′ = CP, the following relationships hold:

(1)

where A is an nq × nt matrix that contains theoretical difference scattering curves, ΔSfit (qi, tj), at given q and t values. Theoretical difference scattering curves calculated by using Eq. (1) are compared with the experimental difference scattering curves, and the matrices P and C are optimized by minimizing the discrepancy (quantified by chi-square, χ2) between the theoretical and experimental difference scattering curves using the Minuit72 package

(2)

where ΔSexp (q, t) and ΔSfit (q, t) are the experimental and theoretical difference scattering intensities at a given point of (qi, tj), respectively, and σij is the experimental standard deviation at (qi, tj). As written in Eq. (1), we can define a matrix E as E = U′S′PT, that is, a linear combination of the np left singular vectors in U′ weighted by their singular values in S′ with their ratios determined by P. Then, the matrix E, an nq × np matrix, contains the np difference scattering curves directly associated with the np intermediate species involved in a reaction of interest. Therefore, by optimizing the matrices P and C for the optimum kinetic model, we obtain both the time-dependent concentrations (the columns of the optimized C for the optimum kinetic model) and the time-independent species-associated difference X-ray scattering curves (SACs) of the intermediate species (the columns of the optimized E for the optimum kinetic model).

The C method described in Section II B becomes complicated and time-consuming when the number of candidate kinetic models increases. Therefore, it is desirable to reduce the number of candidate kinetic models to be examined. Such a goal can be achieved by performing multiple SVD analyses in variable reduced time ranges instead of a single SVD analysis in the entire time range (termed as the V method in Figure 2(b)). An SVD analysis in a reduced time range gives the information on the number of intermediates in that specific time range. This additional information provides a constraint to exclude kinetic models that are not consistent with the SVD analyses in variable reduced time ranges, thus simplifying the determination of the optimum kinetic model. For example, suppose that we identified np distinct intermediates and np + 1 relaxation times (Λi, where i = 1,…, np + 1) from the SVD analysis in the entire time range of a photoreaction of interest. Assuming that the last relaxation component represents the recovery of the ground state from the last intermediate, the earlier np relaxation times must account for the transitions among the np intermediates. Since the minimum number of relaxation components required for transitions among np intermediates is np – 1, one of the relaxation times must be associated with either a parallel (that is, biphasic) pathway or a bypass pathway (to a non-adjacent intermediate). To identify which relaxation component is associated with such a pathway, we can additionally perform the SVD analysis in certain reduced time ranges whose upper limits are set to be close to one of the relaxation times obtained from the SVD analysis in the entire time range. If it turns out that there exist two distinct intermediates in the time range up to around Λ2, these two intermediates must be responsible for the first two relaxation times (Λ1 and Λ2), suggesting the existence of a biphasic or a bypass pathway involving the two intermediates. Consequently, in the subsequent kinetic analysis to determine the optimum kinetic model, we need to consider only the candidate kinetic models consistent with the SVD analyses in variable reduced time ranges, which are a subset of those consistent with the SVD analysis in the entire time range.

For example, in a previous TRXSS study on MbCO,63 two relaxation times (460 ps and 3.6 ns) were identified for the first two intermediates (termed B and C). In general, a transition between two intermediates would exhibit only a single exponential dynamics irrespective of how many relaxation times are assigned for the transition in a kinetic model (see Figure S3 in the supplementary material). However, in the case of MbCO, the first intermediate (B) was found to have two conformational substates (termed B1 and B2) due to the variation of interaction between CO ligand and distal histidine in the primary docking site. Since B1 and B2 have conformations that are only subtly different from each other, their TRXSS patterns are indistinguishable from each other. Despite the structural similarity of the two conformational substates, B1 and B2 transform to C with different rate constants. As a result, the transition from B to C exhibits biphasic dynamics characterized by two relaxation times (see Figure S4 in the supplementary material).

The V method outlined in Section II C still requires that each candidate kinetic model has to be tested against the experimental data and the one that gives the best agreement is chosen as the optimal kinetic model. Here, we introduce a new method, SVD-aided pseudo principal-component analysis (SAPPA), which speeds up and improves determination of the optimum kinetic model from time-resolved data by circumventing such consideration of candidate kinetic models. In principle, the principal lSVs and their time-dependent amplitude changes (that is, principal rSVs) obtained from the SVD analysis provide a basis for the time-independent SACs of reaction intermediates and the time-dependent population changes of those intermediates, respectively. The relationship (V′ = CP) between the model-independent information (V′ in the C method and V method) and the model-dependent information (C in the C method and V method) mediated by the matrix P in the C method and V method indicates that, if there exist stationary time zones where the amplitudes of all the principal lSVs remain constant, the amplitudes of all the time-independent SACs of the intermediates should also remain constant in each of those time zones. Especially, when the number of such time zones matches the number of intermediates (np) identified by the SVD analysis in the entire time range, the experimental time-resolved data in each of the stationary time zones can be directly regarded as the pseudo SAC of each intermediate species (see P method in Figure 2(c)). In other words, without the need of determining the optimum kinetic model, we can easily obtain the optimized matrix E whose columns are the SACs for the optimum kinetic model determined in the C method and V method by (i) determining stationary time zones based on the rSVs obtained from the SVD analysis in the entire time range and (ii) taking experimental time-resolved data at the selected stationary time zones as pseudo SACs. Specifically, to systematically determine the stationary time zones, we inspect the sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) and take its local minima as stationary time zones as shown in Figure S1 in the supplementary material. Since the original principal rSVs contain noise, the curves fitted to principal rSVs can be used for the calculation of derivatives and the summation of their absolute values. Then, time-resolved data at the selected stationary time zones (that is, pseudo SACs) are used as the columns of the matrix E in Eq. (1) and C can be obtained by fitting the experimental data at all time delays by linear combinations of the pseudo SACs. In the fitting, the coefficients of the pseudo SACs are determined by minimizing the χ2 value defined in Eq. (2), and these coefficients correspond to the time-dependent relative populations of the transient intermediate species. Then, by fitting these time-dependent populations with the relaxation times obtained from the SVD analysis in the entire time range, we can easily assign the relaxation components to specific transitions among the intermediates. In this way, we can determine the optimum kinetic model without considering any candidate kinetic model, in contrast to the C method or V method.

Time-resolved difference X-ray solution scattering curves, ΔSMb(q,t), measured following photoexcitation of a wild-type sperm whale MbCO solution,63 are shown in Figure 3(a). From SVD of the experimental data in the q range of 0.15–1.0 Å−1 and the entire time range (100 ps–10 ms), we identified four principal singular components, which correspond to four structurally distinct intermediates, and six relaxation times (in this case, four unimolecular time constants and one bimolecular time constant approximated by a combination of two latest unimolecular time constants69,70) as shown in Figures 3(b) and 3(c). If the C method is applied, a total of 18 candidate kinetic models have to be considered (see Figure 2(a)). In contrast, if the V method is applied, the number of candidate kinetic models is reduced down to four (see Figure 2(b)), as was done in our previous work on MbCO.63 In this work, instead of considering any candidate kinetic model, we applied the P method to determine the optimum kinetic model. Since the number of principal singular vectors was determined to be four from the SVD analysis, we selected four stationary time zones (see Figure 3(c)). Specifically, based on the time-dependent rSVs (black circles (experimental) and red curves (fit) in Figure 3(c)) and the sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve in Figure 3(c)), we selected 100 ps, 17.8 ns, 316 ns, and 10 μs as stationary time zones and, accordingly, the experimental curves measured at 100 ps, 17.8 ns, 316 ns, and 10 μs as the pseudo SACs corresponding to the four intermediates, as shown in Figure 3(d). We fitted the experimental curves at all time delays by linear combinations of the pseudo SACs and determined the time-dependent relative population of each intermediate from the coefficient of the corresponding pseudo SAC as shown in Figure 3(e). Then, we fitted the time-dependent relative population of each intermediate by the relaxation components obtained from the SVD analysis and assigned each relaxation component to a specific transition.

FIG. 3.

Example of the P method (SVD-aided pseudo PCA analysis) applied to TRXSS data. (a) Time-resolved difference X-ray solution scattering curves, ΔSMb(q,t), measured for a solution sample of wild-type sperm whale MbCO. The time delay after photoexcitation is indicated above each curve. Experimental curves (black curves) are compared with fit curves (red curves) that were generated from the P method. (b) Four principal lSVs. (c) Four principal rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by six exponentials sharing common relaxation times, yielding the relaxation times of 460 ± 160 ps, 3.6 ± 0.7 ns, 92 ± 25 ns, 1.4 ± 0.2 μs, 90 ± 20 μs, and 1.2 ± 0.2 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. Within each of the four time zones shaded in blue color, all the weighted principal rSVs remain stationary in their amplitudes, indicating that the experimental curves should not change in these time zones. We took the experimental curves within each of these time zones as pseudo SACs for the four intermediates of MbCO as shown in (d). (d) Pseudo SACs (blue curves) for the four intermediates corresponding to the experimental curves at four selected time delays, 100 ps, 17.8 ns, 316 ns, and 10 μs. These pseudo SACs are compared with the formal SACs (red curves) corresponding to B, C, D, and S intermediates of MbCO extracted from the SVD-aided kinetic analysis (C method or V method). (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (460 ps, 3.6 ns, 92 ns, 1.4 μs, 90 μs, and 1.2 ms) obtained in (c). (f) The optimum kinetic model that best describes the structural dynamics of MbCO. This optimum model determined by the P method is identical to the one determined by the C method or the V method.

FIG. 3.

Example of the P method (SVD-aided pseudo PCA analysis) applied to TRXSS data. (a) Time-resolved difference X-ray solution scattering curves, ΔSMb(q,t), measured for a solution sample of wild-type sperm whale MbCO. The time delay after photoexcitation is indicated above each curve. Experimental curves (black curves) are compared with fit curves (red curves) that were generated from the P method. (b) Four principal lSVs. (c) Four principal rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by six exponentials sharing common relaxation times, yielding the relaxation times of 460 ± 160 ps, 3.6 ± 0.7 ns, 92 ± 25 ns, 1.4 ± 0.2 μs, 90 ± 20 μs, and 1.2 ± 0.2 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. Within each of the four time zones shaded in blue color, all the weighted principal rSVs remain stationary in their amplitudes, indicating that the experimental curves should not change in these time zones. We took the experimental curves within each of these time zones as pseudo SACs for the four intermediates of MbCO as shown in (d). (d) Pseudo SACs (blue curves) for the four intermediates corresponding to the experimental curves at four selected time delays, 100 ps, 17.8 ns, 316 ns, and 10 μs. These pseudo SACs are compared with the formal SACs (red curves) corresponding to B, C, D, and S intermediates of MbCO extracted from the SVD-aided kinetic analysis (C method or V method). (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (460 ps, 3.6 ns, 92 ns, 1.4 μs, 90 μs, and 1.2 ms) obtained in (c). (f) The optimum kinetic model that best describes the structural dynamics of MbCO. This optimum model determined by the P method is identical to the one determined by the C method or the V method.

Close modal

As shown in Figure 3(e), the relative populations of the first (100 ps), the second (17.8 ns), the third (316 ns), and the fourth (10 μs) pseudo SACs were fit by multiple exponentials. The population of the first pseudo SAC, ΔSMb(q,100 ps), decays biphasically with time constants of 460 ps and 3.6 ns. Accordingly, the population of the second pseudo SAC, ΔSMb(q,17.8 ns), rises biphasically with time constants of 460 ps and 3.6 ns and decays with a time constant of 92 ns. Subsequently, the population of the third pseudo SAC, ΔSMb(q,316 ns), rises with a time constant of 92 ns and decays with a time constant of 1.4 μs. Then, the population of the fourth pseudo SAC, ΔSMb(q,10 μs), rises with a time constant of 1.4 μs and decays nonexponentially, which can be approximated by two dummy time constants of 90 μs and 1.2 ms.69,70 These time-dependent relative populations of pseudo SACs allow us to deduce the optimum kinetic model, which involves (1) biphasic transition from the first intermediate to the second one due to the existence of two conformational substates of the first intermediate and (2) bimolecular nongeminate CO recombination of the fourth intermediate as shown in Figure 3(f). In fact, the kinetic model shown in Figure 3(f) is identical to the optimum kinetic model determined by the V method in our previous work.63 

Time-resolved difference X-ray solution scattering curves, ΔSHbI(q,t), measured following photoexcitation of a wild-type HbI solution62 are shown in Figure 4(a). The measured data were analyzed by applying the P method to determine the optimum kinetic model. From SVD of the experimental data in the q range of 0.15–1.0 Å−1 and the entire time range (100 ps–56.2 ms), we identified three principal singular components (that is, three structurally distinct intermediates) and seven relaxation times (in this case, five unimolecular time constants and one bimolecular time constant approximated by a combination of two latest unimolecular time constants69,70) as shown in Figures 4(b) and 4(c). Since the number of principal singular vectors is three, we selected three stationary time zones (see Figure 4(c)), which are 108 ps, 17.7 ns, and 100 μs based on the time-dependent rSVs (black circles (experimental) and red curves (fit) in Figure 4(c)) and the sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve in Figure 4(c)). In other words, we selected the experimental curves measured at 108 ps, 17.7 ns, and 100 μs as the pseudo SACs corresponding to the three intermediates, as shown in Figure 4(d). We fitted the experimental curves at all time delays by linear combinations of the pseudo SACs and determined the time-dependent relative population of each intermediate from the coefficient of the corresponding pseudo SAC as shown in Figure 4(e). Then, we fitted the time-dependent relative population of each intermediate by the relaxation components obtained from the SVD analysis and assigned each relaxation component to a specific transition.

FIG. 4.

Example of the P method applied to TRXSS data of wild-type HbI(CO)2. (a) Time-resolved difference X-ray solution scattering curves, ΔSHbI(q,t), measured for a solution sample of wild-type HbI(CO)2. The time delay after photoexcitation is indicated above each curve. Experimental curves (black curves) are compared with fit curves (red curves) that were obtained from the P method. (b) Three principal time-independent lSVs. (c) Three principal time-dependent rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by seven exponentials sharing common relaxation times, yielding the relaxation times of 3.2 ± 0.2 ns, 93 ± 20 ns, 730 ± 120 ns, 5.6 ± 0.8 μs, 15.2 ± 8 μs, 1.8 ± 0.3 ms, and 9.1 ± 0.9 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. Within each of the three time zones shaded in blue color, all the weighted principal rSVs remain stationary in their amplitudes, and thus the experimental curves should not change in these time zones. We took the experimental curves within each of these time zones as pseudo SACs for the three intermediates of HbI(CO)2 as shown in (d). (d) Pseudo SACs (blue curves) for the three intermediates of HbI(CO)2 corresponding to the experimental curves at three selected time delays, 108 ps, 17.7 ns, and 100 μs. These pseudo SACs are compared with the formal SACs (red curves) corresponding to I1, I2, and I3 intermediates of HbI(CO)2 extracted from the C method or the V method. For clarity, we scaled the formal SAC of I3 species to consider the portion of geminate recombination obtained from C or V methods. (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (3.2 ns, 93 ns, 730 ns, 5.6 μs, 15.2 μs, 1.8 ms, and 9.1 ms) obtained in (c). (f) The optimum kinetic model that best describes the structural dynamics of HbI(CO)2. This optimum model determined by the P method is identical to the one determined by the C method or V method.

FIG. 4.

Example of the P method applied to TRXSS data of wild-type HbI(CO)2. (a) Time-resolved difference X-ray solution scattering curves, ΔSHbI(q,t), measured for a solution sample of wild-type HbI(CO)2. The time delay after photoexcitation is indicated above each curve. Experimental curves (black curves) are compared with fit curves (red curves) that were obtained from the P method. (b) Three principal time-independent lSVs. (c) Three principal time-dependent rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by seven exponentials sharing common relaxation times, yielding the relaxation times of 3.2 ± 0.2 ns, 93 ± 20 ns, 730 ± 120 ns, 5.6 ± 0.8 μs, 15.2 ± 8 μs, 1.8 ± 0.3 ms, and 9.1 ± 0.9 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. Within each of the three time zones shaded in blue color, all the weighted principal rSVs remain stationary in their amplitudes, and thus the experimental curves should not change in these time zones. We took the experimental curves within each of these time zones as pseudo SACs for the three intermediates of HbI(CO)2 as shown in (d). (d) Pseudo SACs (blue curves) for the three intermediates of HbI(CO)2 corresponding to the experimental curves at three selected time delays, 108 ps, 17.7 ns, and 100 μs. These pseudo SACs are compared with the formal SACs (red curves) corresponding to I1, I2, and I3 intermediates of HbI(CO)2 extracted from the C method or the V method. For clarity, we scaled the formal SAC of I3 species to consider the portion of geminate recombination obtained from C or V methods. (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (3.2 ns, 93 ns, 730 ns, 5.6 μs, 15.2 μs, 1.8 ms, and 9.1 ms) obtained in (c). (f) The optimum kinetic model that best describes the structural dynamics of HbI(CO)2. This optimum model determined by the P method is identical to the one determined by the C method or V method.

Close modal

As shown in Figure 4(e), the relative populations of the first (108 ps), the second (17.7 ns), and the third (100 μs) pseudo SACs were fit by multiple exponentials. The population of the first pseudo SAC, ΔSHbI(q,108 ps), decays with a time constant of 3.2 ns, rises with a time constant of 93 ns, and decays again with a time constant of 15.2 μs. The population of the second pseudo SAC, ΔSHbI(q,17.7 ns), rises with a time constant of 3.2 ns and decays with time constants of 93 ns, 730 ns, and 5.6 μs. Then, the population of the third pseudo SAC, ΔSHbI(q,100 μs), rises biphasically with time constants of 730 ns and 5.6 μs and decays nonexponentially, which can be approximated by two dummy time constants of 1.8 ms and 9.1 ms.69,70 These time-dependent relative populations of pseudo SACs allow us to determine the optimum kinetic model, which involves (1) biphasic transition from the second intermediate to the third one due to the existence of two conformational substates of the second intermediate, (2) geminate CO recombination of the second intermediate, and (3) bimolecular nongeminate CO recombination of the third intermediate as shown in Figure 4(f). In fact, the kinetic model shown in Figure 4(f) is identical to the optimum kinetic model determined by the C method or V method.62 We also tested how the selection of stationary time zones affects the analysis result by examining time-dependent relative populations of the three pseudo SACs extracted from seven different combinations of stationary time zones as tabulated in Figure S2 in the supplementary material. As the stationary time zones for the first, the second, and the third pseudo SACs deviate from 108 ps, 17.7 ns, and 100 μs, respectively, the fit to the experimental data becomes worse with increased χ2 value as shown in Figure S2d in the supplementary material. As long as the first, the second, and the third time zones are selected in the ranges of 108–492 ps, 10 ns–42.2 ns, and 17.8–422 μs, respectively, we found that the analysis result does not change significantly, indicating that the selection of proper stationary time zones should not be difficult (see Figure S1 in the supplementary material).

To show the wide applicability of SAPPA, we also applied the P method to analyze time-resolved spectra, ΔAHbI(λ,t), of photoexcited wild-type HbI solution measured by transient absorption (TA) spectroscopy (Figure 5(a)). From SVD of the TA spectra in the λ range of 360–500 nm and the entire time range (100 ns–46.4 ms), we identified two principal singular components (possibly I2 and I3 species considering the time range) and four relaxation times (possibly two unimolecular time constants and one bimolecular time constant approximated by a combination of two latest unimolecular time constants considering the time range and the results of previous studies on HbI(CO)269,70) as shown in Figures 5(b) and 5(c). There are two significant singular vectors, but we were able to identify only one stationary time zone (10 μs) based on the time-dependent rSVs (black circles (experimental) and red curves (fit) in Figure 5(c)) and the sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve in Figure 5(c)). The lack of stationary time zones compared with the significant singular vectors is due to limited time resolution (100 ns) of our TA measurement. As a result, we selected the TA spectra measured at 100 ns and 10 μs as the pseudo SACs corresponding to I2 and I3 intermediates. We note that the pseudo SAC of the I2 intermediate had to be selected from the TA data measured at much later time delay (in this case, 100 ns) than the case of the TRXSS data (17.7 ns). Accordingly, the pseudo SAC of the I2 intermediate, ΔAHbI(λ,100 ns), shown in Figure 5(d) can be regarded as a mixture of pseudo SACs of I2 and I3 intermediates.

FIG. 5.

Example of the P method applied to TA data of wild-type HbI(CO)2. (a) Transient absorption spectra, ΔAHbI(λ,t), measured for a solution sample of wild-type HbI(CO)2. (b) Two principal time-independent lSVs. (c) Two principal time-dependent rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by four exponentials sharing common relaxation times, yielding the relaxation times of 190 ± 100 ns, 1.1 ± 0.3 μs, 1.2 ± 0.4 ms, and 4.8 ± 0.2 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. We took the experimental curves within each of the two time zones shaded in blue color as pseudo SACs for two intermediates of HbI(CO)2 as shown in (d). (d) Pseudo SACs for two intermediates of HbI(CO)2 corresponding to the experimental TA spectra at two selected time delays, 100 ns (black curve) and 10 μs (red curve). Blue curve shows the difference between two pseudo SACs. (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (190 ns, 1.1 μs, 1.2 ms, and 4.8 ms) obtained from the weighted principal rSVs as shown in (c). (f) A kinetic model that well fits the experimental TA spectra of HbI(CO)2. This model is the truncated form of the optimum model shown in Figure 4(f).

FIG. 5.

Example of the P method applied to TA data of wild-type HbI(CO)2. (a) Transient absorption spectra, ΔAHbI(λ,t), measured for a solution sample of wild-type HbI(CO)2. (b) Two principal time-independent lSVs. (c) Two principal time-dependent rSVs weighted by singular values (black circles). These time-dependent singular components were fit (red curves) by four exponentials sharing common relaxation times, yielding the relaxation times of 190 ± 100 ns, 1.1 ± 0.3 μs, 1.2 ± 0.4 ms, and 4.8 ± 0.2 ms. Sum of the absolute values of the first derivatives of the principal rSVs weighted by singular values with respect to log10(time) (blue curve) was used as the selection criteria of the stationary time zones. We took the experimental curves within each of the two time zones shaded in blue color as pseudo SACs for two intermediates of HbI(CO)2 as shown in (d). (d) Pseudo SACs for two intermediates of HbI(CO)2 corresponding to the experimental TA spectra at two selected time delays, 100 ns (black curve) and 10 μs (red curve). Blue curve shows the difference between two pseudo SACs. (e) Time-dependent relative populations (black circles) of the corresponding pseudo SACs. These populations were fit (red curves) by the relaxation times (190 ns, 1.1 μs, 1.2 ms, and 4.8 ms) obtained from the weighted principal rSVs as shown in (c). (f) A kinetic model that well fits the experimental TA spectra of HbI(CO)2. This model is the truncated form of the optimum model shown in Figure 4(f).

Close modal

As shown in Figure 5(e), we found that the relative population of ΔAHbI(λ,100 ns) decays biphasically with time constants of 190 ns and 1.1 μs and then further decays nonexponentially, which can be approximated by two dummy time constants of 1.2 ms and 4.8 ms. Subsequently, the population of ΔAHbI(λ,10 μs) rises biphasically with time constants of 190 ns and 1.1 μs and then decays nonexponentially, which can be approximated by two dummy time constants of 1.2 ms and 4.8 ms.69,70 By considering that ΔAHbI(λ,100 ns) is the mixture of the pseudo SACs of I2 and I3, the results allow us to determine the optimum kinetic model that involves (1) biphasic transition from the second intermediate to the third one due to the existence of two conformational substates of the second intermediate and (2) bimolecular nongeminate CO recombination of the third intermediate as shown in Figure 5(f). The relaxation times (190 ns and 1.1 μs) determined by TA spectroscopy shown in Figure 5(f) are faster than those (730 ns and 5.6 μs) probed by TRXSS (see Figure 4(f)), indicating that global structural changes of HbI may occur more slowly than local structural changes around the heme chromophore.64,73 This discrepancy in the time scales of transitions may be explained by considering the time taken for light-triggered local structural perturbation around a chromophore to propagate over the entire protein. Our recent study on E46Q mutant of PYP using both TRXSS and TA spectroscopy64 also reported that the global conformational change, which is observed by TRXSS, involved in the transition to the signaling state of the protein is temporally delayed from the local structural change around the chromophore, which is observed by TA spectroscopy. Thus, to have a complete understanding of protein structural dynamics, it is desirable to apply both TRXSS (sensitive to global structural changes of protein) and TA spectroscopy (sensitive to local structural changes of chromophore) or other optical spectroscopic techniques.64 

In this work, we demonstrated the applicability of the SVD-aided pseudo principal-component analysis by examining the experimental TRXSS data of wild-type sperm whale MbCO and wild-type HbI(CO)2. In addition, we showed that SAPPA can be applied to time-resolved spectroscopic data as well by examining the experimental TA data of wild-type HbI(CO)2. This new method can be potentially used to easily determine the optimum kinetic model for various time-resolved data with high fidelity.

See supplementary material for information on the TA experiment, the selection criteria of the stationary time zones, the result of a simulation where an intermediate transforms to another intermediate with two relaxation times, and the results of simulations where an intermediate, which has two conformational substates, transforms to another intermediate with two relaxation times.

This work was supported by IBS-R004-A1. This work was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (NRF-2016R1E1A1A01941978). The TRXSS data on proteins, which were published previously, were collected at the BioCARS Sector 14 at the APS supported by NIH National Institute of General Medical Sciences grant R24GM111072 and by the U.S. Department of Energy, Basic Energy Sciences, Office of Science, under Contract No. DE-AC02-06CH11357.

1.
R.
Neutze
,
R.
Wouts
,
S.
Techert
,
J.
Davidsson
,
M.
Kocsis
,
A.
Kirrander
,
F.
Schotte
, and
N.
Wulff
,
Phys. Rev. Lett.
87
,
195508
(
2001
).
2.
S.
Bratos
,
F.
Mirloup
,
R.
Vuilleumier
, and
M.
Wulff
,
J. Chem. Phys.
116
,
10615
(
2002
).
3.
S.
Bratos
,
F.
Mirloup
,
R.
Vuilleumier
,
M.
Wulff
, and
A.
Plech
,
Chem. Phys.
304
,
245
(
2004
).
4.
A.
Plech
,
M.
Wulff
,
S.
Bratos
,
F.
Mirloup
,
R.
Vuilleumier
,
F.
Schotte
, and
P. A.
Anfinrud
,
Phys. Rev. Lett.
92
,
125505
(
2004
).
5.
A.
Plech
,
V.
Kotaidis
,
S.
Gresillon
,
C.
Dahmen
, and
G.
von Plessen
,
Phys. Rev. B
70
,
195423
(
2004
).
6.
H.
Ihee
,
M.
Lorenc
,
T. K.
Kim
,
Q. Y.
Kong
,
M.
Cammarata
,
J. H.
Lee
,
S.
Bratos
, and
M.
Wulff
,
Science
309
,
1223
(
2005
).
7.
J.
Davidsson
,
J.
Poulsen
,
M.
Cammarata
,
P.
Georgiou
,
R.
Wouts
,
G.
Katona
,
F.
Jacobson
,
A.
Plech
,
M.
Wulff
,
G.
Nyman
, and
R.
Neutze
,
Phys. Rev. Lett.
94
,
245503
(
2005
).
8.
A.
Plech
,
V.
Kotaidis
,
M.
Lorenc
, and
M.
Wulff
,
Chem. Phys. Lett.
401
,
565
(
2005
).
9.
M.
Cammarata
,
M.
Lorenc
,
T. K.
Kim
,
J. H.
Lee
,
Q. Y.
Kong
,
E.
Pontecorvo
,
M.
Lo Russo
,
G.
Schiro
,
A.
Cupane
,
M.
Wulff
, and
H.
Ihee
,
J. Chem. Phys.
124
,
124504
(
2006
).
10.
P.
Georgiou
,
J.
Vincent
,
M.
Andersson
,
A. B.
Wohri
,
P.
Gourdon
,
J.
Poulsen
,
J.
Davidsson
, and
R.
Neutze
,
J. Chem. Phys.
124
,
234507
(
2006
).
11.
T. K.
Kim
,
M.
Lorenc
,
J. H.
Lee
,
M.
Russo
,
J.
Kim
,
M.
Cammarata
,
Q. Y.
Kong
,
S.
Noel
,
A.
Plech
,
M.
Wulff
, and
H.
Ihee
,
Proc. Natl. Acad. Sci. U.S.A.
103
,
9410
(
2006
).
12.
V.
Kotaidis
,
C.
Dahmen
,
G.
von Plessen
,
F.
Springer
, and
A.
Plech
,
J. Chem. Phys.
124
,
184702
(
2006
).
13.
A.
Plech
,
V.
Kotaidis
,
M.
Lorenc
, and
J.
Boneberg
,
Nat. Phys.
2
,
44
(
2006
).
14.
M.
Wulff
,
S.
Bratos
,
A.
Plech
,
R.
Vuilleumier
,
F.
Mirloup
,
M.
Lorenc
,
Q.
Kong
, and
H.
Ihee
,
J. Chem. Phys.
124
,
034501
(
2006
).
15.
Q. Y.
Kong
,
M.
Wulff
,
J. H.
Lee
,
S.
Bratos
, and
H.
Ihee
,
J. Am. Chem. Soc.
129
,
13584
(
2007
).
16.
A.
Plech
,
R.
Cerna
,
V.
Kotaidis
,
F.
Hudert
,
A.
Bartels
, and
T.
Dekorsy
,
Nano Lett.
7
,
1026
(
2007
).
17.
A.
Plech
,
V.
Kotaidis
,
K.
Istomin
, and
M.
Wulff
,
J. Synchrotron Radiat.
14
,
288
(
2007
).
18.
M.
Cammarata
,
M.
Levantino
,
F.
Schotte
,
P. A.
Anfinrud
,
F.
Ewald
,
J.
Choi
,
A.
Cupane
,
M.
Wulff
, and
H.
Ihee
,
Nat. Methods
5
,
881
(
2008
).
19.
Q. Y.
Kong
,
J. H.
Lee
,
A.
Plech
,
M.
Wulff
,
H.
Ihee
, and
M. H. J.
Koch
,
Angew. Chem., Int. Ed.
47
,
5550
(
2008
).
20.
J. H.
Lee
,
J.
Kim
,
M.
Cammarata
,
Q.
Kong
,
K. H.
Kim
,
J.
Choi
,
T. K.
Kim
,
M.
Wulff
, and
H.
Ihee
,
Angew. Chem., Int. Ed.
47
,
1047
(
2008
).
21.
J. H.
Lee
,
T. K.
Kim
,
J.
Kim
,
Q.
Kong
,
M.
Cammarata
,
M.
Lorenc
,
M.
Wulff
, and
H.
Ihee
,
J. Am. Chem. Soc.
130
,
5834
(
2008
).
22.
A.
Plech
,
V.
Kotaidis
,
A.
Siems
, and
M.
Sztucki
,
Phys. Chem. Chem. Phys.
10
,
3888
(
2008
).
23.
S.
Ahn
,
K. H.
Kim
,
Y.
Kim
,
J.
Kim
, and
H.
Ihee
,
J. Phys. Chem. B
113
,
13131
(
2009
).
24.
M.
Andersson
,
E.
Malmerberg
,
S.
Westenhoff
,
G.
Katona
,
M.
Cammarata
,
A. B.
Wohri
,
L. C.
Johansson
,
F.
Ewald
,
M.
Eklund
,
M.
Wulff
,
J.
Davidsson
, and
R.
Neutze
,
Structure
17
,
1265
(
2009
).
25.
M.
Christensen
,
K.
Haldrup
,
K.
Bechgaard
,
R.
Feidenhans'l
,
Q. Y.
Kong
,
M.
Cammarata
,
M.
Lo Russo
,
M.
Wulff
,
N.
Harrit
, and
M. M.
Nielsen
,
J. Am. Chem. Soc.
131
,
502
(
2009
).
26.
K.
Haldrup
,
M.
Christensen
,
M.
Cammarata
,
Q. Y.
Kong
,
M.
Wulff
,
S. O.
Mariager
,
K.
Bechgaard
,
R.
Feidenhans'l
,
N.
Harrit
, and
M. M.
Nielsen
,
Angew. Chem., Int. Ed.
48
,
4180
(
2009
).
27.
28.
T. K.
Kim
,
J. H.
Lee
,
M.
Wulff
,
Q. Y.
Kong
, and
H.
Ihee
,
ChemPhysChem
10
,
1958
(
2009
).
29.
J.
Vincent
,
M.
Andersson
,
M.
Eklund
,
A. B.
Wohri
,
M.
Odelius
,
E.
Malmerberg
,
Q. Y.
Kong
,
M.
Wulff
,
R.
Neutze
, and
J.
Davidsson
,
J. Chem. Phys.
130
,
154502
(
2009
).
30.
H. S.
Cho
,
N.
Dashdorj
,
F.
Schotte
,
T.
Graber
,
R.
Henning
, and
P.
Anfinrud
,
Proc. Natl. Acad. Sci. U.S.A.
107
,
7281
(
2010
).
31.
M.
Christensen
,
K.
Haldrup
,
K. S.
Kjaer
,
M.
Cammarata
,
M.
Wulff
,
K.
Bechgaard
,
H.
Weihe
,
N. H.
Harrit
, and
M. M.
Nielsen
,
Phys. Chem. Chem. Phys.
12
,
6921
(
2010
).
32.
H.
Ihee
,
M.
Wulff
,
J.
Kim
, and
S.
Adachi
,
Int. Rev. Phys. Chem.
29
,
453
(
2010
).
33.
S.
Jun
,
J. H.
Lee
,
J.
Kim
,
J.
Kim
,
K. H.
Kim
,
Q. Y.
Kong
,
T. K.
Kim
,
M.
Lo Russo
,
M.
Wulff
, and
H.
Ihee
,
Phys. Chem. Chem. Phys.
12
,
11536
(
2010
).
34.
J.
Kim
,
K. H.
Kim
,
J. H.
Lee
, and
H.
Ihee
,
Acta Crystallogr., Sect. A
66
,
270
(
2010
).
35.
Q.
Kong
,
J. H.
Lee
,
K. H.
Kim
,
J.
Kim
,
M.
Wulff
,
H.
Ihee
, and
M. H. J.
Koch
,
J. Am. Chem. Soc.
132
,
2600
(
2010
).
36.
Q. Y.
Kong
,
J. H.
Lee
,
M.
Lo Russo
,
T. K.
Kim
,
M.
Lorenc
,
M.
Cammarata
,
S.
Bratos
,
T.
Buslaps
,
V.
Honkimaki
,
H.
Ihee
, and
M.
Wulff
,
Acta Crystallogr., Sect. A
66
,
252
(
2010
).
37.
J. H.
Lee
and
H.
Ihee
,
Struct. Chem.
21
,
37
(
2010
).
38.
L.
Salassa
,
E.
Borfecchia
,
T.
Ruiu
,
C.
Garino
,
D.
Gianolio
,
R.
Gobetto
,
P. J.
Sadler
,
M.
Cammarata
,
M.
Wulff
, and
C.
Lamberti
,
Inorg. Chem.
49
,
11240
(
2010
).
39.
S.
Westenhoff
,
E.
Malmerberg
,
D.
Arnlund
,
L.
Johansson
,
E.
Nazarenko
,
M.
Cammarata
,
J.
Davidsson
,
V.
Chaptal
,
J.
Abramson
,
G.
Katona
,
A.
Menzel
, and
R.
Neutze
,
Nat. Methods
7
,
775
(
2010
).
40.
S.
Westenhoff
,
E.
Nazarenko
,
E.
Malmerberg
,
J.
Davidsson
,
G.
Katona
, and
R.
Neutze
,
Acta Crystallogr., Sect. A
66
,
207
(
2010
).
41.
K.
Haldrup
,
T.
Harlang
,
M.
Christensen
,
A.
Dohn
,
T. B.
van Driel
,
K. S.
Kjaer
,
N.
Harrit
,
J.
Vibenholt
,
L.
Guerin
,
M.
Wulff
, and
M. M.
Nielsen
,
Inorg. Chem.
50
,
9329
(
2011
).
42.
S.
Ibrahimkutty
,
J.
Kim
,
M.
Cammarata
,
F.
Ewald
,
J.
Choi
,
H.
Ihee
, and
A.
Plech
,
ACS Nano
5
,
3788
(
2011
).
43.
J.
Kim
,
K. H.
Kim
,
J. G.
Kim
,
T. W.
Kim
,
Y.
Kim
, and
H.
Ihee
,
J. Phys. Chem. Lett.
2
,
350
(
2011
).
44.
E.
Malmerberg
,
Z.
Omran
,
J. S.
Hub
,
X. W.
Li
,
G.
Katona
,
S.
Westenhoff
,
L. C.
Johansson
,
M.
Andersson
,
M.
Cammarata
,
M.
Wulff
,
D.
van der Spoel
,
J.
Davidsson
,
A.
Specht
, and
R.
Neutze
,
Biophys. J.
101
,
1345
(
2011
).
45.
K.
Haldrup
,
G.
Vanko
,
W.
Gawelda
,
A.
Galler
,
G.
Doumy
,
A. M.
March
,
E. P.
Kanter
,
A.
Bordage
,
A.
Dohn
,
T. B.
van Driel
,
K. S.
Kjaer
,
H. T.
Lemke
,
S. E.
Canton
,
J.
Uhlig
,
V.
Sundstrom
,
L.
Young
,
S. H.
Southworth
,
M. M.
Nielsen
, and
C.
Bressler
,
J. Phys. Chem. A
116
,
9878
(
2012
).
46.
S.
Ibrahimkutty
,
P.
Wagener
,
A.
Menzel
,
A.
Plech
, and
S.
Barcikowski
,
Appl. Phys. Lett.
101
,
103104
(
2012
).
47.
J.
Kim
,
J. H.
Lee
,
J.
Kim
,
S.
Jun
,
K. H.
King
,
T. W.
Kim
,
M.
Wulff
, and
H.
Ihee
,
J. Phys. Chem. A
116
,
2713
(
2012
).
48.
A.
Spilotros
,
M.
Levantino
,
G.
Schiro
,
M.
Cammarata
,
M.
Wulff
, and
A.
Cupane
,
Soft Matter
8
,
6434
(
2012
).
49.
K. H.
Kim
,
J. H.
Lee
,
J.
Kim
,
S.
Nozawa
,
T.
Sato
,
A.
Tomita
,
K.
Ichiyanagi
,
H.
Ki
,
J.
Kim
,
S.
Adachi
, and
H.
Ihee
,
Phys. Rev. Lett.
110
,
165505
(
2013
).
50.
K. H.
Kim
,
H.
Ki
,
K. Y.
Oang
,
S.
Nozawa
,
T.
Sato
,
J.
Kim
,
T. K.
Kim
,
J.
Kim
,
S.
Adachi
, and
H.
Ihee
,
ChemPhysChem
14
,
3687
(
2013
).
51.
J. H.
Lee
,
M.
Wulff
,
S.
Bratos
,
J.
Petersen
,
L.
Guerin
,
J. C.
Leicknam
,
M.
Carnmarata
,
Q.
Kong
,
J.
Kim
,
K. B.
Moller
, and
H.
Ihee
,
J. Am. Chem. Soc.
135
,
3255
(
2013
).
52.
K. H.
Kim
,
J.
Kim
,
J. H.
Lee
, and
H.
Ihee
,
Struct. Dyn.
1
,
011301
(
2014
).
53.
K. Y.
Oang
,
K. H.
Kim
,
J.
Jo
,
Y. M.
Kim
,
J. G.
Kim
,
T. W.
Kim
,
S.
Jun
,
J.
Kim
, and
H.
Ihee
,
Chem. Phys.
442
,
137
(
2014
).
54.
K. H.
Kim
,
H.
Ki
,
J. H.
Lee
,
S.
Park
,
Q.
Kong
,
J.
Kim
,
J.
Kim
,
M.
Wulff
, and
H.
Ihee
,
Phys. Chem. Chem. Phys.
17
,
8633
(
2015
).
55.
E.
Malmerberg
,
P. H. M.
Bovee-Geurts
,
G.
Katona
,
X.
Deupi
,
D.
Arnlund
,
C.
Wickstrand
,
L. C.
Johansson
,
S.
Westenhoff
,
E.
Nazarenko
,
G. F. X.
Schertler
,
A.
Menzel
,
W. J.
de Grip
, and
R.
Neutze
,
Sci. Signaling
8
,
ra26
(
2015
).
56.
D.
Arnlund
,
L. C.
Johansson
,
C.
Wickstrand
,
A.
Barty
,
G. J.
Williams
,
E.
Malmerberg
,
J.
Davidsson
,
D.
Milathianaki
,
D. P.
DePonte
,
R. L.
Shoeman
,
D. J.
Wang
,
D.
James
,
G.
Katona
,
S.
Westenhoff
,
T. A.
White
,
A.
Aquila
,
S.
Bari
,
P.
Berntsen
,
M.
Bogan
,
T. B.
van Driel
,
R. B.
Doak
,
K. S.
Kjaer
,
M.
Frank
,
R.
Fromme
,
I.
Grotjohann
,
R.
Henning
,
M. S.
Hunter
,
R. A.
Kirian
,
I.
Kosheleva
,
C.
Kupitz
,
M. N.
Liang
,
A. V.
Martin
,
M. M.
Nielsen
,
M.
Messerschmidt
,
M. M.
Seibert
,
J.
Sjohamn
,
F.
Stellato
,
U.
Weierstall
,
N. A.
Zatsepin
,
J. C. H.
Spence
,
P.
Fromme
,
I.
Schlichting
,
S.
Boutet
,
G.
Groenhof
,
H. N.
Chapman
, and
R.
Neutze
,
Nat. Methods
11
,
923
(
2014
).
57.
S. E.
Canton
,
K. S.
Kjaer
,
G.
Vanko
,
T. B.
van Driel
,
S. I.
Adachi
,
A.
Bordage
,
C.
Bressler
,
P.
Chabera
,
M.
Christensen
,
A. O.
Dohn
,
A.
Galler
,
W.
Gawelda
,
D.
Gosztola
,
K.
Haldrup
,
T.
Harlang
,
Y. Z.
Liu
,
K. B.
Moller
,
Z.
Nemeth
,
S.
Nozawa
,
M.
Papai
,
T.
Sato
,
T.
Sato
,
K.
Suarez-Alcantara
,
T.
Togashi
,
K.
Tono
,
J.
Uhlig
,
D. A.
Vithanage
,
K.
Warnmark
,
M.
Yabashi
,
J. X.
Zhang
,
V.
Sundstrom
, and
M. M.
Nielsen
,
Nat. Commun.
6
,
6359
(
2015
).
58.
M.
Levantino
,
G.
Schiro
,
H. T.
Lemke
,
G.
Cottone
,
J. M.
Glownia
,
D. L.
Zhu
,
M.
Chollet
,
H.
Ihee
,
A.
Cupane
, and
M.
Cammarata
,
Nat. Commun.
6
,
6772
(
2015
).
59.
K. H.
Kim
,
J.
Kim
,
K. Y.
Oang
,
J. H.
Lee
,
D.
Grolimund
,
C. J.
Milne
,
T. J.
Penfold
,
S. L.
Johnson
,
A.
Galler
,
T. W.
Kim
,
J. G.
Kim
,
D.
Suh
,
J.
Moon
,
J.
Kim
,
K.
Hong
,
L.
Guerin
,
T. K.
Kim
,
M.
Wulff
,
C.
Bresslerg
, and
H.
Ihee
,
Phys. Chem. Chem. Phys.
17
,
23298
(
2015
).
60.
K. H.
Kim
,
K. Y.
Oang
,
J.
Kim
,
J. H.
Lee
,
Y.
Kim
, and
H.
Ihee
,
Chem. Commun.
47
,
289
(
2011
).
61.
T. W.
Kim
,
J. H.
Lee
,
J.
Choi
,
K. H.
Kim
,
L. J.
van Wilderen
,
L.
Guerin
,
Y.
Kim
,
Y. O.
Jung
,
C.
Yang
,
J.
Kim
,
M.
Wulff
,
J. J.
van Thor
, and
H.
Ihee
,
J. Am. Chem. Soc.
134
,
3145
(
2012
).
62.
K. H.
Kim
,
S.
Muniyappan
,
K. Y.
Oang
,
J. G.
Kim
,
S.
Nozawa
,
T.
Sato
,
S. Y.
Koshihara
,
R.
Henning
,
I.
Kosheleva
,
H.
Ki
,
Y.
Kim
,
T. W.
Kim
,
J.
Kim
,
S.
Adachi
, and
H.
Ihee
,
J. Am. Chem. Soc.
134
,
7001
(
2012
).
63.
K. Y.
Oang
,
J. G.
Kim
,
C.
Yang
,
T. W.
Kim
,
Y.
Kim
,
K. H.
Kim
,
J.
Kim
, and
H.
Ihee
,
J. Phys. Chem. Lett.
5
,
804
(
2014
).
64.
T. W.
Kim
,
C.
Yang
,
Y.
Kim
,
J. G.
Kim
,
J.
Kim
,
Y. O.
Jung
,
S.
Jun
,
S. J.
Lee
,
S.
Park
,
I.
Kosheleva
,
R.
Henning
,
J. J.
van Thore
, and
H.
Ihee
,
Phys. Chem. Chem. Phys.
18
,
8911
(
2016
).
65.
J. G.
Kim
,
S.
Muniyappan
,
K. Y.
Oang
,
T. W.
Kim
,
C.
Yang
,
K. H.
Kim
,
J.
Kim
, and
H.
Ihee
,
Struct. Dyn.
3
,
023610
(
2016
).
66.
K. H.
Kim
,
J. G.
Kim
,
S.
Nozawa
,
T.
Sato
,
K. Y.
Oang
,
T.
Kim
,
H.
Ki
,
J.
Jo
,
S.
Park
,
C.
Song
,
T.
Sato
,
K.
Ogawa
,
T.
Togashi
,
K.
Tono
,
M.
Yabashi
,
T.
Ishikawa
,
J.
Kim
,
R.
Ryoo
,
J.
Kim
,
H.
Ihee
, and
S.
Adachi
,
Nature
518
,
385
(
2015
).
67.
K. H.
Kim
,
J. G.
Kim
,
K. Y.
Oang
,
T. W.
Kim
,
H.
Ki
,
J.
Jo
,
J.
Kim
,
T.
Sato
,
S.
Nozawa
,
S.
Adachi
, and
H.
Ihee
,
Struct. Dyn.
3
,
043209
(
2016
).
68.
J. G.
Kim
,
K. H.
Kim
,
K. Y.
Oang
,
T. W.
Kim
,
H.
Ki
,
J.
Jo
,
J.
Kim
,
T.
Sato
,
S.
Nozawa
,
S.
Adachi
, and
H.
Ihee
,
J. Phys. B: At. Mol. Opt. Phys.
48
,
244005
(
2015
).
69.
A.
Ansari
,
C. M.
Jones
,
E. R.
Henry
,
J.
Hofrichter
, and
W. A.
Eaton
,
Biochemistry
33
,
5128
(
1994
).
70.
G.
Balakrishnan
,
M. A.
Case
,
A.
Pevsner
,
X. J.
Zhao
,
C.
Tengroth
,
G. L.
McLendon
, and
T. G.
Spiro
,
J. Mol. Biol.
340
,
843
(
2004
).
71.
E. R.
Henry
and
J.
Hofrichter
,
Methods Enzymol.
210
,
129
192
(
1992
).
72.
F.
James
and
M.
Roos
,
Comput. Phys. Commun.
10
,
343
(
1975
).
73.
J.
Choi
and
M.
Terazima
,
J. Phys. Chem. B
106
,
6587
(
2002
).

Supplementary Material